About Bitdeer:
Bitdeer is a world-leading technology company for Bitcoin mining and AI cloud.
Bitdeer is committed to providing comprehensive Bitcoin mining solutions for its customers. Apart from designing industry-leading ASIC chips and manufacturing mining rigs, the Group handles complex processes involved in computing across the value chain. This includes equipment procurement, transport logistics, datacenter design and construction, equipment management, and network and facility operations. Bitdeer also offers advanced cloud capabilities to customers with a high demand for artificial intelligence.
Headquartered in Singapore, Bitdeer operates globally with a diversified 3 GW energy portfolio and deploys Bitcoin mining and HPC datacenters in the United States, Bhutan, Norway, Canada, Malaysia, and Ethiopia.
About the Role
Architecting, deploying, and maintaining the infrastructure required for large-scale AI compute environments in a high-density AI data center.
What You Will Be Responsible For
- Deploy and manage large-scale GPU clusters using orchestration platforms such as Kubernetes or Slurm
- Optimize high-speed, low-latency networking (e.g., InfiniBand, RoCE v2) for distributed compute
- Work closely with Project teams and operations to plan and monitor rack density across AI infrastructure
- Implement and maintain high-throughput storage systems (e.g., Lustre, BeeGFS, WekaIO) supporting GPU-intensive workloads
- Automate infrastructure provisioning and configuration using Infrastructure as Code (IaC) tools such as Terraform or Ansible
- Troubleshoot and optimize performance across compute, networking, and storage infrastructure in a mission-critical environment
- This role ensures stable, scalable, and high-performance infrastructure supporting next-generation AI compute operation.
How You Will Stand Out
- Degree in Computer Science, Data Engineering, or related technical field (Master’s preferred)
- Strong experience with Linux administration, containerization, and GPU infrastructure
- Experience with orchestration and workload management platforms such as Kubernetes or Slurm
- Familiarity with the NVIDIA infrastructure stack (CUDA, NCCL, Triton Inference Server)
- Deep understanding of NVIDIA GB300 and VR NVL72 Scalable Units’ functionality
- Experience working in HPC, AI infrastructure, or large-scale distributed compute environments
- Experience with InfiniBand, RoCE v2, or high-performance networking architectures
- Familiarity with AI-focused storage platforms such as Lustre, BeeGFS, or WekaIO
- Experience with Terraform, Ansible, or other IaC frameworks
- NVIDIA, Kubernetes, or cloud infrastructure certifications
Bitdeer is committed to providing equal employment opportunities in accordance with country, state, and local laws. Bitdeer does not discriminate against employees or applicants based on conditions such as race, color, gender identity and/or expression, sexual orientation, marital and/or parental status, religion, political opinion, nationality, ethnic background or social origin, social status, disability, age, indigenous status, and union.