Research Engineering, Inference

Reposted 2 Days Ago
Be an Early Applicant
Singapore, SGP
In-Office
Mid level
Software
The Role
The role focuses on ML systems research for optimizing large-scale inference, extending serving stacks, and improving AI inference infrastructure on GPUs.
Summary Generated by Built In

About Bitdeer:

Bitdeer is a world-leading technology company for Bitcoin mining and AI cloud.
Bitdeer is committed to providing comprehensive Bitcoin mining solutions for its customers. Apart from designing industry-leading ASIC chips and manufacturing mining rigs, the Group handles complex processes involved in computing across the value chain. This includes equipment procurement, transport logistics, datacenter design and construction, equipment management, and network and facility operations. Bitdeer also offers advanced cloud capabilities to customers with a high demand for artificial intelligence.
Headquartered in Singapore, Bitdeer operates globally with a diversified 3 GW energy portfolio, and deploys Bitcoin mining and HPC datacenters in the United States, Bhutan, Norway, Canada, Malaysia, and Ethiopia.

About Bitdeer AI Lab:

Bitdeer AI Lab is a frontier AI lab under Bitdeer, a global-leading computing power solutions provider. Guided by long-termism, we are committed to exploring the frontiers of artificial intelligence with the ambition, courage, and determination to build technologies that can truly change the world.

We believe that transformative breakthroughs in AI require both long-horizon thinking and relentless execution. Our mission is twofold: first, to effectively transform energy into intelligence; second, to push the limits of intelligence by rethinking AI systems and architectures that can learn more efficiently, reason more deeply, and scale more effectively.

Our vision is to create intelligence that learns more like humans do: efficiently, adaptively, and recursively, turning finite parameters and finite compute into unbounded potential. We pursue this work with a deep sense of purpose, believing that the most meaningful advances in AI will not only push the frontier of research, but also reshape the future of the world.

Our lab is equipped with thousands of cutting-edge GPUs dedicated to AI research, and we are committed to continuously investing in and expanding our computational infrastructure to support world-class research and engineering in artificial intelligence.

We are looking for exceptional talent to join us, helping build the inference foundation for frontier AI models.

    What you will be responsible for:

    This role is centered on ML systems research for large-scale inference optimization. You will focus on improving and extending SGLang- and vLLM-based serving stacks, optimizing inference for both open-source LLMs and reasoning models as well as future in-house models developed by Bitdeer AI Lab. Your work will span GPU kernel-level optimization, high-performance inter-node communication, and end-to-end serving system design on real production clusters of hundreds to thousands of GPUs, helping define the next generation of efficient AI inference infrastructure.

      How you will stand out:

      • Strong interest in ML systems research for large-scale inference optimization on hundreds to thousands of GPUs.
      • Hands-on experience with or strong familiarity with SGLang and vLLM, especially for large-scale model serving and inference optimization.
      • Understanding of inference optimization for open-source LLMs and reasoning models, with the ability to establish strong serving baselines for frontier model architectures.
      • Ability to work closely with research teams to develop efficient inference solutions for future in-house opensource models, and to translate new model ideas into practical deployment from an early stage.
      • Strong understanding of key inference metrics such as latency, throughput, stability, and GPU efficiency, and practical experience optimizing them through batching, scheduling, KV cache, memory usage, and runtime execution.
      • Ability to identify, analyze, and resolve bottlenecks in large-scale inference workloads through ML systems research, experimentation, and performance optimization.
      • Familiarity with distributed inference communication patterns and networking fundamentals, including tensor/pipeline parallelism communication, collective operations (e.g., NCCL), and awareness of interconnect topologies (NVLink, InfiniBand) and their impact on serving performance at scale.

      What you will experience working with us:

      • A culture that values authenticity and diversity of thoughts and backgrounds;
      • An inclusive and respectable environment with open workspaces and exciting start-up spirit;
      • Fast-growing company with the chance to network with industrial pioneers and enthusiasts;
      • Ability to contribute directly and make an impact on the future of the digital asset industry;
      • Involvement in new projects, developing processes/systems;
      • Personal accountability, autonomy, fast growth, and learning opportunities;
      • Attractive welfare benefits and developmental opportunities such as training and mentoring.

      Skills Required

      • Hands-on experience with SGLang and vLLM for inference optimization
      • Strong understanding of inference optimization metrics such as latency and GPU efficiency
      • Ability to analyze and resolve bottlenecks in large-scale inference workloads
      • Familiarity with distributed inference communication patterns and networking fundamentals
      Am I A Good Fit?
      beta
      Get Personalized Job Insights.
      Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

      The Company
      214 Employees

      What We Do

      Bitdeer Technologies Group (Nasdaq: BTDR) is a leader in the blockchain and high-performance computing industry. It is one of the world’s largest holders of proprietary hash rate and suppliers of hash rate. Bitdeer is committed to providing comprehensive computing solutions for its customers. The company was founded by Jihan Wu, an early advocate and pioneer in cryptocurrency who cofounded multiple leading companies serving the blockchain economy. Mr. Wu leads the company as Founder, Chairman, and CEO. Linghui Kong serves as Bitdeer’s CBO and provides leadership through deep industry knowledge and technology expertise. Headquartered in Singapore, Bitdeer has deployed mining datacenters in the United States, Norway, and Bhutan. It offers specialized mining infrastructure, high-quality hash rate sharing products, and reliable hosting services to global users. The company also offers advanced cloud capabilities for customers with high demands for artificial intelligence. Dedication, authenticity, and trustworthiness are foundational to our mission of becoming the world’s most reliable provider of full-spectrum blockchain and high-performance computing solutions. We welcome global talent to join us in shaping the future

      Similar Jobs

      Wise Logo Wise

      Complaints Officer

      Fintech • Mobile • Payments • Software • Financial Services
      Hybrid
      Singapore, SGP
      9000 Employees

      Tapestry - Coach and Kate Spade Logo Tapestry - Coach and Kate Spade

      Network Analyst

      eCommerce • Fashion • Retail • Sales • Wearables • Design
      Hybrid
      Singapore, SGP
      16000 Employees

      Airwallex Logo Airwallex

      Technical Program Manager

      Artificial Intelligence • Fintech • Payments • Business Intelligence • Financial Services • Generative AI
      In-Office or Remote
      Singapore, SGP
      2200 Employees

      CSC Logo CSC

      Accountant

      Fintech • Legal Tech • Software • Financial Services • Cybersecurity • Data Privacy
      Hybrid
      Singapore, SGP
      8500 Employees

      Similar Companies Hiring

      Hanover Park Thumbnail
      Artificial Intelligence • Fintech • Software • Financial Services
      New York, New York
      31 Employees
      Kepler  Thumbnail
      Fintech • Software
      New York, New York
      6 Employees
      Onshore Thumbnail
      Artificial Intelligence • Fintech • Software • Financial Services
      New York, New York
      60 Employees

      Sign up now Access later

      Create Free Account

      Please log in or sign up to report this job.

      Create Free Account