Data/AI Engineer Intern

Posted 26 Days Ago
Be an Early Applicant
Singapore, SGP
In-Office
Internship
Gaming • Software • Metaverse
The Role
The role involves developing and optimizing AI computing infrastructure, focusing on distributed training, compute scheduling, and end-to-end model engineering for gaming applications.
Summary Generated by Built In
About the Hiring TeamTencent Overseas IT has the mission to empower Tencent’s rapid global growth with future ready, global IT platforms, applications and services. We are chartered to lead the Overseas IT strategy, architecture, roadmap and execution. Satisfying our internal/external customers and becoming a world class global IT team are our top aspirations.What the Role Entails

Centered on the deployment needs of Tencent's overseas gaming business in large language model (LLM) and reinforcement learning scenarios, this role is responsible for the development, performance optimization, and engineering implementation of high-quality AI computing infrastructure. Specific responsibilities include:

1. Distributed Training Engineering: Participate in the implementation of large-scale distributed training solutions; own the engineering delivery of data parallelism, model parallelism (Tensor Parallelism / Pipeline Parallelism), and ZeRO techniques; continuously tune GPU utilization and ensure the stability of ultra-large-scale training jobs.
2. Compute Scheduling Optimization: Take a deep role in developing and optimizing AI job scheduling logic; Address compute bottlenecks in complex gaming scenarios through fine-grained resource management, fault self-healing mechanisms, and efficient checkpointing strategies.
3. End-to-End Model Engineering: Own the full engineering pipeline from model training to inference serving; participate in operator profiling, model quantization, and the construction of high-performance inference pipelines to support rapid AI iteration within gaming products.
4. AI-Driven Engineering Evolution: Actively embrace AI Coding tools to boost development efficiency; drive Harness Engineering practices — including automated testing and engineering governance — to ensure extreme reliability of the underlying infrastructure.
 

Who We Look For

1. Bachelor's degree or above; majors in Computer Science, Computer Architecture, High-Performance Computing, or related fields preferred. 
2. Core Tech Stack: Proficient in at least one of Python / C++ / Go; deep understanding of the PyTorch framework; hands-on experience engineering distributed training with DeepSpeed, Megatron-LM, or equivalent frameworks.
3. Solid understanding of distributed systems principles; Familiarity with NCCL, RDMA networking, or high-performance storage is a plus; working knowledge of containerized infrastructure (Docker / Kubernetes).
4. Demonstrable experience with AI Coding tools (e.g., GitHub Copilot, Cursor) is a strong plus; prior work in Harness Engineering — engineering governance, automated benchmarking, or system stress testing — is highly valued.
5. Exceptional learning agility, clear logical thinking, and the ability to collaborate effectively with cross-functional teams on complex systems engineering challenges; Fluent proficiency in English 
6. Bonus: Background in high-performance backend architecture, or real project experience in LLM training / inference engineering.

Equal Employment Opportunity at Tencent

As an equal opportunity employer, we firmly believe that diverse voices fuel our innovation and allow us to better serve our users and the community. We foster an environment where every employee of Tencent feels supported and inspired to achieve individual and common goals.

Skills Required

  • Bachelor's degree or above
  • Proficient in at least one of Python / C++ / Go
  • Deep understanding of the PyTorch framework
  • Hands-on experience engineering distributed training with DeepSpeed or Megatron-LM
  • Solid understanding of distributed systems principles
  • Familiarity with NCCL, RDMA networking, or high-performance storage is a plus
  • Working knowledge of containerized infrastructure (Docker / Kubernetes)
  • Experience with AI Coding tools (e.g., GitHub Copilot) is a strong plus
  • Prior work in Harness Engineering is highly valued
  • Fluent proficiency in English
  • Background in high-performance backend architecture is a bonus
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
107,879 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account