Machine Learning Engineer (Training Optimization)

Posted 15 Hours Ago
Be an Early Applicant
Beijing, CHN
Hybrid
Entry level
Digital Media • Information Technology • Software • Design
The Role
Design, implement, and optimize large-scale distributed training systems for multimodal and foundation models. Improve GPU utilization, communication, and memory efficiency; build custom CUDA/Triton kernels; profile and fine-tune training workflows; collaborate with research teams to align systems with algorithmic needs.
Summary Generated by Built In
Company Description

该岗位现面向所有经验阶段的候选人开放,包括社会招聘、应届毕业生,同时开放实习生岗位。工作地点为北京。欢迎申请,期待你的加入!

Notice: This position is open to candidates at all experience levels, including experienced candidates, graduates, as well as internship opportunities. The role is based in Beijing. We welcome your application and look forward to having you on board!

Job Description

At Canva, we're building a future powered by AI that's as magical as it is impactful. As a Research Scientist at Canva, you'll be responsible for advancing the future of AI by experimenting with cutting-edge techniques, as well as improving models for real-world quality and performance.

 

About the Group/Team

We're the CORE team within the Generative AI supergroup. Our mission is to invent foundational technologies that will power the future of AI-assisted design. From large-scale models to groundbreaking research, our team builds the technical core of Canva’s creative intelligence engine. We collaborate globally to ship research that makes a real impact—from smart editing to AI video tools—at massive scale.

 

About the Role/Specialty

As a Machine Learning Engineer, you’ll lead efforts to scale and optimize the training system for our large-scale multimodal and foundation models. You’ll design distributed training systems using Megatron-LM, NVIDIA NeMo, FSDP, and Triton—pushing the limits of performance across compute, memory, and communication layers. You'll sit at the intersection of systems and AI research, directly shaping how we train the models that will power Canva’s next generation of products.

 

What you’ll do (responsibilities)

  • You’ll design, implement, and optimize large-scale machine learning systems for training
  • You’ll improve all aspects of performance, including GPU utilization, communication overhead, and memory efficiency.
  • You’ll partner with research and modeling teams to align systems with algorithmic needs.
  • You’ll evaluate and apply best practices for distributed training using industry-leading frameworks.
  • You’ll dive deep into low-level optimization, including custom CUDA or Triton kernels.

• • You’ll debug, profile, and fine-tune training workflows to unlock new levels of scalability.

Qualifications

What we're looking for

We’re looking for a systems-first engineer who thrives in fast-paced, high-impact environments. You’re deeply familiar with distributed model training at scale and understand the nuances of optimizing compute at every level of the stack. You're excited by challenges that stretch current boundaries, and you’re a strong collaborator who communicates clearly across domains.

  • Strong background in LLMs, multimodal AI, or diffusion models.
  • Proficiency in Python. Familiarity with a system programming language (e.g. C++ or Rust) is a plus.
  • Deep knowledge of PyTorch or JAX as well as libraries such as Megatron-LM, NeMo, or DeepSpeed.
  • Familiarity with common optimization techniques such as FSDP/ZeRO, gradient checkpointing, or low-precision data types.
  • Hands-on experience writing custom GPU kernels in CUDA or Triton.
  • Excellent communication and problem-solving skills, incl. full proficiency in English.

Additional Information

大模型训练优化工程师(多模态/图像生成),技术要求:算子优化/分布式训练/GPU集群/训练框架。该岗位面向所有经验阶段的候选人开放,包括社会招聘、2026年及2027年应届毕业生,同时开放实习生岗位。

Skills Required

  • Strong background in LLMs, multimodal AI, or diffusion models
  • Proficiency in Python
  • Familiarity with a systems programming language (e.g., C++ or Rust)
  • Deep knowledge of PyTorch or JAX
  • Experience with libraries such as Megatron-LM, NVIDIA NeMo, or DeepSpeed
  • Familiarity with distributed optimization techniques (FSDP/ZeRO, gradient checkpointing, low-precision datatypes)
  • Hands-on experience writing custom GPU kernels in CUDA or Triton
  • Experience designing and optimizing large-scale training systems and GPU clusters
  • Excellent communication skills and full proficiency in English
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
5,500 Employees
Year Founded: 2013

What We Do

Canva is an online graphic design platform with a mission to empower everyone to design anything and publish anywhere, offering a free-to-use tool for creating social media posts, presentations, posters, videos, logos, and more.

Similar Jobs

Ericsson Logo Ericsson

Lab Management Intern

Cloud • Information Technology • Internet of Things • Machine Learning • Software • Cybersecurity • Infrastructure as a Service (IaaS)
In-Office
Beijing, CHN
88000 Employees

Ericsson Logo Ericsson

AI Development Intern

Cloud • Information Technology • Internet of Things • Machine Learning • Software • Cybersecurity • Infrastructure as a Service (IaaS)
In-Office
Beijing, CHN
88000 Employees

Ericsson Logo Ericsson

AI Development Intern

Cloud • Information Technology • Internet of Things • Machine Learning • Software • Cybersecurity • Infrastructure as a Service (IaaS)
In-Office
Beijing, CHN
88000 Employees

Tapestry - Coach and Kate Spade Logo Tapestry - Coach and Kate Spade

Management Trainee

eCommerce • Fashion • Retail • Sales • Wearables • Design
Hybrid
Beijing, CHN
16000 Employees

Similar Companies Hiring

Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account