ML Performance Engineer

Posted 5 Days Ago
Be an Early Applicant
Mountain View, CA
Entry level
Artificial Intelligence • Hardware • Software
The Role
The ML Performance Engineer will build performance models and tools for ML model scheduling, create production-grade libraries for efficient distributed training, and collaborate with teams to enhance ML solutions from architecture to implementation.
Summary Generated by Built In

MatX's mission is to make the world’s best AI models run as efficiently as allowed by physics, bringing the world years ahead in AI quality and availability. We are developing vertically integrated full-stack solutions from silicon to systems including hardware and software to train and run the largest ML workloads for AGI. We are looking for people who are excited about systems-focused ML research.

Responsibilities include:

  • Build performance models and tooling to validate and guide scheduling decisions for current and future ML models.
  • Write production-grade libraries for efficient distributed training and serving.
  • Collaborate with architects, hardware, and software teams to drive solutions from model to metal.

Requirements:

  • Bachelor's or Master's degree in Computer Science, Engineering, or a related field
  • Strong programming skills in Python
  • Expertise in ML frameworks such as JAX, PyTorch, or Tensorflow.
  • Deep knowledge of the Transformer architecture.
  • Experience with distributed computing, high performance networking, or large-scale ML systems,

Preferred Skills: 
Any of the following:

  • Hands on experience with flash attention, quantization, pruning, or other systems performance optimizations.
  • Fluency in parallelism strategies that balance computation, communication, and memory to improve throughput and latency of large models.
  • Experience with performance analysis tools and profilers for large scale systems.
  • Solid understanding of computer architecture and low-level optimization techniques
  • A track record of impactful ML Systems Research, through publication and/or industrial practice.
  • Experiences in pre-silicon exploration, post-silicon bringup, and fleet debug.

Compensation: The US base salary for this full-time position is $120,000 - $400,000 + equity + benefits

As part of our dedication to the diversity of our team and our focus on creating an inviting and inclusive work experience, MatX is committed to a policy of Equal Employment Opportunity and will not discriminate against an applicant or employee on the basis of race, color, religion, creed, national origin or ancestry, sex, gender, gender identity, gender expression, sexual orientation, age, physical or mental disability, medical condition, marital/domestic partner status, military and veteran status, genetic information or any other legally recognized protected basis under federal, state or local laws, regulations or ordinances.

All candidates must be authorized to work in the United States and work from our offices in Mountain View Tuesdays-Thursdays.

This position requires access to information that is subject to U.S. export controls. This offer of employment is contingent upon the applicants capacity to perform job functions in compliance with U.S. export control laws without obtaining a license from U.S. export control authorities.

Top Skills

Python
The Company
HQ: Mountain View, CA
19 Employees
On-site Workplace

What We Do

MatX designs hardware tailored for the world’s best AI models: We dedicate every transistor to maximizing performance for large models. For these models, we deliver 10× more computing power, enabling AI labs to make models an order of magnitude smarter and more useful. Our hardware would make it possible to train GPT-4 and run ChatGPT, but on the budget of a small startup.

A world with more widely available intelligence is a happier and more prosperous world—picture people of all socioeconomic levels having access to an AI staff of specialist MDs, tutors, coaches, advisors, and assistants.

Similar Jobs

San Francisco, CA, USA
154 Employees
190K Annually

Zoox Logo Zoox

Senior/Staff Software Engineer, ML Performance Optimization

Artificial Intelligence • Machine Learning • Robotics • Software • Transportation • Design • Manufacturing
Foster City, CA, USA
2500 Employees
234K-342K Annually

The Walt Disney Company Logo The Walt Disney Company

Lead Machine Learning Engineer, Ad Platforms

AdTech • Digital Media • News + Entertainment
Hybrid
Santa Monica, CA, USA
200000 Employees
168K-246K Annually
Los Angeles, CA, USA
53 Employees
120K-180K Annually

Similar Companies Hiring

Jobba Trade Technologies, Inc. Thumbnail
Software • Professional Services • Productivity • Information Technology • Cloud
Chicago, IL
45 Employees
RunPod Thumbnail
Software • Infrastructure as a Service (IaaS) • Cloud • Artificial Intelligence
Charlotte, North Carolina
53 Employees
Hedra Thumbnail
Software • News + Entertainment • Marketing Tech • Generative AI • Enterprise Web • Digital Media • Consumer Web
San Francisco, CA
14 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account