MatX

ML Performance Engineer

Reposted 20 Days Ago

Mountain View, CA

Mid level

Artificial Intelligence • Hardware • Software

The Role

The ML Performance Engineer develops performance models, writes production libraries for ML, and collaborates on solutions from model to hardware.

Summary Generated by Built In

MatX's mission is to make the world’s best AI models run as efficiently as allowed by physics, bringing the world years ahead in AI quality and availability. We are developing vertically integrated full-stack solutions from silicon to systems including hardware and software to train and run the largest ML workloads for AGI. We are looking for people who are excited about systems-focused ML research.

Responsibilities include:

Build performance models and tooling to validate and guide scheduling decisions for current and future ML models.
Write production-grade libraries for efficient distributed training and serving.
Collaborate with architects, hardware, and software teams to drive solutions from model to metal.

Requirements:

Bachelor's or Master's degree in Computer Science, Engineering, or a related field
Strong programming skills in Python
Expertise in ML frameworks such as JAX, PyTorch, or Tensorflow.
Deep knowledge of the Transformer architecture.
Experience with distributed computing, high performance networking, or large-scale ML systems,

Preferred Skills:
Any of the following:

Hands on experience with flash attention, quantization, pruning, or other systems performance optimizations.
Fluency in parallelism strategies that balance computation, communication, and memory to improve throughput and latency of large models.
Experience with performance analysis tools and profilers for large scale systems.
Solid understanding of computer architecture and low-level optimization techniques
A track record of impactful ML Systems Research, through publication and/or industrial practice.
Experiences in pre-silicon exploration, post-silicon bringup, and fleet debug.

Compensation: The US base salary for this full-time position is $120,000 - $400,000 + equity + benefits

As part of our dedication to the diversity of our team and our focus on creating an inviting and inclusive work experience, MatX is committed to a policy of Equal Employment Opportunity and will not discriminate against an applicant or employee on the basis of race, color, religion, creed, national origin or ancestry, sex, gender, gender identity, gender expression, sexual orientation, age, physical or mental disability, medical condition, marital/domestic partner status, military and veteran status, genetic information or any other legally recognized protected basis under federal, state or local laws, regulations or ordinances.

All candidates must be authorized to work in the United States and work from our offices in Mountain View Tuesdays-Thursdays.

This position requires access to information that is subject to U.S. export controls. This offer of employment is contingent upon the applicants capacity to perform job functions in compliance with U.S. export control laws without obtaining a license from U.S. export control authorities.

MatX does not accept unsolicited resumes from individual recruiters or third-party recruiting agencies in response to job postings. No fee will be paid to third parties who submit unsolicited candidates directly to our hiring managers or People team and any resumes submitted are deemed to be the property of MatX.

Top Skills

Jax

Python

PyTorch

TensorFlow

View all jobs at MatX

View MatX Profile

Report Job

Am I A Good Fit?

beta

Get Personalized Job Insights.

Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

Upload Resume

The Company

HQ: Mountain View, CA

19 Employees

On-site Workplace

What We Do

MatX designs hardware tailored for the world’s best AI models: We dedicate every transistor to maximizing performance for large models. For these models, we deliver 10× more computing power, enabling AI labs to make models an order of magnitude smarter and more useful. Our hardware would make it possible to train GPT-4 and run ChatGPT, but on the budget of a small startup.

A world with more widely available intelligence is a happier and more prosperous world—picture people of all socioeconomic levels having access to an AI staff of specialist MDs, tutors, coaches, advisors, and assistants.