Senior AI Systems Performance Engineer

Reposted 2 Days Ago
Palo Alto, CA
In-Office
Mid level
Artificial Intelligence • Hardware • Machine Learning • Natural Language Processing • Software • Semiconductor • Generative AI
SambaNova is the #1 platform for business AI.
The Role
As a Senior AI Systems Performance Engineer, optimize and scale foundation models, enhance performance across various system layers, and deliver high-performance AI applications.
Summary Generated by Built In

The era of pervasive AI has arrived. In this era, organizations will use generative AI to unlock hidden value in their data, accelerate processes, reduce costs, drive efficiency and innovation to fundamentally transform their businesses and operations at scale.

SambaNova Suite™ is the first full-stack, generative AI platform, from chip to model, optimized for enterprise and government organizations. Powered by the intelligent SN40L chip, the SambaNova Suite is a fully integrated platform, delivered on-premises or in the cloud, combined with state-of-the-art open-source models that can be easily and securely fine-tuned using customer data for greater accuracy. Once adapted with customer data, customers retain model ownership in perpetuity, so they can turn generative AI into one of their most valuable assets.

About the role

We are seeking a talented and driven ML performance engineer to optimize and scale state-of-the-art foundation models on SambaNova's reconfigurable dataflow platform. You'll work hands-on with some of the most advanced models in the world — such as DeepSeek R1, GPT OSS, and other frontier architectures — to push the limits of throughput, latency, and efficiency. In this role, you'll bridge the gap between deep learning and systems performance, collaborating across compiler, runtime, and hardware layers to deliver world-record performance for large-scale AI inference.

Responsibilities
  • Bring up and optimize cutting-edge foundation models (e.g., DeepSeek, Llama, Qwen, and others) on the SambaNova platform through the SambaNova software stack.
  • Profile and enhance model performance across compiler, runtime, and hardware layers to achieve SOTA throughput and latency.
  • Collaborate with machine learning, compiler, runtime, and hardware teams to deliver co-designed, high-performance AI applications.
  • Integrate the latest advances in model architecture, quantization, scheduling, and memory optimization from both academia and industry.
  • Develop robust, scalable, and efficient end-to-end inference solutions aligned with customer needs.
  • Identify performance bottlenecks and propose dataflow or scheduling optimizations for both single-node and distributed systems.
Basic Qualifications
  • Bachelor's or higher degree in computer science, electrical engineering, or a related field (e.g., applied mathematics, physics, or statistics).
  • 3+ years of experience in one or more of the following areas:
  • Deep learning model development and performance optimization
  • Compiler, runtime, or kernel-level optimization
  • Software–hardware co-design or systems performance tuning
  • Proficiency in Python or C++, with strong foundations in algorithms, data structures, and numerical computing.
  • Experience with at least one major ML framework — PyTorch, TensorFlow, or JAX.
  • Demonstrated ability to analyze and optimize performance in real-world ML pipelines.
Preferred Qualifications
  • Hands-on experience with LLM or multimodal model training and inference.
  • Background in large-scale distributed training, continuous batching, and high-throughput inference systems.
  • Familiarity with quantization, graph optimization, kernel fusion, and model partitioning.
  • Experience with frameworks such as DeepSpeed, Megatron, vLLM, or TensorRT.
  • Strong GPU programming skills (CUDA, Triton, or OpenCL); experience with cuDNN, cuBLAS, or similar libraries is a plus.
  • Knowledge of memory hierarchy optimization, caching, and scheduling for large-scale model execution.
  • Publication record or open-source contributions in ML systems or performance optimization is a plus.

Submission Guidelines
Please note that in order to be considered an applicant for any position at SambaNova Systems, you must submit an application form for each position for which you believe you are qualified. 

EEO Policy
SambaNova Systems is an Equal Opportunity/Affirmative Action Employer. All qualified applicants will receive consideration for employment without regard basis of age (40 and over), color, disability, gender identity, genetic information, marital status, military or veteran status, national origin/ancestry, race, religion, creed, sex (including pregnancy, childbirth, breastfeeding), sexual orientation, and any other applicable status protected by federal, state, or local laws.

Benefits Summary for US-Based, Full-Time Employment Positions
SambaNova offers a competitive total rewards package, including the base salary, plus equity and benefits. We cover 95% premium coverage for employee medical insurance, and 77% premium coverage for dependents and offer a Health Savings Account (HSA) with employer contribution. We also offer Dental, Vision, Short/Long term Disability, Basic Life, Voluntary Life, and AD&D insurance plans in addition to Flexible Spending Account (FSA) options like Health Care, Limited Purpose, and Dependent Care. Our library of well-being benefits available to you and your dependents includes a full subscription to Headspace, Gympass+ membership with access to physical gyms, One Medical membership, counseling services with an Employee Assistance Program, and much more.

Top Skills

C++
Cuda
Deepspeed
Jax
Megatron
Opencl
Python
PyTorch
TensorFlow
Tensorrt
Triton
Vllm
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Palo Alto, CA
500 Employees
Year Founded: 2017

What We Do

AI is changing the world and at SambaNova, we believe that you don’t need unlimited resources to take advantage of the most advanced, valuable AI capabilities - capabilities that are helping organizations explore the universe, find cures for cancer, and giving companies access to insights that provide a competitive edge.

We deliver the world’s fastest and only complete AI solution for enterprises and governments with world-record inference performance and accuracy. Powered by the SambaNova SN40L Reconfigurable Dataflow Unit (RDU), organizations can build a technology backbone for the next decade of AI innovation with SambaNova Suite. Our fully integrated hardware-software system, DataScale®, enables organizations to train, fine-tune, and deploy the most demanding AI workloads using the largest and most challenging models. Most recently, with the launch of our newest offering, SambaNova Cloud, developers can supercharge AI-powered applications on Llama 3.2 models.

SambaNova was founded in 2017 in Palo Alto, California, by a group of industry luminaries, business leaders, and world-class innovators who understand AI. Today, we’ve built an incredibly smart and motivated team dedicated to making a lasting impact on the industry and equipping our customers to thrive in the new era of AI.

Why Work With Us

As a talent first company, we aim to hire the greatest and most innovative minds in the industry- driving the next generation of AI computing where no barrier is too high and the possibilities are truly limitless. We encourage our peers to take risks and take the initiative to make a lasting impact on the AI and ML industries.

Gallery

Gallery

Similar Jobs

NVIDIA Logo NVIDIA

Senior Systems Engineer

Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
In-Office or Remote
Santa Clara, CA, USA
21960 Employees
184K-357K Annually

Metropolis Technologies Logo Metropolis Technologies

Staff Software Engineer

Artificial Intelligence • Computer Vision • Machine Learning • Payments • Real Estate • PropTech
Easy Apply
In-Office
Los Angeles, CA, USA
23100 Employees
220K-250K Annually

DigitalOcean Logo DigitalOcean

Software Engineer

Artificial Intelligence • Cloud • Software • Infrastructure as a Service (IaaS)
In-Office
San Francisco, CA, USA
1400 Employees
200K-250K Annually

Samsara Logo Samsara

Staff Software Engineer

Artificial Intelligence • Cloud • Computer Vision • Hardware • Internet of Things • Software
Easy Apply
Remote or Hybrid
6 Locations
4000 Employees
162K-290K Annually

Similar Companies Hiring

Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees
Fairly Even Thumbnail
Software • Sales • Robotics • Other • Hospitality • Hardware
New York, NY

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account