Senior Software Engineer, ML Infrastructure

Posted 20 Days Ago
Hiring Remotely in United States
Remote
200K-275K Annually
Senior level
Artificial Intelligence • Hardware • Machine Learning • Natural Language Processing • Software • Generative AI
SambaNova is the #1 platform for business AI.
The Role
Lead development and optimization of the compiler stack for ML systems, collaborate across teams, integrate and deploy products, map ML operations to hardware, and drive compiler infrastructure innovation and performance debugging.
Summary Generated by Built In

The era of pervasive AI has arrived. In this era, organizations will use generative AI to unlock hidden value in their data, accelerate processes, reduce costs, drive efficiency and innovation to fundamentally transform their businesses and operations at scale.

SambaNova Suite™ is the first full-stack, generative AI platform, from chip to model, optimized for enterprise and government organizations. Powered by the intelligent SN40L chip, the SambaNova Suite is a fully integrated platform, delivered on-premises or in the cloud, combined with state-of-the-art open-source models that can be easily and securely fine-tuned using customer data for greater accuracy. Once adapted with customer data, customers retain model ownership in perpetuity, so they can turn generative AI into one of their most valuable assets.

Overview

The Senior Software Engineer, ML Infrastructure will be responsible for designing, building, and operating the production-grade inference infrastructure that powers SambaNova's serving stack on our Reconfigurable Dataflow Unit (RDU) architecture. SambaNova is an inference-first company, and this role sits at the heart of that mission: turning state-of-the-art inference techniques into reliable, high-throughput, low-latency services exposed to customers through SambaStack and SambaCloud. The engineer will own end-to-end systems spanning request scheduling, advanced decoding algorithms, caching layers, API surfaces, and the accuracy infrastructure that keeps the stack trustworthy. This role partners closely with ML, compiler, runtime, and product teams to ship inference features from prototype to production.

Qualifications

  • Bachelor's degree in Computer Science, Electrical Engineering, or related field
  • 5+ years of industry experience building and operating large-scale distributed systems, ideally in ML serving
  • Strong software engineering fundamentals: algorithms, data structures, concurrency, and systems design
  • Experience designing and maintaining production services with strict latency, throughput, and availability requirements
  • Working knowledge of modern LLM inference techniques and familiarity with open-source serving stacks such as vLLM, TensorRT-LLM, or SGLang
  • Proficiency in Python
  • Experience collaborating across teams to deliver complex, system-level engineering solutions

Key responsibilities

  • Design and productionize advanced inference techniques on RDU to optimize for performance and cost. Key areas include speculative decoding, constrained decoding, function/tool calling, prompt caching, and long-context inference.
  • Own SambaNova's integration with vLLM and adjacent serving frameworks, adapting them to RDU's architecture.
  • Own the public inference API surface exposed through SambaStack and SambaCloud.
  • Build and maintain the accuracy verification and regression infrastructure that gates every inference feature shipped to customers.
  • Partner with ML, compiler, runtime, and product teams to take inference features from prototype to production.
  • Contribute to technical design discussions, code reviews, and architectural decisions as a senior individual contributor.
 

Base Salary Range:

Base Pay Range
$200,000$275,000 USD

Submission Guidelines
Please note that in order to be considered an applicant for any position at SambaNova Systems, you must submit an application form for each position for which you believe you are qualified. 

EEO Policy
SambaNova Systems is an Equal Opportunity/Affirmative Action Employer. All qualified applicants will receive consideration for employment without regard basis of age (40 and over), color, disability, gender identity, genetic information, marital status, military or veteran status, national origin/ancestry, race, religion, creed, sex (including pregnancy, childbirth, breastfeeding), sexual orientation, and any other applicable status protected by federal, state, or local laws.

Benefits Summary for US-Based, Full-Time Employment Positions
SambaNova offers a competitive total rewards package, including the base salary, plus equity and benefits. We cover 95% premium coverage for employee medical insurance, and 77% premium coverage for dependents and offer a Health Savings Account (HSA) with employer contribution. We also offer Dental, Vision, Short/Long term Disability, Basic Life, Voluntary Life, and AD&D insurance plans in addition to Flexible Spending Account (FSA) options like Health Care, Limited Purpose, and Dependent Care. Our library of well-being benefits available to you and your dependents includes a full subscription to Headspace, Gympass+ membership with access to physical gyms, One Medical membership, counseling services with an Employee Assistance Program, and much more.

Skills Required

  • Bachelor's or Master's Degree in Computer Science, Computer Engineering, or equivalent with 5-10 years industry experience
  • Deep theoretical understanding of compiler fundamentals
  • Experience building and deploying software products
  • Experience with common compiler development practices and methodologies
  • Excitement about high-performance systems engineering and performance debugging
  • Appreciation for process and developing cross-disciplinary collaboration
  • Experience with one or more deep learning frameworks (e.g., TensorFlow, PyTorch)
  • Experience with MLIR
  • Familiarity with machine learning models and frameworks
  • Familiarity with accelerated computing
  • Exposure to dataflow architectures
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Palo Alto, CA
500 Employees
Year Founded: 2017

What We Do

AI is changing the world and at SambaNova, we believe that you don’t need unlimited resources to take advantage of the most advanced, valuable AI capabilities - capabilities that are helping organizations explore the universe, find cures for cancer, and giving companies access to insights that provide a competitive edge. We deliver the world’s fastest and only complete AI solution for enterprises and governments with world-record inference performance and accuracy. Powered by the SambaNova SN40L Reconfigurable Dataflow Unit (RDU), organizations can build a technology backbone for the next decade of AI innovation with SambaNova Suite. Our fully integrated hardware-software system, DataScale®, enables organizations to train, fine-tune, and deploy the most demanding AI workloads using the largest and most challenging models. Most recently, with the launch of our newest offering, SambaNova Cloud, developers can supercharge AI-powered applications on Llama 3.2 models. SambaNova was founded in 2017 in Palo Alto, California, by a group of industry luminaries, business leaders, and world-class innovators who understand AI. Today, we’ve built an incredibly smart and motivated team dedicated to making a lasting impact on the industry and equipping our customers to thrive in the new era of AI.

Why Work With Us

As a talent first company, we aim to hire the greatest and most innovative minds in the industry- driving the next generation of AI computing where no barrier is too high and the possibilities are truly limitless. We encourage our peers to take risks and take the initiative to make a lasting impact on the AI and ML industries.

Gallery

Gallery

Similar Jobs

Arena (arena.ai) Logo Arena (arena.ai)

Senior Software Engineer

Artificial Intelligence • Information Technology • Software
Remote or Hybrid
7 Locations
58 Employees
150K-350K Annually

Dynatrace Logo Dynatrace

Sr Director, Customer Education Portfolio

Artificial Intelligence • Big Data • Cloud • Information Technology • Software • Big Data Analytics • Automation
Remote or Hybrid
Boston, MA, USA
5600 Employees

Collectors Logo Collectors

Senior Software Engineer

Consumer Web • eCommerce • Machine Learning • Software • Sports • Analytics
Remote or Hybrid
California, USA
2246 Employees
141K-229K Annually

Collectors Logo Collectors

Senior Software Engineer

Consumer Web • eCommerce • Machine Learning • Software • Sports • Analytics
Remote or Hybrid
US
2246 Employees
141K-229K Annually

Similar Companies Hiring

Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
LTX Thumbnail
Conversational AI • Generative AI
Jerusalem, Israel
360 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account