Principal Cloud Backend Engineer

Reposted 6 Days Ago
Be an Early Applicant
Palo Alto, CA
In-Office
Senior level
Artificial Intelligence • Hardware • Machine Learning • Natural Language Processing • Software • Semiconductor • Generative AI
SambaNova is the #1 platform for business AI.
The Role
Lead the architecture and implementation of cloud-based AI inference services, focusing on monetization strategies, systems design, and cross-functional collaboration.
Summary Generated by Built In

The era of pervasive AI has arrived. In this era, organizations will use generative AI to unlock hidden value in their data, accelerate processes, reduce costs, drive efficiency and innovation to fundamentally transform their businesses and operations at scale.

SambaNova Suite™ is the first full-stack, generative AI platform, from chip to model, optimized for enterprise and government organizations. Powered by the intelligent SN40L chip, the SambaNova Suite is a fully integrated platform, delivered on-premises or in the cloud, combined with state-of-the-art open-source models that can be easily and securely fine-tuned using customer data for greater accuracy. Once adapted with customer data, customers retain model ownership in perpetuity, so they can turn generative AI into one of their most valuable assets.

About The Role

We are seeking a highly skilled and experienced Principal or Senior Principal Cloud Backend Engineer to architect and build the core platform that powers our large-scale AI inference services, with a critical focus on enabling flexible billing and monetization strategies. You will own the design and implementation of the systems that not only ensure reliability and scalability but also directly unlock new revenue streams and business models for our AI services.

This is a high-impact role where you will solve complex challenges at the intersection of cloud-native AI infrastructure, metering, and monetization. You will build the foundational systems for usage-based pricing, subscription plans, and dynamic entitlements that serve as the economic engine for our business. If you are passionate about building platforms that are both technically robust and commercially critical, we want to hear from you.


Key Responsibilities
  • Platform Architecture & Strategy: Lead the technical vision and architecture for our inference serving and monetization platform. Design systems that are fault-tolerant, highly available, and can scale to meet growing demand while accurately tracking usage for billing.
  • Monetization Platform Design: Architect the core systems for flexible monetization, including:
    • Entitlements & Quota Management: Designing a flexible system to define and enforce complex usage plans, rate limits, and access policies.
    • Usage Metering & Aggregation: Building a highly reliable and accurate system to meter usage (e.g., tokens, requests) at scale and prepare data for billing.
    • Billing Integration: Designing clean abstractions and APIs to seamlessly integrate with external billing and payment providers (e.g., Stripe, Metronome).
  • Distributed Systems Design: Architect and implement complex distributed systems involving real-time rate limiting, quota enforcement, and fair-share scheduling for a multi-tenant environment.
  • Performance & Cost Optimization: Identify and eliminate bottlenecks in the end-to-end system, ensuring low-latency request handling while maintaining precise financial accuracy.
  • Technical Leadership: Serve as a technical leader and mentor. Establish best practices in code quality, testing, and observability for business-critical financial data pipelines.
  • Cross-Functional Collaboration: Work closely with Product Management, Finance, and GTM teams to translate business requirements for new pricing models (e.g., subscriptions, pay-as-you-go, custom enterprise plans) into scalable technical solutions.
Required Qualifications (Senior Principal Level)
  • 10 + years of experience in software engineering, with a significant focus on designing and building large-scale, distributed backend systems in cloud environments.
  • 5 + years in a Principal or Lead Engineer role, with a proven track record of architecting, delivering, and operating business-critical platforms.
  • Expert proficiency in one or more of the following: Go, Rust and C++. Deep understanding of concurrency, performance optimization, and systems programming.
  • Deep, hands-on experience with cloud-native technologies (Kubernetes, Docker, etc.) and major cloud providers (AWS, GCP, Azure).
  • Extensive experience with both SQL and NoSQL databases (e.g., PostgreSQL, Redis) and designing data models for high-throughput, low-latency applications.
  • Strong foundation in API design (REST, gRPC), event-driven architecture, and building resilient microservices.
  • Excellent communication and leadership skills, with the ability to drive technical consensus and articulate complex concepts to a diverse audience.
Preferred Qualifications
  • Direct Monetization/Billing Experience: Proven experience building or significantly extending platforms for usage-based metering, subscription management, entitlements, or billing systems. Experience with billing providers (e.g., Stripe,Metronome) is a strong plus.
  • Experience in AI/ML Infrastructure: Direct experience building or operating platforms for serving, scaling, and managing AI models (e.g., inference servers, model deployment pipelines).
What You'll Work On

As a key leader on our team, you will be at the forefront of building the economic backbone of our inference platform. Your work will directly impact our ability to:

  • Launch New Business Models: Enable product-led growth through self-service plans, automatic upgrades, pay-as-you-go pricing, and custom enterprise agreements.
  • Monetize Efficiently: Create a flexible platform that allows our business to experiment with and deploy new pricing strategies rapidly without complex engineering changes.
  • Ensure Financial Accuracy: Build robust, auditable systems for metering usage and generating billing events with high reliability.
  • Scale Economically: Design systems that dynamically manage resources and costs, tying infrastructure efficiency directly to business metrics.

You will be solving challenging problems at the intersection of distributed systems, cloud infrastructure, and commercial strategy, making our monetization platform a key competitive advantage.

How to Apply

Please submit your resume along with a cover letter. In your cover letter, we encourage you to describe your experience with a large-scale system you've architected, particularly any involving billing, entitlements, or monetization. Highlight the challenges you faced in ensuring scalability, reliability, and accuracy, and how you overcame them.

Submission Guidelines
Please note that in order to be considered an applicant for any position at SambaNova Systems, you must submit an application form for each position for which you believe you are qualified. 

EEO Policy
SambaNova Systems is an Equal Opportunity/Affirmative Action Employer. All qualified applicants will receive consideration for employment without regard basis of age (40 and over), color, disability, gender identity, genetic information, marital status, military or veteran status, national origin/ancestry, race, religion, creed, sex (including pregnancy, childbirth, breastfeeding), sexual orientation, and any other applicable status protected by federal, state, or local laws.

Benefits Summary for US-Based, Full-Time Employment Positions
SambaNova offers a competitive total rewards package, including the base salary, plus equity and benefits. We cover 95% premium coverage for employee medical insurance, and 77% premium coverage for dependents and offer a Health Savings Account (HSA) with employer contribution. We also offer Dental, Vision, Short/Long term Disability, Basic Life, Voluntary Life, and AD&D insurance plans in addition to Flexible Spending Account (FSA) options like Health Care, Limited Purpose, and Dependent Care. Our library of well-being benefits available to you and your dependents includes a full subscription to Headspace, Gympass+ membership with access to physical gyms, One Medical membership, counseling services with an Employee Assistance Program, and much more.

Top Skills

AWS
Azure
C++
Docker
GCP
Go
Kubernetes
Postgres
Redis
Rust
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Palo Alto, CA
500 Employees
Year Founded: 2017

What We Do

AI is changing the world and at SambaNova, we believe that you don’t need unlimited resources to take advantage of the most advanced, valuable AI capabilities - capabilities that are helping organizations explore the universe, find cures for cancer, and giving companies access to insights that provide a competitive edge.

We deliver the world’s fastest and only complete AI solution for enterprises and governments with world-record inference performance and accuracy. Powered by the SambaNova SN40L Reconfigurable Dataflow Unit (RDU), organizations can build a technology backbone for the next decade of AI innovation with SambaNova Suite. Our fully integrated hardware-software system, DataScale®, enables organizations to train, fine-tune, and deploy the most demanding AI workloads using the largest and most challenging models. Most recently, with the launch of our newest offering, SambaNova Cloud, developers can supercharge AI-powered applications on Llama 3.2 models.

SambaNova was founded in 2017 in Palo Alto, California, by a group of industry luminaries, business leaders, and world-class innovators who understand AI. Today, we’ve built an incredibly smart and motivated team dedicated to making a lasting impact on the industry and equipping our customers to thrive in the new era of AI.

Why Work With Us

As a talent first company, we aim to hire the greatest and most innovative minds in the industry- driving the next generation of AI computing where no barrier is too high and the possibilities are truly limitless. We encourage our peers to take risks and take the initiative to make a lasting impact on the AI and ML industries.

Gallery

Gallery

Similar Jobs

Remote or Hybrid
9 Locations
213000 Employees
27-41 Hourly

Cash App Logo Cash App

Data Analyst

Blockchain • Fintech • Mobile • Payments • Software • Financial Services
Remote or Hybrid
8 Locations
3500 Employees
108K-203K Annually

CrowdStrike Logo CrowdStrike

Consultant

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Remote or Hybrid
11 Locations
10000 Employees
140K-195K Annually

Square Logo Square

Senior Ios Engineer

eCommerce • Fintech • Hardware • Payments • Software • Financial Services
Remote or Hybrid
8 Locations
12000 Employees
185K-327K Annually

Similar Companies Hiring

Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees
Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account