Baseten

Software Engineer - Capacity

Posted Yesterday

Be an Early Applicant

San Francisco, CA, USA

Hybrid

220K-285K Annually

Senior level

Software

The Role

Own and build the internal Capacity product end-to-end, creating full-stack features, translating operational requirements into tooling, eliminating manual processes, and ensuring observability and operational alignment with SRE and infrastructure teams.

Summary Generated by Built In

ABOUT BASETEN

Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the frontier of AI to bring cutting-edge models into production. We're growing quickly and recently raised our $300M Series E, backed by investors including BOND, IVP, Spark Capital, Greylock, and Conviction. Join us and help build the platform engineers turn to to ship AI products.

THE ROLE

As a Software Engineer on the Capacity team, you will own the internal operating system that sits at the heart of how Baseten operates. Capacity helps unlock revenue by carefully balancing supply and demand. The operating system manages all aspects of the customer lifecycle: from onboarding to managing complex customer SLA requirements.

This role is for engineers who want to own a product end to end, not just implement tickets. You will work directly with the Capacity, Sales, and Engineering teams to understand requirements, define solutions, and ship software that removes friction from some of the most high-stakes workflows in the company. If something is slow, manual, or error-prone in the capacity fulfillment lifecycle, you will be the one to fix it.

You are a strong fit if you have strong product intuition, move fast without sacrificing quality, and take satisfaction in building tools that make the people around you measurably more effective.

RESPONSIBILITIES

Own the Capacity product end to end: scoping, design, implementation, and iteration based on feedback from internal stakeholders
Translate complex operational requirements from Capacity, Sales, and SRE teams into clean, ergonomic product experiences
Build and maintain full-stack features across the Capacity toolchain, including UI surfaces, APIs, and backend services
Identify workflow bottlenecks and manual processes across the capacity lifecycle and drive their elimination through tooling
Instrument your work with observability and monitoring so issues surface before they become incidents
Partner with SRE and Infra teams to ensure Capacity reflects the operational reality of the fleet, not just the desired state

REQUIREMENTS

Bachelor's degree or higher in Computer Science or a related field
5+ years of software engineering experience, with meaningful time spent owning a product or internal platform end to end
Strong full-stack proficiency; experience with NextJS, Javascript, Postgres / Drizzle, Tailwind and AWS. Experience with SST.dev is a plus but not a requirement
Demonstrated ability to work directly with non-engineering stakeholders to understand requirements and translate them into product decisions
Experience building internal tooling, operational platforms, or developer infrastructure — particularly in high-growth or infra-heavy environments
High ownership mindset: you notice what is broken, you fix it, and you follow through
Interest in AI/ML infrastructure; familiarity with GPU infrastructure, or capacity planning is a meaningful plus

BENEFITS

Competitive compensation, including meaningful equity.
100% coverage of medical, dental, and vision insurance for employee and dependents
Flexible PTO policy including company wide Winter Break (our offices are closed from Christmas Eve to New Year's Day!)
Paid parental leave
Fertility and family-building stipend through Carrot
Company-facilitated 401(k)
Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities.

Apply now to embark on a rewarding journey in shaping the future of AI! If you are a motivated individual with a passion for machine learning and a desire to be part of a collaborative and forward-thinking team, we would love to hear from you.

At Baseten, we are committed to fostering a diverse and inclusive workplace. We provide equal employment opportunities to all employees and applicants without regard to race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, or veteran status.

We are an Equal Opportunity Employer and will consider qualified applicants with criminal histories in a manner consistent with applicable law (by example, the requirements of the San Francisco Fair Chance Ordinance, where applicable).

Skills Required

Bachelor's degree or higher in Computer Science or a related field
5+ years of software engineering experience with meaningful time owning a product or internal platform end to end
Strong full-stack proficiency; experience with NextJS, JavaScript, Postgres/Drizzle, Tailwind, and AWS
Experience with SST.dev
Demonstrated ability to work directly with non-engineering stakeholders to translate requirements into product decisions
Experience building internal tooling, operational platforms, or developer infrastructure
High ownership mindset; ability to identify and fix broken workflows and follow through
Interest in AI/ML infrastructure; familiarity with GPU infrastructure or capacity planning

View all jobs at Baseten

View Baseten Profile

Report Job

Am I A Good Fit?

beta

Get Personalized Job Insights.

Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company

59 Employees

What We Do

At Baseten we provide all the infrastructure you need to deploy and serve ML models performantly, scalably, and cost-efficiently. Get started in minutes, and avoid getting tangled in complex deployment processes. You can deploy best-in-class open-source models and take advantage of optimized serving for your own models. We also utilize horizontally scalable services that take you from prototype to production, with light-speed inference on infra that autoscales with your traffic. Best in class doesn't mean breaking the bank. Run your models on the best infrastructure without running up costs by taking advantage of our scaled-to-zero feature