Software Engineer - Capacity

Posted Yesterday
Be an Early Applicant
San Francisco, CA, USA
Hybrid
220K-285K Annually
Senior level
Software
The Role
Own and build the internal Capacity product end-to-end, creating full-stack features, translating operational requirements into tooling, eliminating manual processes, and ensuring observability and operational alignment with SRE and infrastructure teams.
Summary Generated by Built In

ABOUT BASETEN

Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the frontier of AI to bring cutting-edge models into production. We're growing quickly and recently raised our $300M Series E, backed by investors including BOND, IVP, Spark Capital, Greylock, and Conviction. Join us and help build the platform engineers turn to to ship AI products.

THE ROLE

As a Software Engineer on the Capacity team, you will own the internal operating system that sits at the heart of how Baseten operates. Capacity helps unlock revenue by carefully balancing supply and demand. The operating system manages all aspects of the customer lifecycle: from onboarding to managing complex customer SLA requirements.

This role is for engineers who want to own a product end to end, not just implement tickets. You will work directly with the Capacity, Sales, and Engineering teams to understand requirements, define solutions, and ship software that removes friction from some of the most high-stakes workflows in the company. If something is slow, manual, or error-prone in the capacity fulfillment lifecycle, you will be the one to fix it.

You are a strong fit if you have strong product intuition, move fast without sacrificing quality, and take satisfaction in building tools that make the people around you measurably more effective.

RESPONSIBILITIES

  • Own the Capacity product end to end: scoping, design, implementation, and iteration based on feedback from internal stakeholders

  • Translate complex operational requirements from Capacity, Sales, and SRE teams into clean, ergonomic product experiences

  • Build and maintain full-stack features across the Capacity toolchain, including UI surfaces, APIs, and backend services

  • Identify workflow bottlenecks and manual processes across the capacity lifecycle and drive their elimination through tooling

  • Instrument your work with observability and monitoring so issues surface before they become incidents

  • Partner with SRE and Infra teams to ensure Capacity reflects the operational reality of the fleet, not just the desired state

REQUIREMENTS

  • Bachelor's degree or higher in Computer Science or a related field

  • 5+ years of software engineering experience, with meaningful time spent owning a product or internal platform end to end

  • Strong full-stack proficiency; experience with NextJS, Javascript, Postgres / Drizzle, Tailwind and AWS. Experience with SST.dev is a plus but not a requirement

  • Demonstrated ability to work directly with non-engineering stakeholders to understand requirements and translate them into product decisions

  • Experience building internal tooling, operational platforms, or developer infrastructure — particularly in high-growth or infra-heavy environments

  • High ownership mindset: you notice what is broken, you fix it, and you follow through

  • Interest in AI/ML infrastructure; familiarity with GPU infrastructure, or capacity planning is a meaningful plus

BENEFITS

  • Competitive compensation, including meaningful equity.

  • 100% coverage of medical, dental, and vision insurance for employee and dependents

  • Flexible PTO policy including company wide Winter Break (our offices are closed from Christmas Eve to New Year's Day!)

  • Paid parental leave

  • Fertility and family-building stipend through Carrot

  • Company-facilitated 401(k)

  • Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities.

Apply now to embark on a rewarding journey in shaping the future of AI! If you are a motivated individual with a passion for machine learning and a desire to be part of a collaborative and forward-thinking team, we would love to hear from you.

At Baseten, we are committed to fostering a diverse and inclusive workplace. We provide equal employment opportunities to all employees and applicants without regard to race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, or veteran status.

We are an Equal Opportunity Employer and will consider qualified applicants with criminal histories in a manner consistent with applicable law (by example, the requirements of the San Francisco Fair Chance Ordinance, where applicable).

Skills Required

  • Bachelor's degree or higher in Computer Science or a related field
  • 5+ years of software engineering experience with meaningful time owning a product or internal platform end to end
  • Strong full-stack proficiency; experience with NextJS, JavaScript, Postgres/Drizzle, Tailwind, and AWS
  • Experience with SST.dev
  • Demonstrated ability to work directly with non-engineering stakeholders to translate requirements into product decisions
  • Experience building internal tooling, operational platforms, or developer infrastructure
  • High ownership mindset; ability to identify and fix broken workflows and follow through
  • Interest in AI/ML infrastructure; familiarity with GPU infrastructure or capacity planning
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
59 Employees

What We Do

At Baseten we provide all the infrastructure you need to deploy and serve ML models performantly, scalably, and cost-efficiently. Get started in minutes, and avoid getting tangled in complex deployment processes. You can deploy best-in-class open-source models and take advantage of optimized serving for your own models. We also utilize horizontally scalable services that take you from prototype to production, with light-speed inference on infra that autoscales with your traffic. Best in class doesn't mean breaking the bank. Run your models on the best infrastructure without running up costs by taking advantage of our scaled-to-zero feature

Similar Jobs

In-Office or Remote
San Francisco, CA, USA
177K-365K Annually

OpenAI Logo OpenAI

Software Engineer

Artificial Intelligence • Machine Learning • Generative AI
In-Office
San Francisco, CA, USA
4500 Employees
293K-385K Annually
In-Office
4 Locations
26259 Employees
75K-160K Annually
In-Office
3 Locations
2359 Employees
251K-310K Annually

Similar Companies Hiring

Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
42 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account