Engineering Manager - Forward Deployed Engineering (LLM)

Reposted 3 Days Ago
Be an Early Applicant
San Francisco, CA
In-Office
220K-285K Annually
Mid level
Software
The Role
Lead and mentor a team of Forward Deployed Engineers in building and optimizing LLM inference workloads, delivering high-performance AI applications, and collaborating with cross-functional teams.
Summary Generated by Built In

ABOUT BASETEN

Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the frontier of AI to bring cutting-edge models into production. We're growing quickly and recently raised our $150M Series D, backed by investors including BOND, IVP, Spark Capital, Greylock, and Conviction. Join us and help build the platform engineers turn to to ship AI products.

THE ROLE

As an Engineering Manager (Player & Coach), you will lead and mentor a team of Forward Deployed Engineers focused on building, scaling, and optimizing LLM inference workloads for Baseten customers. Applying both hands-on technical ownership and managerial leadership, you will guide your team through the processes of designing, deploying, and managing high performance, low latency AI applications on Baseten’s platform. FDE at baseten is not a sales function – we are a mix of engineering, product, and customer architects who contribute to the core Baseten codebase, drive large portions of our feature roadmap, and execute on complicated customer engagements.
You will also partner with product, infrastructure, and other customer engineering teams to ensure that large language models (LLMs) and other generative AI systems deliver best-in-class performance, reliability, and cost efficiency in production environments.
EXAMPLE INITIATIVES

Take a look at these blog posts written by members of our Forward Deployed Engineering team:

  • Forward Deployed Engineering on the frontier of AI

  • The fastest, most accurate Whisper transcription

  • Deploy production-ready model servers from Docker images

  • Deploy custom ComfyUI workflows as APIs

RESPONSIBILITIES

Leadership & Team Management

  • Lead, mentor, and grow a team of Forward Deployed Engineers, providing guidance on technical direction, project execution, and professional development.

  • Set clear goals and ensure timely, high-quality delivery across multiple customer-facing projects involving LLM deployment and inference optimization.

  • Collaborate with leadership to align team priorities with company and customer goals, balancing short-term delivery, widely varying customer priorities, and long-term technical initiatives.

  • Player-coach – While much of this role will be leading the team, you will also be expected to be a key driver on strategic product initiatives and customer engagements. The best managers derive credibility from being able to be hands-on when needed.

Technical Ownership

  • Develop and maintain software systems and product features using one or more general-purpose programming languages in a production-level environment, with a preference for Python due to its relevance in ML projects.

  • Drive customer impact by designing, implementing, and deploying Baseten solutions end-to-end (problem framing → evaluation → production deployment → monitoring). This involves working with customers’ engineering teams at every stage of the customer journey including: sales, implementation, and expansion.

  • Deliver with velocity: turn vague objectives into clear specs and well-defined PoCs so we can rapidly ship well-tested services and outcomes for our customers

  • Optimize and enhance AI/ML projects, contributing to the continuous improvement of our technical stack. This includes developing features and PRDs with other engineering and product orgs.

  • Own products and customer projects end-to-end, functioning as both an engineer, project manager, and product manager, with a focus on user empathy, project specification, and end-to-end execution.

REQUIREMENTS

  • Bachelor’s, Master’s, or Ph.D. in Computer Science, Engineering, or related field.

  • 4+ years of professional software engineering experience, including 1+ year in a leadership or mentorship capacity.

  • Strong programming skills in Python, with production experience in building or optimizing ML inference systems.

  • Proven experience with LLMs, inference optimization, or serving frameworks (e.g., vLLM, TensorRT, Triton, Hugging Face, Ray Serve).

  • Familiarity with observability, profiling, and cost/performance tradeoffs in production ML systems.

  • Excellent communication and collaboration skills—able to lead cross-functional efforts and drive outcomes in ambiguous, fast-paced environments.

BONUS POINTS

  • Experience leading customer-facing engineering teams or working directly with enterprise partners.

  • Deep understanding of GPU infrastructure, distributed inference, or model compression techniques.

BENEFITS

  • Competitive compensation, including meaningful equity.

  • 100% coverage of medical, dental, and vision insurance for employee and dependents

  • Generous PTO policy including company wide Winter Break (our offices are closed from Christmas Eve to New Year's Day!)

  • Paid parental leave

  • Company-facilitated 401(k)

  • Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities.

Apply now to embark on a rewarding journey in shaping the future of AI! If you are a motivated individual with a passion for machine learning and a desire to be part of a collaborative and forward-thinking team, we would love to hear from you.

At Baseten, we are committed to fostering a diverse and inclusive workplace. We provide equal employment opportunities to all employees and applicants without regard to race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, or veteran status.

Top Skills

Hugging Face
Llm
Python
Ray Serve
Tensorrt
Triton
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
59 Employees

What We Do

At Baseten we provide all the infrastructure you need to deploy and serve ML models performantly, scalably, and cost-efficiently.

Get started in minutes, and avoid getting tangled in complex deployment processes. You can deploy best-in-class open-source models and take advantage of optimized serving for your own models.

We also utilize horizontally scalable services that take you from prototype to production, with light-speed inference on infra that autoscales with your traffic.

Best in class doesn't mean breaking the bank. Run your models on the best infrastructure without running up costs by taking advantage of our scaled-to-zero feature

Similar Jobs

Anduril Logo Anduril

Senior Strategic Sourcing Manager, Indirect

Aerospace • Artificial Intelligence • Hardware • Robotics • Security • Software • Defense
In-Office
Costa Mesa, CA, USA
6000 Employees
146K-194K Annually

Forward Financing Logo Forward Financing

Account Manager

Fintech • Financial Services
In-Office
Los Angeles, CA, USA
529 Employees

BuildOps Logo BuildOps

Human Resources Business Partner

Cloud • Mobile • Software
Easy Apply
Hybrid
Los Angeles, CA, USA
300 Employees
120K-160K Annually

Rubrik Logo Rubrik

Customer Experience Manager

Artificial Intelligence • Big Data • Cloud • Information Technology • Software • Cybersecurity • Data Privacy
In-Office
Palo Alto, CA, USA
3000 Employees
123K-185K Annually

Similar Companies Hiring

Standard Template Labs Thumbnail
Software • Information Technology • Artificial Intelligence
New York, NY
10 Employees
PRIMA Thumbnail
Travel • Software • Marketing Tech • Hospitality • eCommerce
US
15 Employees
Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account