Staff Software Engineer, Ads ML Inference Infrastructure

Posted 9 Days Ago
Be an Early Applicant
3 Locations
In-Office
208K-365K Annually
Senior level
Social Media
Our mission is to bring everyone the inspiration to create a life they love.
The Role
Lead the development of model inference and feature serving systems for Ads, optimize pipelines, partner with teams for productionization, and mentor engineers.
Summary Generated by Built In

About Pinterest:

Millions of people around the world come to our platform to find creative ideas, dream about new possibilities and plan for memories that will last a lifetime. At Pinterest, we’re on a mission to bring everyone the inspiration to create a life they love, and that starts with the people behind the product.

Discover a career where you ignite innovation for millions, transform passion into growth opportunities, celebrate each other’s unique experiences and embrace the flexibility to do your best work. Creating a career you love? It’s Possible.

Staff Software Engineer, Ads ML Inference Infrastructure


The Ads ML Inference Infra team owns the online inference and feature serving systems that power real-time model scoring and delivery for all Ads models at Pinterest. The team is looking for a staff engineer with strong hands-on experience in large-scale ML inference systems, as well as capabilities in solving ambiguous technical problems and driving strategic, cross-functional efforts.


What you’ll do:

  • Lead and drive efforts to build next-generation model inference and feature serving systems that power up to 100x larger models and directly uplevel Pinterest’s monetization business.
  • Design and optimize low-latency, high-throughput inference pipelines to meet strict SLOs while improving performance, efficiency, and cost.
  • Partner with Ads ML and product teams to productionize new model architectures (including LLMs and multi-stage ranking models) and scale them reliably to global traffic.
  • Evolve the online feature platform (feature computation, caching, and retrieval) to improve coverage, freshness, and consistency for Ads models.
  • Evaluate and integrate new technologies (e.g., GPU acceleration, model compression, Triton, vLLM, Dynamo) to advance our inference stack.
  • Build strong partnerships with other infra and ML teams to improve end-to-end reliability, observability, and developer velocity for Ads ML.
  • Mentor and coach other engineers, guiding them through technical decisions, system design, and career development.

What we’re looking for:

  • BS (or higher) degree in Computer Science or a related field.
  • ~8+ years of relevant industry experience designing and operating large-scale, production ML or distributed infra systems.
  • Deep knowledge of at least one programming language (Java, C++, Python).
  • Deep experience with distributed systems or recommendation / ads serving infrastructure (e.g., request routing, online storage, caching, feature serving, APIs).
  • Hands-on experience with at least one deep learning framework (PyTorch or TensorFlow) and bringing models from offline experimentation to production.
  • [Preferred] Experience with model / hardware accelerator libraries (e.g., CUDA, quantization, distillation, low-precision inference).
  • [Preferred] Experience with inference optimization and serving frameworks such as Triton, vLLM, or Dynamo.
  • Proven track record of leading complex projects, setting technical direction, and collaborating across functions and orgs; experience mentoring and coaching other engineers.

In-Office Requirement Statement:

  • We let the type of work you do guide the collaboration style. That means we're not always working in an office, but we continue to gather for key moments of collaboration and connection.
  • This role will need to be in the office for in-person collaboration 1-2 times per week and therefore needs to be in a commutable distance from one of the following offices Palo Alto, CA; San Francisco, CA; Seattle, WA.

Relocation Statement:

  • This position is not eligible for relocation assistance. Visit our PinFlex page to learn more about our working model.

#LI-HYBRID

#LI-AG8

At Pinterest we believe the workplace should be equitable, inclusive, and inspiring for every employee. In an effort to provide greater transparency, we are sharing the base salary range for this position. The position is also eligible for equity. Final salary is based on a number of factors including location, travel, relevant prior experience, or particular skills and expertise.

Information regarding the culture at Pinterest and benefits available for this position can be found here.

US based applicants only
$208,454$364,795 USD

Our Commitment to Inclusion:

Pinterest is an equal opportunity employer and makes employment decisions on the basis of merit. We want to have the best qualified people in every job. All qualified applicants will receive consideration for employment without regard to race, color, ancestry, national origin, religion or religious creed, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender, gender identity, gender expression, age, marital status, status as a protected veteran, physical or mental disability, medical condition, genetic information or characteristics (or those of a family member) or any other consideration made unlawful by applicable federal, state or local laws. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. If you require a medical or religious accommodation during the job application process, please complete this form for support.
 

Top Skills

C++
Cuda
Dynamo
Java
Python
PyTorch
TensorFlow
Triton
Vllm
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Francisco, CA
0 Employees

What We Do

Pinterest is the visual inspiration platform people around the world use to shop products personalized to their taste, find ideas to do offline and discover the most inspiring creators. Today, more than 460 million people come to the platform every month to explore and experience billions of ideas that have been saved. We’re proud to help people to discover and do what they love.

Similar Jobs

Anduril Logo Anduril

Senior Program Manager

Aerospace • Artificial Intelligence • Hardware • Robotics • Security • Software • Defense
In-Office
Bellevue, WA, USA
6000 Employees
146K-194K Annually

Anduril Logo Anduril

Senior Site Reliability Engineer

Aerospace • Artificial Intelligence • Hardware • Robotics • Security • Software • Defense
In-Office
Seattle, WA, USA
6000 Employees
166K-220K Annually

Zscaler Logo Zscaler

Platform Engineer

Cloud • Information Technology • Security • Software • Cybersecurity
Easy Apply
Hybrid
Bellevue, WA, USA
8697 Employees
154K-220K Annually

ZS Logo ZS

Senior Director - Product Management

Artificial Intelligence • Healthtech • Professional Services • Analytics • Consulting
Hybrid
5 Locations
13000 Employees

Similar Companies Hiring

Babylist Thumbnail
Social Media • Retail • Kids + Family • Healthtech • eCommerce
Emeryville, CA
300 Employees
Digible Thumbnail
Social Media • PropTech • Marketing Tech • Digital Media • Artificial Intelligence • Agency • AdTech
PH
145 Employees
Posh Thumbnail
Software • Social Media • Events
New York, New York
65 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account