Principal Engineer, Compute Platform

Posted 5 Days Ago
Be an Early Applicant
Hiring Remotely in San Francisco, CA, USA
In-Office or Remote
243K-500K Annually
Expert/Leader
Social Media
Our mission is to bring everyone the inspiration to create a life they love.
The Role
Lead consolidation and modernization of a large-scale shared compute platform (PinCompute). Design Kubernetes-based solutions for stateful and GPU-heavy AI workloads, improve utilization via bin-packing/stacking and oversubscription, enable multi-cloud deployments, partner with internal customers for migrations, and drive production quality, observability, performance, and automation.
Summary Generated by Built In

About Pinterest:

Millions of people around the world come to our platform to find creative ideas, dream about new possibilities and plan for memories that will last a lifetime. At Pinterest, we’re on a mission to bring everyone the inspiration to create a life they love, and that starts with the people behind the product.

Discover a career where you ignite innovation for millions, transform passion into growth opportunities, celebrate each other’s unique experiences and embrace the flexibility to do your best work. Creating a career you love? It’s Possible.

At Pinterest, AI isn't just a feature, it's a powerful partner that augments our creativity and amplifies our impact, and we’re looking for candidates who are excited to be a part of that. To get a complete picture of your experience and abilities, we’ll explore your foundational skills and how you collaborate with AI.

Through our interview process, what matters most is that you can always explain your approach, showing us not just what you know, but how you think. You can read more about our AI interview philosophy and how we use AI in our recruiting process here.

Pinterest serves over 600 million users through sophisticated visual and social capabilities which connect inspiration, advertisement, and shopping. Compute Platform provides the underlying compute capabilities to run jobs and processes for all of the systems and workloads needed behind the scenes to create the best experience for our users and advertisers.  This includes distributed processing, data systems, search, experimentation, monetization, AI/ML for ranking and recommendations, GenAI, and internal systems.

We are looking for a Principal Engineer who can lead and scale the consolidation and modernization of this infrastructure under what we call PinCompute, with an emphasis on some of the largest and most challenging stateful workloads, as well as GPU-heavy AI workloads. The scale and scope of the effort will require designing and building around Kubernetes and solving its scaling limitations, handling stateful systems and data-intensive workloads, formalizing mechanisms to stack and bin pack workloads, working with multiple internal customers and giving them migration paths, and working through ambiguous and unforeseen situations which arise from workload requirements, production and operability requirements, and unique multi-tenancy challenges.


What you'll do:

  • Solving the challenges of replacing isolated pools of dedicated compute resources with a very large scale shared compute platform, shifting from machine-based designs to container-based designs.
  • Working with leads across various platforms, especially stateful and data platforms, to build the right features and migration paths that work for them.
  • Owning and driving up utilization on the shared compute platform by designing and implementing workload stacking, optimizing and bin packing, safe oversubscription, etc.
  • Work with multiple customers with unique requirements to make sure the platform will address their needs and is not only a viable but a desirable solution for running their workloads.
  • Leading a group of engineers around design topics, execution, trade offs, migration paths, observability, performance, and operability for the platform.
  • Evolving the platform towards a multi-cloud abstraction layer to enable running workloads across multiple cloud providers.
  • Being a role model for setting a high bar for production quality and engineering excellence in delivering a foundational technology which empowers the entire company.
  • Working closely with partners around capacity planning, cost visibility, fungibility of virtual machine instance types, and efficiency.
  • Putting special focus on the delivery of GPU resources through the platform, to enable and expedite AI workloads.
  • Leverage AI tools to increase the velocity and ease of migrations, and create self service solutions for the customers of the platform as needed
  • Help the team apply AI to the operational aspects of running the cluster, discovering issues, and investigating and root causing issues.
  • Expedite feature development using AI coding tools and be a thought leader on creating the right balance between speed and safety by designing safeguards and layers of defense.


What we’re looking for:

  • Bachelor’s degree in Computer Science, Engineering, or a related field, or equivalent experience.
  • 12+ years of relevant industry experience with large scale, production distributed systems.
  • 5+ years of experience with Kubernetes in production.
  • Experience working across SWE and SRE or Production Engineering teams to deliver robust production systems.
  • Experience with running distributed data systems and migrating them to Kubernetes is highly preferred.
  • Ability to work with cross-functional partners across multiple organizations.
  • Passion for automation, reducing toil, and building proper tooling for getting the job done.

In-Office Requirement Statement: 

  • We recognize that the ideal environment for work is situational and may differ across departments. What this looks like day-to-day can vary based on the needs of each organization or role.
  • This role will need to be in the office for in-person collaboration 1-2 times/quarter and therefore can be situated anywhere in the country. 


Relocation Statement:

  • This position is not eligible for relocation assistance. Visit our PinFlex page to learn more about our working model.

#LI-REMOTE

#LI-JT1

At Pinterest we believe the workplace should be equitable, inclusive, and inspiring for every employee. In an effort to provide greater transparency, we are sharing the base salary range for this position. The position is also eligible for equity. Final salary is based on a number of factors including location, travel, relevant prior experience, or particular skills and expertise.

Information regarding the culture at Pinterest and benefits available for this position can be found here.

US based applicants only
$242,634$499,541 USD

Our Commitment to Inclusion:

Pinterest is an equal opportunity employer and makes employment decisions on the basis of merit. We want to have the best qualified people in every job. All qualified applicants will receive consideration for employment without regard to race, color, ancestry, national origin, religion or religious creed, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender, gender identity, gender expression, age, marital status, status as a protected veteran, physical or mental disability, medical condition, genetic information or characteristics (or those of a family member) or any other consideration made unlawful by applicable federal, state or local laws. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. If you require a medical or religious accommodation during the job application process, please complete this form for support.
 
By submitting this application, I certify that all information submitted in my application and throughout the hiring process is true, accurate, and complete to the best of my knowledge. I understand that any false statement, omission, or misrepresentation may disqualify me from employment consideration or result in termination if discovered after hire.

Skills Required

  • Bachelor's degree in Computer Science, Engineering, or related field, or equivalent experience.
  • 12+ years relevant industry experience with large scale, production distributed systems.
  • 5+ years of experience with Kubernetes in production.
  • Experience working across software engineering and SRE/Production Engineering teams to deliver robust production systems.
  • Experience with running distributed data systems and migrating them to Kubernetes.
  • Ability to work with cross-functional partners across multiple organizations.
  • Passion for automation, reducing toil, and building proper tooling.
  • Experience delivering GPU resources through a shared platform to enable AI workloads.
  • Experience designing and implementing workload stacking, bin packing, and safe oversubscription for utilization.
  • Experience evolving platforms toward multi-cloud abstraction and running workloads across multiple cloud providers.

Pinterest Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about Pinterest and has not been reviewed or approved by Pinterest.

  • Fair & Transparent Compensation Compensation is considered competitive in core technical roles, with clearly defined base, bonus, and RSU components. Vesting schedules and pay elements are articulated clearly, supporting visibility into total rewards.
  • Parental & Family Support Policies include substantial paid parental leave globally alongside fertility and family‑building benefits, adoption assistance, and dedicated caregiver supports. Communications also highlight a phased return‑to‑work approach.
  • Flexible Benefits PinFlex enables role‑dependent hybrid/remote work with home‑office, connectivity, and commuter support, plus the option to work internationally for a limited period with approval. Time away is reinforced by a paid company shutdown and generous vacation framing.

Pinterest Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Francisco, CA
0 Employees

What We Do

Pinterest is the visual inspiration platform people around the world use to shop products personalized to their taste, find ideas to do offline and discover the most inspiring creators. Today, more than 460 million people come to the platform every month to explore and experience billions of ideas that have been saved. We’re proud to help people to discover and do what they love.

Similar Jobs

Atlassian Logo Atlassian

Solution Sales, Specialist Sales, Strategy Collection

Cloud • Information Technology • Productivity • Security • Software • App development • Automation
In-Office or Remote
San Francisco, CA, USA
11000 Employees

Atlassian Logo Atlassian

Solution Sales, Specialist Sales, Strategy Collection

Cloud • Information Technology • Productivity • Security • Software • App development • Automation
In-Office or Remote
San Francisco, CA, USA
11000 Employees

Nexthink Logo Nexthink

Enterprise Account Executive

Artificial Intelligence • Big Data • Cloud • Information Technology • Machine Learning • Software
Remote or Hybrid
San Diego, CA, USA
1200 Employees
150K-360K Annually

Nexthink Logo Nexthink

Enterprise Account Executive

Artificial Intelligence • Big Data • Cloud • Information Technology • Machine Learning • Software
Remote or Hybrid
San Francisco, CA, USA
1200 Employees
150K-360K Annually

Similar Companies Hiring

Digible Thumbnail
Social Media • PropTech • Marketing Tech • Digital Media • Artificial Intelligence • Agency • AdTech
PH
145 Employees
Posh Thumbnail
Events • Social Media • Software
New York, New York
70 Employees
Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account