R&D Principal Software Engineer

Posted 2 Days Ago
Be an Early Applicant
2 Locations
In-Office
127K-226K Annually
Expert/Leader
Software • Semiconductor • Manufacturing
The Role
Lead design and implementation of an AI Virtualization Stack for ESXi, integrating PyTorch/JAX backends via OpenXLA. Optimize ML acceleration (LLM inference, KV-caching, FlashAttention), troubleshoot GPU/XPU runtime issues, collaborate with driver/device teams and vendors, and deliver production-quality code and documentation.
Summary Generated by Built In

Please Note:

1. If you are a first time user, please create your candidate login account before you apply for a job. (Click Sign In > Create Account)

2. If you already have a Candidate Account, please Sign-In before you apply.

Job Description:

About Us:

Broadcom is a global leader in semiconductor and infrastructure software solutions. As part of our commitment to innovation and excellence, our VMware subsidiary is dedicated to shaping the future of virtualization technology. We are seeking talented individuals to join the GPU Virtualization Team, which is responsible for integrating GPUs in the ESXi Operating System and providing acceleration to AI/ML and Graphics applications running inside the Virtual Machines. The GPU Virtualization Team is part of the VMware Cloud Foundation (VCF) Division which enables readily deployable, easily managed solutions with GPUs to unleash the power of heterogeneous computing for modern applications.

Job Summary:

We are seeking an experienced Principal Software Engineer who has experience leading initiatives in the past. As a Principal Engineer, you will be focused on developing and integrating our AI Virtualization Stack to provide hardware-agnostic acceleration for AI/ML workloads on Virtual Machines. This role is critical in enabling multi-vendor GPU and XPU support using ML compilation technologies.

Responsibilities:

  • Research, design, and develop the AI Virtualization Stack for our ESXi server product.

  • Implement and optimize PyTorch and JAX backends using the OpenXLA framework to ensure high-performance AI/ML workload execution across GPUs and XPUs.

  • Analyze and re-architect performance-critical sections of the ML acceleration code, focusing on optimization techniques for LLM inference such as KV-caching and FlashAttention.

  • Troubleshoot and address bugs related to AI/ML acceleration functionality.

  • Deliver software that meets the coding guidelines and quality standards set by the VCF.

  • Develop and maintain technical documentation for delivered features.

  • Work closely with the larger team, including virtual driver and device team, as well as external GPU/XPU vendors, to provide end-to-end support for ML frameworks.

  • Stay up-to-date with the latest GPU/XPU hardware architecture and AI/ML compiler technologies.

Qualifications:

  • Bachelor's degree in Computer Science or related field and 12+ years of related experience or Masters degree and 10+ years of related experience.

  • 5+ years of experience in ML framework/runtime development, GPU/XPU backend engineering.

  • Strong understanding and direct experience with ML frameworks (PyTorch, JAX) and graph/ML compiler technologies (e.g. OpenXLA).

  • Experience with C++ and Python programming languages.

  • Strong problem-solving skills and ability to troubleshoot complex issues.

  • Excellent communication and collaboration skills.

  • Experience with version control systems such as Git.

  • Ability to thrive in a fast-paced and dynamic work environment.

  • Familiarity with enterprise coding standards and best practices.

Nice to Have:

  • Experience with inference servers such as vLLM, Triton.

  • Experience with low-level GPU kernel development and writing custom kernels (e.g., CUDA, ROCm, or similar).
     

Must have legal authorization to work in the US

Additional Job Description:

Compensation and Benefits

The annual base salary range for this position is $127,100 - $226,000.

As a valued member of our team, you'll be eligible for a discretionary annual bonus and the opportunity to receive not only a competitive new hire equity grant, but also annual equity awards, connecting your success directly to the company's growth. All subject to relevant plan documents and award agreements.

Broadcom offers a competitive and comprehensive benefits package: Medical, dental and vision plans, 401(K) participation including company matching, Employee Stock Purchase Program (ESPP), Employee Assistance Program (EAP), company paid holidays, paid sick leave and vacation time. The company follows all applicable laws for Paid Family Leave and other leaves of absence.

Broadcom is proud to be an equal opportunity employer.  We will consider qualified applicants without regard to race, color, creed, religion, sex, sexual orientation, national origin, citizenship, disability status, medical condition, pregnancy, protected veteran status or any other characteristic protected by federal, state, or local law.  We will also consider qualified applicants with arrest and conviction records consistent with local law.

If you are located outside USA, please be sure to fill out a home address as this will be used for future correspondence.

Skills Required

  • Bachelor's degree in Computer Science or related field and 12+ years experience, or Master's degree and 10+ years experience
  • 5+ years of experience in ML framework/runtime development and GPU/XPU backend engineering
  • Direct experience with PyTorch and JAX and graph/ML compiler technologies (e.g., OpenXLA)
  • Proficiency in C++ and Python programming languages
  • Experience with version control systems such as Git
  • Strong problem-solving skills and ability to troubleshoot complex issues
  • Excellent communication and collaboration skills
  • Familiarity with enterprise coding standards and best practices
  • Must have legal authorization to work in the US
  • Knowledge of GPU/XPU hardware architectures and ML compiler technologies
  • Experience analyzing and optimizing performance-critical ML acceleration code (LLM inference techniques)

Broadcom Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about Broadcom and has not been reviewed or approved by Broadcom.

  • Equity Value & Accessibility Equity is used broadly through RSUs with quarterly or annual vesting, and an ESPP with a discount and look‑back that can add meaningful upside. Company disclosures show ongoing equity grants, including inducement RSUs tied to acquisitions, underscoring equity’s central role in total rewards.
  • Retirement Support A 401(k) plan with a competitive company match and immediate vesting is consistently highlighted, supporting long‑term savings. Tax‑advantaged accounts like HSA/FSA further strengthen the financial wellness toolkit.
  • Pay Growth & Progression Compensation ceilings in technical tracks are described as high, with wide ranges and very strong totals for experienced engineers. Sales compensation is also characterized as competitive, supporting attractive on‑target earnings.

Broadcom Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Jose, CA
38,985 Employees
Year Founded: 1991

What We Do

Broadcom Inc. (NASDAQ: AVGO) is a global technology leader that designs, develops and supplies semiconductor and infrastructure software solutions.

Similar Jobs

In-Office
Los Angeles, CA, USA
4500 Employees

Zscaler Logo Zscaler

Senior Director, Head of Cybersecurity & Privacy Legal

Cloud • Information Technology • Security • Software • Cybersecurity
Easy Apply
Remote or Hybrid
San Jose, CA, USA
8697 Employees
231K-330K Annually

CoreWeave Logo CoreWeave

Senior Product Manager

Cloud • Information Technology • Machine Learning
In-Office
5 Locations
1450 Employees
182K-242K Annually

CoreWeave Logo CoreWeave

Director Of Product Marketing

Cloud • Information Technology • Machine Learning
In-Office
2 Locations
1450 Employees
198K-264K Annually

Similar Companies Hiring

Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
42 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account