Senior Product Quality Engineer

Posted 3 Hours Ago
Be an Early Applicant
Santa Clara, CA, USA
In-Office
116K-236K Annually
Senior level
Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
The Role
Lead system-level power failure analysis for data center systems and compute modules, driving customer-return investigations from symptom confirmation to root cause and corrective action. Use lab instruments and Linux-based diagnostics to reproduce, isolate, and analyze complex power issues, correlate field data and telemetry, partner with cross-functional teams, and deliver concise technical and executive reports.
Summary Generated by Built In

NVIDIA is widely considered to be one of the technology world's most desirable employers. We have some of the most experienced and hardworking people in the world working for us. If you're creative, autonomous, and energized by deep technical problem solving, we want to hear from you!

We are looking for a Product Quality Engineer to join our Systems Product Quality team as the system-level power debug domain expert for customer returns and field failures. This role will lead technical failure analysis for NVIDIA data center systems, compute trays, and compute modules, with special focus on customer-reported field issues involving power delivery, power sequencing, and intermittent power events.

Your hands-on debug capability, structured root-cause mindset, and ability to connect board-level signals with system-level behavior will be essential to driving customer-return investigations from symptom confirmation through technical root cause and corrective action closure. Own customer-return and field-failure power investigations from symptom confirmation through containment, root cause, corrective action, and quality learning closure.

What you'll be doing:

  • Lead system-level power failure analysis for customer returns and field failures across data center systems, compute trays, and compute modules.

  • Confirm, reproduce, and isolate complex power failures such as no power, intermittent boot, unexpected shutdown, brown-out, rail droop, over-current protection, under-voltage protection, sequencing faults, hot-plug events, and margin-related failures.

  • Analyze system power architecture from AC/DC input through PSU, PDU, hot-swap, eFuse, VR, regulator, current-sense, and board-level power rails to determine the true failure boundary.

  • Use oscilloscopes, current probes, DMMs, BMC-reported voltage/current readings, system event logs, and Linux-based diagnostics to build fact-based debug conclusions.

  • Correlate field return data, customer logs, firmware behavior, board schematics, PCB layout, BOM history, and telemetry trends to identify root cause and assess risk.

  • Partner with hardware design, power design, firmware, customer quality, reliability, manufacturing, and supplier quality teams to resolve critical customer and field issues.

  • Drive containment, failure analysis, corrective and preventive actions, and defect-prevention feedback with clear ownership and closure criteria.

  • Create concise technical reports, quality updates, and executive-ready summaries that communicate failure mechanism, impact, risk, mitigation, and next steps.

What we need to see:

  • Bachelor's degree or equivalent experience in Electrical Engineering, Electronic Engineering, or a related field; Master's degree preferred.

  • 5+ years of hands-on experience in hardware debug, customer return analysis, field failure analysis, or power electronics support for complex electronic systems.

  • Strong understanding of system power delivery, DC-DC converters, multiphase VRs, regulators, power sequencing, current sharing, sense circuits, protection circuits, and high-current low-voltage rails.

  • Proven ability to debug power issues at system, board, and component level by reading schematics, PCB layouts, power trees, design specifications, and test logs.

  • Experience with Linux systems, Linux shell scripts, BMC/IPMI/Redfish-style logs or telemetry, and basic automation for data collection and debug efficiency.

  • Strong analytical and problem-solving skills, including structured troubleshooting, design of experiments, root cause analysis, statistical process control, and quality data analysis.

  • Ability to work across engineering, customer quality, supplier, and customer-facing teams while maintaining clear technical ownership and urgency.

  • Excellent written and spoken English, strong documentation habits, and the ability to explain complex debug findings to both technical and non-technical audiences.

  • High sense of responsibility, self-motivation, collaborative working style, and comfort driving ambiguous technical issues to closure.

Ways to stand out from the crowd:

  • Experience debugging high-power server or data center platforms in customer-return or field-failure analysis workflows.

  • Hands-on familiarity with PSU/PDU behavior, rack-level power distribution, power capping, power transients, or data center deployment conditions observed in field returns.

  • Experience with board-level power design, hardware verification, power integrity measurement, or design-for-debug improvements.

  • Knowledge of quality and reliability concepts, 8D problem solving, customer failure reporting, RMA/FA workflow, and supplier corrective action processes.

  • Ability to confirm, bound, and translate power-related field failures into corrective actions, debug playbooks, and prevention feedback for design, customer quality, and supplier teams.

With highly competitive salaries and a comprehensive benefits package, NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people on the planet working for us. If you're creative and autonomous, with a genuine passion for technology, we want to hear from you.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 116,000 USD - 184,000 USD for Level 3, and 148,000 USD - 235,750 USD for Level 4.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until June 18, 2026.

This posting is for an existing vacancy. 

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering an inclusive work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Skills Required

  • Bachelor's degree or equivalent experience in Electrical Engineering, Electronic Engineering, or related field
  • Master's degree
  • 5+ years hands-on experience in hardware debug, customer return analysis, field failure analysis, or power electronics support for complex electronic systems
  • Strong understanding of system power delivery, DC-DC converters, multiphase VRs, regulators, power sequencing, current sharing, sense circuits, protection circuits, and high-current low-voltage rails
  • Proven ability to debug power issues at system, board, and component level by reading schematics, PCB layouts, power trees, design specifications, and test logs
  • Experience with Linux systems, Linux shell scripts, BMC/IPMI/Redfish-style logs or telemetry, and basic automation for data collection and debug efficiency
  • Strong analytical and problem-solving skills including structured troubleshooting, design of experiments, root cause analysis, statistical process control, and quality data analysis
  • Ability to work across engineering, customer quality, supplier, and customer-facing teams with clear technical ownership
  • Excellent written and spoken English and strong documentation habits
  • High sense of responsibility, self-motivation, and collaborative working style
  • Experience debugging high-power server or data center platforms in customer-return or field-failure analysis workflows
  • Hands-on familiarity with PSU/PDU behavior, rack-level power distribution, power capping, power transients, or data center deployment conditions
  • Experience with board-level power design, hardware verification, power integrity measurement, or design-for-debug improvements
  • Knowledge of quality and reliability concepts, 8D problem solving, customer failure reporting, RMA/FA workflow, and supplier corrective action processes
  • Ability to translate power-related field failures into corrective actions, debug playbooks, and prevention feedback

NVIDIA Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about NVIDIA and has not been reviewed or approved by NVIDIA.

  • Equity Value & Accessibility Equity awards and a discounted ESPP are highlighted as core parts of total compensation, enabling employees to share in the company’s success. Stock-based compensation and the two-year lookback ESPP are consistently described as especially valuable.
  • Healthcare Strength Health coverage is portrayed as robust, with comprehensive medical, dental, and vision options alongside mental health support and on-site care resources. Employer HSA contributions and wellness perks reinforce the depth of the offering.
  • Retirement Support Retirement programs are depicted as strong, featuring a meaningful 401(k) match with Roth options and support for Mega Backdoor Roth contributions. These elements position long-term savings as a notable advantage of the total rewards package.

NVIDIA Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Santa Clara, CA
21,960 Employees
Year Founded: 1993

What We Do

NVIDIA’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing — with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world. Today, NVIDIA is increasingly known as “the AI computing company.”

Similar Jobs

Applied Materials Logo Applied Materials

Reliability Engineer

Artificial Intelligence • Semiconductor • Manufacturing
In-Office
Santa Clara, CA, USA
23282 Employees
192K-264K Annually
In-Office
San Jose, CA, USA
283 Employees
120K-155K Annually

Epirus Logo Epirus

Senior Product Quality Engineer

Aerospace • Hardware • Machine Learning • Robotics • Software
Hybrid
Torrance, CA, USA
240 Employees
114K-131K Annually

Skydio Logo Skydio

Senior NPI Product Quality Engineer

Artificial Intelligence • Hardware • Robotics • Software
Hybrid
San Mateo, CA, USA
250 Employees
147K-200K Annually

Similar Companies Hiring

Fairly Even Thumbnail
Hardware • Robotics • Sales • Software • Hospitality
New York, NY
30 Employees
Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
42 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account