HPC Systems Engineer

Posted 4 Days Ago
Be an Early Applicant
Milpitas, CA, USA
In-Office
160K-271K Annually
Senior level
Hardware
The Role
Lead design, deployment, and operational support of global HPC cluster platforms. Architect scalable compute/storage/network solutions, drive implementations to production, ensure reliability and performance, provide troubleshooting and lifecycle management, and promote automation, DevOps, and standardization across the platform.
Summary Generated by Built In

Company Overview

KLA is a global leader in diversified electronics for the semiconductor manufacturing ecosystem. Virtually every electronic device in the world is produced using our technologies. No laptop, smartphone, wearable device, voice-controlled gadget, flexible screen, VR device or smart car would have made it into your hands without us. KLA invents systems and solutions for the manufacturing of wafers and reticles, integrated circuits, packaging, printed circuit boards and flat panel displays. The innovative ideas and devices that are advancing humanity all begin with inspiration, research and development. KLA focuses more than average on innovation and we invest 15% of sales back into R&D. Our expert teams of physicists, engineers, data scientists and problem-solvers work together with the world’s leading technology providers to accelerate the delivery of tomorrow’s electronic devices. Life here is exciting and our teams thrive on tackling really hard problems. There is never a dull moment with us.

Group/Division

Enabling the movement toward advanced chip design, KLA's Measurement, Analytics and Control group (MACH) is looking for the best and brightest research scientists, software engineers, application development engineers and senior product technology process engineers to join our team. The MACH team's mission is to collaborate with our customers to innovate technologies and solutions that detect and control highly complex process variations—at their source—rather than compensate for them at later stages of the manufacturing process. With over 40 years of semiconductor process control experience, chipmakers around the globe rely on KLA to ensure that their fabs ramp next-generation devices to volume production quickly and cost-effectively. Our MACH team develops leading-edge solutions for patterning process analytics and control technologies, thereby providing customers with critical insight at the feature level, field level and cross-wafer analysis. Our teams also develop advanced modeling simulation, data analytics and process control modeling technologies. As a member of the MACH team, you’ll be joining the most sophisticated and successful process-control company in the semiconductor industry--working across functions to solve the most complex technical problems in the digital age.

Job Description/Preferred Qualifications

Role Overview

In this role, you will lead the architecture, deployment, and operational support of a high-performance computing (HPC) cluster platform used across IC fabrication facilities and mask shops globally.

You will partner with engineering stakeholders to gather requirements, design scalable solutions, and drive implementation from concept through production. This role requires a strong balance of systems architecture, hands-on engineering, and operational excellence in complex HPC environments.

Key Responsibilities

  • Design and architect scalable, high-performance HPC cluster solutions for global manufacturing environments
  • Lead deployment, configuration, and lifecycle management of cluster infrastructure
  • Collaborate with developers and cross-functional teams to understand requirements and translate them into technical solutions
  • Drive solutions from design through production, including implementation, validation, and support
  • Ensure system reliability, performance, and availability across compute, storage, and networking layers
  • Support ongoing operations, troubleshooting, and continuous improvement of HPC systems
  • Contribute to automation, standardization, and DevOps best practices across the platform

Qualifications & Experience

Systems & Infrastructure

  • Deep expertise in Linux operating systems (SUSE, Red Hat, Rocky Linux, Ubuntu)
  • Strong experience architecting and maintaining robust storage systems
  • Solid understanding of HPC hardware ecosystems, including servers, GPUs, networking, storage, schedulers, BIOS, and BMC
  • Experience with virtualization technologies such as VMware, Proxmox, or XCP-ng

Networking & Core Services

  • Strong understanding of TCP/IP fundamentals and network protocols (DNS, DHCP, HTTP, LDAP, SMTP)
  • Experience with file sharing technologies (NFS, CIFS)
  • Familiarity with net boot/PXE and high-availability Linux configurations

Automation & DevOps

  • Proficiency in scripting and development using Shell and Python
  • Experience with configuration management tools (Ansible, Salt, Chef, Puppet)
  • Strong DevOps mindset, including CI/CD pipelines and Git-based repositories

Platforms & Tools

  • Experience with HPC schedulers (SGE, SLURM)
  • Familiarity with web servers and traffic management (Apache, Nginx, reverse proxy, load balancing via HAProxy)
  • Monitoring and observability tools (Prometheus, Grafana, Nagios)
  • Database experience with MySQL

Minimum Qualifications

Doctorate (Academic) Degree and related work experience of 3+ years; Master's Level Degree and related work experience of 6+ years; Bachelor's Level Degree and related work experience of 8+ years

Base Pay Range: $159,500.00 - $271,200.00 Annually

Primary Location: USA-CA-Milpitas-KLA

KLA’s total rewards package for employees may also include participation in performance incentive programs and eligibility for additional benefits including but not limited to: medical, dental, vision, life, and other voluntary benefits, 401(K) including company matching, employee stock purchase program (ESPP), student debt assistance, tuition reimbursement program, development and career growth opportunities and programs, financial planning benefits, wellness benefits including an employee assistance program (EAP), paid time off and paid company holidays, and family care and bonding leave.

Interns are eligible for some of the benefits listed. Our pay ranges are determined by role, level, and location. The range displayed reflects the pay for this position in the primary location identified in this posting. Actual pay depends on several factors, including state minimum pay wage rates, location, job-related skills, experience, and relevant education level or training. We are committed to complying with all applicable federal and state minimum wage requirements where applicable. If applicable, your recruiter can share more about the specific pay range for your preferred location during the hiring process.

               

KLA is proud to be an Equal Opportunity Employer. We will ensure that qualified individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Please contact us at [email protected] or at +1-408-352-2808 to request accommodation.

Be aware of potentially fraudulent job postings or suspicious recruiting activity by persons that are currently posing as KLA employees.  KLA never asks for any financial compensation to be considered for an interview, to become an employee, or for equipment. Further, KLA does not work with any recruiters or third parties who charge such fees either directly or on behalf of KLA. Please ensure that you have searched KLA’s Careers website for legitimate job postings.  KLA follows a recruiting process that involves multiple interviews in person or on video conferencing with our hiring managers.  If you are concerned that a communication, an interview, an offer of employment, or that an employee is not legitimate, please send an email to [email protected] to confirm the person you are communicating with is an employee. We take your privacy very seriously and confidentially handle your information.

Skills Required

  • Deep expertise in Linux operating systems (SUSE, Red Hat, Rocky Linux, Ubuntu)
  • Experience architecting and maintaining robust storage systems
  • Solid understanding of HPC hardware ecosystems including servers, GPUs, networking, storage, schedulers, BIOS, and BMC
  • Experience with virtualization technologies such as VMware, Proxmox, or XCP-ng
  • Strong understanding of TCP/IP fundamentals and network protocols (DNS, DHCP, HTTP, LDAP, SMTP)
  • Experience with file sharing technologies (NFS, CIFS)
  • Familiarity with net boot/PXE and high-availability Linux configurations
  • Proficiency in scripting and development using Shell and Python
  • Experience with configuration management tools (Ansible, Salt, Chef, Puppet)
  • DevOps experience including CI/CD pipelines and Git-based repositories
  • Experience with HPC schedulers (SGE, SLURM)
  • Familiarity with web servers and traffic management (Apache, Nginx, reverse proxy, HAProxy)
  • Experience with monitoring and observability tools (Prometheus, Grafana, Nagios)
  • Database experience with MySQL
  • Minimum qualifications: Doctorate +3 years OR Master's +6 years OR Bachelor's +8 years related work experience

KLA Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about KLA and has not been reviewed or approved by KLA.

  • Retirement Support Retirement offerings include a 401(k) plan with company matching and financial planning support. Student debt assistance and related financial benefits reinforce long-term savings and security.
  • Equity Value & Accessibility Ownership programs include an Employee Stock Purchase Plan and broad-based RSU participation that extend equity beyond a narrow group. These elements complement competitive pay and bonuses to strengthen total rewards.
  • Leave & Time Off Breadth Time-off programs span paid time off, paid company holidays, and paid volunteer time. Family care and bonding leave and back-up care services add flexibility during life events.

KLA Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Milipitas, CA
10,001 Employees

What We Do

KLA develops industry-leading equipment and services that enable innovation throughout the electronics industry. We provide advanced process control and process-enabling solutions for manufacturing wafers and reticles. In close collaboration with leading customers across the globe, our expert teams of physicists, engineers, data scientists and problem-solvers design solutions that move the world forward.

Similar Jobs

Mistral AI Logo Mistral AI

Systems Engineer

Artificial Intelligence
In-Office or Remote
5 Locations
92 Employees

Bridge Defense Logo Bridge Defense

Systems Engineer

Aerospace • Defense • Manufacturing
In-Office
Monterey, CA, USA
14 Employees
130K-155K Annually

Vast.ai Logo Vast.ai

Systems Engineer

Artificial Intelligence • On-Demand • Software
In-Office
San Francisco, CA, USA
41 Employees
160K-320K Annually

Similar Companies Hiring

Red 6 Thumbnail
Aerospace • Hardware • Software • Virtual Reality • Defense
Orlando, Florida
186 Employees
Blissway Thumbnail
Computer Vision • Fintech • Hardware • Internet of Things • Machine Learning • Software • Transportation
Denver, CO
24 Employees
Fairly Even Thumbnail
Hardware • Robotics • Sales • Software • Hospitality
New York, NY
30 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account