HPC Engineer

Reposted 14 Days Ago
Be an Early Applicant
Fremont, CA, USA
In-Office
Senior level
Information Technology
The Role
The HPC Engineer will design, implement, and support high-performance computing solutions, optimize applications, manage security protocols, and troubleshoot issues.
Summary Generated by Built In

We are seeking a highly skilled and motivated HPC Engineer to join our Engineering team. This individual will design, implement, optimize, and support high-performance computing solutions tailored to customer needs. This role requires strong technical experience in HPC architectures, parallel computing, and cluster management, as well as a passion for working with emerging technologies to solve complex problems.

Primary responsibilities include:

    • Design, configure, and deploy HPC clusters and solutions including hardware, networking, storage, and software stack
    • Monitor and maintain the performance of HPC resources.
    • Troubleshoot hardware, software, and network issues within HPC environments.
    • Optimize and tune applications for performance in HPC systems.
    • Introduce and apply new technologies for improving computational effectiveness.
    • Manage security protocols and ensure data integrity and confidentiality.

Requirements
    • 5+ years in HPC engineering, cluster administration or technical role
    • Bachelor's or Master's degree in Computer Science, Electrical Engineering, or a related field.
    • Experience with MPI, CUDA, or other parallel computing
    • Hands-on experience with job schedulers (Kubernetes, SLURM, PBS,)
    • Proficiency in Linux operating systems and shell scripting (Python, Bash)
    • Familiar with containerization technologies (docker, singularity) in HPC environment
    • Strong troubleshooting, analytical, and problem-solving skills
    • Excellent communication skills
    • Ability to work effectively in a team environment.
    • Ability to prioritize tasks and manage time effectively, especially when dealing with urgent issues

Benefits
  • Medical Insurance
  • Dental Insurance
  • Vision Insurance
  • 401(k)
  • Flexible spending account
  • Commuter benefits
  • Disability insurance

We also have a perfect location for all types of commuters: AMAX is located right between I-680 and I-880. Warm Springs/South Fremont BART station and bus stops are within a 10-minute walking distance. 5 grocery stores, 6+ coffee/tea places, and numerous restaurants within 1 mile. Try the delicious fusions or grab your daily groceries after work.

About AMAX

Established in 1979, AMAX is a globally recognized leader in GPU-accelerated IT infrastructure, specializing in transforming standard IT systems into advanced, high-performance computing solutions. Catering to industries such as AI, cloud computing, autonomous vehicles, and high-performance computing, AMAX has set benchmarks in innovation, including pioneering liquid-cooled HPC systems for the semiconductor industry. With a global footprint spanning North America, Europe, and Asia, AMAX offers end-to-end services from design and manufacturing to deployment. Committed to addressing the growing demands of AI, AMAX delivers advanced solutions that help organizations achieve their technology goal and drive progress on a global scale. To learn more about AMAX’s advanced AI solutions, visit amax.com.

Join Us

Become part of a diverse and inclusive team that values your technical expertise and innovative thinking. Together, we’ll push the boundaries of what’s possible in the hardware industry.

AMAX is proud to be an equal-opportunity employer. We welcome all applicants and provide equal employment opportunities regardless of age, race, gender, or other legally protected characteristics.

Skills Required

  • 5+ years in HPC engineering, cluster administration or technical role
  • Bachelor's or Master's degree in Computer Science, Electrical Engineering, or a related field
  • Experience with MPI, CUDA, or other parallel computing
  • Hands-on experience with job schedulers (Kubernetes, SLURM, PBS)
  • Proficiency in Linux operating systems and shell scripting (Python, Bash)
  • Familiar with containerization technologies (docker, singularity) in HPC environment
  • Strong troubleshooting, analytical, and problem-solving skills
  • Excellent communication skills
  • Ability to work effectively in a team environment
  • Ability to prioritize tasks and manage time effectively, especially when dealing with urgent issues
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Fremont, CA
375 Employees
Year Founded: 1979

What We Do

We are an IT infrastructure design company delivering solutions by transforming IT components into specialized products through intelligent engineering design. Our quality-first approach is at the core of AMAX’s solution development process that spans across concept ideation, material management, validation, manufacturing, deployment, and service. We provide advanced computing solutions through focused business models of OEM, AMAX-designed and branded workstations, servers, rack solutions, and professional services. AMAX brings a high-touch, high-value approach to engaging with our customers to deliver optimized computing solutions. From new product introductions (NPI) to full-scale production and after-market services, our blending of people, processes, and technology across North America, Europe, and Asia enables our customers to move product personalization and manufacturing closer to the edge of business consumption.

Similar Jobs

CoreWeave Logo CoreWeave

Operations Engineer, HPC Networking

Cloud • Information Technology • Machine Learning
In-Office
4 Locations
1450 Employees
110K-179K Annually

Hewlett Packard Enterprise Logo Hewlett Packard Enterprise

HPC & AI Senior Performance Engineer

Artificial Intelligence • Cloud • Information Technology • Consulting
In-Office
2 Locations
85422 Employees
120K-275K Annually

San Francisco Compute Company Logo San Francisco Compute Company

Network Engineer

Artificial Intelligence • Cloud • Information Technology • Infrastructure as a Service (IaaS)
In-Office
San Francisco, CA, USA
30 Employees
250K-325K Annually

NVIDIA Logo NVIDIA

Senior AI and ML HPC Cluster Engineer

Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
In-Office or Remote
2 Locations
21960 Employees
152K-288K Annually

Similar Companies Hiring

Scrunch  Thumbnail
Artificial Intelligence • Information Technology • Marketing Tech • Software • SEO
Salt Lake City, Utah
Standard Template Labs Thumbnail
Artificial Intelligence • Information Technology • Software
New York, NY
25 Employees
Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account