Director, AI Cluster Engineering

Sorry, this job was removed at 04:20 p.m. (CST) on Thursday, Aug 28, 2025
Be an Early Applicant
San Jose, CA, USA
In-Office
180K-300K Annually
Information Technology • Semiconductor
The Role
Job Title: Director, AI Cluster Engineering
Office Location: San Jose, CA
Work Model: Onsite

About SK hynix America
At SK hynix America, we're at the forefront of semiconductor innovation, developing advanced memory solutions that power everything from smartphones to data centers. As a global leader in DRAM and NAND flash technologies, we drive the evolution of advancing mobile technology, empowering cloud computing, and pioneering future technologies. Our cutting-edge memory technologies are essential in today's most advanced electronic devices and IT infrastructure, enabling enhanced performance and user experiences across the digital landscape.
We're looking for innovative minds to join our mission of shaping the future of technology. At SK hynix America, you'll be part of a team that's pioneering breakthrough memory solutions while maintaining a strong commitment to sustainability. We're not just adapting to technological change – we're driving it, with significant investments in artificial intelligence, machine learning, and eco-friendly solutions and operational practices. As we continue to expand our market presence and push the boundaries of what's possible in semiconductor technology, we invite you to be part of our journey to creating the next generation of memory solutions that will define the future of computing.

Job Overview:

As the Director of AI Cluster Engineering, you will lead the design, development, and operation of high-performance computing (HPC) clusters for AI/ML workloads. You will be responsible for architecting and optimizing AI data center and IT environments to ensure scalability, performance, reliability, and cost-effectiveness. This role requires collaboration with cross-functional teams to align computing infrastructure with the organization's strategic direction and future needs. This role offers an exciting opportunity to drive the future of AI/ML computing infrastructure, leveraging cutting-edge technologies to build scalable and efficient high-performance computing environments.

Responsibilities:

Computing Cluster Architecture

  • Design and architect high-performance computing clusters optimized for AI/ML model training and inference, including scientific computing applications and transformer-based AI models.
  • Optimize cluster performance through hardware selection, network topology design, equipment configuration, and performance analysis.
  • Evaluate, implement, and manage job scheduling and workload management systems to enhance efficiency.
  • Deploy and operate data center networking infrastructure using software system for automation for design validation, deployment, and operational support.

Infrastructure Planning & Implementation

  • Develop and maintain mid-to-long term infrastructure roadmaps aligned with business objectives.
  • Evaluate emerging technologies and recommend solutions to enhance AI data center capabilities, identifying marketable AI DC solutions.
  • Oversee large-scale infrastructure projects, ensuring timely execution, budget adherence, and performance optimization.

Team Leadership & Collaboration

  • Lead and mentor engineering teams in designing, implementing, and maintaining AI/ML infrastructure solutions.
  • Collaborate with cross-functional teams, including strategy, security, and application development, to align infrastructure with organizational goals.
  • Engage with technology vendors and partners to evaluate new solutions, negotiate contracts, and drive innovation in AI computing infrastructure.

Qualification:   

  • Bachelor’s degree in Computer Science, Engineering, or a related field (Master’s degree preferred).
  • 15+ years of hands-on experience in computing cluster architecture and backend server systems supporting global-scale consumer applications, including mobile devices.
  • 10+ years of experience in cloud computing (AWS, Azure, GCP), with deep expertise in virtualization and cloud platform architectures.
  • Strong understanding of data center networking, including egress/ingress traffic, interconnection points, and exchange services.
  • Extensive experience in high-performance and distributed computing cluster design, management, and optimization.
  • Strong familiarity with AI/ML infrastructure requirements, best practices, and industry trends.
  • Knowledge of data center security standards and regulatory compliance requirements.

Benefits:

  • Top Tier health insurance at no employee cost
  • Paid day offs: PTO + Company Holidays + Happy Fridays
  • Paid Parental Leave Program
  • 401k MatchingEducational reimbursement up to $10,000 per year
  • Donation Matching and volunteering opportunities
  • Corporate discount programs
  • Free Breakfast/Lunch/Dinner provided to employees

Equal Employment Opportunity:

SKHYA is an Equal Employment Opportunity Employer. We provide equal employment opportunities to all qualified applicants and employees and prohibit discrimination and harassment of any type without regard to race, sex, pregnancy, sexual orientation, religion, age, gender identity, national origin, color, protected veteran or disability status, genetic information or any other status protected under federal, state, or local applicable laws. 


Compensation:

Our compensation reflects the cost of labor across several U.S. geographic markets, and we pay differently based on those defined markets. Pay within the provided range varies by work location and may also depend on job-related skills and experience. Your Recruiter can share more about the specific salary range for the job location during the hiring process.

Pay Range
$180,000$300,000 USD

Similar Jobs

SailPoint Logo SailPoint

Customer Success Manager

Artificial Intelligence • Cloud • Sales • Security • Software • Cybersecurity • Data Privacy
Remote or Hybrid
United States
2461 Employees
125K-210K Annually

SailPoint Logo SailPoint

Enterprise Identity & IT Security - Intern

Artificial Intelligence • Cloud • Sales • Security • Software • Cybersecurity • Data Privacy
Remote or Hybrid
United States
2461 Employees
15-35 Hourly

SailPoint Logo SailPoint

Answer Engine Optimization (AEO/GEO) Manager

Artificial Intelligence • Cloud • Sales • Security • Software • Cybersecurity • Data Privacy
Remote or Hybrid
2 Locations
2461 Employees
101K-171K Annually

SoFi Logo SoFi

Bank Teller, Golden Pacific Bank, Live Oak

Fintech • Mobile • Software • Financial Services
Easy Apply
Hybrid
Live Oak, CA, USA
4500 Employees
17-32 Hourly
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Jose, CA
328 Employees
Year Founded: 1983

What We Do

Semiconductors are essential to all IT products, and its performance often determines the performance of the final products. SK hynix is a global leader in producing semiconductor, such as DRAM, NAND Flash and CMOS Image Sensors. With these technology driven semiconductor products, SK hynix has consistently led the industry and is now the second largest memory chip maker worldwide. IT devices become more pervasive as new imaginative and innovative IT products continue to grab imagination and desires of consumers. SK hynix has enhanced its competency with the best level of technology and a wide range of business portfolios in order to satisfy all those demand from customers. As a member of SK Group*, SK hynix is aiming at becoming the world’s best semiconductor company. SK hynix America Inc. operates as a subsidiary of SK Hynix Inc. *SK Group is one of South Korea's top five industrial conglomerates. It has about 40 affiliated companies, ranging from energy, telecommunications, finance, to construction.

Similar Companies Hiring

Standard Template Labs Thumbnail
Artificial Intelligence • Information Technology • Software
New York, NY
25 Employees
Amalgamated Sugar Thumbnail
Food • Greentech • Agriculture • Industrial • Manufacturing
Boise, Idaho
768 Employees
Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account