HPC & Cloud Engineer

Posted 3 Days Ago
Be an Early Applicant
Bengaluru, Bengaluru Urban, Karnataka, IND
In-Office
Senior level
Hardware • Information Technology • Semiconductor • Manufacturing
The Role
Designs, deploys, and operates large-scale HPC and hybrid/multi-cloud environments. Automates infrastructure with IaC and CI/CD, supports AI/ML pipelines and GPU workloads, implements observability and security, and optimizes cluster performance for research and engineering teams.
Summary Generated by Built In
Company Description

Sandisk understands how people and businesses consume data and we relentlessly innovate to deliver solutions that enable today’s needs and tomorrow’s next big ideas. With a rich history of groundbreaking innovations in Flash and advanced memory technologies, our solutions have become the beating heart of the digital world we’re living in and that we have the power to shape.

Sandisk meets people and businesses at the intersection of their aspirations and the moment, enabling them to keep moving and pushing possibility forward. We do this through the balance of our powerhouse manufacturing capabilities and our industry-leading portfolio of products that are recognized globally for innovation, performance and quality.

Sandisk has two facilities recognized by the World Economic Forum as part of the Global Lighthouse Network for advanced 4IR innovations. These facilities were also recognized as Sustainability Lighthouses for breakthroughs in efficient operations. With our global reach, we ensure the global supply chain has access to the Flash memory it needs to keep our world moving forward.

Job Description

Cloud Architecture & Operations

  • Build and operate HPC environments on cloud platforms such as:
    • Amazon Web Services (AWS)
    • Microsoft Azure
    • Google Cloud Platform
  • Design hybrid-cloud and multi-cloud architectures for HPC workloads.

  • Implement cloud-native storage, networking, security, and disaster recovery solutions.

Infrastructure Automation & DevOps

  • Develop Infrastructure as Code (IaC) using:
    • Terraform
    • CloudFormation
    • Ansible

    • Python code

  • Build CI/CD pipelines for infrastructure and platform deployments.
  • Automate cluster provisioning, configuration management, monitoring, and patch management.
  • Develop self-service provisioning frameworks for research and engineering teams.

AI & Data Engineering

  • Design and implement scalable AI/ML data pipelines.
  • Build data ingestion, transformation, and orchestration frameworks.
  • Support distributed AI training and inference workloads.
  • Optimize GPU utilization for deep learning applications.
  • Collaborate with Data Scientists and ML Engineers to deploy production AI solutions.

Platform Monitoring & Reliability

  • Implement observability solutions using: Prometheus, Grafana, ELK Stack, OpenTelemetry
  • Monitor system performance, capacity planning, and SLA compliance.
  • Troubleshoot performance bottlenecks across compute, storage, network, and AI frameworks.

HPC Infrastructure Engineering

  • Design, deploy, and manage large-scale HPC clusters across on-premises and cloud environments.
  • Administer compute, storage, networking, and GPU resources for AI/ML and data-intensive workloads.
  • Optimize cluster performance, scheduling, and resource utilization using workload managers such as: Slurm, LSF, PBS Pro, Kubernetes

Security & Governance

  • Implement security best practices for HPC and cloud environments.
  • Manage IAM, secrets management, encryption, and compliance controls.
  • Support regulatory requirements and enterprise governance standards.

Qualifications

5+ years of experience in DevOps and Cloud infrastructure management

Technical Skills

  • Bachelor's or Master's degree in Computer Science, Engineering, Information Systems, or related field.
  • Strong experience with Linux system administration (RHEL, Rocky Linux, Ubuntu).
  • Experience managing HPC clusters and distributed computing environments.
  • Proficiency in Python, Bash, or Go.
  • Hands-on experience with: Terraform, Ansible, Git, Jenkins/GitHub Actions
  • Experience with container technologies: Docker, Kubernetes, Singularity/Apptainer
  • Knowledge of AI/ML frameworks: TensorFlow, PyTorch, Ray, Spark
  • Experience with GPU technologies and accelerator platforms.

Cloud Skills

  • AWS, Azure, or GCP architecture and operations.
  • Cloud networking, storage, and security services.
  • Hybrid cloud and HPC workload migration experience.

Additional Information

Sandisk thrives on the power and potential of diversity. As a global company, we believe the most effective way to embrace the diversity of our customers and communities is to mirror it from within. We believe the fusion of various perspectives results in the best outcomes for our employees, our company, our customers, and the world around us. We are committed to an inclusive environment where every individual can thrive through a sense of belonging, respect and contribution.

Sandisk is committed to offering opportunities to applicants with disabilities and ensuring all candidates can successfully navigate our careers website and our hiring process. Please contact us at [email protected] to advise us of your accommodation request. In your email, please include a description of the specific accommodation you are requesting as well as the job title and requisition number of the position for which you are applying.

Skills Required

  • 5+ years experience in DevOps and cloud infrastructure management
  • Bachelor's or Master's degree in Computer Science, Engineering, Information Systems, or related field
  • Strong experience with Linux system administration (RHEL, Rocky Linux, Ubuntu)
  • Experience managing HPC clusters and distributed computing environments
  • Proficiency in Python, Bash, or Go
  • Hands-on experience with Terraform, CloudFormation, Ansible
  • Experience with Git, Jenkins or GitHub Actions (CI/CD)
  • Experience with container technologies: Docker, Kubernetes, Singularity/Apptainer
  • Knowledge of AI/ML frameworks such as TensorFlow, PyTorch, Ray, Spark
  • Experience with GPU technologies and accelerator platforms
  • Experience designing hybrid-cloud and multi-cloud architectures for HPC workloads
  • Experience implementing observability (Prometheus, Grafana, ELK Stack, OpenTelemetry)
  • Experience with workload managers and schedulers such as Slurm, LSF, PBS Pro, Kubernetes
  • Experience building CI/CD pipelines and Infrastructure as Code (IaC) for platform deployments

Sandisk Corporation Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about Sandisk Corporation and has not been reviewed or approved by Sandisk Corporation.

  • Fair & Transparent Compensation Pay is considered competitive across many roles, with compensation often described as fair for the work. Technical roles are characterized as aligned with market expectations in semiconductor and storage.
  • Healthcare Strength The official program highlights comprehensive medical, dental, and vision coverage with wellness and caregiving support. Employer-verified materials describe a generally strong U.S. health benefits package.
  • Retirement Support A 401(k) plan with company match is explicitly called out and positioned as a core benefit. Retirement and savings options are emphasized as part of a well-rounded total rewards offering.

Sandisk Corporation Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
11,000 Employees
Year Founded: 1988

What We Do

Sandisk is a leading developer, manufacturer, and provider of data storage devices and solutions based on NAND flash technology, including memory cards, USB flash drives, and solid-state drives (SSDs).

Similar Jobs

ServiceNow Logo ServiceNow

Business Development Representative

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Remote or Hybrid
Bangalore, Bengaluru Urban, Karnataka, IND
29000 Employees

Boeing Logo Boeing

Associate Business Support Specialist

Aerospace • Information Technology • Software • Cybersecurity • Design • Defense • Manufacturing
In-Office
Bengaluru, Bengaluru Urban, Karnataka, IND
170000 Employees

Boeing Logo Boeing

Lead Electromechanical Design and Analysis Engineer

Aerospace • Information Technology • Software • Cybersecurity • Design • Defense • Manufacturing
In-Office
Bengaluru, Bengaluru Urban, Karnataka, IND
170000 Employees

Boeing Logo Boeing

Experienced Software Programmer - .NET

Aerospace • Information Technology • Software • Cybersecurity • Design • Defense • Manufacturing
In-Office
Bengaluru, Bengaluru Urban, Karnataka, IND
170000 Employees

Similar Companies Hiring

Fairly Even Thumbnail
Hardware • Robotics • Sales • Software • Hospitality
New York, NY
30 Employees
Amalgamated Sugar Thumbnail
Food • Greentech • Agriculture • Industrial • Manufacturing
Boise, Idaho
768 Employees
Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account