Supercompute Infrastructure Engineer

Reposted 9 Days Ago
Hiring Remotely in Menlo Park, CA
In-Office or Remote
Senior level
Artificial Intelligence • Hardware • Information Technology • Robotics
From bits to atoms.
The Role
Lead and manage large-scale compute clusters for AI scientific research, focusing on orchestration, resource allocation, and lifecycle automation.
Summary Generated by Built In

About Periodic Labs

We are an AI + physical sciences lab building state of the art models to make novel scientific discoveries. We are well funded and growing rapidly. Team members are owners who identity and solve problems without boundaries or bureaucracy. We eagerly learn new tools and new science to push forward our mission.

About the Role

You will lead, design, build, and operate large-scale compute clusters to power AI scientific research.

You will write software that orchestrates large GPU and CPU clusters, manages resource allocation and automates cluster lifecycle operations. You will work on bringup, operations and maintenance of all aspects of these clusters.

You will build tools and get directly involved in large scale frontier research experiments to make Periodic Labs the world's best AI + science lab for physicists, computational materials scientists, AI researchers, and engineers.

We’re looking for distributed systems engineers with experience in managing large-scale compute environments, high-performance clusters, or similar hyperscale infrastructure.

You might thrive in this role if you have experience with:

  • >=5,000 GPU clusters

  • Cluster scheduling and orchestration tools like k8s and slurm

  • Cloud environments such as GCP, AWS, or Azure

  • Observability and monitoring tools like DataDog, Prometheus, Grafana, or VictoriaMetrics

  • IaC tools like terraform and ansible

  • GitOps tools like Github CI and ArgoCD

Top Skills

Ansible
Argocd
AWS
Azure
Cpu
Datadog
GCP
Github Ci
Gpu
Grafana
K8S
Prometheus
Slurm
Terraform
Victoriametrics
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
32 Employees
Year Founded: 2025

What We Do

We're building AI scientists and the autonomous laboratories for them to operate.

Similar Jobs

ServiceNow Logo ServiceNow

Staff Software Engineer

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Remote or Hybrid
San Diego, CA, USA
28000 Employees
147K-258K Annually
Remote or Hybrid
9 Locations
213000 Employees
38-67 Hourly

Cox Enterprises Logo Cox Enterprises

Business Services Specialist II (Plus One)

Automotive • Cloud • Greentech • Information Technology • Other • Software • Cybersecurity
Remote or Hybrid
United States
50000 Employees
18-27 Hourly

Cox Enterprises Logo Cox Enterprises

Business Services Specialist II (Plus One)

Automotive • Cloud • Greentech • Information Technology • Other • Software • Cybersecurity
Remote or Hybrid
CA, USA
50000 Employees
18-27 Hourly

Similar Companies Hiring

Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees
Fairly Even Thumbnail
Software • Sales • Robotics • Other • Hospitality • Hardware
New York, NY

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account