Supercompute Infrastructure Engineer

Posted 2 Days Ago
Be an Early Applicant
Hiring Remotely in Menlo Park, CA
In-Office or Remote
Senior level
Artificial Intelligence • Hardware • Information Technology • Robotics
From bits to atoms.
The Role
Lead and manage large-scale compute clusters for AI scientific research, focusing on orchestration, resource allocation, and lifecycle automation.
Summary Generated by Built In

About Periodic Labs

We are an AI + physical sciences lab building state of the art models to make novel scientific discoveries. We are well funded and growing rapidly. Team members are owners who identity and solve problems without boundaries or bureaucracy. We eagerly learn new tools and new science to push forward our mission.

About the Role

You will lead, design, build, and operate large-scale compute clusters to power AI scientific research.

You will write software that orchestrates large GPU and CPU clusters, manages resource allocation and automates cluster lifecycle operations. You will work on bringup, operations and maintenance of all aspects of these clusters.

You will build tools and get directly involved in large scale frontier research experiments to make Periodic Labs the world's best AI + science lab for physicists, computational materials scientists, AI researchers, and engineers.

We’re looking for distributed systems engineers with experience in managing large-scale compute environments, high-performance clusters, or similar hyperscale infrastructure.

You might thrive in this role if you have experience with:

  • >=5,000 GPU clusters

  • Cluster scheduling and orchestration tools like k8s and slurm

  • Cloud environments such as GCP, AWS, or Azure

  • Observability and monitoring tools like DataDog, Prometheus, Grafana, or VictoriaMetrics

  • IaC tools like terraform and ansible

  • GitOps tools like Github CI and ArgoCD

Top Skills

Ansible
Argocd
AWS
Azure
Cpu
Datadog
GCP
Github Ci
Gpu
Grafana
K8S
Prometheus
Slurm
Terraform
Victoriametrics
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
32 Employees
Year Founded: 2025

What We Do

We're building AI scientists and the autonomous laboratories for them to operate.

Similar Jobs

Cloudflare Logo Cloudflare

Sales Director, US Majors Heartland

Cloud • Information Technology • Security • Software • Cybersecurity
Remote or Hybrid
United States
4400 Employees

Kalshi Logo Kalshi

Accountant

Fintech • Payments • Financial Services
Easy Apply
In-Office or Remote
2 Locations
203 Employees
100K-180K Annually

Cox Enterprises Logo Cox Enterprises

Search Engine Optimization Specialist

Automotive • Cloud • Greentech • Information Technology • Other • Software • Cybersecurity
Remote or Hybrid
United States
50000 Employees
21-32 Hourly
Remote
3 Locations
2331 Employees

Similar Companies Hiring

Credal.ai Thumbnail
Software • Security • Productivity • Machine Learning • Artificial Intelligence
Brooklyn, NY
Standard Template Labs Thumbnail
Software • Information Technology • Artificial Intelligence
New York, NY
10 Employees
Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account