Staff Software Engineer, Managed Orchestration (Kubernetes)

Reposted 22 Days Ago
Be an Early Applicant
San Francisco, CA
Hybrid
204K-247K Annually
Senior level
Cloud • Greentech • Other • Energy
We're on a mission to eliminate flaring and emissions in the oil field.
The Role
The Staff Software Engineer will lead the technical direction of a managed Kubernetes service, focusing on architecture, GPU integration, and advanced networking capabilities for mission-critical workloads.
Summary Generated by Built In

Crusoe's mission is to accelerate the abundance of energy and intelligence. We’re crafting the engine that powers a world where people can create ambitiously with AI — without sacrificing scale, speed, or sustainability.

Be a part of the AI revolution with sustainable technology at Crusoe. Here, you'll drive meaningful innovation, make a tangible impact, and join a team that’s setting the pace for responsible, transformative cloud infrastructure.

About the Role

Crusoe is building its next-generation orchestration platform to power GPU-accelerated and high-performance computing at scale. As a Staff Software Engineer on the Managed Orchestration team, you will shape the technical direction of our managed Kubernetes service, delivering systems that allow customers to run advanced workloads across CPUs, NVIDIA and AMD GPUs, and high-performance networking environments.

You’ll drive architecture and design for complex, distributed systems that integrate GPU operators, network operators, and CNI technologies (Cilium, Calico, Multus) with Kubernetes, while also supporting high-performance fabrics such as InfiniBand and RoCE. This role requires a blend of deep technical expertise, architectural leadership, and the ability to influence cross-functional teams to deliver reliable, scalable, and secure orchestration for mission-critical workloads.

What You'll Be Working On:

  • Lead architecture and design for core features of Crusoe’s Managed Kubernetes platform (multi-tenancy, control plane scalability, cluster lifecycle, and high availability).

  • Drive integration of GPU acceleration in Kubernetes, including device plugin architecture, GPU operators, scheduling, autoscaling, and monitoring.

  • Guide development of advanced container networking capabilities, including CNI plugins, network operators, service meshes, and high-performance fabrics (InfiniBand, RoCE).

  • Define and enforce best practices for security, multi-cluster deployments, and workload isolation across compute, GPU, and networking layers.

  • Partner with product and engineering leadership to set long-term technical strategy and roadmap for CMK.

  • Mentor engineers across the organization, providing technical guidance and elevating standards for design, code quality, and operational excellence.

  • Troubleshoot and resolve complex distributed systems challenges spanning compute, networking, and GPU acceleration.

  • Contribute to and represent Crusoe in open-source communities (Kubernetes SIGs, CNCF projects, GPU and networking ecosystem).

What You'll Bring to the Team:

  • 8+ years of software engineering experience in distributed systems, cloud, or HPC.

  • Proven track record of technical leadership and driving architecture in production systems.

  • Deep expertise in Kubernetes internals (control plane, operators, API machinery, scheduling).

  • Strong proficiency in Go (preferred) or another systems language (Rust, C++, Python for HPC tooling).

  • Extensive experience with GPU integration in Kubernetes (device plugins, GPU operators, resource allocation).

  • Strong knowledge of container networking (Cilium, Calico, Multus, service meshes) and Linux networking fundamentals.

  • Familiarity with high-performance networking technologies (InfiniBand, RoCE) and accelerator-aware scheduling.

  • Excellent debugging, systems design, and problem-solving skills in distributed systems.

Bonus Points

  • Familiarity with both NVIDIA and AMD GPU stacks (CUDA, ROCm, NCCL).

  • Experience with Slurm, MPI, Ray, or distributed ML frameworks (TensorFlow, PyTorch, JAX).

  • Contributions to open-source projects in the Kubernetes, GPU, or networking ecosystems.

  • Experience scaling multi-cluster environments and managing interconnects across data centers.

  • Background in security for Kubernetes and GPU workloads (RBAC, PodSecurity, runtime scanning).

Benefits:

  • Industry competitive pay

  • Restricted Stock Units in a fast growing, well-funded technology company

  • Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents

  • Employer contributions to HSA accounts

  • Paid Parental Leave

  • Paid life insurance, short-term and long-term disability

  • Teladoc

  • 401(k) with a 100% match up to 4% of salary

  • Generous paid time off and holiday schedule

  • Cell phone reimbursement

  • Tuition reimbursement

  • Subscription to the Calm app

  • MetLife Legal

  • Company paid commuter benefit; $300 per month

Compensation Range:

Compensation will be paid in the range of $204,000 - $247,000. Restricted Stock Units are included in all offers. Compensation to be determined by the applicants knowledge, education, and abilities, as well as internal equity and alignment with market data.

Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex/gender, sexual preference/ orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.

Top Skills

C++
Calico
Cilium
Go
Infiniband
Kubernetes
Multus
Python
Roce
Rust
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Denver, CO
667 Employees
Year Founded: 2018

What We Do

Crusoe is on a mission to eliminate routine flaring of natural gas and reduce the cost of cloud computing. We are passionate about our goals to help the oil industry operate more efficiently, achieve better relationships with communities and regulators, and improve environmental performance. Crusoe repurposes otherwise wasted energy to fuel the growing demand for computational power in the expanding digital economy.

Why Work With Us

Crusoe has five core values with each value grounded in a set of actionable practices. The combination of philosophical values and actionable practices creates a decision-making framework for each employee to achieve success at Crusoe.

Gallery

Gallery

Similar Jobs

Crusoe Energy Systems Logo Crusoe Energy Systems

Staff Software Engineer

Cloud • Greentech • Other • Energy
Hybrid
Sunnyvale, CA, USA
667 Employees
220K-250K Annually
Hybrid
6 Locations
213000 Employees
23-31
Hybrid
Perris, CA, USA
213000 Employees
20-26
Hybrid
Ventura, CA, USA
213000 Employees
23-31

Similar Companies Hiring

Yooz Thumbnail
Software • Machine Learning • Fintech • Financial Services • Cloud • Automation • Artificial Intelligence
Aimargues, FR
470 Employees
Compa Thumbnail
Software • Other • HR Tech • Business Intelligence • Artificial Intelligence
Irvine, CA
60 Employees
Amplify Platform Thumbnail
Fintech • Financial Services • Consulting • Cloud • Business Intelligence • Big Data Analytics
Scottsdale, AZ
62 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account