Site Reliability Engineer

Reposted 3 Days Ago
Hiring Remotely in USA
Remote
Senior level
Artificial Intelligence • Cloud • Software
The Role
The Senior SRE Engineer will design, build, and maintain resilient infrastructure systems, manage infrastructure-as-code, and write tooling in various languages.
Summary Generated by Built In

Our mission at Tensorwave Cloud is to build seamless, secure, reliable, and resilient AI infrastructure at scale, eliminating barriers and challenging the status quo to empower builders and support AI innovation.

About the role

We are seeking a Site Reliability Engineer with a strong background in software engineering to build and maintain highly scalable, secure, and resilient infrastructure.

You’ll play a critical role in designing low-level systems, automating infrastructure with modern tooling, and ensuring platform reliability.

This role is ideal for someone who’s comfortable working at the intersection of systems programming and DevOps - writing code in Go, Javascript, Rust, C, or Zig while also managing infrastructure with NixOS, Kubernetes, and Terraform.

Responsibilities

  • Design, build, and maintain infrastructure systems using Linux and NixOS

  • Manage infrastructure-as-code with Terraform to provision and scale resources

  • Architect and operate Kubernetes clusters with a focus on performance, security, and automation

  • Write high-performance tooling and internal utilities in Go, Javascript, Rust

  • Develop and maintain CI/CD pipelines for infrastructure and code deployments

  • Monitor system performance, resolve issues, and improve reliability through observability tooling

  • Collaborate closely with engineering teams to support deployment strategies and development workflows

Required Experience

  • Bachelor of Science in Computer Science, Computer Engineering, or a related technical field, or equivalent practical experience

  • 5+ years in DevOps, Site Reliability, or Infrastructure Engineering roles

  • Proficiency in one or more low-level languages: Rust, C, Zig, Javascript, and Go

  • Deep experience with Linux systems and configuration management

  • Hands-on experience with Terraform, Kubernetes, and containerized environments

  • Strong understanding of systems programming, performance tuning, and operating system internals

  • Familiarity with CI/CD practices and infrastructure monitoring/alerting tools

What We Bring

  • Mission driven company

  • Competitive Salary

  • Stock Options

  • 100% paid Medical, Dental, and Vision insurance

  • Flexible PTO

  • Paid Holidays

  • 401(k)

  • Parental Leave

  • Flexible Spending Account

  • Short Term Disability Insurance

  • Life and Voluntary Supplemental Insurance

  • Mental Health Benefits through Spring Health

We’re looking for resilient, adaptable people to join our team, people who believe in the mission and think at massive scale. The solutions that worked on a handful of devices will not work at Exascale. Be prepared to be pushed daily, to learn a lot, and literally build the future.

Tensorwave is an equal opportunity employer, committed to fostering an inclusive and supportive workplace. All qualified applicants and candidates will receive consideration for employment without regard to race, color, religion, sex, disability, age, national origin, or veteran status.

Top Skills

C
Go
JavaScript
Kubernetes
Nixos
Rust
Terraform
Zig
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Las Vegas, Nevada
56 Employees

What We Do

TensorWave is a cutting-edge cloud platform designed specifically for AI workloads. Offering AMD MI300X accelerators and a best-in-class inference engine, TensorWave is a top-choice for training, fine-tuning, and inference. Visit tensorwave.com to learn more.
Send us a message to try it for free.

Similar Jobs

Coinbase Logo Coinbase

Site Reliability Engineer

Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
Easy Apply
Remote
USA
4000 Employees
152K-179K Annually

Milestone Systems Logo Milestone Systems

Site Reliability Engineer

Artificial Intelligence • Other • Security • Software • Analytics • Big Data Analytics
Remote or Hybrid
2 Locations
1500 Employees
160K-180K Annually

Coinbase Logo Coinbase

Senior Software Engineer

Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
Easy Apply
Remote
USA
4000 Employees
181K-212K Annually

Circle Logo Circle

Site Reliability Engineer

Blockchain • Fintech • Payments • Financial Services • Cryptocurrency • Web3
Remote
United States of America
1050 Employees
153K-205K Annually

Similar Companies Hiring

Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees
Fairly Even Thumbnail
Software • Sales • Robotics • Other • Hospitality • Hardware
New York, NY

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account