Senior DevOps Engineer

Sorry, this job was removed at 02:14 p.m. (CST) on Tuesday, Oct 14, 2025
Easy Apply
San Francisco, CA
In-Office
Artificial Intelligence • Information Technology
The Role

About Together AI

Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancement such as FlashAttention, Hyena, FlexGen, and RedPajama. We invite you to join a passionate group of researchers and engineers in our journey in building the next generation AI infrastructure.

We are hiring a talented Senior DevOps Engineer to develop the software and processes for orchestration of AI workloads over large fleets of distributed GPU hardware. In this role, you'll be part of a cloud engineering organization that aims to automate everything and build failure-resistant and horizontally scalable cloud infrastructure for GPU-resident applications.

As a Senior DevOps Engineer, you'll build deep understanding of Together AI’s services and use that knowledge to optimize and evolve our infrastructure's reliability, availability, serviceability, and profitability.

The best applicants for this role are deeply technical, enthusiastic, great collaborators, and intrinsically motivated to deliver high quality infrastructure. You have experience practicing infrastructure-as-code, including the use of tools like Terraform and Ansible. You also have strong software development fundamentals, systems knowledge, troubleshooting abilities, and a deep sense of responsibility.

Requirements

  • Minimum of 5 years of prior relevant experience in DevOps, cloud computing, data center operations and Linux systems administration
  • Experience in programming in at least one of the following languages: Go, Python, Java, and C++
  • Experience designing and building advanced CI/CD pipeline frameworks
  • Experience with cloud computing toolsets like Terraform, Vault, and Packer
  • Experience with configuration management tools like Ansible, Pulumi, Chef and Puppet
  • Experience with Kubernetes and containerization 
  • Strong sense of ownership and desire to build great tools for others

Responsibilities

  • Introduce tools to facilitate greater automation and operability of services
  • Design, build, and maintain CI/CD infrastructure
  • Architect, deploy, and scale observability infrastructure
  • Create runtime tools/processes that optimize cloud triaging and limit downtime
  • Define best practices to make our systems and services measurable
  • Work closely with internal teams to ensure best practices are appropriately applied
  • Build tools to help engineering and research teams measure and improve their velocity
  • Analyze and decompose complex software systems
  • Collaborate with and influence others to improve the overall design

About Together AI

Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancement such as FlashAttention, Hyena, FlexGen, and RedPajama. We invite you to join a passionate group of researchers in our journey in building the next generation AI infrastructure.

Compensation

We offer competitive compensation, startup equity, health insurance and other competitive benefits. The US base salary range for this full-time position is: $160,000 - $230,000 + equity + benefits. Our salary ranges are determined by location, level and role. Individual compensation will be determined by experience, skills, and job-related knowledge.

Equal Opportunity

Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more.

Please see our privacy policy at https://www.together.ai/privacy  

Similar Jobs

EliseAI Logo EliseAI

Senior Devops Engineer

Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Real Estate
In-Office
2 Locations
400 Employees
230K-320K Annually

BuildOps Logo BuildOps

Senior Devops Engineer

Cloud • Mobile • Software
Easy Apply
Hybrid
San Francisco, CA, USA
500 Employees
120K-150K Annually

RoboForce Logo RoboForce

Senior Devops Engineer

Artificial Intelligence • Machine Learning • Robotics
Easy Apply
In-Office
Milpitas, CA, USA
14 Employees

Anduril Logo Anduril

Senior Devops Engineer

Aerospace • Artificial Intelligence • Hardware • Robotics • Security • Software • Defense
In-Office
Costa Mesa, CA, USA
6000 Employees
166K-220K Annually
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
San Francisco, California
84 Employees
Year Founded: 2022

What We Do

Together AI is a research-driven artificial intelligence company. We contribute leading open-source research, models, and datasets to advance the frontier of AI. Our decentralized cloud services empower developers and researchers at organizations of all sizes to train, fine-tune, and deploy generative AI models. We believe open and transparent AI systems will drive innovation and create the best outcomes for society

Similar Companies Hiring

Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees
Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account