Senior Site Reliability Engineer

Posted 20 Days Ago
Easy Apply
Be an Early Applicant
San Francisco, CA
In-Office
140K-190K Annually
Senior level
Artificial Intelligence • Blockchain • Fintech • Cryptocurrency • Web3
Access everything from everywhere with ZetaChain, the First Universal Blockchain.
The Role
The Senior Site Reliability Engineer will manage blockchain infrastructure, ensure high availability, build automation, improve Kubernetes reliability, and collaborate with engineering teams on best practices.
Summary Generated by Built In
About Anuma.aiWe’re building Anuma.ai at ZetaChain: an ambitious privacy AI product designed to solve hard problems and deliver real value. Backed by top-tier investors, our team is pushing the boundaries of what AI can do in the real world. If you’re excited by meaningful challenges and building products from the ground up, this is the place for you.
About the Role

We are looking for a Senior Site Reliability Engineer (SRE) to ensure the reliability, scalability, and security of Anuma.ai's production infrastructure.

This role is highly hands‑on and execution‑focused. You will operate critical blockchain and AI‑adjacent infrastructure, build automation to reduce operational overhead, and partner closely with protocol, platform, and AI teams to design systems that are reliable by default.

What You'll Do
  • Operate and maintain production blockchain infrastructure, including validators, RPC services, indexers, and supporting services
  • Ensure high availability and performance for AI‑enabled developer platforms and internal tooling
  • Build and maintain monitoring, alerting, and dashboards for protocol, infrastructure, and application health
  • Write high‑quality automation and infrastructure code to reduce toil and improve reliability
  • Participate in on‑call rotations, incident response, and post‑incident reviews
  • Partner with engineering teams to embed reliability, scalability, and security best practices into system design
  • Improve Kubernetes reliability across cloud and bare‑metal environments
  • Continuously refine deployment, rollback, and recovery strategies
Minimum Qualifications

Our ideal candidate description is a wish list, not a checklist. We don’t expect every applicant to meet every requirement.

  • 4+ years of experience in Site Reliability Engineering, Infrastructure Engineering, or Platform Engineering
  • Strong software engineering background with production experience in Go and/or Python
  • Deep experience operating Linux systems in production
  • Proven experience running Kubernetes at scale
  • Experience supporting high‑availability distributed systems
  • Comfortable working in fast‑moving startup environments
  • Strong security mindset, especially for infrastructure running on public or adversarial networks
  • Excellent collaboration and communication skills
Our Tech Stack
  • Languages: Go, Python, Bash, Terraform, Ansible
  • Infrastructure: Kubernetes, Docker, Linux
  • Observability: Prometheus, Grafana, Datadog, Loki, incident.io
  • Platforms: AWS, GCP, bare metal
  • Blockchain Stack: Cosmos SDK, Tendermint / CometBFT, Ethereum, Bitcoin
Bonus Points
  • Exposure to AI‑powered infrastructure, observability, or developer tooling
  • Experience operating blockchain nodes or validator infrastructure
  • Familiarity with Cosmos‑based chains or EVM clients
  • Experience with DevOps, DevSecOps, or GitOps methodologies
  • Contributions to open‑source software
In-Office Culture

We believe that collaboration is supercharged when we share space together. Many members of our team work hybrid from our San Francisco office, and we aim for 3 in-office days per week. We know life happens, whether it’s travel, appointments, or family needs and we’re flexible when the schedule needs to shift. But generally, we value showing up, building together, and keeping the energy high. The company is a mix of remote and local team members.

Compensation

Base Salary: $140,000 – $190,000
This range reflects base salaries for roles in the San Francisco market. For candidates in other locations, compensation is adjusted to remain competitive within their local market.

In addition to the base salary, all full-time team members receive an additional 10% to 25% in liquid benefits with upside based on role, experience, and impact. We believe in building together and sharing in the long-term success of the network. Compensation packages are designed to be competitive and aligned with the growth of both the team and the ecosystem.

Top Skills

Ansible
AWS
Bash
Bitcoin
Cosmos Sdk
Datadog
Docker
Ethereum
GCP
Go
Grafana
Kubernetes
Linux
Loki
Prometheus
Python
Tendermint
Terraform
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Francisco, CA
35 Employees

What We Do

ZetaChain is building the first fully interoperable, decentralized, and universal blockchain — one universal chain to manage all chains.

For those new to crypto: Think of ZetaChain as the “internet of blockchains.” Just like the internet connects websites and services so they can talk to each other, ZetaChain connects different blockchains — which are normally isolated — so apps and users can move assets and data freely and securely across them. Our network is already live and has been running successfully since early 2024.

Backed by top-tier investors and a global community of developers, ZetaChain is where foundational protocol design meets real-world application. We’re a small, sharp, remote friendly team tackling some of the hardest problems in crypto infrastructure — and we’re hiring for engineering, product, and ecosystem roles.

If you’re excited by deep technical challenges, real impact, and shaping the future of the decentralized internet, you’ll thrive here.

Why Work With Us

At ZetaChain, we unite brilliant minds from all backgrounds in pursuit of a borderless, decentralized future. We're building the first Universal Blockchain that brings true connectivity and freedom to all.

If you're excited about shaping this future with us and ready to take on the challenge together, we'd love to meet you.

Similar Jobs

Braze Logo Braze

Senior Site Reliability Engineer

Marketing Tech • Mobile • Software
Easy Apply
Hybrid
San Francisco, CA, USA
1918 Employees
129K-232K Annually

Anduril Logo Anduril

Senior Site Reliability Engineer

Aerospace • Artificial Intelligence • Hardware • Robotics • Security • Software • Defense
In-Office
Costa Mesa, CA, USA
6000 Employees
166K-220K Annually

Zeta Global Logo Zeta Global

Senior Site Reliability Engineer

AdTech • Artificial Intelligence • Marketing Tech • Software • Analytics
Easy Apply
Remote or Hybrid
United States
2429 Employees
140K-170K Annually

NBCUniversal Logo NBCUniversal

Senior Site Reliability Engineer

AdTech • Cloud • Digital Media • Information Technology • News + Entertainment • App development
Remote or Hybrid
Los Angeles, CA, USA
68000 Employees
130K-160K Annually

Similar Companies Hiring

Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees
Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account