Senior Site Reliability Engineer (Copy)

Posted 24 Days Ago
Be an Early Applicant
San Francisco, CA, USA
Hybrid
205K-225K Annually
Senior level
Artificial Intelligence • HR Tech • Professional Services
The Role
Design, build, and operate reliable, scalable cloud infrastructure. Maintain AWS/GCP and Linux systems, manage Kubernetes clusters, implement IaC (Ansible/Puppet/Terraform), automate CI/CD (Jenkins), monitor with Prometheus/ELK, triage alerts, participate in design/reviews, migrate apps to Kubernetes, and improve operational automation.
Summary Generated by Built In
Based in NY/SF or willing to relocate (in-person collaboration is critical). In-person component is important (if in NYC/SF, 3 days a week; if within 30 miles outside the city - 1 day a week)


The Company:

We are a decentralized, public blockchain that gives developers the tools to create experiences that are more like cash than crypto. The network is faster, cheaper, and far more energy-efficient than most blockchain-based systems. It’s designed so Stellar’s ecosystem can make a real-world, lasting impact.

The Role:

As one of the first engineers, you will help build and operate the foundation that powers our engineering teams. You’ll ensure the reliability and scalability of our systems, design and improve the infrastructure behind our production environments, and automate operational work so developers can focus on building great products.


Key Responsibilities:Matching & Scoring Systems
  • Maintain, improve, scale and secure our AWS/GCP infrastructure and Linux systems.
  • Assist our development teams in running, packaging, deploying and troubleshooting applications
  • Work with developers on streamlining deployment processes with Jenkins and other CI/CD tooling.
  • Build, maintain, monitor and improve our Kubernetes clusters.
  • Work with development teams on migrating applications to Kubernetes.
  • Be responsible for maintenance and improvements to multiple internal services, for example Kubernetes, Prometheus, ELK.
  • Monitor, triage and respond to alerts in our high availability environments.
  • Participate in design and code reviews, and ensure that the foundation for our services is best in class.
  • Evaluate new technologies, design and implement as appropriate.
  • Identify automation opportunities and implement by creating custom or by using off the shelf solutions.


Qualifications:
Required Experience
  • 5+ years of experience of working in cloud-based systems operations, as a SRE or DevOps engineer.
  • First-hand experience with configuration management and infrastructure as code (Ansible, Puppet, Terraform).
  • Proficient in utilizing SRE methodologies like capacity planning and disaster recovery testing to ensure the scalability, resilience, and availability of critical services.
  • Production experience building and maintaining Kubernetes clusters.
  • Will need to know how to code

Preferred Attributes:
  • Ability to understand Go, Rust, C++ and TypeScript source code
  • Experience experimenting with AI-driven approaches to operations
  • Comfortable with participating in on-call rotations and conducting thorough root cause analyses to keep systems running smoothly.
  • Experienced in managing production workloads and skilled in using monitoring tools to detect issues early.
  • A strong understanding of computer networking, TCP/UDP, load balancing, distributed computing, web services, and the fundamental protocols used by the internet (HTTP, HTTPS, DNS, etc.).
  • No blockchain needed
  • Experience using AI is a plus

Compensation
The base pay range for this role is $205,000 – $225,000 per year.

Skills Required

  • 5+ years of experience working in cloud-based systems operations as a SRE or DevOps engineer
  • Experience with configuration management and infrastructure as code (Ansible, Puppet, Terraform)
  • Proficient in SRE methodologies such as capacity planning and disaster recovery testing
  • Production experience building and maintaining Kubernetes clusters
  • Ability to code (development experience required)
  • Experience maintaining, improving, scaling and securing AWS and/or GCP infrastructure and Linux systems
  • Experience with CI/CD tooling and streamlining deployments (e.g., Jenkins)
  • Experience building, maintaining, and improving monitoring and logging stacks (Prometheus, ELK)
  • Ability to understand Go, Rust, C++ and TypeScript source code
  • Comfortable participating in on-call rotations and performing root cause analyses
  • Experience experimenting with AI-driven approaches to operations / experience using AI
  • Strong understanding of computer networking, TCP/UDP, load balancing, distributed computing, HTTP/HTTPS, DNS
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company

What We Do

Arx Talent provides AI-powered, asynchronous recruiting services for senior roles at growth-stage companies. They specialize in 'hiring sprints'—fixed-scope, fixed-price engagements that deliver five pre-vetted candidates in a live pipeline within seven days. Their process emphasizes efficiency by replacing traditional recruiter calls with a portal-based system for candidate review and scheduling.

Similar Jobs

Hybrid
Concord, CA, USA
205000 Employees
143K-224K Annually
Hybrid
San Anselmo, CA, USA
205000 Employees
37K-66K Hourly

Wells Fargo Logo Wells Fargo

Relationship Banker East Bay Ridge

Fintech • Financial Services
Hybrid
Orinda, CA, USA
205000 Employees
27K-41K Hourly
Hybrid
Novato, CA, USA
205000 Employees
37K-66K Hourly

Similar Companies Hiring

Legora Thumbnail
Artificial Intelligence • Legal Tech • Software
Chicago, Illinois
700 Employees
Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
42 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account