Senior Site Reliability Engineer

Posted Yesterday
Hiring Remotely in United States
Remote
160K-180K Annually
Senior level
Software
The Role
Own and improve platform performance, reliability, and deployment automation. Manage cloud infrastructure, implement IaC, monitor systems with observability tools, provide operational support for distributed applications, and integrate production learnings into development workflows.
Summary Generated by Built In

Arkestro's Predictive Procurement Platform applies AI, game theory, and behavioral science to enterprise negotiations. It moves teams away from reactive supplier bidding toward data-driven offers that reduce friction and help both buyers and suppliers reach better outcomes faster.
Arkestro helps procurement teams influence more enterprise spend and uncover savings that traditional sourcing approaches often leave behind by applying predictive intelligence to how sourcing events are designed and executed.
We’re a hyper-growth company looking for people who take ownership and are comfortable working through complexity. This is a tremendous opportunity if you want to do meaningful work and help shape how large enterprises run procurement… Arkestro is a place where you can have a big impact.  See Arkestro in action at arkestro.com.

About the Role

Arkestro is hiring for a Senior SRE Engineer to manage our performance and reliability for our software platform and infrastructure. The right candidate will own and develop our infrastructural components.  They will be responsible for managing the deployment of product updates, identifying production issues, and improving infrastructure across the platform.  They will collaborate with software engineers, data scientists, data engineers, as well as product managers and other functions across the company.

Responsibilities:

  • Building and setting up new development tools and infrastructure
  • Working on ways to automate and improve development and release processes
  • Management of existing cloud infrastructure
  • Document and act as subject matter expert for practices and policies involving infrastructure
  • Improve reliability, quality, and time-to-market of our suite of software solutions
  • Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating for continual improvement
  • Provide primary operational support and engineering for distributed software applications
  • Continuously integrate production context back into the development lifecycle, ensuring the system learns from past incidents to prevent recurring issues

An ideal candidate has:

  • 5+ years of experience with various AWS products, like S3, RDS, Elastic Containers, etc
  • 5+ years of experience with Kubernetes or other orchestration products
  • 3+ years of experience with Infrastructure as Code
  • 3+ years of experience with using observability platforms and log management, like Datadog, Splunk, Rigor, etc
  • 6 months+ of experience working with LLMs, prompt engineering, harness engineering, or AIOps tooling
  • 6 months+ of SDLC AI native usage via Claude Code, Claude Cowork or similar
  • Excellent communications skills, the ability to learn on the fly, and a desire for ownership
Pay Range
$160,000$180,000 USD

Arkestro is committed to providing our employees with a benefits package designed to give you the flexibility you need to ensure a healthy life/work balance. Arkestro offers our employees great benefits and perks, including, but not limited to:

  • Competitive salary and startup equity
  • Medical, Dental, Vision insurance premiums covered up to 100% (employee only)
  • 401K discretionary employer matching
  • Unlimited PTO
  • A remote-first team with regular opportunities to get together in person for team building, design sprints, and customer visits
  • Annual budget of $1,000 for learning and professional development
  • Diverse, inclusive, highly collaborative, and vibrant culture
Arkestro is an equal opportunity employer that is committed to diversity and inclusion in the workplace. We prohibit discrimination and harassment of any kind based on race, color, sex, religion, sexual orientation, national origin, disability, genetic information, pregnancy, or any other protected characteristic as outlined by federal, state, or local laws.
 
Disclaimer

Please note this job description may not be inclusive of all assigned duties, responsibilities, or aspects of the job described and that additional tasks may be assigned to the employee from time to time; or the scope of the job may change as necessitated by business demands. Arkestro reserves the right to change duties, responsibilities and activities at any time with or without notice. 

Skills Required

  • 5+ years experience with AWS products (S3, RDS, Elastic Containers, etc.)
  • 5+ years experience with Kubernetes or other orchestration products
  • 3+ years experience with Infrastructure as Code
  • 3+ years experience with observability platforms and log management (Datadog, Splunk, Rigor, etc.)
  • 6+ months experience working with LLMs, prompt engineering, harness engineering, or AIOps tooling
  • 6+ months SDLC AI native usage via Claude Code, Claude Cowork or similar
  • Excellent communication skills, ability to learn on the fly, and desire for ownership
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Francisco, CA
64 Employees

What We Do

Arkestro is a Predictive Procurement Orchestration Platform that amplifies the impact of procurement’s influence. How? By pre-embedding preferred outcomes in any system or process. Arkestro enables companies to touch 100% of the spend in a better, more programmatic way: with behavioral science, game theory and predictive machine learning models. Learn more at https://www.arkestro.com

Similar Jobs

Circle (circle.so) Logo Circle (circle.so)

Senior Site Reliability Engineer

Artificial Intelligence • Consumer Web • Digital Media • Information Technology • Social Impact • Software
Easy Apply
Remote
31 Locations
250 Employees
130K-140K Annually

Coinbase Logo Coinbase

Senior Site Reliability Engineer

Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
Easy Apply
Remote
USA
4700 Employees
186K-219K Annually

Coinbase Logo Coinbase

Senior Site Reliability Engineer

Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
Easy Apply
Remote
USA
4700 Employees
186K-219K Annually

Mastercard Logo Mastercard

Senior Site Reliability Engineer

Blockchain • Fintech • Payments • Consulting • Cryptocurrency • Cybersecurity • Quantum Computing
Remote or Hybrid
Salt Lake City, UT, USA
38800 Employees
96K-163K Annually

Similar Companies Hiring

Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
42 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account