Senior DevOps & Site Reliability Engineer

Posted Yesterday
Hiring Remotely in United States
Remote
165K-190K Annually
Senior level
Artificial Intelligence • Information Technology • Software • Automation
The Role
Own US PST coverage for releases and incidents as the first SRE; bridge infrastructure and code by working with Kubernetes, Terraform, and AWS and patching Elixir when needed; lead incident response and post-mortems; define SLOs and observability; author runbooks and support HIPAA-aligned compliance for a regulated medical-device platform.
Summary Generated by Built In

About the role

As our first SRE & first engineer in the US, you will own the platform’s stability and releases, especially during PST hours.

You are the perfect "bridge" profile: part system administrator, part software engineer. You don't just manage infrastructure; you understand the code running on it. You will operate with high autonomy, making critical decisions during incidents and ensuring that our production environment is state-of-the-art, secure, and resilient.

You’ll report to our Lead DevOps Engineer, Pierre, and your main mission will be:

  • Own US coverage for releases and incidents as the first responder during PST hours.

  • Bridge infra and code by working hand-in-hand with our DevOps team on Kubernetes, Terraform, and AWS, while being able to read and patch Elixir code to unblock yourself without waiting for a backend engineer.

  • Drive incident response end-to-end, managing triage, mitigation, and blameless post-mortems with real follow-through.

  • Improve the platform’s operability by defining SLOs, tuning alerts to reduce toil, and pushing observability (metrics, logs, tracing) where it’s lacking.

  • Transfer operational knowledge from France to the US by authoring runbooks and documenting procedures so local teams are empowered to act when something breaks.

  • Support compliance and security in our regulated medical-device environment, maintaining HIPAA-aligned controls and an audit-ready infrastructure.

About the profile

Sonio is a mission-driven company, so interest in our mission is critical. Other requirements are:

  • 4+ years of experience in SRE, DevOps, or Production Engineering, including significant on-call experience on a 24/7 product

  • You possess a hybrid "code-literate" mindset, acting as an infrastructure expert who can also navigate a backend codebase to triage and patch issues independently.

  • You bring strong technical foundations in Kubernetes, Terraform, and AWS, along with the ability to architect and tune your own observability signals.

  • You are highly autonomous and comfortable making technical decisions with limited supervision, which is essential given the timezone difference with France.

  • You maintain operational rigor and stay calm under pressure, with the written English skills necessary to produce high-quality runbooks and handle async handoffs.

Location: where you can cover for PST timezone (not necessarily only in the US)

Salary: $165,000 -190,000 + 10% bonus

Benefits:

⚕️Health Insurance (Medical plan, vision, dental) - up to 30,000$ per year + FSA & HSA

👵 401(k) - up 4% of your salary matched

⛑️ Life Insurance - covering 2 times your salary, up to $200k

🐣 An attractive Parental Policy for primary and secondary caregivers

🏝️ 20 PTO + 1 week offered between Christmas and New Year

🖥️ Offices in Boston (HQ) & New York (incl. free breakfast, drinks & gym)

⏰ Flexible hours & remote policies

🚎 Commuter Benefits

✈️ One offsite per year in France & regular team building with US team

🚀 Ongoing trainings and continuous opportunities for professional growth and development, specifically unlimited access to coaching

We move fast and aspire to be transparent over the process - our objective is that the process from the first chat to an offer is no longer than a month.

Skills Required

  • 4+ years experience in SRE, DevOps, or Production Engineering, including significant 24/7 on-call experience
  • Strong technical experience with Kubernetes
  • Strong technical experience with Terraform
  • Strong technical experience with AWS
  • Ability to read, triage, and patch Elixir backend code
  • Experience defining SLOs, tuning alerts, and implementing observability (metrics, logs, tracing)
  • Operational rigor, autonomy, and strong written English to produce runbooks and async handoffs
  • Experience supporting compliance and security in a regulated medical-device environment (HIPAA-aligned controls)
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
6 Employees
Year Founded: 2023

What We Do

Xpert Development is a US-based software development company specializing in providing businesses worldwide with custom technology solutions. The company focuses on delivering AI and automation tools designed to help small business owners reclaim their time by automating repetitive workflows, such as customer support, scheduling, and invoicing, ensuring real outcomes and scalable growth.

Similar Jobs

Stellar Cyber Logo Stellar Cyber

Senior Devops Engineer

Software • Cybersecurity
Remote
United States
93 Employees
165K-215K Annually

BRINC Drones Logo BRINC Drones

Senior Site Reliability Engineer

3D Printing • Aerospace • Hardware • Robotics • Software
Remote or Hybrid
2 Locations
95 Employees
154K-199K Annually
Remote
United States
84 Employees

Similar Companies Hiring

Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account