Xpert Development LLC

Senior DevOps & Site Reliability Engineer

Reposted 16 Days Ago

Hiring Remotely in United States

Remote

165K-190K Annually

Senior level

Artificial Intelligence • Information Technology • Software • Automation

The Role

Own US PST coverage for releases and incidents as the first SRE; bridge infrastructure and code by working with Kubernetes, Terraform, and AWS and patching Elixir when needed; lead incident response and post-mortems; define SLOs and observability; author runbooks and support HIPAA-aligned compliance for a regulated medical-device platform.

Summary Generated by Built In

About the role

As our first SRE & first engineer in the US, you will own the platform’s stability and releases, especially during PST hours.

You are the perfect "bridge" profile: part system administrator, part software engineer. You don't just manage infrastructure; you understand the code running on it. You will operate with high autonomy, making critical decisions during incidents and ensuring that our production environment is state-of-the-art, secure, and resilient.

You’ll report to our Lead DevOps Engineer, Pierre, and your main mission will be:

Own US coverage for releases and incidents as the first responder during PST hours.
Bridge infra and code by working hand-in-hand with our DevOps team on Kubernetes, Terraform, and AWS, while being able to read and patch Elixir code to unblock yourself without waiting for a backend engineer.
Drive incident response end-to-end, managing triage, mitigation, and blameless post-mortems with real follow-through.
Improve the platform’s operability by defining SLOs, tuning alerts to reduce toil, and pushing observability (metrics, logs, tracing) where it’s lacking.
Transfer operational knowledge from France to the US by authoring runbooks and documenting procedures so local teams are empowered to act when something breaks.
Support compliance and security in our regulated medical-device environment, maintaining HIPAA-aligned controls and an audit-ready infrastructure.

About the profile

Sonio is a mission-driven company, so interest in our mission is critical. Other requirements are:

4+ years of experience in SRE, DevOps, or Production Engineering, including significant on-call experience on a 24/7 product
You possess a hybrid "code-literate" mindset, acting as an infrastructure expert who can also navigate a backend codebase to triage and patch issues independently.
You bring strong technical foundations in Kubernetes, Terraform, and AWS, along with the ability to architect and tune your own observability signals.
You are highly autonomous and comfortable making technical decisions with limited supervision, which is essential given the timezone difference with France.
You maintain operational rigor and stay calm under pressure, with the written English skills necessary to produce high-quality runbooks and handle async handoffs.

Location: where you can cover for PST timezone (not necessarily only in the US)

Salary: $165,000 -190,000 + 10% bonus

Benefits:

⚕️Health Insurance (Medical plan, vision, dental) - up to 30,000$ per year + FSA & HSA

👵 401(k) - up 4% of your salary matched

⛑️ Life Insurance - covering 2 times your salary, up to $200k

🐣 An attractive Parental Policy for primary and secondary caregivers

🏝️ 20 PTO + 1 week offered between Christmas and New Year

🖥️ Offices in Boston (HQ) & New York (incl. free breakfast, drinks & gym)

⏰ Flexible hours & remote policies

🚎 Commuter Benefits

✈️ One offsite per year in France & regular team building with US team

🚀 Ongoing trainings and continuous opportunities for professional growth and development, specifically unlimited access to coaching

We move fast and aspire to be transparent over the process - our objective is that the process from the first chat to an offer is no longer than a month.

Skills Required

4+ years experience in SRE, DevOps, or Production Engineering, including significant 24/7 on-call experience
Strong technical experience with Kubernetes
Strong technical experience with Terraform
Strong technical experience with AWS
Ability to read, triage, and patch Elixir backend code
Experience defining SLOs, tuning alerts, and implementing observability (metrics, logs, tracing)
Operational rigor, autonomy, and strong written English to produce runbooks and async handoffs
Experience supporting compliance and security in a regulated medical-device environment (HIPAA-aligned controls)

View all jobs at Xpert Development LLC

View Xpert Development LLC Profile

Report Job

Am I A Good Fit?

beta

Get Personalized Job Insights.

Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company

6 Employees

Year Founded: 2023

What We Do

Xpert Development is a US-based software development company specializing in providing businesses worldwide with custom technology solutions. The company focuses on delivering AI and automation tools designed to help small business owners reclaim their time by automating repetitive workflows, such as customer support, scheduling, and invoicing, ensuring real outcomes and scalable growth.