Airbyte

Senior Site Reliability Engineer

Reposted 24 Days Ago

Be an Early Applicant

San Francisco, CA, USA

Hybrid

196K-255K Annually

Senior level

Artificial Intelligence • Big Data • Software

Airbyte, The Open Data Movement Platform.

The Role

You will manage the infrastructure for the Data Replication team, focusing on Kubernetes, reliability standards, and integrating product features with infrastructure. You'll enhance observability and tooling using AI, ensuring engineers can effectively manage their stack.

Summary Generated by Built In

Airbyte is the data and action layer for AI agents. We give agents fast, accurate, authenticated access to business data across hundreds of sources, so they can discover the entities that matter, reason over real-time context, and take action in the systems they read from, not just observe them.

We started as the open-source standard for data movement and proved the economics of data integration at scale: hundreds of connectors, thousands of companies, and, since 2020, have raised $181M from leading investors including Benchmark, Accel, Altimeter, Coatue, and Y Combinator. As our CEO Michel Tricot puts it, "the last ten years were all about structured data. The future is all about context." We're now building that context infrastructure for production-grade agents on the same open foundation, as agents become the primary consumers of enterprise data.

Our mission is unchanged: make data available and actionable to everyone, everywhere. That everyone now includes AI agents.

The Role:

You'll be the infrastructure and reliability engineer on the Data Replication team - a full-stack product team running over 3 million sync jobs a week powering thousands of data use cases across multiple regions and clouds. You’ll build and maintain the infrastructure, set reliability standards, drive down incidents, and make it easier and safer for engineers to ship through tooling. You're equally comfortable in a Terraform file, a Kubernetes cluster, and a postmortem doc.

We expect engineers here to actively use AI as a force multiplier - agentic tools to automate toil, augment incident response, and build smarter internal tooling. If you're not already doing this, you should be excited to start. We care as much about how you work as what you build. Trust, directness, and craftsmanship matter here.

What You’ll Do:

Own the infrastructure underpinning the Data Replication platform - Kubernetes clusters, CI/CD pipelines, secrets management, networking, and cloud resource configuration across AWS and GCP.
Partner with product engineers to reliably integrate product features with infrastructure.
Maintain and enhance observability, alerting, and anomaly detection with an eye towards LLM automation.
Maintain and enhance AI-augmented release and internal tooling: canary deployments, progressive rollouts, automated release qualification, and rollback automation - with an eye towards LLM automation.
Set the infrastructure bar for the team - build self-serve tooling, write runbooks, and coach engineers to own more of their stack.

What You’ll Need:

7+ years in infrastructure, platform engineering, SRE, or DevOps.
Hands-on ownership of Kubernetes, Helm, and Terraform in production environments.
Deep experience with observability stacks (Prometheus, Grafana, Datadog) and on-call operations.
Experience with CI/CD pipeline ownership and developer tooling.
Ability & willingness to read backend code to understand how systems break and instrument them correctly.
Fluency with AI tools - LLMs and agentic frameworks to automate, debug faster, and reduce toil.
A startup-ready mindset: comfortable with ambiguity, moving fast, and owning problems end-to-end.

Nice To Have:

Data pipelines, replication systems, or ETL/ELT platforms.
Control plane / data plane architectures or internal developer platforms.
Experience with Airbyte, CDKs, or connector-based architectures.

Location:

Onsite 4 days/week in San Francisco, CA

Why You'll Love Working at Airbyte:

At Airbyte, we believe great work happens when people feel supported, trusted, and empowered to grow. Our market-leading Total Rewards package is designed to help you thrive professionally and personally. Our benefits and perks include:

Flexible PTO with a culture that encourages at least 25 days off annually
16 weeks fully paid parental leave for all parents
Comprehensive medical, dental, and vision coverage for employees and dependents
401(k) retirement plan
Professional development budget, conference sponsorship, and book reimbursement
Commuter benefits and monthly internet reimbursement
Breakfast and lunch in our San Francisco office
A collaborative, in-person culture focused on learning, growth, and impact

If you find this role exciting, we encourage you to apply even if you think you don’t meet all of the requirements!

Airbyte is an equal opportunity employer that does not discriminate on the basis of actual or perceived race, creed, color, religion, national origin, ancestry, age, physical or mental disability, pregnancy, genetic information, sex, sexual orientation, gender identity or expression, marital status, familial status, domestic violence victim status, veteran or military status, or any other legally recognized protected basis under federal, state or local laws. Pursuant to the San Francisco Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.

Airbyte is committed to providing reasonable accommodations for qualified individuals with disabilities in our job application procedures. Please let us know if you need assistance or accommodations due to a disability.

Skills Required

7+ years in infrastructure, platform engineering, SRE, or DevOps
Hands-on ownership of Kubernetes, Helm, and Terraform in production environments
Deep experience with observability stacks (Prometheus, Grafana, Datadog) and on-call operations
Experience with CI/CD pipeline ownership and developer tooling
Ability & willingness to read backend code to understand how systems break
Fluency with AI tools - LLMs and agentic frameworks for automation

View all jobs at Airbyte

View Airbyte Profile

Report Job

Am I A Good Fit?

beta

Get Personalized Job Insights.

Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company

HQ: San Francisco, CA

120 Employees

Year Founded: 2020

What We Do

Airbyte specializes in open-source data integration, designed to centralize data from diverse sources into storage solutions like data warehouses and lakes. Supporting over 400 connectors and a self-serve, extensible framework, Airbyte enables organizations to move both structured and unstructured data seamlessly for uses like AI, analytics, and business intelligence. Airbyte’s flexibility in deployment—whether cloud, hybrid, or on-premises—prioritizes data security, compliance, and governance, making it ideal for complex, scalable data needs across industries.

Why Work With Us

Airbyte is extremely transparent both internally and externally. Our company handbook, culture & values, strategy, and roadmap are open to all. https://handbook.airbyte.com/