Founding Reliability Engineer

Posted 11 Days Ago
Be an Early Applicant
San Francisco, CA
In-Office
150K-300K Annually
Mid level
Artificial Intelligence • Information Technology • Robotics • Software
The Role
Sieve is seeking a Founding Reliability Engineer to build and maintain infrastructure for petabyte-scale video workloads, focusing on reliability and security. Responsibilities include incident response, cloud security, and observability systems management.
Summary Generated by Built In
About Us

Sieve is the only AI research lab exclusively focused on video data. We combine exabyte-scale video infrastructure, novel video understanding techniques, and dozens of data sources to develop datasets that push the frontier of video modeling. Video makes up 80% of internet traffic and has become the enabling digital medium powering creativity, communication, gaming, AR/VR, and robotics. Sieve exists to solve the biggest bottleneck in the growth of these applications: high-quality training data.


We’ve partnered with top AI labs and did $XXM last quarter alone, as a team of just 12 people. We also raised our Series A earlier this year from Tier 1 firms such as Matrix Partners, Swift Ventures, Y Combinator, and AI Grant.


About the Role

We process petabytes of video across thousands of nodes and multiple cloud environments. As we scale, reliability, observability, and security become existential.


We’re hiring our first engineer fully dedicated to the infrastructure foundation of Sieve. This is a high-ownership role for someone who thinks deeply about:

  • throughput and system stability

  • monitoring and incident response

  • security and least-privilege design

  • reducing operational burden for the entire engineering team


You’ll work directly with our CTO and our founding engineers to build the core tooling that powers all of engineering.


This role is for someone who spends their time thinking deeply about reliability, throughput, observability, and security. You’re the kind of engineer who is always anticipating failure modes, eliminating operational risk, and designing systems that don’t break.


If something goes down, you take it personally, and you thrive in that level of responsibility.


What You’ll Do
  • Work with engineering to design and validate the infrastructure powering PB-scale workloads

  • Build and maintain Terraform-managed multi-cloud deployments

  • Improve cloud and data security (SSO, IAM, least privilege, auditability)

  • Own incident response and harden systems against failure

  • Develop CI/CD systems that minimize user error and maximize safety

  • Build monitoring + alerting platforms (Prometheus, OpenTelemetry, VictoriaMetrics)

  • Wrap internal reliability tooling with simple UIs for engineers


Requirements
  • 3+ years building internal infrastructure at scale

  • Experience on-call for Sev 0 / Sev 1 production incidents (L3 preferred)

  • Strong cloud experience (GCP, AWS, Oracle, Cloudflare, etc.)

  • Deep Infrastructure-as-Code experience (Terraform preferred)

  • Familiarity with Argo, Helm, Kustomize, or similar deployment tools

  • Experience operating observability systems (Prometheus, OTel, VictoriaMetrics)

  • Backend fundamentals in Python, Go, Rust, or C++

  • Strong networking + security intuition, including SSO implementation

  • High ownership mindset over critical systems


Bonus
  • Experience building lightweight internal tooling (APIs, dashboards, Svelte)

  • Familiarity with object storage systems (“buckets”)

  • Active GitHub or portfolio projects


Location

In-person at our SF HQ.

Top Skills

AWS
C++
Cloudflare
GCP
Go
Opentelemetry
Oracle
Prometheus
Python
Rust
Terraform
Victoriametrics
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Francisco, CA
16 Employees
Year Founded: 2022

What We Do

Sieve is the only AI research lab exclusively focused on video data.

Video already makes up 80% of internet traffic and has become the dominant medium driving creativity, communication, gaming, AR/VR, and robotics. Unlocking the ability to truly model video is the key to breakthroughs across all of these domains but progress has been bottlenecked by one thing: high-quality training data. That’s where Sieve comes in.

We bring together exabyte-scale video infrastructure, novel video understanding techniques, and dozens of diverse data sources to create datasets that push the frontier of video modeling. This unique combination allows us to deliver data with unmatched precision, quality, and speed which has earned the trust of frontier AI labs, Fortune 100 companies, and fast-growing generative AI startups.

Similar Jobs

Assort Health Logo Assort Health

Site Reliability Engineer

Artificial Intelligence • Healthtech • Other • Productivity • Telehealth • Conversational AI • Generative AI
In-Office
San Francisco, CA, USA
59 Employees
160K-225K Annually
In-Office
San Francisco, CA, USA
24 Employees

NBCUniversal Logo NBCUniversal

Product Manager

AdTech • Cloud • Digital Media • Information Technology • News + Entertainment • App development
Remote or Hybrid
Hollywood Beach, CA, USA
68000 Employees
110K-140K Annually

Comcast Advertising Logo Comcast Advertising

Account Manager

AdTech • Digital Media • Marketing Tech
Hybrid
Los Angeles, CA, USA
5000 Employees
48K-133K Annually

Similar Companies Hiring

Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees
Fairly Even Thumbnail
Software • Sales • Robotics • Other • Hospitality • Hardware
New York, NY
Bellagent Thumbnail
Artificial Intelligence • Machine Learning • Business Intelligence • Generative AI
Chicago, IL
20 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account