Infrastructure Engineer - Observability (APAC)

Posted 10 Days Ago
Be an Early Applicant
3 Locations
Remote
Senior level
Database
The Role
Operate and scale observability infrastructure: own Kubernetes for logging, tracing, and metrics; implement org-wide telemetry standards; run and scale VictoriaMetrics, OpenTelemetry Collector, and Vector; ensure high availability and reliability for internal and external stakeholders.
Summary Generated by Built In

We are seeking a seasoned infrastructure operations expert that has experience with orchestrating high-throughput data services.
Experience that showcases high availability and systems reliability skillsets in high volume data pipeline environments are a big plus.


What you'll own
  • Collaborate deeply with our infrastructure and product teams to enforce org-wide practices for emitting and collecting telemetry across a wide range of services, both internal and external-facing. This includes contributing to org-wide documentation, advocacy of best practices and helping to enforce standards org-wide.

  • Own and operate the Kubernetes infrastructure of the observability team. You will help in defining the documentation, operational flows, and engineering standards to ensure high uptime across our logging, tracing, and metrics systems that are used by internal and external stakeholders

  • Work within the Observability team to ensure industry-standard deployment and reliability practices are used, and to develop industry-leading reliability software to ensure that our observability systems never go down for our customers.

  • Orchestrate and scale systems such as VictoriaMetrics, OpenTelemetry Collector, and Vector.

What you bring
  • 5+ years of experience in a Site Reliability Engineering role

  • Experience operating and supporting clustered applications in production environments

  • Hands-on experience deploying and managing applications in Kubernetes (k8s) environments

  • Working knowledge of PostgreSQL, including administration, performance tuning, and troubleshooting

  • Proficiency with at least one Infrastructure as Code (IaC) tool (e.g., Terraform, Pulumi, OpenTofu, or equivalent)

  • Experience with telemetry tooling such as OpenTelemetry, VictoriaMetrics, Grafana, Prometheus.

  • Experience with AWS services is a plus

  • Strong documentation and communication skills is a plus

What We Offer
  • Fully Remote

    We hire globally. We believe you can do your best work from anywhere. There are no Supabase offices, but we provide a WeWork membership or co-working allowance you can use anywhere in the world.

  • ESOP

    Every team member receives ESOP (equity ownership) in the company. We want everyone to share in the upside of what we’re building together.

  • Tech Allowance

    Use this budget to set up your ideal work environment—laptop, monitor, headphones, or whatever helps you do your best work.

  • Health Benefits

    Supabase covers 100% of health insurance for employees and 80% for dependents, wherever you are. Your wellbeing and your family’s health are important to us.

  • Annual Off-Sites

    Once a year, the entire company gathers in a new city for a week of connection, collaboration, and fun. It’s a highlight of our year.

  • Flexible Work

    We operate asynchronously and trust you to manage your own time. You know what needs to be done and when.

  • Professional Development

    Every team member receives an annual education allowance to spend on learning—courses, books, conferences, or anything that supports your growth.

About the Team

Supabase was born-remote and open-source-first. We believe our globally distributed team is our secret weapon in building tools developers love.

  • 280+ team members

  • 55+ countries

  • 20+ languages spoken

  • $500M raised

  • 500,000+ community members

We move fast, build in public, and use what we ship. If it’s in your project, we probably use it in ours too. We believe deeply in the open-source ecosystem and strive to support—not replace—existing tools and communities.

Hiring Process

We keep things simple, async-friendly, and respectful of your time:

  1. Apply – Our team will review your application.

  2. Intro Call – A short video chat to get to know each other.

  3. Interviews – Up to four calls with:

    • Team Leads

    • Future teammates

    • Someone cross-functional from product, growth, or engineering (depending on the role)

    • Someone from our leadership/founding team

  4. Decision – We may follow up with a final question or go straight to offer.

All communication is remote and we aim to move fast.

Skills Required

  • 5+ years of experience in a Site Reliability Engineering role
  • Experience operating and supporting clustered applications in production
  • Hands-on experience deploying and managing applications in Kubernetes (k8s) environments
  • Working knowledge of PostgreSQL including administration, performance tuning, and troubleshooting
  • Proficiency with at least one Infrastructure as Code (IaC) tool (Terraform, Pulumi, OpenTofu, or equivalent)
  • Experience with telemetry tooling such as OpenTelemetry, VictoriaMetrics, Grafana, Prometheus
  • Experience showcasing high availability and systems reliability in high-volume data pipeline environments
  • Experience with AWS services
  • Strong documentation and communication skills
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
49 Employees
Year Founded: 2020

What We Do

Build in a weekend. Scale to millions. Supabase is an open source Firebase alternative. Start your project with a Postgres Database, Authentication, instant APIs, and realtime subscriptions.

Similar Jobs

Acquia Logo Acquia

Solutions Engineer

AdTech • Cloud • Marketing Tech • Productivity • Software • Analytics • Automation
Easy Apply
Remote or Hybrid
Japan
1100 Employees

Acquia Logo Acquia

Account Manager

AdTech • Cloud • Marketing Tech • Productivity • Software • Analytics • Automation
Easy Apply
Remote or Hybrid
Japan
1100 Employees

Micron Technology Logo Micron Technology

Production DX Engineer

Artificial Intelligence • Hardware • Information Technology • Machine Learning
Remote
Hiroshima, JPN
45000 Employees

Micron Technology Logo Micron Technology

F15 HVM CVD Equipment Engineer

Artificial Intelligence • Hardware • Information Technology • Machine Learning
Remote
Hiroshima, JPN
45000 Employees

Similar Companies Hiring

Apollo.io Thumbnail
Software • Sales • Productivity • Information Technology • Enterprise Web • Database • Artificial Intelligence
US
850 Employees
Perchwell Thumbnail
Mobile • Real Estate • Software • Database • Analytics
New York City, NY
60 Employees
Jellyfish Thumbnail
Big Data • Cloud • Productivity • Software • Database • Analytics • Automation
Boston, MA
225 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account