Senior SRE/DevOps Engineer

Posted 4 Days Ago
Hiring Remotely in United States
Remote
Senior level
Big Data
The Role
As a Senior SRE/DevOps Engineer, you will own the application stack and AWS infrastructure, debug runtime issues, develop internal tooling for managed Metabase installations, and improve automated deployments and testing.
Summary Generated by Built In

Metabase is the easiest way for people to get insights from their data, from tiny startups who get up and running quickly to major corporations with tens of thousands of users. That's why people love us.


We bring data tools with the elegance and simplicity of consumer products to the crufty world of enterprise business intelligence. We provide an opinionated open source starting point for how companies should measure, analyze and share their data, which is used by tens of thousands of companies.


Tens of thousands of companies use Metabase every day to answer questions about their data. While we seek to become the de-facto self-managed open source analytics software for organizations everywhere, many customers want an ability to use Metabase without worrying about the operational details of self-hosting. That’s why we recently launched our Metabase Cloud product. We’re looking for operations engineers to help build out and run our new and quickly growing ‘Metabase Cloud’ hosted product.

You will:

  • Own and operate our application stack and AWS infrastructure to orchestrate and manage our hosted customer instances of Metabase
  • Debug runtime issues across the different levels of our application stack and hosting stack.
  • Develop and build our internal tooling and automation to manage the lifecycle of a hosted Metabase installation, from purchase to deployment, zero-downtime upgrades, and general operational health
  • Continuously improve our automated deployments and testing

We're looking for someone who:

  • Is thoughtful and careful
  • Compulsively automates everything and documents it
  • Is able to make solid technical judgements and back them up articulately
  • Has at least 5 years of experience building and operating production infrastructure, ideally on public cloud
  • Strong Kubernetes and AWS experience
  • Strong experience with IaC and Terraform
  • Can write high quality and readable code in a modern language (e.g. Python, Go, etc.)
  • Experience with modern monitoring stacks (e.g Prometheus/Grafana/Datadog)

Projects you could work on:

  • Multi-region hosting
  • Automate EKS cluster provisioning
  • Extend our CRDs and Operators
  • Improve the RDS sharding strategy for our multi-tenant platform
  • Unify and improve our CI/CD platforms
  • Collaborate with core application developers on changes to improve our application metrics, deployment speeds and CI integration.
  • Maintain our SOC2 compliance and security posture



We're a global team (50% outside the US), fully distributed (from Thailand to California), who get things done asynchronously, with plenty of uninterrupted time, supporting each other to do the best work of our careers. We offer flexibility (define your own schedule and work from wherever you want), autonomy, and an environment that fosters growth, learning, and development. We're relentlessly user-focused and believe in building long-term value, not short-term hacks. And we raised a $30M Series B to take our approach to the next level for years to come.


For U.S. applicants: Metabase participates in the federal E-Verify program, which confirms employment authorization of newly hired U.S. based employees. E-Verify is not used as a tool to pre-screen candidates and is only initiated upon hire.


E-Verify Participation Notice (English/Spanish)

Right to Work Notice (English/Spanish)

Top Skills

Go
Python
The Company
HQ: San Francisco , CA
84 Employees
Remote Workplace
Year Founded: 2014

What We Do

Metabase is bringing data tools with the elegance and simplicity of consumer products to the crufty world of enterprise business intelligence. We provide an opinionated open source starting point for how companies should measure, analyze and share their data as well as a suite of tools to deal with the complexity that arises as they grow.

We'd hiring and would love for you to join us

Similar Jobs

Atlassian Logo Atlassian

Principal Site Reliability Engineer

Cloud • Information Technology • Productivity • Security • Software • App development • Automation
Remote
San Francisco, CA, USA
11000 Employees
167K-269K Annually

Cherre Logo Cherre

Senior DevOps and Site Reliability Engineer, remote

Big Data • Fintech • Machine Learning • Real Estate • Database
Remote
Hybrid
New York, NY, USA
86 Employees
120K-250K Annually

Gemini Logo Gemini

Principal Site Reliability Engineer, Platform

Blockchain • Fintech • Cryptocurrency
Remote
USA
660 Employees

Square Logo Square

Staff Software Engineer, Orders Platform

eCommerce • Fintech • Hardware • Payments • Software • Financial Services
Remote
8 Locations
12000 Employees
240K-359K Annually

Similar Companies Hiring

Monte Carlo Thumbnail
Software • Generative AI • Cloud • Big Data Analytics • Big Data
San Francisco, CA
173 Employees
Hex Thumbnail
Software • Business Intelligence • Big Data Analytics • Big Data • Artificial Intelligence • Analytics
San Francisco, CA
100 Employees
MassMutual India Thumbnail
Insurance • Information Technology • Fintech • Financial Services • Big Data
Hyderabad, Telangana

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account