Site Reliability Engineer (SRE)

Sorry, this job was removed at 08:20 a.m. (CST) on Thursday, Jul 24, 2025
Be an Early Applicant
Tokyo
Hybrid
Information Technology
The Role
Our Mission:
Driving technology always feels old. Not by a little bit. We believe vehicles can be a thousand times smarter, safer, and more connected to the world around us, and our mission is to see it happen. In 2019, we joined forces with Honda as their first startup acquisition, and now we’re expanding our vision into building the future of electric vehicles (BEV) for millions of people around the world.

Why Drivemode: 
Join Drivemode for an exciting startup environment and a vibrant culture that combines impactful work, competitive compensation, and excellent benefits. By becoming a part of our team, you'll contribute to a crucial mission that revolutionizes the way people engage with vehicles, addressing both business needs and the world's environmental challenges. This presents an exceptional opportunity to be at the forefront of innovation and drive Honda's success in the EV market.

About the Role:
We’re seeking an experienced Site Reliability Engineer to own the reliability, performance, and day-to-day operations of our Kotlin/Swift mobile applications and Kotlin backend services on AWS. You will partner with product engineers and platform engineers to design SLIs/SLOs, automate operations, lead incident response, and drive a “code-driven reliability” culture across time zones.
You will be part of a production-support model where: Level 2 / Level 3 are shared by SREs and feature teams. SREs provide the tooling, coaching, and leadership that make developers excellent on call.

Why Join?
Green-field influence: define SRE culture, tooling, and error-budget policy from day one.
Career trajectory: opportunity to grow into Staff SRE / Reliability Lead as we scale to multiple regions and product lines.
Impact at scale: your work spans globally across multiple regions and product lines.
Engineering-driven org: close collaboration with product, platform, and security teams who value operational excellence.
Competitive salary, flexible remote policy, and an allowance for certifications, conferences, and home lab gear.

What You Will Do:

  • Service Reliability: Define and track SLIs/SLOs & error budgets for backend APIs and mobile release health. Hold teams accountable to reliability goals.
  • Incident Management: Lead the on-call rotations, coordinate incident response, run post-mortems, and eradicate root causes.
  • Observability & Tooling: Own Datadog dashboards, log pipelines, crash analytics (Firebase / Sentry), and feature-flag metrics (LaunchDarkly / ConfigCat).
  • Automation & Elimination of Toil: Write tools and self-healing runbooks in Kotlin, Rust, Go, or Python for rollbacks, DB failovers, chaos tests, and config drift detection.
  • Capacity & Performance: Forecast load, run stress / load tests, tune JVM & Graal settings for Kotlin services, and advise on RDS & Redis scaling.
  • Disaster Recovery & Chaos Engineering: Design BCP/DR playbooks; run game days to validate recovery objectives.
  • Cost & FinOps: Instrument cost metrics and collaborate with Finance to keep AWS spend within agreed “cost budgets.”
  • Security & Compliance Support: Monitor GuardDuty / CSPM alerts, be prepared and participate in security incident response.
  • Developer Partnership: Pair with mobile & backend engineers on instrumentation, release gates, and staged roll-outs; mentor teams in SLO thinking via brown-bag sessions.

What We Are Looking For:

  • 3 + years in SRE, DevOps, or backend engineering for high-traffic services
  • Proficient in at least one of Kotlin / Java, Rust, Go, or Python
  • Deep Linux & networking fundamentals and hands-on AWS (ECS, ALB/NLB, RDS, S3, IAM, CloudWatch)
  • Production experience with Datadog (or Prometheus / OpenTelemetry) for metrics, traces, and logs
  • Incident response expertise: runbooks, RCA, post-mortems, and blameless culture
  • Practical knowledge of relational DB (PostgreSQL/RDS) and Redis operations
  • Familiarity with Kubernetes (EKS) concepts, Helm/OPA, container networking, and rolling releases
  • Excellent communication skills; able to coach developers and influence process improvements.

Nice to have:

  • AWS, CKA certifications
  • Experience with feature-flag systems, chaos-engineering tools
  • Prior work in regulated or enterprise-integrated environments (e.g., automotive, fintech)

EEOC Statement: Drivemode is proud of a very diverse team with employees coming from 5 continents/20 countries as of today. Diversity in our workplace has played an important part in our success; we recognize each employee’s unique background, knowledge, experiences, ideas, and viewpoints which are all critical in developing a product that has the greatest impacts on drivers all over the world. Drivemode provides equal opportunities to all employees and applicants for employment without regard to race, religion, color, age, gender, national origin, sexual orientation, gender identity, disability, or any other characteristics that make you unique. 

Similar Jobs

Autify Logo Autify

Engineering Manager

Artificial Intelligence • Software • Automation
In-Office or Remote
Tokyo, JPN
74 Employees
10M-13M Annually
Easy Apply
In-Office
Tokyo, JPN
402 Employees
Easy Apply
In-Office
Tokyo, JPN
10000 Employees

Zeals Logo Zeals

Site Reliability Engineer

Artificial Intelligence • eCommerce • Social Media
Easy Apply
In-Office
Tokyo, JPN
211 Employees
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Redwood City, CA
30 Employees
Year Founded: 2014

What We Do

Drivemode enables smarter, safer, connected driving in any vehicle.

Drivemode was founded in 2014 by entrepreneurs from Zipcar and Tesla Motors who set out to fundamentally change the way consumers use technology in the car. Drivemode offers a mobile-based connected car platform through a consumer-facing Android app, driver assistance and analytics for fleet managers, and a bring-your-own-device connected car solution for automakers. The Drivemode app transforms a user’s phone into a car’s central computing device allowing voice-to-text messaging, music player overlay on navigation, “Do Not Disturb” mode, message auto-reply, and personalized travel recommendations. The Drivemode app has an automotive-grade interface designed and developed to adhere to National Highway Traffic Safety Administration safety guidelines for driving apps. Drivemode has raised $9.2M from industry leaders. Learn more at https://drivemode.com or download Drivemode at bit.ly/getdrivemode.

Similar Companies Hiring

Axle Health Thumbnail
Logistics • Information Technology • Healthtech • Artificial Intelligence
Santa Monica, CA
17 Employees
Scrunch AI Thumbnail
Software • SEO • Marketing Tech • Information Technology • Artificial Intelligence
Salt Lake City, Utah
Standard Template Labs Thumbnail
Software • Information Technology • Artificial Intelligence
New York, NY
15 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account