Software Engineer, Site Reliability

Posted 8 Days Ago
Be an Early Applicant
Tokyo
In-Office
6M-12M Annually
Mid level
Sales • Software • Automation
Building a world where challenges goes beyond existing boundaries
The Role
As a Site Reliability Engineer, you'll manage and enhance AWS and Kubernetes platforms, improve reliability, automate operations, and partner with engineers to ensure high standards in performance and security.
Summary Generated by Built In

Contract Type: Full-time Employee

Salary Range: 6,000,000 ~ 12,000,000 JPY

About Us

At Sales Marker, our mission is to create a world where all people and companies can challenge themselves beyond existing boundaries. We are one of the fastest-growing startups in Japan—scaling at more than twice the pace of typical unicorns. Our flagship product, Sales Marker, empowers sales teams to achieve 3 times greater efficiency. In just two years since launch, we’ve achieved 2,000% growth, and today our year-over-year business growth stands at 270%—and this is only the beginning.

Backed by strong financial growth, we’re expanding into a bold portfolio of new products, empowering businesses to grow from every angle: Sales, Marketing, Recruiting, AI agents and so on.
More about us: https://corp.sales-marker.jp/

The Team

Our co-founders come from leading global companies and were recognized on the Forbes 30 Under 30 Asia List (2023).

Our Product & Engineering team is proudly global, with members from 24+ countries and backgrounds at top tech companies such as Google, Microsoft, Indeed, Mercari, LINE, Yahoo, and SmartNews etc.

At Sales Marker, you’ll join a global, ambitious, and fast-moving team where your ideas truly shape the future. We’re building an engineering culture around:

  • Customer Obsession – solving real problems and exceeding expectations

  • Ownership – taking responsibility end to end, across roles and functions

  • 10x – aiming for bold impact, moving fast, and disrupting old standards

The Role

The Common Foundation team helps engineering teams move faster by providing scalable, reliable, and reusable systems that serve as the platform for product development. The Platform Foundation side focuses on reliability, performance, security, and developer productivity across our cloud infrastructure and Kubernetes platform. We build paved roads, automate operations, and ensure that application teams can ship safely at speed.

We're looking for a Site Reliability Engineer who can own the health and evolution of our platform. You’ll design and operate our AWS and Kubernetes environments, lead reliability initiatives, and partner with product engineers to embed best practices in availability, observability, and performance. You’ll turn complex infrastructure into simple, well-documented, self-service building blocks.

Responsibilities
  • Operate and improve our Kubernetes platform (EKS), including cluster lifecycle, upgrades, scaling, networking, and multi-tenant isolation.

  • Design, provision, and manage AWS infrastructure (VPC, RDS/Aurora, OpenSearch, S3, SQS, Lambda, API Gateway, Batch, Glue) with a strong focus on security, reliability, and developer experience.

  • Build infrastructure as code using Terraform and AWS CDK. Establish standards for modules, environments, and change management via GitOps.

  • Drive observability end to end: metrics, logs, traces, SLOs, error budgets, and actionable dashboards and alerts in Datadog.

  • Partner with backend engineers to improve service reliability, performance, and cost efficiency. Champion best practices in testing, rollout strategies, and production readiness.

  • Automate operations and repetitive work with tooling and pipelines. Reduce MTTR with improved runbooks, diagnostics, and incident tooling.

  • Lead incident response and post-incident reviews. Raise the operational bar through blameless retros, remediation plans, and reliability roadmaps.

  • Strengthen platform security through identity and access control, secrets management, network policies, patching, and vulnerability management.

  • Support data workloads and pipelines with robust, scalable infrastructure and monitoring.

  • Contribute to platform documentation, paved paths, and self-service developer workflows to accelerate delivery.

What We're Looking For

Required

  • 3+ years in SRE, Platform, or Infrastructure Engineering with production ownership of cloud-native systems.

  • Strong experience running Kubernetes in production, including upgrades, scaling, and workload reliability.

  • Deep hands-on expertise with AWS services (networking, compute, storage, databases, messaging) and secure-by-default architectures.

  • Proficiency with IaC (Terraform and/or AWS CDK), modularization, and environment management.

  • Solid observability fundamentals: metrics, logging, tracing, SLOs/error budgets, actionable alerting.

  • Proven track record improving reliability, performance, and developer experience in partnership with application teams.

  • Experience running incident response and driving post-incident improvements.

  • Experience with algorithms, data structures, complexity analysis, and software design.

  • Experience in development using one or more of the following languages, C, C++, Java, Python, Go.

Nice to Haves

  • Experience with identity and access management patterns, Cognito, JWT, and service-to-service auth.

  • Background in multi-tenant architectures, capacity planning, and cost optimization.

  • History of handling major incidents at scale and building tooling to reduce MTTR/MTTD.

  • Contributions to internal developer platforms, golden paths, or shared libraries.

  • Fluency in English or Japanese.

Our Tech Stack

[Front-end]

- TypeScript, React, NextJS;

- Testing: Storybook, jest, playwright;

- Hosting: Amplify;

- Feature flag: Unleash;

[Server Side/Back-End]

- Infrastructure: AWS, EKS, ElasticBeanstalk;

- DB: Aurora, ElasticSearch, Redis;

- Languages: Go, Typescript;

- Analysis environment: Athena, Superset;

- Monitoring: DataDog;

- Others: AWS Lambda, AWS Batch, AWS API Gateway, AWS Glue, AWS S3;

Why Us?
  • One of the fastest growing Saas startup in Japan with strong financial growth.

  • Innovative new product development and opportunity to build things from scratch.

  • Plenty of leadership and career development opportunities.

  • Hybrid work environment & full flexible work schedules.

  • Global team and English speaking environment.

  • Great benefits & perks packages such as Resort Worx, Purchasing Books, Free Weekly Lunch, Offsites, etc.

Working Style
  1. Hybrid Work
    We follow a hybrid work style, combining both office and remote work. Recommended in-office days vary by role. Even when working remotely, we maintain smooth collaboration and communication through tools like Zoom, Google Meet, and Gather.

  2. Flex Work
    You can customize your working hours to suit your day. For business and client-facing teams, schedules are often arranged around client meetings.

  3. Global Environment
    With team members from over 20 countries, we bring together diverse perspectives and ideas, driving projects forward across languages and cultures in an environment where English and Japanese blend naturally into daily communication.

Read More
  • Career Page: https://sales-marker.jp/corporate/en/

  • Culture Book: https://speakerdeck.com/salesmarker/sales-marker-culturebook-en

  • YouTube: https://www.youtube.com/watch?v=Ob8Ds06zwo0

DEV-005

Top Skills

AWS
Aws Cdk
C
C++
Datadog
Go
Java
Kubernetes
Python
Terraform
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Shibuya-Ku, Tokyo
211 Employees
Year Founded: 2021

What We Do

Sales Marker, a leading BtoB sales intelligence platform, combines a database of 5 million corporate entities with intent data for precise targeting of companies with immediate needs.

Implementing the latest methodology known as intent-based sales, Sales Marker identifies companies requiring immediate attention by analyzing web search behavior. Targeting these high-conversion prospects, AI-automated actions are initiated through the most effective channels for seamless engagement.

Similar Jobs

CrowdStrike Logo CrowdStrike

Sales Development Representative

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Hybrid
Tokyo, JPN
10000 Employees

ServiceNow Logo ServiceNow

Sales Executive

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Remote or Hybrid
Tokyo, JPN
28000 Employees

ServiceNow Logo ServiceNow

Sales Executive

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Remote or Hybrid
Tokyo, JPN
28000 Employees

Similar Companies Hiring

Standard Template Labs Thumbnail
Software • Information Technology • Artificial Intelligence
New York, NY
10 Employees
PRIMA Thumbnail
Travel • Software • Marketing Tech • Hospitality • eCommerce
US
15 Employees
Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account