Software Engineer III, Reliability

Posted 10 Hours Ago
Easy Apply
Be an Early Applicant
Redwood City, CA, USA
Hybrid
165K-207K Annually
Mid level
Cloud • Information Technology • Software
The Role
The Software Engineer III (Reliability) will enhance performance and scalability of services, analyze metrics, build testing frameworks, drive efficiency, and work across teams to improve reliability of the platform.
Summary Generated by Built In

Box (NYSE:BOX) is the leader in Intelligent Content Management. Our platform enables organizations to fuel collaboration, manage the entire content lifecycle, secure critical content, and transform business workflows with enterprise AI. We help companies thrive in the new AI-first era of business. Founded in 2005, Box simplifies work for leading global organizations, including JLL, Morgan Stanley, and Nationwide. Box is headquartered in Redwood City, CA, with offices across the United States, Europe, and Asia.

By joining Box, you will have the unique opportunity to continue driving our platform forward. Content powers how we work. It’s the billions of files and information flowing across teams, departments, and key business processes every single day: contracts, invoices, employee records, financials, product specs, marketing assets, and more. Our mission is to bring intelligence to the world of content management and empower our customers to completely transform workflows across their organizations. With the combination of AI and enterprise content, the opportunity has never been greater to transform how the world works together and at Box you will be on the front lines of this massive shift.

WHY BOX NEEDS YOU

The Reliability Engineering team at Box ensures our platform delivers world-class performance, scalability, and reliability as we continue to serve millions of users worldwide. As our business grows, so does the complexity of operating distributed systems at scale. Our mission is to proactively identify and solve the hardest reliability and performance challenges across Box’s infrastructure, working closely with product and platform teams to build resilient, scalable, and highly performant services.

As a Software Engineer III (SWE 3) on the Reliability Engineering team, you’ll have a direct impact on the performance and scalability of our most critical services. You’ll partner with engineering teams across the company to analyze complex system behaviors under load, design scalable solutions, and build testing frameworks that validate service reliability before issues arise in production. This role provides a unique opportunity to work broadly across our technical stack and make meaningful contributions to Box’s long-term scalability and customer experience.

WHAT YOU’LL DO

  • Partner with product and platform engineering teams to assess service designs for scalability and performance risks, ensuring systems are built for long-term growth.
  • Analyze production workloads, system metrics, and load test results to identify bottlenecks, resource inefficiencies, and architectural scaling limits.
  • Design and build frameworks for load testing, capacity modeling, and performance validation that enable teams to proactively address scale concerns.
  • Drive improvements in backend service efficiency, API response times, and resource utilization across Box’s globally distributed platform.
  • Collaborate with SRE, infrastructure, and platform teams to optimize scaling strategies, auto-scaling policies, and resource allocation.
  • Build automation and tooling that integrate performance validation into CI/CD pipelines, enabling early detection of regressions.
  • Participate in root cause analysis of performance-related incidents, identify systemic issues, and drive cross-team remediation efforts.
  • Contribute to the evolution of observability standards (SLIs, SLOs, latency/error budgets) that measure and guide service health.

WHO YOU ARE

  • 3+ years of experience in software engineering, performance engineering, or site reliability engineering, with a focus on backend systems and scalability.
  • Proficient in one or more programming languages such as Go or Java, with an emphasis on building performant services.
  • Strong understanding of distributed systems, concurrency, resource contention, and efficient system design.
  • Hands-on experience analyzing and improving application and system performance across compute, storage, database, and networking layers.
  • Familiarity with load testing and performance benchmarking tools (e.g., Locust, JMeter, Gatling, or custom frameworks).
  • Experience working with cloud infrastructure (AWS, GCP) and container orchestration (Kubernetes).
  • Proficient with observability tools and telemetry systems (e.g., Prometheus, Chronosphere, Grafana, Datadog, ELK).
  • Excellent problem-solving and analytical skills, with a data-driven approach to diagnosing complex system behaviors.
  • Strong collaboration and communication skills; comfortable partnering across engineering teams to drive reliability improvements.

PREFERRED QUALIFICATIONS

  • Experience with service mesh technologies (Istio, Envoy) and cloud-native networking performance optimization.
  • Exposure to capacity planning, cost optimization, and long-term resource forecasting in cloud environments.
  • Familiarity with incident response processes, post-incident reviews, and reliability improvement practices.
  • Experience contributing to internal platforms, developer tooling, or performance automation frameworks.

METHODOLOGY

  • Agile management - Scrum
  • Issue tracking tool - Jira
  • Knowledge repository - Confluence
  • Code reviews - GitHub Enterprise
  • Version control system - Git
Box is committed to fair and equitable compensation practices. Actual base salary (or OTE if commissionable role) is dependent upon factors such as: knowledge, skill level, experience, and work location. This role is also eligible for equity and benefits. For more information on benefits, check out our healthcare benefits and additional Box Benefits + Perks.
 
In accordance with OFCCP compliance, here is the Pay Transparency Provision. 
United States Pay Range
$165,000$206,500 USD

Top Skills

AWS
Chronosphere
Datadog
Elk
Gatling
GCP
Go
Grafana
Java
Jmeter
Kubernetes
Locust
Prometheus
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Redwood City, CA
2,500 Employees
Year Founded: 2005

What We Do

Box (NYSE:BOX) is the leading Content Cloud, a single platform that empowers organizations to manage the entire content lifecycle, work securely from anywhere, and integrate across best of breed apps. Founded in 2005, Box simplifies work for leading global organizations, including AstraZeneca, JLL, and Nationwide. Box is headquartered in Redwood City, CA, with offices across the United States, Europe, and Asia. Visit box.com to learn more. And visit box.org to learn more about how Box empowers nonprofits to fulfill their missions.

Why Work With Us

We have an inclusive culture that is based on development and growth. We value our people as individuals and know that they can make an impact when properly empowered. We fill 30% of all of our open positions with internal people. Everyone is an owner and we are candid with each other in order to learn.

Gallery

Gallery

Similar Jobs

Dynatrace Logo Dynatrace

Sr. Web Analyst, Marketing

Artificial Intelligence • Big Data • Cloud • Information Technology • Software • Big Data Analytics • Automation
Remote or Hybrid
United States
5200 Employees
91K-125K Annually

Dynatrace Logo Dynatrace

Marketing Analyst

Artificial Intelligence • Big Data • Cloud • Information Technology • Software • Big Data Analytics • Automation
Remote or Hybrid
United States
5200 Employees
116K-145K Annually

Leader Bank Logo Leader Bank

SBL Underwriting Specialist

Fintech • Insurance • Payments • Social Impact • Financial Services
Remote or Hybrid
United States
420 Employees
90K-110K Annually

Mondelēz International Logo Mondelēz International

SIEM Engineer

Big Data • Food • Hardware • Machine Learning • Retail • Automation • Manufacturing
Remote or Hybrid
2 Locations
90000 Employees
122K-168K Annually

Similar Companies Hiring

Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Fairly Even Thumbnail
Software • Sales • Robotics • Other • Hospitality • Hardware
New York, NY
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account