Staff Site Reliability Engineer

Sorry, this job was removed at 06:22 p.m. (CST) on Tuesday, Dec 23, 2025
Be an Early Applicant
Bengaluru, Bengaluru Urban, Karnataka, IND
In-Office
Big Data • Software
The Role

About Aerospike

Aerospike is the real-time database for mission-critical use cases and workloads, including machine learning, generative, and agentic AI. Aerospike powers millions of transactions per second with millisecond latency, at a fraction of the total cost of ownership compared to other databases.

Global leaders, including Adobe, Airtel, Barclays, Criteo, DBS Bank, Experian, Grab, HDFC Bank, PayPal, Sony Interactive Entertainment, The Trade Desk, and Wayfair, rely on Aerospike for customer 360, fraud detection, real-time bidding, profile stores, recommendation engines, and other use cases

Headquartered in Mountain View, California, Aerospike has a global presence with offices in London, Bangalore, and Tel Aviv.

This is a hybrid role.Employees are expected to work from the Bangalore office 2-3 times a week.

Site Reliability Engineer

As a Staff Site Reliability Engineer (SRE) for Aerospike, you will be instrumental in architecting, building, and optimizing enterprise-scale, highly resilient cloud platform infrastructure and services. You will focus on establishing reliability, performance, and automation standards to ensure seamless delivery and operation across our cloud platform ecosystem. Your responsibilities will include driving robust infrastructure initiatives across multiple teams, implementing organization-wide monitoring and observability practices, and leading strategic improvement initiatives that enhance system efficiency, scalability, and overall platform stability at enterprise scale.

Key Responsibilities

  • Architecting, deploying, and optimizing enterprise-scale Aerospike cloud platform infrastructure and services across multiple environments
  • Driving the development and standardization of automation, tooling, and infrastructure solutions across multiple engineering teams to improve efficiency at scale
  • Building and establishing monitoring, alerting, and observability standards and implementations across the organization with cutting-edge solutions and best practices
  • Leading complex incident response activities across multiple teams, conducting detailed root cause analysis, and driving systematic improvements
  • Establishing and implementing security best practices and standards for cloud platform infrastructure and services impacting multiple teams
  • Collaborating with development teams and engineering leadership to ensure reliable service delivery and alignment with enterprise-scale SRE best practices
  • Serving as escalation point for critical production incidents, coordinating cross-team mitigation strategies
  • Establishing documentation standards, runbooks, and knowledge sharing practices for operational excellence
  • Leading capacity planning and performance optimization efforts at enterprise scale
  • Mentoring engineers across teams and sharing knowledge to build technical capabilities
Required Experience
  • 8+ years of experience in Site Reliability Engineering (SRE), DevOps, or related fields, with a focus on architecting scalable, resilient, and automated enterprise-scale systems
  • Experience leading complex infrastructure projects, driving measurable improvements in system reliability and performance
  • Deep knowledge of multiple public cloud providers (AWS, Google Cloud, Azure), including advanced cloud-native services and architectures
  • Advanced proficiency in automation, tooling, and infrastructure solutions to enable enterprise-scale automated and reproducible infrastructure
  • Extensive experience in CI/CD pipeline design and implementation, enabling seamless, automated software delivery and infrastructure updates at scale
  • Deep understanding of Linux/Unix systems, advanced networking concepts, and distributed system architectures
  • Comprehensive proficiency in scripting and software development using Python, Bash, Go, or similar languages to build sophisticated automation, tooling, and infrastructure solutions
  • Extensive experience with containerization and orchestration technologies such as Docker and Kubernetes for enterprise-scale service deployment and scaling
  • In-depth experience with monitoring, logging, and observability tools and methodologies to drive data-driven system improvements across multiple teams
  • Advanced problem-solving skills with an engineering-first mindset for improving system reliability, scalability, and performance at enterprise scale
  • Extensive experience implementing security best practices for cloud infrastructure, access control, and data protection across multiple teams
  • Excellent communication and influence skills to collaborate effectively across multiple teams and drive technical decisions

Preferred Skills and Qualifications

  • Extensive experience managing and optimizing database deployments and services in production environments at enterprise scale, ensuring high availability and performance
  • Deep expertise with Aerospike or other distributed NoSQL databases, including advanced features and enterprise-scale deployment optimization
  • Comprehensive understanding of security principles and implementation in complex cloud environments across multiple teams
  • Advanced industry certifications, such as AWS Solutions Architect Professional, Google Professional Cloud Architect, Azure Solutions Architect Expert, or equivalent
  • Advanced Kubernetes certifications (CKA, CKD, CKS) with extensive experience managing Kubernetes at enterprise scale
  • Advanced proficiency with configuration management and automation tools in complex, multi-team environments
  • Experience leading technical initiatives, mentoring, and driving best practices across multiple engineering teams.

 Aerospike is an Equal Opportunity Employer. We are committed to providing an environment free from discrimination on the basis of race, religion, color, sex, gender identity, sexual orientation, age, non-disqualifying physical or mental disability, national origin, veteran status, or any other basis covered by appropriate law.

Aerospike Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about Aerospike and has not been reviewed or approved by Aerospike.

  • Fair & Transparent Compensation Compensation is characterized as very competitive across roles, with total compensation (cash, equity, and benefits) seen favorably. Market-aligned engineering and sales packages contribute to overall pay satisfaction.
  • Healthcare Strength Health coverage is described as comprehensive with strong medical plans and FSA/HSA options. This breadth supports a positive view of the overall rewards package.
  • Leave & Time Off Breadth Flexible Time Off alongside company holidays is emphasized. Remote-friendly norms and the ability to take time as needed reinforce time-off flexibility.

Aerospike Insights

Similar Jobs

OneTrust Logo OneTrust

Site Reliability Engineer

Artificial Intelligence • Cloud • Information Technology • Security • Social Impact • Software • Cybersecurity
Hybrid
Bengaluru, Bengaluru Urban, Karnataka, IND
2000 Employees

AlphaSense Logo AlphaSense

Site Reliability Engineer

Artificial Intelligence • Fintech • Machine Learning • Natural Language Processing • Business Intelligence
Remote or Hybrid
India
2000 Employees

Jobs for Humanity Logo Jobs for Humanity

Site Reliability Engineer

Artificial Intelligence • HR Tech • Information Technology • Social Impact
In-Office
Bengaluru, Bengaluru Urban, Karnataka, IND
100 Employees
In-Office
Bengaluru, Bengaluru Urban, Karnataka, IND
13042 Employees
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Mountain View, CA
191 Employees
Year Founded: 2009

What We Do

The Aerospike Real-time Data Platform enables organizations to act instantly across billions of transactions while reducing server footprint up to 80%. The Aerospike multi-cloud platform powers real-time applications with predictable sub-millisecond performance up to petabyte scale with five-nines uptime with globally distributed, strongly consistent data. Applications built on the Aerospike Real-time Data Platform fight fraud, provide recommendations that dramatically increase shopping cart size, enable global digital payments, and deliver hyper-personalized user experiences to tens of millions of customers. Customers such as Airtel, Experian, European Central Bank, Nielsen, PayPal, Snap, Verizon Media and Wayfair rely on Aerospike as their data foundation for the future.

Similar Companies Hiring

Fairly Even Thumbnail
Hardware • Other • Robotics • Sales • Software • Hospitality
New York, NY
30 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account