Senior Site Reliability Engineer

Sorry, this job was removed at 04:02 a.m. (CST) on Thursday, Jan 15, 2026
Be an Early Applicant
Hiring Remotely in Australia
Remote
Big Data • Software
The Role

Aerospike is the real-time database for mission-critical use cases and workloads, including machine learning, generative, and agentic AI. Aerospike powers millions of transactions per second with millisecond latency, at a fraction of the total cost of ownership compared to other databases.

Global leaders, including Adobe, Airtel, Barclays, Criteo, DBS Bank, Experian, Grab, HDFC Bank, PayPal, Sony Interactive Entertainment, The Trade Desk, and Wayfair, rely on Aerospike for customer 360, fraud detection, real-time bidding, profile stores, recommendation engines, and other use cases. 

 At Aerospike, we dream big and deliver even bigger. Our mission is to unleash the power of the world’s real-time data with a database built for infinite scale, speed, and sustainability.

If you're ready to shape the future of data, join us.

Senior Site Reliability Engineer

As a Senior Site Reliability Engineer (SRE) for Aerospike, you will be instrumental in designing, building, and optimizing a scalable, highly resilient cloud platform. You will focus on improving reliability, performance, and automation to ensure seamless delivery and operation of our cloud platform services. Your responsibilities will include developing robust infrastructure, implementing intelligent monitoring systems, and driving continuous improvement initiatives that enhance system efficiency, scalability, and overall platform stability.

Key Responsibilities

  • Designing, deploying, and optimizing large-scale Aerospike cloud platform infrastructure and services across multiple environments
  • Leading the development and enhancement of automation and infrastructure-as-code solutions to improve operational efficiency
  • Building and maintaining monitoring, alerting, and observability implementations to proactively detect and resolve system issues
  • Leading incident response activities, conducting post-mortems, and driving continuous improvement initiatives
  • Designing and enforcing security best practices for cloud infrastructure and access control
  • Collaborating with development teams to ensure reliable service delivery and alignment with SRE best practices
  • Participating in on-call rotation, responding to critical incidents and minimizing downtime through proactive mitigation strategies
  • Establishing documentation standards, runbooks, and system configurations for team knowledge sharing
  • Leading capacity planning and performance optimization efforts
  • Mentoring junior engineers and sharing knowledge to build team capabilities
Required Experience
  • 6+ years of experience in Site Reliability Engineering (SRE), DevOps, or related fields, with a focus on building scalable, resilient, and automated cloud-based systems
  • Hands-on experience designing, deploying, and optimizing production-grade, business-critical systems in cloud environments
  • Expertise with at least one major public cloud provider (AWS, Google Cloud, or Azure), including cloud-native services and architectures
  • Strong proficiency in infrastructure-as-code (IaC) tools such as Terraform to enable automated and reproducible infrastructure
  • Experience in CI/CD pipeline design and implementation, enabling seamless, automated software delivery and infrastructure updates
  • Deep understanding of Linux/Unix systems, networking fundamentals, and distributed system architectures
  • Proficiency in scripting and software development using Python, Bash, or Go to build automation, tooling, and infrastructure enhancements
  • Experience with containerization and orchestration technologies such as Docker and Kubernetes for efficient service deployment and scaling
  • Hands-on experience with monitoring, logging, and observability tools (e.g., Prometheus, Grafana, Datadog, Elasticsearch, Kibana) to drive data-driven system improvements
  • Strong problem-solving skills with an engineering-first mindset for improving system reliability, scalability, and performance
  • Experience implementing security best practices for cloud infrastructure, access control, and data protection
  • Excellent English communication skills (verbal and written) to collaborate effectively across teams and document key processes

Preferred Skills and Qualifications

  • Hands-on experience managing and optimizing database deployments and services in production environments, ensuring high availability and performance
  • Familiarity with Aerospike or other distributed NoSQL databases
  • Advanced understanding of security practices and implementation in cloud environments
  • Relevant industry certifications, such as AWS Certified DevOps Engineer, AWS Certified Solutions Architect, Google Professional Cloud DevOps Engineer, or equivalent
  • Kubernetes certifications such as Certified Kubernetes Administrator (CKA), Certified Kubernetes Application Developer (CKAD), or Certified Kubernetes Security Specialist (CKS)
  • Proficiency with configuration management tools (Ansible, Terraform, or similar) in complex environments
  • Experience leading collaborative development practices and advanced version control workflows

Aerospike is an Equal Opportunity Employer. We are committed to providing an environment free from discrimination on the basis of race, religion, color, sex, gender identity, sexual orientation, age, non-disqualifying physical or mental disability, national origin, veteran status, or any other basis covered by appropriate law.




Aerospike Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about Aerospike and has not been reviewed or approved by Aerospike.

  • Fair & Transparent Compensation Compensation is characterized as very competitive across roles, with total compensation (cash, equity, and benefits) seen favorably. Market-aligned engineering and sales packages contribute to overall pay satisfaction.
  • Healthcare Strength Health coverage is described as comprehensive with strong medical plans and FSA/HSA options. This breadth supports a positive view of the overall rewards package.
  • Leave & Time Off Breadth Flexible Time Off alongside company holidays is emphasized. Remote-friendly norms and the ability to take time as needed reinforce time-off flexibility.

Aerospike Insights

Similar Jobs

In-Office or Remote
4 Locations
340 Employees
Remote
Australia
91 Employees
80K-150K Annually

Dynatrace Logo Dynatrace

Senior Site Reliability Engineer

Artificial Intelligence • Big Data • Cloud • Information Technology • Software • Big Data Analytics • Automation
Remote or Hybrid
Sydney, New South Wales, AUS
5200 Employees

Algolia Logo Algolia

Senior Site Reliability Engineer

Natural Language Processing • Software
Remote
Australia
700 Employees
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Mountain View, CA
191 Employees
Year Founded: 2009

What We Do

The Aerospike Real-time Data Platform enables organizations to act instantly across billions of transactions while reducing server footprint up to 80%. The Aerospike multi-cloud platform powers real-time applications with predictable sub-millisecond performance up to petabyte scale with five-nines uptime with globally distributed, strongly consistent data. Applications built on the Aerospike Real-time Data Platform fight fraud, provide recommendations that dramatically increase shopping cart size, enable global digital payments, and deliver hyper-personalized user experiences to tens of millions of customers. Customers such as Airtel, Experian, European Central Bank, Nielsen, PayPal, Snap, Verizon Media and Wayfair rely on Aerospike as their data foundation for the future.

Similar Companies Hiring

Fairly Even Thumbnail
Hardware • Other • Robotics • Sales • Software • Hospitality
New York, NY
30 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Software
US
100 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account