Senior Site Reliability Engineer, AI Research

Reposted 2 Days Ago
Easy Apply
Be an Early Applicant
Hiring Remotely in Australia
Remote
Senior level
Natural Language Processing • Software
The Role
The Senior Site Reliability Engineer will support the AI Research team by ensuring infrastructure stability, overseeing production services on GCP using Kubernetes, and improving CI/CD pipelines while collaborating closely with researchers and engineers.
Summary Generated by Built In

At Algolia, we’re proud to be a pioneer and market leader in AI Search, empowering 17,000+ businesses to deliver blazing-fast, predictive search and browse experiences at internet scale. Every week, we power over 30 billion search requests — four times more than Microsoft Bing, Yahoo, Baidu, Yandex, and DuckDuckGo combined.

In 2021, we raised $150 million in Series D funding, quadrupling our valuation to $2.25 billion. This strong foundation enables us to keep investing in our market-leading platform and serving incredible customers like Under Armour, PetSmart, Stripe, Gymshark, and Walgreens.

About the AI Research Team

The AI Research team at Algolia combines fundamental research with product engineering to deliver customer-facing AI-powered features.

The team is highly cross-functional, made up of PhD researchers, full-stack engineers, and infrastructure specialists working together to explore new ideas, validate impact, and bring successful research outcomes into production. While the work is research-driven, the output is real, customer-facing systems.

The Opportunity

We are looking for an embedded Senior Site Reliability Engineer to join the AI Research team as a full member of the group. In this role, you will support both the research and product-engineering aspects of the team by ensuring the stability, scalability, and operability of the infrastructure that enables this work.

This is a classic SRE role focused on cloud-first, service-oriented architectures running on Google Cloud Platform. While the team builds AI-powered systems, AI or ML experience is not required for this role. Our priority is strong SRE fundamentals, experience operating production services, and comfort working in an environment with ambiguity and high ownership.

You will play an important role in day-to-day execution as well as in longer-term (12-month) planning, helping shape how the team builds and operates its platforms over time.

What You’ll Work OnPlatform Reliability & Enablement
  • Support and evolve the reliability of platforms used by the AI Research team. Examples of our infrastructure work to date include:
    • A production inference service (embedding model serving API)
    • AI data feature store
    • Internal tools used for novel research and experimentation
    • Infrastructure that combines the above to enable offline testing of customer deployments to agentically discover configuration improvements.
  • Ensure production services meet expectations for availability, latency, and operational readiness, particularly for systems that sit on customer-critical paths
  • Design infrastructure and operational patterns that prioritize iteration speed while maintaining appropriate safeguards for production systems
Embedded Collaboration
  • Work closely with researchers and engineers in a cross-functional setting, acting as an advisor on infrastructure, reliability, and operational concerns
  • Participate directly in team planning and execution, from early exploration through production rollout
  • Help researchers self-serve infrastructure safely and effectively, without becoming a bottleneck
Cloud Infrastructure & Operations
  • Build and maintain Kubernetes-based services on GCP using infrastructure-as-code and GitOps (Terraform, ArgoCD)
  • Own and improve CI/CD pipelines for services written primarily in Go, with some Python-based services
  • Design and operate observability systems using tools such as Datadog
  • Participate in an on-call rotation (relatively light), responding to incidents and helping improve systems over time
What We’re Looking ForRequired Experience
  • Strong experience operating cloud-first infrastructure
  • Hands-on experience running production services on Kubernetes
  • Proficiency with infrastructure-as-code (Terraform) and CI/CD systems
  • Experience supporting production services written in Go (Python experience is a plus)
  • Solid grounding in service reliability, incident response, and operational best practices
  • Comfort working in environments with ambiguity, where problems are not always well-defined upfront
Nice to Have
  • Experience supporting mission-critical internal platforms
  • Exposure to research or experimentation-heavy environments
  • Familiarity working alongside researchers or highly specialized domain experts
Explicitly Not Required
  • AI, ML, or deep learning experience
  • Model training, tuning, or ML framework expertise (e.g. PyTorch, JAX)
Ways This Role May Not Be a Fit

This role may not be a good match if:

  • You are only interested in maintaining existing infrastructure without contributing to what is being built
  • You want to work exclusively on customer-facing product features
  • You are looking to avoid on-call or production systems entirely
  • You are seeking narrowly defined work with low ambiguity and limited ownership
  • You want to build or train AI models yourself rather than enable the systems around them
Why Join the AI Research Team
  • High Impact: Your work directly enables new AI-powered capabilities that reach customers
  • High Agency: You’ll help shape what gets built, how it’s built, and whether it’s worth building
  • Strong Peers: Collaborate with experienced SREs, engineers, and PhD researchers
  • Growth: Build expertise in research-adjacent infrastructure and platform reliability
  • Flexibility: Australia-based role with remote-friendly culture; occasional off-hours collaboration may be required

FLEXIBLE WORKPLACE STRATEGY:

Algolia’s flexible workplace model is designed to empower all Algolians to fulfill our mission to power search and discovery with ease. We place an emphasis on an individual’s impact, contribution, and output, over their physical location. Algolia is a high-trust environment and many of our team members have the autonomy to choose where they want to work and when. 

We have a global presence with offices in Paris, NYC, London, Sydney and Bucharest, however we also offer many of our team members the option to work remotely either as fully remote or hybrid-remote employees. Positions listed as "Remote" are only available for remote work within the specified country. Positions listed within a specific city are only available in that location - depending on the role it may be available with either a hybrid-remote or in-office schedule.

WE’RE LOOKING FOR SOMEONE WHO CAN LIVE OUR VALUES:

  • GRIT - Problem-solving and perseverance capability in an ever-changing and growing environment.
  • TRUST - Willingness to trust our co-workers and to take ownership.
  • CANDOR - Ability to receive and give constructive feedback.
  • CARE - Genuine care about other team members, our clients and the decisions we make in the company.
  • HUMILITY - Aptitude for learning from others, putting ego aside.

We’re looking for talented, passionate people to help build the world’s best search and discovery technology. We value autonomy, diversity, and collaboration. We’re committed to creating an inclusive workplace where everyone is respected and supported—regardless of race, age, ancestry, religion, sex, gender identity, sexual orientation, marital status, color, veteran status, disability, or socioeconomic background.

IMPORTANT NOTICE FOR CANDIDATES - Recruitment Fraud Notice

We’ve recently seen an increase in recruitment scams targeting job seekers. To help protect yourself, please keep the following in mind:

  • Our open positions may appear on third-party job boards, but the best way to apply safely is directly through our careers page.
  • All genuine communication from Algolia will come from an @algolia.com email address. If you receive an email from someone claiming to work at Algolia who does not have an @algolia.com email address, please do not respond or share any personal information.
  • We’ll never ask for payments, purchases, or financial details during the hiring process.

READY TO APPLY?

If you share our values and our enthusiasm for building the world’s best search & discovery technology, we’d love to review your application!

Top Skills

Ci/Cd
Datadog
Go
Google Cloud Platform
Kubernetes
Python
Terraform
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Palo Alto, CA
700 Employees
Year Founded: 2012

What We Do

Algolia is the search-as-a-service platform that enables companies of all sizes to deliver fast and relevant digital experiences that drive real results. With Algolia, consumers are able to find and discover what they want easily across web, mobile, and voice. Algolia allows developers and business teams to build and optimize engaging search experiences to increase online engagement, conversion and revenue. Founded in 2012, we're backed by over $175M in funding from Accel Partners, Alven Capital, Point Nine Capital, Storm Ventures, Salesforce Ventures and others. Our most recent fundraising round was our Series D in 2021 in which we raised $150 million. The team is headquartered in San Francisco with offices in Paris, London, New York City, Bucharest and Sydney. To learn more, visit www.algolia.com.

Similar Jobs

monday.com Logo monday.com

Account Executive

Artificial Intelligence • Productivity • Sales • Software
Remote or Hybrid
Melbourne, Victoria, AUS
3049 Employees

Atlassian Logo Atlassian

Associate Product Management Intern, Summer 2026 Australia

Cloud • Information Technology • Productivity • Security • Software • App development • Automation
In-Office or Remote
Sydney, New South Wales, AUS
11000 Employees

Citadel Securities Logo Citadel Securities

Quantitative Developer/Research Engineer

Information Technology • Software • Financial Services • Quantitative Trading
In-Office or Remote
7 Locations
1900 Employees
250K-350K Annually

Enverus Logo Enverus

Business Development Representative

Big Data • Information Technology • Software • Analytics • Energy
In-Office or Remote
Melbourne, Victoria, AUS
1800 Employees

Similar Companies Hiring

Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Fairly Even Thumbnail
Software • Sales • Robotics • Other • Hospitality • Hardware
New York, NY
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account