Site Reliability Engineer

Reposted 13 Hours Ago
Be an Early Applicant
Hiring Remotely in Costa Rica
Remote
Junior
Fintech • Internet of Things • Payments • Software
Our mission is to power the world’s best companies to win in the Subscription Economy.
The Role
The Site Reliability Engineer at Zuora is responsible for maintaining and enhancing system reliability, scalability, and performance, while leveraging AI/ML for optimized operations.
Summary Generated by Built In

Costa Rica

Company Overview

At Zuora, we do Modern Business. We’re helping people subscribe to new ways of doing business that are better for people, companies and ultimately the planet. It’s an approach resulting from the shift to the Subscription Economy that puts customers first by building recurring relationships instead of one-time product sales and focuses on sustainable growth. Through our leading expertise and multi-product suite, we are transforming all industries and working with the world’s most innovative companies to monetize new business models, nurture subscriber relationships and optimize their digital experiences.


The Team & Role

Zuora’s Cloud Engineering teams are responsible for Cloud infrastructures, monitoring performance and uptime, managing internal and external shared services, infrastructure services and more -for Zuora’s customer facing SaaS products and platforms. Our technologists sit across US, Beijing, India, Costa Rica and remotely, using a follow-the-sun model to provide 24x7x365 coverage for critical functions and partner closely with our Engineering, Customer Support, Security, Global Services and Sales teams on a daily basis to keep our customers front and center.

We are seeking a highly skilled and motivated Site Reliability Engineer (SRE) to join our infrastructure team. The ideal candidate will be focused on maximizing system uptime, efficiency, and reliability while building the tools and automation necessary to scale our services. This role requires a strong balance of operational experience and development skills, with deep expertise in cloud environments and modern CI/CD practices.


This is a location specific position that requires you to come into the office regularly to be most effective. 


Our Tech Stack: AWS, Microservices, Kafka, Kubernetes, Terraform, Jenkins, Puppet 


What you’ll do

  • Reliability & Performance: Maintain and improve the reliability, scalability, and performance of our production systems, targeting a high-availability environment.
  • Automation: Design, implement, and maintain automation solutions for infrastructure provisioning, deployment, configuration management, and monitoring using Terraform and Jenkins.
  • Infrastructure Management: Administer, manage, and optimize our cloud infrastructure primarily hosted on AWS, focusing on cost efficiency and secure operations.
  • Configuration Management: Develop and maintain infrastructure-as-code using Puppet and/or Ansible to ensure consistent and reproducible environments.
  • Incident Response: Participate in on-call rotation, troubleshoot and resolve critical production incidents, and conduct comprehensive post-mortems to prevent recurrence.
  • System Hardening: Apply strong Linux administration skills to manage, patch, and secure operating systems and underlying infrastructure.
  • Messaging & Data Streams: Manage and optimize distributed messaging systems, specifically Kafka, ensuring high throughput and data integrity.

Your experience

  • 6-8 years of relevant experience on SRE/DevOps
  • Cloud Computing (AWS): Proven hands-on working experience with core AWS services (e.g., EC2, VPC, S3, RDS, IAM, CloudWatch, EKS/ECS).
  • Infrastructure Automation: Deep expertise in infrastructure-as-code principles using Terraform for provisioning and state management.
  • Configuration Management: Expert-level knowledge and practical experience with configuration management tools such as Puppet and/or Ansible.
  • CI/CD Pipeline: Strong experience setting up, maintaining, and enhancing Continuous Integration/Continuous Deployment pipelines using Jenkins.
  • Scripting & Programming: Proficiency in scripting languages, particularly Python and/or Shell scripting, for developing automation tools and performing system administration tasks.
  • Linux Administration: Advanced knowledge of Linux operating systems, including performance tuning, troubleshooting, security, and networking fundamentals.
  • Distributed Systems: Working knowledge and operational experience with distributed messaging queues, specifically Kafka.

Nice to haves

  • Experience with containerization technologies like Docker and Kubernetes (EKS).
  • Familiarity with logging and monitoring tools (e.g., Prometheus, Grafana, ELK stack).
  • Knowledge of networking (TCP/IP, Load Balancing, DNS).
  • Previous experience in a 24/7 high-availability production environment.

#ZEOLife at Zuora

As an industry pioneer, our work is constantly evolving and challenging us in new ways that require us to think differently, iterate often and learn constantly—it’s exciting. Our people, whom we refer to as “ZEOs" are empowered to take on a mindset of ownership and make a bigger impact here. Our teams collaborate deeply, exchange different ideas openly and together we’re making what’s next possible for our customers, community and the world.

As part of our commitment to building an inclusive, high-performance culture where ZEOs feel inspired, connected and valued, we support ZEOs with:

  • Competitive compensation, variable bonus and performance reward opportunities, and retirement programs
  • Medical, dental and vision insurance
  • Generous, flexible time off
  • Paid holidays, “wellness” days and company wide end of year break
  • 6 months fully paid parental leave
  • Learning & Development stipend
  • Opportunities to volunteer and give back, including charitable donation match
  • Free resources and support for your mental wellbeing

Specific benefits offerings may vary by country and can be viewed in more detail during your interview process.

Location & Work Arrangements

Organizations and teams at Zuora are empowered to design efficient and flexible ways of working, being intentional about scheduling, communication, and collaboration strategies that help us achieve our best results. In our dynamic, globally distributed company, this means balancing flexibility and responsibility — flexibility to live our lives to the fullest, and responsibility to each other, to our customers, and to our shareholders. For most roles, we offer the flexibility to work both remotely and at Zuora offices.

Our Commitment to an Inclusive Workplace

Think, be and do you! At Zuora, different perspectives, experiences and contributions matter. Everyone counts. Zuora is proud to be an Equal Opportunity Employer committed to creating an inclusive environment for all.

Zuora does not discriminate on the basis of, and considers individuals seeking employment with Zuora without regards to, race, religion, color, national origin, sex (including pregnancy, childbirth, reproductive health decisions, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, genetic information, political views or activity, or other applicable legally protected characteristics.

We encourage candidates from all backgrounds to apply. Applicants in need of special assistance or accommodation during the interview process or in accessing our website may contact us by sending an email to [email protected].

Top Skills

Activemq
Ansible
AWS
Debezium
Docker
Gitops
Grafana
Jenkins
Kafka
Kubernetes
Linux Administration
Load Balancers
MySQL
Open Telemetry
Oracle
Prometheus
Puppet
Python
Redis
Terraform
Tomcat
Waf
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Redwood City, CA
1,500 Employees
Year Founded: 2007

What We Do

At Zuora, we do Modern Business. We’re helping people subscribe to new ways of doing business that are better for customers, companies and ultimately the planet. It’s an approach resulting from the shift to the Subscription Economy that puts customers first (building ongoing relationships instead of one-time product sales) and focuses on sustainable growth. Through our leading expertise and multi-product suite, we are transforming all industries and working with the world’s most innovative companies to monetize new business models, nurture subscriber relationships and optimize their digital experiences.

Why Work With Us

As an industry pioneer, our work is constantly evolving and challenging us in new ways that require us to think differently, iterate often and learn constantly. Our people, whom we call “ZEOs" are empowered to take on a mindset of ownership and work together in collaboration to make what’s next possible for our customers, community and the world.

Gallery

Gallery

Similar Jobs

Zuora Logo Zuora

Site Reliability Engineer

Fintech • Internet of Things • Payments • Software
Remote
Costa Rica
1500 Employees
In-Office or Remote
11 Locations
75 Employees

ServiceNow Logo ServiceNow

Sr.Program Manager

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Remote or Hybrid
Heredia, San Francisco, CRI
28000 Employees

Cargill Logo Cargill

Software Engineer

Food • Greentech • Logistics • Sharing Economy • Transportation • Agriculture • Industrial
Remote
Costa Rica
155000 Employees

Similar Companies Hiring

Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees
Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Fairly Even Thumbnail
Software • Sales • Robotics • Other • Hospitality • Hardware
New York, NY

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account