Get the job you really want.

Top Site Reliability Engineer Jobs

7 Hours Ago
Boston, MA, USA
Hybrid
2,000 Employees
192K-288K Annually
Expert/Leader
2,000 Employees
192K-288K Annually
Expert/Leader
Consumer Web • eCommerce • Marketing Tech • Retail • Software • Analytics • Generative AI
As a Lead Site Reliability Engineer at Klaviyo, you will oversee foundational Klaviyo services and drive productivity in product engineering teams. Responsibilities include designing scalable systems, eliminating bottlenecks, ensuring high availability, and collaborating with product-facing engineers. You will participate in on-call duties, conduct quantitative analyses, and promote best practices in Site Reliability Engineering.
Top Benefits:
401-K
401-K Matching
Adoption Assistance
+65 More
7 Hours Ago
Boston, MA, USA
Hybrid
2,000 Employees
192K-288K Annually
Senior level
2,000 Employees
192K-288K Annually
Senior level
Consumer Web • eCommerce • Marketing Tech • Retail • Software • Analytics • Generative AI
As a Lead Site Reliability Engineer, you will ensure uninterrupted service while enhancing the productivity of product teams. Key responsibilities include designing scalable systems, developing foundational services, identifying bottlenecks, collaborating with teams, and advocating best practices. You will also engage in root cause analysis during outages and implement architectural improvements.
Top Benefits:
401-K
401-K Matching
Adoption Assistance
+65 More
11 Hours Ago
Fort Worth, TX, USA
Hybrid
289,097 Employees
Senior level
289,097 Employees
Senior level
Financial Services
The Senior Lead Site Reliability Engineer at JPMorgan Chase will define non-functional requirements, ensure those are implemented, mentor engineers, and contribute to the site reliability community by evolving and debugging components of applications.
Top Benefits:
401-K
401-K Matching
Commuter Benefits
+19 More
Yesterday
Plano, TX, USA
Hybrid
55,000 Employees
Senior level
55,000 Employees
Senior level
Fintech • Machine Learning • Payments • Software • Financial Services
The Lead Platform Engineer will work closely with product owners to understand application capabilities and testing scenarios. They will improve software engineering practices, lead a team in creating scalable and resilient solutions, and stay updated with emerging technologies. Key responsibilities include using DevOps tools, mentoring peers, and driving technology transformation within the company.
Top Benefits:
401-K
401-K Matching
Adoption Assistance
+52 More
11 Hours Ago
Fort Worth, TX, USA
Hybrid
289,097 Employees
Senior level
289,097 Employees
Senior level
Financial Services
As a Lead Site Reliability Engineer at JPMorgan Chase, you'll define and ensure non-functional requirements and availability targets for services. Responsibilities include designing and implementing observability for systems, mentoring engineers, debugging applications, and contributing to the site reliability community.
Top Benefits:
401-K
401-K Matching
Commuter Benefits
+19 More
Yesterday
New York, NY, USA
892 Employees
Senior level
892 Employees
Senior level
Fintech • Information Technology • Financial Services
As a Lead Site Reliability Engineer, you will guide a team in implementing SRE best practices within a next-gen trading platform. Your role includes automating manual tasks, integrating observability products, providing technical leadership on containerization and DevOps, conducting incident analytics, developing monitoring metrics, and fostering an agile development culture. Collaborating across teams to promote the SRE culture and mentoring others in the engineering community are also key responsibilities.
Top Benefits:
401-K
401-K Matching
Child Care Benefits
+59 More
Yesterday
San Francisco, CA, USA
Remote
51 Employees
140K-180K Annually
Senior level
51 Employees
140K-180K Annually
Senior level
Artificial Intelligence • Cloud • Hardware • Machine Learning • Other • Software • Infrastructure as a Service (IaaS)
As a Site Reliability Engineer, you will design, build, and maintain Voltage Park's core infrastructure, focusing on bare metal provisioning, telemetry, storage, and container orchestration, collaborating across teams to support internal and customer use cases, while participating in SRE on-call rotations.
Top Benefits:
401-K
401-K Matching
Company Equity
+10 More
2 Days Ago
Jersey City, NJ, USA
Hybrid
289,097 Employees
Mid level
289,097 Employees
Mid level
Financial Services
As a Site Reliability Engineer III, you'll solve complex business problems using code and cloud infrastructure. You'll maintain and optimize applications, contribute to team knowledge on operations, and implement best practices in site reliability engineering. Responsibilities include collaborating with engineers on deployment approaches and ensuring application availability.
Top Benefits:
401-K
401-K Matching
Commuter Benefits
+19 More

Featured Jobs

2 Days Ago
3 Locations
1,100 Employees
Senior level
1,100 Employees
Senior level
Cloud • Software
As a Principal Site Reliability Engineer, you will drive operational excellence for the platform's mission critical datastores, ensuring their reliability, availability, and performance. This role involves innovating solutions, collaborating across teams, and mentoring engineers. Responsibilities include designing scalable systems, writing high-quality code, and utilizing cloud-managed services and IaC tools.
Top Benefits:
401-K
401-K Matching
Adoption Assistance
+52 More
Yesterday
Chicago, IL, USA
Hybrid
1,782 Employees
Junior
1,782 Employees
Junior
Fintech • Information Technology • Machine Learning • Software • Analytics • Financial Services
The Site Reliability Engineer (SRE) will enhance the reliability of our consumer business by implementing automated tools and resolving complex technical problems. The role involves collaboration with IT and software engineering teams, monitoring service levels, and promoting best practices for reliability and efficiency.
Top Benefits:
401-K
401-K Matching
Child Care Benefits
+66 More
2 Days Ago
Jersey City, NJ, USA
Hybrid
289,097 Employees
Junior
289,097 Employees
Junior
Financial Services
As a Site Reliability Engineer II, you will ensure system reliability by executing projects independently while collaborating with teams. Responsibilities include coding to solve business problems, resolving incidents, and enhancing monitoring and alerting systems. Familiarity with cloud infrastructure and observability tools is crucial for the role.
Top Benefits:
401-K
401-K Matching
Commuter Benefits
+19 More
2 Days Ago
2 Locations
Hybrid
200 Employees
Senior level
200 Employees
Senior level
Blockchain • Information Technology • Software • Cryptocurrency • Web3
As a Site Reliability Engineer, you will enhance developer productivity and ensure product reliability by designing and improving infrastructure. Responsibilities include setting reliability standards, managing production infrastructure, mentoring teams, and implementing best practices for coding and deployment.
Top Benefits:
401-K
Commuter Benefits
Company Equity
+28 More
3 Days Ago
New York, NY, USA
4,000 Employees
Entry level
4,000 Employees
Entry level
Information Technology • Software • Financial Services • Big Data Analytics
Site Reliability Engineers at Citadel are responsible for enhancing system reliability, availability, and performance while automating tasks and resolving complex issues. They collaborate with teams to implement efficient engineering solutions and promote best practices within the organization.
Top Benefits:
401-K
401-K Matching
Adoption Assistance
+44 More
7 Hours Ago
San Francisco, CA, USA
Hybrid
450 Employees
180K-225K Annually
Senior level
450 Employees
180K-225K Annually
Senior level
Cloud • Greentech • Other • Energy
As a Senior Site Reliability Engineer at Crusoe, you will ensure the reliability and performance of the AI-first cloud infrastructure. Your role involves analyzing system performance, automating processes, advising teams on resilient code, and engaging in root cause analysis to improve service levels while focusing on customer satisfaction.
Top Benefits:
401-K
401-K Matching
Commuter Benefits
+34 More
3 Days Ago
San Francisco, CA, USA
Remote
11,000 Employees
Junior
11,000 Employees
Junior
Cloud • Information Technology • Productivity • Security • Software • App development • Automation
The Site Reliability Engineer will scale cloud services, manage caching infrastructure, and improve service reliability and performance. Responsibilities include building monitoring into the code, defining alerts, and automating tasks. Programming expertise, particularly in backend languages, and strong communication skills are essential as the role involves collaborating with both technical and non-technical audiences.
Top Benefits:
401-K
401-K Matching
Adoption Assistance
+69 More
11 Hours Ago
Plano, TX, USA
Hybrid
55,000 Employees
256K-292K Annually
Senior level
55,000 Employees
256K-292K Annually
Senior level
Fintech • Machine Learning • Payments • Software • Financial Services
As a Distinguished Engineer, you will lead technical contributions for Capital One's Loyalty platform, focusing on building resilient features, driving engineering excellence, mentoring, and optimizing technology solutions. You will address complex problems and promote a culture of innovation while ensuring high performance and scalability of systems.
Top Benefits:
401-K
401-K Matching
Adoption Assistance
+52 More
11 Hours Ago
Plano, TX, USA
Hybrid
55,000 Employees
Mid level
55,000 Employees
Mid level
Fintech • Machine Learning • Payments • Software • Financial Services
As a Senior Software Engineer in DevOps, you'll collaborate with Agile teams to design and implement technical solutions, enhancing the reliability and observability of the Loyalty Platform while driving powerful cloud-based experiences. You'll stay updated with tech trends, mentor peers, and leverage various programming languages and tools, contributing to significant transformations at Capital One.
Top Benefits:
401-K
401-K Matching
Adoption Assistance
+52 More
2 Days Ago
United States
Remote
3,000 Employees
95K-153K Annually
Entry level
3,000 Employees
95K-153K Annually
Entry level
Hardware • Information Technology • Security • Software • Cybersecurity • Conversational AI
The Site Reliability Engineer will maintain reliability, performance, and scalability of production systems, collaborating with security, engineering, and operations teams to ensure service availability. Responsibilities include implementing monitoring systems, managing infrastructures, participating in on-call rotations, and encouraging automation.
Top Benefits:
401-K
401-K Matching
Adoption Assistance
+87 More
3 Days Ago
Tampa, FL, USA
73 Employees
Mid level
73 Employees
Mid level
Artificial Intelligence • Big Data • Computer Vision • Machine Learning • Analytics • Defense
As a Site Reliability Engineer, you will be responsible for maintaining Striveworks' software deployments on-premises and in cloud environments. Your role includes automation of infrastructure-as-code, working on software deployments, incident response, and engagement with customers in both cloud and air-gapped environments.
Top Benefits:
401-K
Company Equity
Company Outings
+24 More
Yesterday
Plano, TX, USA
Hybrid
55,000 Employees
Senior level
55,000 Employees
Senior level
Fintech • Machine Learning • Payments • Software • Financial Services
The Senior Lead Software Engineer, DevOps/SRE will lead initiatives to improve developer productivity by leveraging AI tools, manage the development of cloud-native CI/CD pipelines, and enhance the resilience and observability of cloud infrastructure. They will mentor talent and drive strategy to optimize software delivery processes.
Top Benefits:
401-K
401-K Matching
Adoption Assistance
+52 More
Yesterday
New York, NY, USA
1,000 Employees
Senior level
1,000 Employees
Senior level
Artificial Intelligence • Fintech • Other • Automation
The Senior IT Site Reliability Engineer will manage and automate technical infrastructure, ensuring system availability and reliability, while developing monitoring solutions and documenting best practices. Ideal candidates will have strong skills in Linux, Python, and IaC technologies, as well as project management capabilities.
Top Benefits:
401-K
401-K Matching
Adoption Assistance
+32 More
Yesterday
Costa Mesa, CA, USA
1,400 Employees
124K-186K Annually
Senior level
1,400 Employees
124K-186K Annually
Senior level
Aerospace • Artificial Intelligence • Hardware • Robotics • Security • Software • Defense
As a Site Reliability Engineer at Anduril Industries, you will build and deliver solutions to support deployment engineers, collaborate on integration strategies, and enhance operational capabilities through analysis and tooling. Your role involves ensuring scalable system delivery and leading projects that directly impact warfighter capabilities.
Top Benefits:
401-K
Adoption Assistance
Child Care Benefits
+56 More
Yesterday
Richmond, VA, USA
Hybrid
55,000 Employees
Senior level
55,000 Employees
Senior level
Fintech • Machine Learning • Payments • Software • Financial Services
The Senior Lead Software Engineer, DevOps/SRE will enhance developer productivity by improving CI/CD processes and mentor internal talent. The role involves leading critical IT initiatives, collaborating on GenAI applications, and utilizing cloud technologies to create resilient infrastructure.
Top Benefits:
401-K
401-K Matching
Adoption Assistance
+52 More
Yesterday
McLean, VA, USA
Hybrid
55,000 Employees
Senior level
55,000 Employees
Senior level
Fintech • Machine Learning • Payments • Software • Financial Services
The Senior Lead Software Engineer, DevOps/SRE will drive strategy to enhance developer productivity, improve CI/CD processes, and mentor engineering talent. Responsibilities include leveraging AWS for Infrastructure, troubleshooting CI/CD failures, and creating self-service Developer tools based on GenAI applications.
Top Benefits:
401-K
401-K Matching
Adoption Assistance
+52 More
4 Days Ago
Wilmington, DE, USA
Hybrid
289,097 Employees
Mid level
289,097 Employees
Mid level
Financial Services
As a Site Reliability Engineer III, you will optimize and maintain applications and infrastructure, collaborate to design automated deployment approaches, and ensure application reliability and scalability. You will guide peers, resolve complex issues, and adopt best site reliability practices within your team.
Top Benefits:
401-K
401-K Matching
Commuter Benefits
+19 More
All Filters
Date Posted
Job Category
Experience
Industry
Company Name
Company Size