Top Site Reliability Engineer Jobs
As a Lead Site Reliability Engineer, you will provide technical leadership, design and build the Embedded SRE program, mentor engineers, and be proficient in various technologies.
Lead Site Reliability Engineer at JPMorgan Chase within Corporate Technology Team, responsible for championing site reliability practices, leading initiatives to improve reliability, and demonstrating technical expertise across multiple domains.
Lead Site Reliability Engineering focusing on system operations as a software engineering problem to ensure uninterrupted service for customers. Responsibilities include designing, developing, and implementing highly available and scalable systems using various technologies like Python, AWS, Django, Kubernetes, Bash, Terraform, MySQL, Redis, Cassandra, Postgresql. Collaborate with teams, perform on-call duties, quantitative analysis, and advocate for best practices.
As a Site Reliability Engineer, you will contribute to the development and maintenance of iManage's SaaS platform. You will work with a global SRE team to ensure the scalability and reliability of the platform, drive innovation and platform evolution, and adhere to security best practices. Responsibilities include participating in agile sprints, scaling cloud infrastructure, writing automation and monitoring tools, conducting incident management, and collaborating cross-functionally.
The Site Reliability Engineer will be responsible for building out internal telemetry systems and maintaining infrastructure stability. They will play a pivotal role in bringing infrastructure online across multiple data centers.
As a Principal Site Reliability Engineer (SRE) at Discover, you will focus on improving reliability and performance issues by developing and running SRE tooling and observability through automation. You will also work on CI/CD projects, enhance data monitoring, and collaborate with internal product groups to define the SRE practice within the Fraud value stream.
As a Site Reliability Engineer III at JPMorgan Chase within the Infrastructure Platforms Engineering team, you will solve complex and broad business problems with simple and straightforward solutions. Responsibilities include collaborating on deployment approaches, designing and implementing solutions, and implementing infrastructure and network as code.
Lead site reliability engineering efforts to improve anomaly detection, platform stability, and resilience. Implement scalable, reliable, secure SRE and observability platforms. Collaborate with engineering teams to achieve reliability and scalability goals. Participate in on-call engineering duty for production support.
Featured Jobs
As a Site Reliability Engineer III at JPMorgan Chase, you will lead initiatives to improve the reliability and stability of web Hosting platforms, utilizing your expertise in site reliability practices and principles. You will collaborate with team members to set service level objectives, resolve complex problems, and share knowledge within the organization. Required qualifications include formal training in site reliability engineering concepts, 3+ years of experience, AWS exposure, Terraform experience, and fluency in programming languages such as Python and Java.
The Site Reliability Engineer will support the full stack of software applications and architecture, collaborate with a team to develop automation tools, and enhance monitoring solutions. Responsibilities include building tools and automation, designing user interfaces, maintaining high availability, writing code, and optimizing systems. The role requires a Bachelor's degree in Computer Science or related field, 3+ years of coding experience, proficiency in .NET languages, familiarity with ASP.NET, and experience with monitoring platforms and CI/CD pipelines.
Lead Site Reliability Engineer responsible for improving anomaly detection, platform stability, and incident response. Implement scalable and secure SRE and Observability platform. Collaborate with engineering teams to enhance reliability and scalability. Monitor service health and participate in on-call duty.
As a Senior/Staff Site Reliability Engineer, you will be responsible for building fast, highly available infrastructure at scale. You will contribute to the architecture and design of new and current systems, with a focus on reliability and scaling. This role requires a Bachelor's Degree in Computer Science or related field, along with 5+ years of professional SRE experience. Strong programming skills in Python, Go, or similar languages, as well as experience with modern infrastructure tools and CI/CD practices, are essential. Excellent communication skills and a deep understanding of information security best practices are also required.
Join Celonis as a Site Reliability Engineer to build and operate resilient, reliable, and scalable systems, ensuring product health and peak performance. Take ownership of complex issues, provide technical leadership on reliability, and drive continuous improvement.
Lead Site Reliability Engineer responsible for designing and developing systems and processes to enable highly available, scalable, and secure systems at Klaviyo. Responsibilities include championing best practice security initiatives, improving automation around vulnerability detection, participating in on-call duties, and collaborating with product-facing engineers and SREs.
As a Site Reliability Engineer, you will help solve the unique challenges of blockchain oracle architecture and be responsible for the off-chain part of the Chainlink ecosystem.
Site Reliability Engineer II responsible for service stability and reliability through defining monitoring strategies, capacity planning, production readiness, and collaboration with product teams. Must have a Bachelor's degree and 3 years of experience, with expertise in orchestration tools, system configuration, building highly available applications, virtualization, automation, and Linux System Administration.
The Site Reliability Engineer will work on developing and integrating observability platforms to provide insights into system performance. Responsibilities include improving performance, security, and scalability of observability services, developing dashboards and alerts, and analyzing gathered data to gain meaningful insights.
Lead Site Reliability Engineer at JPMorgan Chase within the Enterprise technology, network operations team. Responsibilities include championing site reliability practices, improving application stability, technical leadership, incident management, and knowledge sharing within the organization.
The Site Reliability Engineer at ServiceNow is responsible for maintaining and developing the reliability, scalability, and performance of the infrastructure. They combine expertise in software development, networking, and systems engineering to drive technical resolutions and improve operability.
The Site Reliability Engineer at ServiceNow is responsible for maintaining and developing the reliability, scalability, and performance of the infrastructure. They combine software development, networking, and systems engineering expertise to improve platform operability and reduce incidents.
The Site Reliability Engineer will be responsible for managing and maintaining the systems powering the Direct-to-Consumer platforms, with a focus on automation, databases, testing, observability, and resiliency.
Join IonQ as a Staff Site Reliability Engineer to help build the world's largest quantum computing platform. Responsibilities include creating, supporting, and managing infrastructure, maintaining monitoring systems, and mentoring junior engineers. Requires BS in Computer Science, 5+ years of site reliability engineering experience, and expertise in Kubernetes and virtualized environments.
Join ServiceNow as a Site Reliability Engineer for their Federal SRE Team providing 24x7 production support for Government Community Cloud infrastructure. Responsibilities include driving technical resolutions and improving platform operability. Requires expertise in DevOps, Automation and Scripting, Linux systems, software development, Observability and Monitoring, and Cloud technologies.
The Principal Site Reliability Engineer will focus on innovating and providing strong technical vision for the datastores team. They will collaborate with other teams to design and build reliable, scalable, and highly available datastores on a multi-region scale platform.
The Site Reliability Engineer at ServiceNow is responsible for maintaining and developing the reliability, scalability, and performance of the ServiceNow infrastructure. The role involves a combination of software development, networking, and systems engineering to improve services for customers.
Top Companies Hiring Site Reliability Engineers
See AllAll Filters
No Results
No Results