Job Title, Company or Keyword

Maximum of 25 job preferences reached.

Top Site Reliability Engineer Jobs

Mastercard

Lead Site Reliability Engineer

Reposted 6 Days AgoSaved

Hybrid

O'Fallon, MO, USA

Mid level

Blockchain • Fintech • Payments • Consulting • Cryptocurrency • Cybersecurity • Quantum Computing

The Lead Site Reliability Engineer will ensure reliability, scalability, and performance of Mastercard's applications, enhancing operational practices and developer collaboration in a proactive environment.

Top Skills: Ci/CdDevOpsGoJavaPythonSpring Framework

Zscaler

Sr. Staff Site Reliability Engineer-Federal, Security Clearance

Reposted 6 Days AgoSaved

Easy Apply

Remote or Hybrid

Crystal City, VA, USA

Easy Apply

140K-200K Annually

Senior level

140K-200K Annually

Senior level

Cloud • Information Technology • Security • Software • Cybersecurity

Responsible for managing operations within classified environments, overseeing cloud infrastructure, automating tasks, and ensuring system stability in a high-security setting.

Top Skills: AnsibleAws EcsKubernetesLinuxPythonTerraform

NBCUniversal

Staff Software Engineer (SAP BTP SRE Lead)

Reposted 6 Days AgoSaved

Remote or Hybrid

New York, NY, USA

130K-170K Annually

Senior level

130K-170K Annually

Senior level

AdTech • Cloud • Digital Media • Information Technology • News + Entertainment • App development

Oversee operational support of SAP BTP CPI applications, manage incidents, lead support specialists, and collaborate on architecture and governance for finance processes.

Top Skills: Abap ProxiesAemCapmCloud ConnectorCloud FoundryEdge Integration CellIdocJSONMessage QueuesOauthOdataRestSAMLSap BtpSfapiSftpSoapXML

Waystar

Site Reliability Engineer II

Reposted 6 Days AgoSaved

In-Office

3 Locations

Senior level

Healthtech • Payments • Software

The Senior Site Reliability Engineer II manages infrastructure for Waystar products, enhancing system reliability, observability, and performance while collaborating with engineering teams and mentoring juniors.

Top Skills: Apache AirflowAWSAzureCloudFormationGCPGrafanaKafkaKubernetesPowershellPrometheusPythonSparkSplunkTerraform

NBCUniversal

Site Reliability Engineer

7 Days AgoSaved

Remote or Hybrid

Centennial, CO, USA

110K-145K Annually

Mid level

110K-145K Annually

Mid level

AdTech • Cloud • Digital Media • Information Technology • News + Entertainment • App development

Build and maintain automation and reliability for live video distribution across on-prem and cloud. Deploy and manage systems, develop monitoring and automated recovery, troubleshoot complex incidents, coordinate with vendors, document SOPs, support live broadcast components, and participate in L2 on-call rotation.

Top Skills: AacAc3AnsibleAtscAvcAWSBashChefCloudFormationCmafDockerEksGitHevcHlsJavaScriptJSONKubernetesLinuxMicrosoft Graph ApiMpeg Transport StreamsPythonRistScte104Scte224Scte35SrtSsaiSt2022-7St2110StatmuxTerraformUnixXMLYmlZixi

Palantir Technologies

Site Reliability Engineer - US Government

Reposted 7 Days AgoSaved

Hybrid

Washington, DC, USA

125K-185K Annually

Mid level

125K-185K Annually

Mid level

Artificial Intelligence • Software

The Site Reliability Engineer will maintain high-performance cloud and on-premises services, automate tasks, troubleshoot production issues, and collaborate with product teams.

Top Skills: AWSAzureBashDockerGCPGoJavaJavaScriptKubernetesLinuxOpenshiftPodmanPrometheusPython

ServiceNow

Sr Staff Site Reliability Engineer - Veza

Reposted 7 Days AgoSaved

Remote or Hybrid

Santa Clara, CA, USA

166K-290K Annually

Senior level

166K-290K Annually

Senior level

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation

The Sr Staff Site Reliability Engineer will lead infrastructure projects, design scalable solutions, and collaborate across teams while providing technical support and mentorship.

Top Skills: AWSBashDatadogGitopsGoGrafanaHelmKubernetesLinuxPrometheusPythonTerraform

Superhuman

Site Reliability Engineer

Reposted 7 Days AgoSaved

Hybrid

San Francisco, CA, USA

214K-260K Annually

Senior level

214K-260K Annually

Senior level

Artificial Intelligence • Information Technology • Machine Learning • Natural Language Processing • Productivity • Software • Generative AI

The SRE will ensure the reliability of backend systems, scale Kubernetes-based control planes, and improve automation mechanisms while managing incident processes.

Top Skills: AWSAzureDockerGCPJavaKubernetesLinuxTerraform

New York Life Insurance Company

Senior Associate - Application Remediation SRE

8 Days AgoSaved

Hybrid

New York, NY, USA

112K-159K Annually

Senior level

112K-159K Annually

Senior level

Artificial Intelligence • Cloud • Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics

Lead application readiness and remediation coordination for AWS, EOL, and vulnerability patches. Validate impacts, define smoke and regression tests, drive automation, resolve dependencies, escalate blockers, and secure production sign-off to ensure audit-ready closure.

Top Skills: AmiApi TestingAWSCertificatesCi/Cd PipelinesContainerizationDastDatabasesDockerEc2EksLibrariesMiddlewareNew RelicNew Relic MonitorsObservability ToolingRegression AutomationRuntimesSastScaService DashboardsSmoke TestingSyntheticsTerraform

New York Life Insurance Company

Senior Associate - Endpoint Patching SRE

8 Days AgoSaved

Hybrid

New York, NY, USA

112K-159K Annually

Senior level

112K-159K Annually

Senior level

Artificial Intelligence • Cloud • Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics

Own and operate endpoint patch deployment and remediation for workstations and user devices: manage pilot rings and rollback groups, monitor patch success and endpoint health, validate post-patch functionality, coordinate incident triage with cross-functional teams, and improve automation, reporting, and evidence capture for vulnerability remediation.

Top Skills: DashboardingEdrEndpoint Management PlatformsItsmMecmMicrosoft IntunePowershellSccmTaniumVpn

Mastercard

Vice President, Site Reliability Engineer

Reposted 8 Days AgoSaved

Hybrid

O'Fallon, MO, USA

200K-330K Annually

Senior level

200K-330K Annually

Senior level

Blockchain • Fintech • Payments • Consulting • Cryptocurrency • Cybersecurity • Quantum Computing

As Vice President of BizOps, you will lead a team to ensure operational excellence, focusing on automation, platform availability, and risk management while collaborating with various stakeholders.

Top Skills: Application AutomationCapacity PlanningCi/CdDevOpsDistributed SystemsItsmOperational DesignSite Reliability Engineering

Mastercard

Site Reliability Engineer II

Reposted 8 Days AgoSaved

Hybrid

O'Fallon, MO, USA

76K-127K Annually

Mid level

76K-127K Annually

Mid level

Blockchain • Fintech • Payments • Consulting • Cryptocurrency • Cybersecurity • Quantum Computing

The BizOps Engineer II manages the Crypto Services platform, ensuring production readiness, risk management, and collaboration across teams, focusing on automation and operational excellence.

Top Skills: ArtifactoryAWSAzureBitbucketCC++ChefCi/CdGitGoGCPJavaJenkinsMavenPerlPythonRuby

New

Cut your apply time in half.

Use ourAI Assistantto automatically fill your job applications.

Use For Free

Mastercard

Site Reliability Engineer II

Reposted 8 Days AgoSaved

Hybrid

O'Fallon, MO, USA

76K-127K Annually

Mid level

76K-127K Annually

Mid level

Blockchain • Fintech • Payments • Consulting • Cryptocurrency • Cybersecurity • Quantum Computing

The BizOps Engineer II is responsible for enhancing system reliability, automating workflows, and ensuring operational excellence across technology services at Mastercard, including application monitoring and CI/CD processes.

Top Skills: ArtifactoryBitbucketChefGitJavaJenkinsLinuxMainframeMaven

Mastercard

Director, Site Reliability Engineer

Reposted 8 Days AgoSaved

Hybrid

O'Fallon, MO, USA

152K-258K Annually

Senior level

152K-258K Annually

Senior level

Blockchain • Fintech • Payments • Consulting • Cryptocurrency • Cybersecurity • Quantum Computing

The Director of BizOps leads efforts to ensure the reliability and performance of Mastercard applications, fostering developer ownership and operational standards, while managing risks and aligning product priorities with operational needs.

Top Skills: AWSAzureDatadogDynatraceGCPGoGrafanaJavaPrometheusPythonSplunkSpring Framework

Mastercard

Lead Site Reliability Engineer

Reposted 8 Days AgoSaved

Hybrid

O'Fallon, MO, USA

122K-207K Annually

Senior level

122K-207K Annually

Senior level

Blockchain • Fintech • Payments • Consulting • Cryptocurrency • Cybersecurity • Quantum Computing

As a Lead Site Reliability Engineer, you will drive SRE practices, influence system architecture, and ensure operational excellence for critical platforms, emphasizing automation and risk mitigation.

Top Skills: AnsibleBashChefDynatraceGitGoJenkinsPythonSplunk

Sprinter Health

Senior, Site Reliability Engineer (SRE)

Reposted 8 Days AgoSaved

Remote or Hybrid

2 Locations

160K-235K Annually

Senior level

160K-235K Annually

Senior level

Artificial Intelligence • Healthtech • Logistics • Social Impact • Software • Telehealth

The Senior Site Reliability Engineer will enhance the reliability and security of infrastructure for in-home healthcare services, using cloud technology and automation to improve systems and processes.

Top Skills: AWSBashGCPPythonTerraformTypescript

Attain

Sr/Staff Site Reliability Engineer, Consumer Apps

Reposted 8 Days AgoSaved

Easy Apply

In-Office

2 Locations

Easy Apply

Mid level

AdTech

As a Site Reliability Engineer, you'll maintain the infrastructure for systems, ensure efficiency, automate processes, monitor databases, and participate in architecture discussions.

Top Skills: Amazon KinesisAws LambdaAws SnsBigQueryDockerGcp (Google Cloud Platform)GitlabGoogle Cloud FunctionsGoogle Cloud RunGoogle Pub/SubGrafanaIstioKafkaKubernetesMySQLPrometheusSpannerSQLTerraform

Caterpillar

Engineering Manager, IAM Platform (Ops, SRE & AI Enablement)

Reposted 8 Days AgoSaved

Hybrid

Irving, TX, USA

148K-222K Annually

Senior level

148K-222K Annually

Senior level

Artificial Intelligence • Cloud • Internet of Things • Software • Cybersecurity • Industrial

The Engineering Manager leads the IAM Platform team, focusing on operational leadership, SRE implementation, automation, AI integration, and team development to ensure stability and innovation in identity management.

Top Skills: AIAutomationGoIamPythonSre

Optum

Site Reliability Engineer - Remote

9 Days AgoSaved

In-Office or Remote

Eden Prairie, MN, USA

73K-130K Annually

Mid level

73K-130K Annually

Mid level

Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics

Architect, build, and operate AWS commercial and government cloud infrastructure and platform services. Implement IaC, Kubernetes (EKS/AKS) management, observability, automation, incident response, and compliance (FedRAMP/NIST). Participate in on-call rotations and support production resiliency, performance, and security.

Top Skills: AksArgocdAws VpcAzure DevopsCloudFormationCloudwatchDynatraceEc2EcsEksEksElbFluxGitGitlabGrafanaHelmKmsKubernetesLambdaOwaspPkiPrometheusRdsRedshiftRestful ServicesRoute53S3SplunkTerraformVpc Flow Logs

NBCUniversal

Staff Site Reliability Engineer (Collaboration Engineering)

Reposted 9 Days AgoSaved

Remote or Hybrid

Orlando, FL, USA

Expert/Leader

AdTech • Cloud • Digital Media • Information Technology • News + Entertainment • App development

The Staff Site Reliability Engineer is responsible for ensuring the reliability, performance, and security of workplace collaboration services, focusing on automation, incident management, and operational excellence while providing technical leadership and mentoring to engineers.

Top Skills: Ai EngineeringAzure Virtual DesktopDefender For Office 365Exchange OnlineGraph ApiIntuneJamf ProMicrosoft 365Microsoft Entra IdMicrosoft PurviewOnedrivePowershellSharepoint OnlineTeams

New York Life Insurance Company

Senior Associate - Infrastructure Patching SRE

10 Days AgoSaved

Hybrid

New York, NY, USA

112K-159K Annually

Senior level

112K-159K Annually

Senior level

Artificial Intelligence • Cloud • Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics

Manage and execute infrastructure patching across servers, OS, middleware, databases, virtualization, and cloud. Coordinate patch waves, validate reboots and service health, handle failures and rollbacks, capture evidence, automate workflows, and partner with cross-functional teams to meet remediation SLAs and audit requirements.

Top Skills: AnsibleAWSAws Ec2Aws Systems Manager Patch ManagerCi/CdDatabaseEksHardened AmisLinuxMiddlewarePuppetQualys Patch ManagementSatelliteSccm/MecmTaniumTerraformVirtualizationWindows

Zscaler

Staff Site Reliability Engineer (Production Engineer)

Reposted 10 Days AgoSaved

Easy Apply

Remote or Hybrid

9 Locations

Easy Apply

119K-170K Annually

Senior level

119K-170K Annually

Senior level

Cloud • Information Technology • Security • Software • Cybersecurity

As a Staff Site Reliability Engineer, you'll oversee Zscaler production data center services, optimize code, and ensure cloud service availability and performance. Collaborate with cross-functional teams to improve processes and resolve escalated issues.

Top Skills: BashDnsFirewallsGrafanaHTTPIcmpLoad BalancingNagiosOsi ModelPrometheusPythonTcp/Ip

Citadel

Site Reliability Engineer

Reposted 10 Days AgoSaved

In-Office or Remote

4 Locations

105K-300K Annually

Entry level

105K-300K Annually

Entry level

Information Technology • Software • Financial Services • Big Data Analytics

SREs at Citadel focus on optimizing and maintaining system reliability, performance, and automation for investment applications, collaborating closely with teams.

Top Skills: Ci/CdCSSJavaScriptPythonReactSQL

Alloy

Lead Site Reliability Engineer

Reposted 10 Days AgoSaved

Easy Apply

Hybrid

New York City, NY, USA

Easy Apply

151K-191K Annually

Senior level

151K-191K Annually

Senior level

Fintech • Information Technology • Software • Financial Services

The role involves designing and automating infrastructure management, improving reliability, building internal tools, and contributing to architectural decisions. Responsibilities include working with Kubernetes and managing large-scale infrastructure, while participating in on-call rotations to prevent incidents.

Top Skills: CloudwatchDatadogDockerEfkElkGoJavaScriptKubernetesPythonTerraform

Sage

Senior/Staff Site Reliability Engineer

Reposted 10 Days AgoSaved

Hybrid

New York, NY, USA

175K-230K Annually

Senior level

175K-230K Annually

Senior level

Hardware • Healthtech • Software • Analytics

The Site Reliability Engineer will ensure high availability of Sage's platform, lead incident response, design reliable systems, and improve operational workflows.

Top Skills: Amazon Web ServicesDatadogGoGoogle Cloud PlatformGrafanaJavaKubernetesMySQLPostgresPrometheusPulumiPythonTerraform