Top Site Reliability Engineer Jobs

Reposted 6 Days AgoSaved
Hybrid
O'Fallon, MO, USA
Mid level
Mid level
Blockchain • Fintech • Payments • Consulting • Cryptocurrency • Cybersecurity • Quantum Computing
The Lead Site Reliability Engineer will ensure reliability, scalability, and performance of Mastercard's applications, enhancing operational practices and developer collaboration in a proactive environment.
Top Skills: Ci/CdDevOpsGoJavaPythonSpring Framework
Reposted 6 Days AgoSaved
Easy Apply
Remote or Hybrid
Crystal City, VA, USA
Easy Apply
140K-200K Annually
Senior level
140K-200K Annually
Senior level
Cloud • Information Technology • Security • Software • Cybersecurity
Responsible for managing operations within classified environments, overseeing cloud infrastructure, automating tasks, and ensuring system stability in a high-security setting.
Top Skills: AnsibleAws EcsKubernetesLinuxPythonTerraform
Reposted 6 Days AgoSaved
Remote or Hybrid
New York, NY, USA
130K-170K Annually
Senior level
130K-170K Annually
Senior level
AdTech • Cloud • Digital Media • Information Technology • News + Entertainment • App development
Oversee operational support of SAP BTP CPI applications, manage incidents, lead support specialists, and collaborate on architecture and governance for finance processes.
Top Skills: Abap ProxiesAemCapmCloud ConnectorCloud FoundryEdge Integration CellIdocJSONMessage QueuesOauthOdataRestSAMLSap BtpSfapiSftpSoapXML
Reposted 6 Days AgoSaved
In-Office
3 Locations
Senior level
Senior level
Healthtech • Payments • Software
The Senior Site Reliability Engineer II manages infrastructure for Waystar products, enhancing system reliability, observability, and performance while collaborating with engineering teams and mentoring juniors.
Top Skills: Apache AirflowAWSAzureCloudFormationGCPGrafanaKafkaKubernetesPowershellPrometheusPythonSparkSplunkTerraform
7 Days AgoSaved
Remote or Hybrid
Centennial, CO, USA
110K-145K Annually
Mid level
110K-145K Annually
Mid level
AdTech • Cloud • Digital Media • Information Technology • News + Entertainment • App development
Build and maintain automation and reliability for live video distribution across on-prem and cloud. Deploy and manage systems, develop monitoring and automated recovery, troubleshoot complex incidents, coordinate with vendors, document SOPs, support live broadcast components, and participate in L2 on-call rotation.
Top Skills: AacAc3AnsibleAtscAvcAWSBashChefCloudFormationCmafDockerEksGitHevcHlsJavaScriptJSONKubernetesLinuxMicrosoft Graph ApiMpeg Transport StreamsPythonRistScte104Scte224Scte35SrtSsaiSt2022-7St2110StatmuxTerraformUnixXMLYmlZixi
Reposted 7 Days AgoSaved
Hybrid
Washington, DC, USA
125K-185K Annually
Mid level
125K-185K Annually
Mid level
Artificial Intelligence • Software
The Site Reliability Engineer will maintain high-performance cloud and on-premises services, automate tasks, troubleshoot production issues, and collaborate with product teams.
Top Skills: AWSAzureBashDockerGCPGoJavaJavaScriptKubernetesLinuxOpenshiftPodmanPrometheusPython
Reposted 7 Days AgoSaved
Remote or Hybrid
Santa Clara, CA, USA
166K-290K Annually
Senior level
166K-290K Annually
Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
The Sr Staff Site Reliability Engineer will lead infrastructure projects, design scalable solutions, and collaborate across teams while providing technical support and mentorship.
Top Skills: AWSBashDatadogGitopsGoGrafanaHelmKubernetesLinuxPrometheusPythonTerraform
Reposted 7 Days AgoSaved
Hybrid
San Francisco, CA, USA
214K-260K Annually
Senior level
214K-260K Annually
Senior level
Artificial Intelligence • Information Technology • Machine Learning • Natural Language Processing • Productivity • Software • Generative AI
The SRE will ensure the reliability of backend systems, scale Kubernetes-based control planes, and improve automation mechanisms while managing incident processes.
Top Skills: AWSAzureDockerGCPJavaKubernetesLinuxTerraform
8 Days AgoSaved
Hybrid
New York, NY, USA
112K-159K Annually
Senior level
112K-159K Annually
Senior level
Artificial Intelligence • Cloud • Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
Lead application readiness and remediation coordination for AWS, EOL, and vulnerability patches. Validate impacts, define smoke and regression tests, drive automation, resolve dependencies, escalate blockers, and secure production sign-off to ensure audit-ready closure.
Top Skills: AmiApi TestingAWSCertificatesCi/Cd PipelinesContainerizationDastDatabasesDockerEc2EksLibrariesMiddlewareNew RelicNew Relic MonitorsObservability ToolingRegression AutomationRuntimesSastScaService DashboardsSmoke TestingSyntheticsTerraform
8 Days AgoSaved
Hybrid
New York, NY, USA
112K-159K Annually
Senior level
112K-159K Annually
Senior level
Artificial Intelligence • Cloud • Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
Own and operate endpoint patch deployment and remediation for workstations and user devices: manage pilot rings and rollback groups, monitor patch success and endpoint health, validate post-patch functionality, coordinate incident triage with cross-functional teams, and improve automation, reporting, and evidence capture for vulnerability remediation.
Top Skills: DashboardingEdrEndpoint Management PlatformsItsmMecmMicrosoft IntunePowershellSccmTaniumVpn
Reposted 8 Days AgoSaved
Hybrid
O'Fallon, MO, USA
200K-330K Annually
Senior level
200K-330K Annually
Senior level
Blockchain • Fintech • Payments • Consulting • Cryptocurrency • Cybersecurity • Quantum Computing
As Vice President of BizOps, you will lead a team to ensure operational excellence, focusing on automation, platform availability, and risk management while collaborating with various stakeholders.
Top Skills: Application AutomationCapacity PlanningCi/CdDevOpsDistributed SystemsItsmOperational DesignSite Reliability Engineering
Reposted 8 Days AgoSaved
Hybrid
O'Fallon, MO, USA
76K-127K Annually
Mid level
76K-127K Annually
Mid level
Blockchain • Fintech • Payments • Consulting • Cryptocurrency • Cybersecurity • Quantum Computing
The BizOps Engineer II manages the Crypto Services platform, ensuring production readiness, risk management, and collaboration across teams, focusing on automation and operational excellence.
Top Skills: ArtifactoryAWSAzureBitbucketCC++ChefCi/CdGitGoGCPJavaJenkinsMavenPerlPythonRuby
New

Cut your apply time in half.

Use ourAI Assistantto automatically fill your job applications.

Use For Free
Application Tracker Preview
Reposted 8 Days AgoSaved
Hybrid
O'Fallon, MO, USA
76K-127K Annually
Mid level
76K-127K Annually
Mid level
Blockchain • Fintech • Payments • Consulting • Cryptocurrency • Cybersecurity • Quantum Computing
The BizOps Engineer II is responsible for enhancing system reliability, automating workflows, and ensuring operational excellence across technology services at Mastercard, including application monitoring and CI/CD processes.
Top Skills: ArtifactoryBitbucketChefGitJavaJenkinsLinuxMainframeMaven
Reposted 8 Days AgoSaved
Hybrid
O'Fallon, MO, USA
152K-258K Annually
Senior level
152K-258K Annually
Senior level
Blockchain • Fintech • Payments • Consulting • Cryptocurrency • Cybersecurity • Quantum Computing
The Director of BizOps leads efforts to ensure the reliability and performance of Mastercard applications, fostering developer ownership and operational standards, while managing risks and aligning product priorities with operational needs.
Top Skills: AWSAzureDatadogDynatraceGCPGoGrafanaJavaPrometheusPythonSplunkSpring Framework
Reposted 8 Days AgoSaved
Hybrid
O'Fallon, MO, USA
122K-207K Annually
Senior level
122K-207K Annually
Senior level
Blockchain • Fintech • Payments • Consulting • Cryptocurrency • Cybersecurity • Quantum Computing
As a Lead Site Reliability Engineer, you will drive SRE practices, influence system architecture, and ensure operational excellence for critical platforms, emphasizing automation and risk mitigation.
Top Skills: AnsibleBashChefDynatraceGitGoJenkinsPythonSplunk
Reposted 8 Days AgoSaved
Remote or Hybrid
2 Locations
160K-235K Annually
Senior level
160K-235K Annually
Senior level
Artificial Intelligence • Healthtech • Logistics • Social Impact • Software • Telehealth
The Senior Site Reliability Engineer will enhance the reliability and security of infrastructure for in-home healthcare services, using cloud technology and automation to improve systems and processes.
Top Skills: AWSBashGCPPythonTerraformTypescript
Reposted 8 Days AgoSaved
Easy Apply
In-Office
2 Locations
Easy Apply
Mid level
Mid level
AdTech
As a Site Reliability Engineer, you'll maintain the infrastructure for systems, ensure efficiency, automate processes, monitor databases, and participate in architecture discussions.
Top Skills: Amazon KinesisAws LambdaAws SnsBigQueryDockerGcp (Google Cloud Platform)GitlabGoogle Cloud FunctionsGoogle Cloud RunGoogle Pub/SubGrafanaIstioKafkaKubernetesMySQLPrometheusSpannerSQLTerraform
Reposted 8 Days AgoSaved
Hybrid
Irving, TX, USA
148K-222K Annually
Senior level
148K-222K Annually
Senior level
Artificial Intelligence • Cloud • Internet of Things • Software • Cybersecurity • Industrial
The Engineering Manager leads the IAM Platform team, focusing on operational leadership, SRE implementation, automation, AI integration, and team development to ensure stability and innovation in identity management.
Top Skills: AIAutomationGoIamPythonSre
9 Days AgoSaved
In-Office or Remote
Eden Prairie, MN, USA
73K-130K Annually
Mid level
73K-130K Annually
Mid level
Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
Architect, build, and operate AWS commercial and government cloud infrastructure and platform services. Implement IaC, Kubernetes (EKS/AKS) management, observability, automation, incident response, and compliance (FedRAMP/NIST). Participate in on-call rotations and support production resiliency, performance, and security.
Top Skills: AksArgocdAws VpcAzure DevopsCloudFormationCloudwatchDynatraceEc2EcsEksEksElbFluxGitGitlabGrafanaHelmKmsKubernetesLambdaOwaspPkiPrometheusRdsRedshiftRestful ServicesRoute53S3SplunkTerraformVpc Flow Logs
Reposted 9 Days AgoSaved
Remote or Hybrid
Orlando, FL, USA
Expert/Leader
Expert/Leader
AdTech • Cloud • Digital Media • Information Technology • News + Entertainment • App development
The Staff Site Reliability Engineer is responsible for ensuring the reliability, performance, and security of workplace collaboration services, focusing on automation, incident management, and operational excellence while providing technical leadership and mentoring to engineers.
Top Skills: Ai EngineeringAzure Virtual DesktopDefender For Office 365Exchange OnlineGraph ApiIntuneJamf ProMicrosoft 365Microsoft Entra IdMicrosoft PurviewOnedrivePowershellSharepoint OnlineTeams
10 Days AgoSaved
Hybrid
New York, NY, USA
112K-159K Annually
Senior level
112K-159K Annually
Senior level
Artificial Intelligence • Cloud • Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
Manage and execute infrastructure patching across servers, OS, middleware, databases, virtualization, and cloud. Coordinate patch waves, validate reboots and service health, handle failures and rollbacks, capture evidence, automate workflows, and partner with cross-functional teams to meet remediation SLAs and audit requirements.
Top Skills: AnsibleAWSAws Ec2Aws Systems Manager Patch ManagerCi/CdDatabaseEksHardened AmisLinuxMiddlewarePuppetQualys Patch ManagementSatelliteSccm/MecmTaniumTerraformVirtualizationWindows
Reposted 10 Days AgoSaved
Easy Apply
Remote or Hybrid
9 Locations
Easy Apply
119K-170K Annually
Senior level
119K-170K Annually
Senior level
Cloud • Information Technology • Security • Software • Cybersecurity
As a Staff Site Reliability Engineer, you'll oversee Zscaler production data center services, optimize code, and ensure cloud service availability and performance. Collaborate with cross-functional teams to improve processes and resolve escalated issues.
Top Skills: BashDnsFirewallsGrafanaHTTPIcmpLoad BalancingNagiosOsi ModelPrometheusPythonTcp/Ip
Reposted 10 Days AgoSaved
In-Office or Remote
4 Locations
105K-300K Annually
Entry level
105K-300K Annually
Entry level
Information Technology • Software • Financial Services • Big Data Analytics
SREs at Citadel focus on optimizing and maintaining system reliability, performance, and automation for investment applications, collaborating closely with teams.
Top Skills: Ci/CdCSSJavaScriptPythonReactSQL
Reposted 10 Days AgoSaved
Easy Apply
Hybrid
New York City, NY, USA
Easy Apply
151K-191K Annually
Senior level
151K-191K Annually
Senior level
Fintech • Information Technology • Software • Financial Services
The role involves designing and automating infrastructure management, improving reliability, building internal tools, and contributing to architectural decisions. Responsibilities include working with Kubernetes and managing large-scale infrastructure, while participating in on-call rotations to prevent incidents.
Top Skills: CloudwatchDatadogDockerEfkElkGoJavaScriptKubernetesPythonTerraform
Reposted 10 Days AgoSaved
Hybrid
New York, NY, USA
175K-230K Annually
Senior level
175K-230K Annually
Senior level
Hardware • Healthtech • Software • Analytics
The Site Reliability Engineer will ensure high availability of Sage's platform, lead incident response, design reliable systems, and improve operational workflows.
Top Skills: Amazon Web ServicesDatadogGoGoogle Cloud PlatformGrafanaJavaKubernetesMySQLPostgresPrometheusPulumiPythonTerraform
All Filters
JobType
New Jobs
Job Category
Experience
Industry
Company Name
Company Size

Sign up now Access later

Create Free Account