Top Site Reliability Engineer Jobs

3 Days AgoSaved
Remote
United States
120K-160K Annually
Senior level
120K-160K Annually
Senior level
Healthtech • Other • Software
The role involves managing PostgreSQL services, ensuring high availability and performance, driving incident response, automating tasks, and improving observability for a 24x7 SaaS platform.
Top Skills: AnsibleBashDatadogGrafanaHaproxyNew RelicPgbackrestPgbouncerPostgresPowershellPrometheusPythonRepmgrTerraform
Reposted 8 Days AgoSaved
Easy Apply
Hybrid
New York City, NY, USA
Easy Apply
130K-232K Annually
Senior level
130K-232K Annually
Senior level
Marketing Tech • Mobile • Software
As a Senior Site Reliability Engineer, you'll maintain and improve the data export system, focusing on observability, reliability, and scalability while guiding junior engineers and adhering to best practices.
Top Skills: BuildkiteDocker SwarmGitGitlabJavaJenkinsKafkaKotlinKubernetesMongoDBPostgresRubySidekiqSnsSqs
Reposted 8 Days AgoSaved
Easy Apply
Hybrid
Austin, TX, USA
Easy Apply
130K-232K Annually
Senior level
130K-232K Annually
Senior level
Marketing Tech • Mobile • Software
The Senior Site Reliability Engineer will maintain the Currents data export system, solve reliability issues, mentor junior engineers, and improve system performance and scalability.
Top Skills: BuildkiteDatadogDockerGitGitlabJavaJenkinsKafkaKotlinKubernetesMongoDBPagerdutyPostgresRubySentrySidekiqSnsSqs
Reposted 8 Days AgoSaved
Easy Apply
Hybrid
San Francisco, CA, USA
Easy Apply
130K-232K Annually
Senior level
130K-232K Annually
Senior level
Marketing Tech • Mobile • Software
As a Senior Site Reliability Engineer, you'll maintain and enhance the Currents data export system, focusing on observability, scalability, and reliability, while mentoring junior engineers and solving performance issues.
Top Skills: BuildkiteDatadogDocker SwarmGitGitlabJavaJenkinsKafkaKotlinKubernetesMongoDBPagerdutyPostgresRubySentrySidekiqSnsSqs
Reposted 8 Days AgoSaved
Hybrid
O'Fallon, MO, USA
96K-163K Annually
Senior level
96K-163K Annually
Senior level
Blockchain • Fintech • Payments • Consulting • Cryptocurrency • Cybersecurity • Quantum Computing
The Senior BizOps Engineer is responsible for ensuring platform stability and resilience, guiding teams in product development, and facilitating operational excellence throughout the software lifecycle.
Top Skills: ArtifactoryBitbucketCC++ChefDynatraceGitGoJavaJenkinsMavenOraclePerlPl/SqlPostgresPythonRubySplunkSQL
Reposted 8 Days AgoSaved
Remote or Hybrid
3 Locations
147K-278K Annually
Senior level
147K-278K Annually
Senior level
Cloud • Software
Responsible for maintaining FedRAMP compliant services, designing infrastructure, monitoring systems, and ensuring security for federal regions, while driving automation and collaboration with development teams.
Top Skills: AWSFedrampGoKubernetesPuppetPythonTerraformUnix/Linux
Reposted 3 Days AgoSaved
In-Office
McLean, VA, USA
62K-141K Annually
Junior
62K-141K Annually
Junior
Information Technology
As a Site Reliability Engineer, you'll build resilient systems by implementing automation, managing infrastructure in cloud environments, and enhancing deployment processes.
Top Skills: AnsibleAWSBashCi/CdCloudFormationDockerGitGitlabGroovyJSONKubernetesOpenshiftPowershellPythonRestRubyTerraformXML
3 Days AgoSaved
In-Office
Atlanta, GA, USA
112K-200K Annually
Senior level
112K-200K Annually
Senior level
Cloud • Fintech • HR Tech
This role involves managing AWS resources using IaC, building self-service platforms for developers, and maintaining CI/CD pipelines, along with ensuring system reliability and performance.
Top Skills: Argo CdAWSCloudFormationCloudwatchDockerElkJenkinsKubernetesPrometheusTeamcityTerraform
Reposted 3 Days AgoSaved
Remote
USA
Mid level
Mid level
Software • Analytics
The role involves automating and managing AWS infrastructure, ensuring reliability and scalability of stateful systems, and optimizing deployment processes. You'll also handle incident responses and improve operational tooling.
Top Skills: AWSKubernetesTerraformTerragrunt
Reposted 3 Days AgoSaved
In-Office
Aliso Viejo, CA, USA
146K-219K Annually
Senior level
146K-219K Annually
Senior level
Gaming
The role involves ensuring production quality, owning system reliability, and participating in decision-making. Responsibilities include incident response and lifecycle management in cloud gaming technologies.
Top Skills: BashC++ElasticsearchGoIstioJavaKafkaKong Api GatewayKubernetesKumaLinkerdMongoDBMySQLPostgresPythonRedisRust
Reposted 3 Days AgoSaved
In-Office
San Francisco, CA, USA
Senior level
Senior level
Artificial Intelligence • Software
As a Site Reliability Engineer at Mercor, you will ensure production reliability, develop SRE function, and collaborate with engineering teams to maintain system performance.
Top Skills: AWSKubernetesSpaceliftTerraform
Reposted 3 Days AgoSaved
Remote
US
136K-177K Annually
Senior level
136K-177K Annually
Senior level
Big Data • Machine Learning • Software • Analytics
As a Lead Site Reliability Engineer, you will drive the reliability strategy, improve system health, lead incident management, and mentor engineers for a multi-region SaaS platform.
Top Skills: ArgocdC++Ci/CdCloud PlatformsDatadogGitopsGrafanaInfrastructure As CodeJavaJavaScriptKubernetesPython
New

Cut your apply time in half.

Use ourAI Assistantto automatically fill your job applications.

Use For Free
Application Tracker Preview
Reposted 3 Days AgoSaved
Remote
2 Locations
Junior
Junior
Computer Vision • Information Technology • Machine Learning • Natural Language Processing • Real Estate • Software
The SRE will maintain infrastructure for SaaS products on AWS, support developers, manage platform components, and handle IT tasks.
Top Skills: AWSComputer VisionIacLarge Language ModelsNlpTerraform
Reposted 3 Days AgoSaved
In-Office
St. Louis, MO, USA
Senior level
Senior level
Fintech • Analytics
The Site Reliability Engineer will support and automate critical Real Time applications, ensuring service availability and quality across cloud and on-premise deployments, while also collaborating with various teams on operational documentation and incident management.
Top Skills: AWSAzureDatadogDockerGitKubernetesPythonUnix/Linux
Reposted 3 Days AgoSaved
Remote
United States
205K-270K Annually
Senior level
205K-270K Annually
Senior level
Artificial Intelligence • Other • Sales • Software
The role involves designing and advancing infrastructure for the engineering team, ensuring the reliability of Kubernetes clusters, automating operations, and building machine learning infrastructure.
Top Skills: ArgoAWSAzureCloudFormationFluxGithub ActionsGoGCPKubernetesPostgresPythonTerraform
4 Days AgoSaved
In-Office
Arlington, VA, USA
Senior level
Senior level
Artificial Intelligence • Information Technology • Cybersecurity • Defense
As a Site Reliability Engineer, you'll ensure system reliability in a government environment, manage incidents, and collaborate with engineering teams on operational tasks and improvements while maintaining security compliance.
Top Skills: AWSBashDockerDocker ComposeGrafanaLinux/UnixLokiMimirPrometheusPythonTerraform
4 Days AgoSaved
Remote
United States
66K-88K Annually
Mid level
66K-88K Annually
Mid level
Cloud • Information Technology
The Site Reliability Engineer I is responsible for supporting Backblaze’s infrastructure stability by addressing customer issues, monitoring system health, and improving operational processes through documentation and automation.
Top Skills: AnsibleLinuxZabbix
Reposted 4 Days AgoSaved
In-Office
Jefferson Park, NJ, USA
170K-230K Annually
Mid level
170K-230K Annually
Mid level
Fintech • Financial Services
The role involves developing and delivering software solutions, collaborating cross-functionally, ensuring secure coding practices, managing multi-faceted projects, and mentoring team members.
Top Skills: FrameworksProgramming LanguagesTools
4 Days AgoSaved
In-Office
Houston, TX, USA
Mid level
Mid level
Other • Energy
The Site Reliability Engineer will build and maintain reliable systems on Google Cloud Platform, automate operations, and improve system performance and reliability.
Top Skills: AirflowBigQueryCloud MonitoringDataflowDatastreamDockerGithub ActionsGitlab CiGoGoogle Cloud PlatformGrafanaIamJavaKubernetesPrometheusPythonTerraform
4 Days AgoSaved
Remote or Hybrid
United States
165K-190K Annually
Mid level
165K-190K Annually
Mid level
Artificial Intelligence • Healthtech • Information Technology • Software
As the first Site Reliability Engineer in the US, you'll ensure platform stability and oversee incident responses during PST hours, bridging infrastructure and code, while improving operability and compliance in a medical-device environment.
Top Skills: AWSElixirKubernetesTerraform
4 Days AgoSaved
Hybrid
2 Locations
Mid level
Mid level
AdTech • Big Data • Marketing Tech • Software
Responsible for owning and optimizing the Internal Developer Platform, improving reliability, scalability, and usability while supporting engineering teams and standardizing operational processes through automation and best practices.
Top Skills: ArmAWSAzureBashCloudFormationConsulDockerGithub ActionsHashicorpJenkinsKubernetesLinuxNomadPowershellPythonSplunkSumo LogicTerraformVaultWindows
4 Days AgoSaved
Remote
6 Locations
320K-489K Annually
Expert/Leader
320K-489K Annually
Expert/Leader
Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
Lead the design and operation of large scale Kubernetes clusters, ensuring high availability and performance while supporting system lifecycle and reliability improvements.
Top Skills: ContainersGoKubernetesLinuxNetworkingOpenstackPerlPythonRuby
Reposted 4 Days AgoSaved
Remote or Hybrid
7 Locations
Senior level
Senior level
Artificial Intelligence • Information Technology • Software
The role involves defining and evolving technical foundations for AI evaluation, optimizing performance, designing resilient systems, and collaborating with various teams for infrastructure improvements.
Top Skills: Node.jsPostgresServerless EnvironmentsTypescript
Reposted 4 Days AgoSaved
In-Office
Miami, FL, USA
Senior level
Senior level
Healthtech
The Senior Software Engineer will enhance system reliability, manage Kubernetes and AWS environments, oversee incident responses, and implement observability measures.
Top Skills: AWSCloudwatchElbGithub ActionsKubernetesObservability ToolingTerraformVpc
Reposted 4 Days AgoSaved
Hybrid
Atlanta, GA, USA
Mid level
Mid level
Fintech • Payments • Financial Services
Build, operate, and scale AWS-based infrastructure using IaC (Terraform), manage EKS and serverless environments, create CI/CD pipelines, implement observability (OpenTelemetry/Prometheus/New Relic), support Postgres/RDS (Aurora), lead incident response and define SRE practices (SLIs/SLOs/error budgets).
Top Skills: AuroraAWSAws RdsAzureCloudFormationEcsEksGithub ActionsGitlabGoGCPJavaKubernetesNew RelicOpentelemetryOpentofuPostgresPrometheusPythonRubyServerlessTerraformTerragrunt
All Filters
JobType
New Jobs
Job Category
Experience
Industry
Company Name
Company Size

Sign up now Access later

Create Free Account