Get the job you really want.

Top Site Reliability Engineer Jobs

Reposted 2 Days Ago
Remote
15 Locations
Senior level
Senior level
Big Data • Analytics
As a Senior SRE, you will maintain and scale the infrastructure, manage containerized applications, and ensure system performance and reliability.
Top Skills: AWSDockerGceGoHelmKubernetesLinodePython
18 Days Ago
Remote
4 Locations
180K-230K
Senior level
180K-230K
Senior level
Artificial Intelligence • Information Technology • Security • Software
The Staff Site Reliability Engineer will design, build, and maintain fault-tolerant systems, enhance observability, and standardize reliability practices across engineering teams.
Top Skills: AWSBashCi/CdDatadogElkGoKubernetesLinuxNew RelicPrometheusPython
Junior
Artificial Intelligence • Big Data • Machine Learning • Software • Analytics
Design, deploy, and maintain cloud infrastructure for Dataiku's SaaS offerings on Azure and AWS while automating technical operations and troubleshooting issues.
Top Skills: Argo CdAWSAzureDockerHelmKubernetesTerraform
Reposted 19 Days Ago
Remote
9 Locations
Mid level
Mid level
Sports
The Site Reliability Engineer will manage AWS infrastructure, deploy across regions, monitor releases, and handle Kubernetes clusters while improving the tech stack.
Top Skills: AWSDockerGrafanaKubernetesPrometheusPython
Reposted 5 Days Ago
Remote
USA
181K-212K Annually
Senior level
181K-212K Annually
Senior level
Cloud • Fintech • Cryptocurrency • NFT • Web3
The Senior Site Reliability Engineer will enhance system reliability, improve observability, build automation, and optimize cloud deployments while mentoring engineers and ensuring process improvements.
Top Skills: AWSAzureDatadogDockerEc2GCPGoKibanaKubernetesRubyTerraform
19 Days Ago
Park, MI, USA
100K-150K
Senior level
100K-150K
Senior level
Fintech • Financial Services
The Cloud SRE Associate will build automation, improve cloud operations, and support Terraform on GCP, driving stability and collaboration across teams.
Top Skills: BashGoGoogle Cloud PlatformNewrelicPythonTerraform
Yesterday
Remote
United States
Senior level
Senior level
Software
As a Senior Site Reliability Engineer, you will enhance the developer platform, improve reliability, and collaborate with internal and external teams on complex infrastructure solutions.
Top Skills: AWSAzureCeleryCircleCIGCPGithub ActionsGitlab CiGoGrafanaHelmKubernetesLaunchdarklyMachineryPrometheusSplunkTemporalTerraform
19 Days Ago
Remote
Hybrid
3 Locations
200K-200K
Senior level
200K-200K
Senior level
Artificial Intelligence • Biotech
The Senior Infrastructure Engineer will maintain and grow multi-cloud infrastructure for ML and drug discovery, focusing on automation and orchestration.
Top Skills: AWSBashGCPKubernetesPythonTerraform
2 Days Ago
3 Locations
154K-287K Annually
Senior level
154K-287K Annually
Senior level
Artificial Intelligence • Digital Media • Marketing Tech • Software
The role involves enhancing Adobe's AI Platform by increasing reliability, scalability, and security, while managing distributed systems and automating operational practices.
Top Skills: AnsibleElastic StackGoHuggingfaceInfluxdbKubernetesNvidia TensorrtOpenai TritonPrometheusPythonPyTorchSagemakerTerraform
19 Days Ago
The Center, IN, USA
175K-175K
Senior level
175K-175K
Senior level
eCommerce • Retail • Software
As Director of Site Reliability Engineering, you'll lead the SRE team, focusing on cloud deployment, scalability, and automation while collaborating with various teams to enhance operational excellence.
Top Skills: AnsibleApacheChefDockerGithub ActionsGoGrafanaJenkinsKubernetesMongoDBMySQLNew RelicNginxPrometheusPythonTerraform
19 Days Ago
Remote
US
Senior level
Senior level
Blockchain • Software
The Senior Engineer, SRE/DevOps will ensure the reliability and security of blockchain infrastructure by automating processes and collaborating with teams.
Top Skills: AnsibleAWSBashCloudtrailCloudwatchCosmosDockerElasticsearchElk-StackEthereumGCPK8SMySQLOpsgeniePagerdutyPingdomPythonTerraform
19 Days Ago
Atlanta, GA, USA
50K-120K
Senior level
50K-120K
Senior level
Fintech • Information Technology • Payments • Software
Lead the High Availability Disaster Recovery team, manage engineers, improve product reliability with automation, align system capabilities with business needs, and mentor staff to ensure success.
Top Skills: AksAnsibleArgo CdArtifactoryAzureBashBigQueryBigtableCassandraChefCi/CdCloud SqlCosmos DbFlux CdGCPGithub ActionsGkeHadoopHelmJenkinsKafkaKubernetesMicrosoft Sql ServerPostgresPowershellTerraformYarnZookeeper
19 Days Ago
Atlanta, GA, USA
Mid level
Mid level
Fintech • Information Technology • Payments • Software
Develop high-quality software solutions as part of an Agile team, focusing on the NCR Retail Loyalty Program. Responsibilities include software development, system maintenance, ensuring coding standards, and improving performance.
Top Skills: AnsibleArgocdAzureFluxcdGCPGithub ActionsGrafanaHelmJavaJenkinsKubernetesPostgresPowershellPrometheusPythonRest ApiShellSpring BootSpring FrameworkSpring SecuritySQLTerraform
19 Days Ago
Fort Worth, TX, USA
Internship
Internship
Information Technology • Software • Travel
As an SRE intern, assist in developing observability tools, collaborate with teams to resolve issues, and help create dashboards for system monitoring.
Top Skills: DatadogElkObservability ToolsPythonSplunkSQL
19 Days Ago
Fort Worth, TX, USA
Internship
Internship
Information Technology • Software • Travel
As a Site Reliability Engineer intern, you'll assist with system monitoring, automate tasks, collaborate on development, and document processes.
Top Skills: AnsibleDockerGoogle Cloud PlatformJavaKubernetesLinuxNon-Sql DatabaseOraclePerlPythonShellTerraform
19 Days Ago
Fort Worth, TX, USA
Internship
Internship
Information Technology • Software • Travel
As a Site Reliability Engineer Intern, you'll monitor systems, automate tasks, collaborate on feature deployment, and document system processes at Sabre.
Top Skills: AnsibleDockerGoogle Cloud PlatformKubernetesLinux Operating SystemNon-Sql DatabaseOraclePerlPythonShellTerraform
Reposted 5 Days Ago
Remote
8 Locations
2K-2K
Senior level
2K-2K
Senior level
Cloud • Software
The Senior Site Reliability Engineer will automate infrastructure using Python, manage cloud environments, and ensure operational excellence across services and applications.
Top Skills: CloudDevOpsKubernetesLinuxOpenstackPython
19 Days Ago
Gateway Trailer Park, Jacksonville, FL, USA
78K-112K Annually
Senior level
78K-112K Annually
Senior level
Fintech • Financial Services
The role involves building cloud solutions with an emphasis on automation, collaborating with teams, and implementing best practices for operational efficiency in cloud infrastructure.
Top Skills: BashGitGoGoogle Cloud PlatformLinuxPythonTerraform
19 Days Ago
Palo Alto, CA, USA
103K-174K Annually
Entry level
103K-174K Annually
Entry level
Gaming • Software • Metaverse
The role involves researching industry solutions, analyzing customers’ media business structures, and developing sales support materials for Tencent’s audio and video products in various markets.
Top Skills: AICloudNetwork Security
19 Days Ago
Palo Alto, CA, USA
103K-174K Annually
Entry level
103K-174K Annually
Entry level
Gaming • Software • Metaverse
The role involves researching and developing customer technology solutions, providing analysis of media business structures, and assisting in sales support material creation.
Top Skills: AICloudNetwork Security
Reposted 25 Days Ago
Mt Washington, Baltimore, MD, USA
100K-170K
Mid level
100K-170K
Mid level
Internet of Things • Security
The Site Reliability Engineer ensures uptime and reliability of Armis's services, manages deployments and monitoring, and collaborates on process improvements.
Top Skills: AlertmanagerAWSBashGitGrafanaHelmKubernetesPrometheusPython
20 Days Ago
Remote
USA
120K-202K
Senior level
120K-202K
Senior level
Information Technology • Cryptocurrency
The Head of SRE will lead the SRE team, defining strategy, ensuring system reliability, and driving operational excellence while mentoring staff.
Top Skills: ArgocdBashElk StackGCPGoGrafanaHelmKubernetesPrometheusPythonTerraform
20 Days Ago
30005, Alpharetta, GA, USA
Internship
Internship
Fintech • Consulting
As a Site Reliability Engineer Intern, you'll design and develop micro-services, validate technology capabilities, and collaborate on product development and feature launches.
Top Skills: AgileJavaMicroservicesPythonSpring Boot
2 Days Ago
Denver, CO, USA
136K-160K
Senior level
136K-160K
Senior level
Big Data • Internet of Things • Machine Learning
The Senior Site Reliability Engineer will manage production operations, provision Kubernetes infrastructure, monitor systems, and drive improvements.
Top Skills: AWSGoGrafanaIcingaJavaKubernetesLinuxNagiosNew RelicPerlPHPPrometheusPythonTerraform
Reposted 20 Days Ago
Santa Clara, CA, USA
215K-364K
Senior level
215K-364K
Senior level
Automotive
Lead the design and implementation of cloud-native AI infrastructure solutions to support autonomous driving technologies, optimizing performance and reliability.
Top Skills: Ali CloudAWSAzureDockerGoKubernetesPythonPyTorchTensorFlow

Top Companies Hiring Site Reliability Engineers

See All
Coinbase Thumbnail
Web3 • NFT • Fintech • Cryptocurrency • Cloud
Fully Remote
3700 Employees
Adobe Thumbnail
Software • Marketing Tech • Digital Media • Artificial Intelligence
12 Offices
21000 Employees
Canonical Thumbnail
Software • Cloud
3 Offices
880 Employees
Plume Design, Inc Thumbnail
Machine Learning • Internet of Things • Big Data
Palo Alto, CA
611 Employees
Macrometa Thumbnail
Big Data • Analytics
San Mateo, CA
88 Employees
Regrello Thumbnail
Software
San Francisco, California
44 Employees
All Filters

Total selected (1)

Job Category

Skills
New Jobs
Job Category
Experience
Industry
Show more
Show less
Company Name
Company Size

Sign up now Access later

Create Free Account