Get the job you really want.
Maximum of 25 job preferences reached.
Top Site Reliability Engineer Jobs
Artificial Intelligence • Software
As a Principal Site Reliability Engineer, you will design hybrid infrastructure, integrate edge devices and cloud resources, optimize performance and costs, and collaborate with cross-functional teams to ensure robust systems.
Top Skills:
AWSGoKubernetesLinuxPythonTerraformTerragrunt
Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
Design and maintain large scale Kubernetes clusters, ensuring reliability through monitoring, automation and incident response.
Top Skills:
DockerGoKubernetesLinuxNetworkingOpenstackPerlPythonRuby
Information Technology • Security • Cybersecurity
The Sr. Site Reliability Engineer will ensure the smooth operation of infrastructure, oversee incident management, and improve operational efficiency while leading a small team.
Top Skills:
AWSBazelCloudGitopsGrafanaHelmKubernetesLinuxPrometheusSaaSTerraform
Artificial Intelligence • Cloud • Software
The Senior Site Reliability Engineer will automate operations, optimize workflows for teams, manage secure infrastructure, and participate in on-call duties.
Top Skills:
AristaAWSBashCephChefCifsCiscoDnsDockerElk StackFortinetHpHTTPIcmpIscsiJenkinsKubernetesLinux/Debian Family/UbuntuMesosphereNfsNode.jsPivotal GreenplumPostgresPythonRabbitMQRubyS3ScyllaSshSslSupermicroTcpTls
Artificial Intelligence • Software • Generative AI
This role involves designing and maintaining cloud infrastructure, automating provisioning, and enhancing system reliability through monitoring, collaboration, and mentorship.
Top Skills:
AWSAzureDockerElk StackGCPGoGrafanaJavaKubernetesPrometheusPythonTerraform
Cybersecurity
As a Principal Site Reliability Engineer, you will build and maintain scalable cloud infrastructure, ensuring reliability and security while automating deployments and orchestrating monitoring solutions.
Top Skills:
BashDockerFirehydrantGCPGitlab Ci/CdGitopsGoGrafanaJavaKubernetesLokiMySQLNode.jsPagerdutyPrometheusPythonTerraform
Information Technology • Security • Software
The Principal Site Reliability Engineer ensures system availability and performance, automates processes, and supports a multi-tenant microservices application suite with a focus on incident response and system reliability.
Top Skills:
AWSBashCloudFormationDatadogElk StackGoGrafanaJavaKubernetesOpentelemetryPostgresPrometheusPythonTerraform
Information Technology • Software
Build and maintain Verisign's Kubernetes platform, enforce security practices, monitor performance, and provide tier 3 support. Requires extensive experience with Kubernetes and related technologies.
Top Skills:
GitJIRAKubernetesLinuxPythonTerraformUnix
Reposted 23 Days AgoSaved
Fintech • Information Technology • Payments
The Staff Site Reliability Engineer maintains and optimizes Hadoop and Kafka clusters on cloud platforms, driving innovation by ensuring system availability and performance while collaborating with teams and developing monitoring tools.
Top Skills:
AnsibleAWSAzureBig DataGCPGrafanaHadoopHdfsJavaKafkaLinuxMapreduceOperaPythonSparkSplunk
Software
The Site Reliability Engineer will manage deployment pipelines, implement observability solutions, enhance infrastructure security, and troubleshoot issues to ensure system reliability and performance.
Top Skills:
AWSCloudFormationDatadogDockerGitlabGrafanaKubernetesMongoDBMySQLPostgresPrometheusTerraform
Information Technology
As a Site Reliability Engineer, you'll design and operate scalable storage systems and optimize performance for AI research data management.
Top Skills:
GoKubernetesPulumiRust
Information Technology
As a Site Reliability Engineer at xAI, you will ensure the reliability and performance of AI data center infrastructure, automate operations, and collaborate across teams.
Top Skills:
ArgocdBuildkiteC++Ci/CdGoGrafanaKubernetesPrometheusPulumiRustTerraform
New
Cut your apply time in half.
Use ourAI Assistantto automatically fill your job applications.
Use For Free
Fintech • Insurance • Financial Services
The SRE Release Engineer develops, tests, and maintains software systems while implementing SRE best practices for performance optimization and release management.
Top Skills:
C++CobolJavaVisual Basic
Cloud • Information Technology • Internet of Things • Software • Consulting • Infrastructure as a Service (IaaS) • Automation
As a Senior Site Reliability Engineer at Red Hat, you will develop and operate OpenShift managed cloud services and enhance system reliability through automation. Responsibilities include coding, troubleshooting, supporting peers, and participating in on-call schedules.
Top Skills:
AnsibleAWSAzureCC++DockerGCPGoJavaKubernetesOpenshiftPrometheusPython
Blockchain • Fintech • Internet of Things • Cryptocurrency • Web3
As a Senior Site Reliability Engineer, you'll design and operate cloud infrastructure, manage Kubernetes environments, implement Infrastructure as Code, and automate processes to ensure reliability and performance.
Top Skills:
Amazon RdsAuroraAWSBashCdkCrossplaneGoKubernetesPythonTerraform
Artificial Intelligence • Robotics • Business Intelligence
The Engineering Quality Lead will manage system effectiveness for product infrastructure, enhance customer satisfaction, drive proactive quality improvements, and collaborate with teams to ensure service excellence.
Top Skills:
AWSElasticGCPGrafanaKubernetesPrometheusPythonTerraform
Information Technology • Other • Payments • Software
The Senior Site Reliability Engineer will design, implement, and maintain scalable systems, improve automation, and enhance reliability through multi-language programming, guiding junior engineers and participating in incident response activities.
Top Skills:
AWSAzureC#CoralogixDatadogDockerGoJavaJavaScriptKubernetesPrometheusPythonTerraformTypescript
Software
The Senior Site Reliability Engineer will design, implement, and maintain scalable systems using coding expertise to enhance performance and reliability. Responsibilities include collaborating with developers, managing CI/CD pipelines, implementing observability solutions, and mentoring junior engineers.
Top Skills:
AWSAzureC#CoralogixDatadogDockerGoJavaJavaScriptKubernetesPrometheusPythonTerraformTypescript
Security • Software
The role involves developing and managing Tenable's cloud products, ensuring reliability and availability, automating systems, and collaborating on cloud technologies while meeting FedRAMP compliance.
Top Skills:
AWSAzureDockerGCPGradleHelmKubernetesNode.jsPythonTerraform
Artificial Intelligence • Fintech • Software • Financial Services
Seeking a seasoned SRE to lead reliability for a cloud-native platform, overseeing infrastructure, CI/CD pipelines, observability, and mentoring engineers.
Top Skills:
AWSClickhouseGoJavaKafkaKubernetesPulumiTerraform
eCommerce • Retail • Software
The Director of Site Reliability Engineering will lead cloud deployment strategies, enhance automation and scalability, and mentor the engineering team.
Top Skills:
AnsibleApacheChefDockerGithub ActionsJenkinsKubernetesMongoDBMySQLNginxTerraform
Software
The Senior Site Reliability Engineer will lead automation, solve technical issues, ensure security, maintain cloud environments, and improve operations and deployment efficiency.
Top Skills:
AzureCi/CdDockerGitKubernetesPackerTerraform
Automotive • Hardware • Logistics
The Site Reliability Engineer III improves system reliability through automation, supports cloud transformations, and partners with development teams to enhance service performance.
Top Skills:
APIsAzure DevopsDynatraceGoogle Cloud PlatformGrafanaKubernetesMicroservice ArchitecturePrometheusTerraform
Other • Real Estate • PropTech
As a Senior Site Reliability Engineer, you will design and manage scalable infrastructure, automate processes, collaborate with teams, and ensure system reliability.
Top Skills:
GoInfrastructure As CodeJavaPython
Fintech • Analytics • Financial Services
As a Staff Site Reliability Engineer, you'll design scalable Azure cloud infrastructure, automate CI/CD processes, leverage AI for solutions, and lead incident management while collaborating across teams.
Top Skills:
AzureAzure DevopsAzure MonitorBashC#/.NetCi/CdDockerGithub ActionsLog AnalyticsPowershellPythonTerraform
Top Companies Hiring Site Reliability Engineers
See AllPopular Job Searches
All Software Engineer Jobs
.NET Developer Jobs
Aerospace Thermal Engineering Jobs
AI Engineer Jobs
Android Developer Jobs
Automation Engineer Jobs
Backend Developer Jobs
Blockchain Developer Jobs
C# Jobs
C++ Jobs
Cloud Architect Jobs
Cloud Engineer Jobs
Design Engineer Jobs
DevOps Engineer Jobs
Director Of Engineering Jobs
Electrical Engineering Jobs
Embedded Software Engineer Jobs
Engineering Jobs
Engineering Manager Jobs
Environmental Engineering Jobs
Field Engineer Jobs
Front End Developer Jobs
Full Stack Developer Jobs
Game Developer Jobs
Golang Jobs
Hardware Engineer Jobs
Industrial Engineering Jobs
iOS Developer Jobs
Java Developer Jobs
Javascript Developer Jobs
Linux Jobs
Manufacturing Engineer Jobs
Mechanical Engineering Jobs
Network Engineer Jobs
PHP Developer Jobs
Process Engineer Jobs
Project Engineer Jobs
Prompt Engineering Jobs
Python Jobs
QA Jobs
Robotics Engineer Jobs
Ruby on Rails Jobs
Salesforce Administrator Jobs
Salesforce Developer Jobs
Scala Jobs
Sharepoint Developer Jobs
Site Reliability Engineer Jobs
Software Engineering Manager Jobs
Solutions Architect Jobs
SQL Developer Jobs
Structural Engineer Jobs
System Engineer Jobs
Test Engineer Jobs
Web Developer Jobs
All Filters
Total selected ()
No Results
No Results

































