Maximum of 25 job preferences reached.
Top Site Reliability Engineer Jobs
Information Technology • Mobile • Software
As a Site Reliability Engineer, you'll ensure system reliability and scalability, automate processes, optimize performance, and collaborate on system design.
Top Skills:
AWSAzureBashCloudFormationDatadogDockerElkGoGoogle Cloud PlatformGrafanaHelmKubernetesNew RelicPrometheusPulumiPythonTerraform
Other
As a Platform Engineer/Dev Ops, you will expand cloud infrastructure, implement monitoring systems, manage databases, and leverage CI/CD tools, working collaboratively with various teams.
Top Skills:
AWSAzureBashDatadogElk StackKubernetesOpentofuPrometheusPythonTerraform
Artificial Intelligence • Software
As a Senior Staff SRE Tech Lead, you'll oversee reliability and scalability, mentor engineers, optimize systems, and enhance data infrastructure.
Top Skills:
ClickhouseGoPostgresPythonTypescript
Cloud
The Site Reliability Engineer will manage Kubernetes platforms, optimize AWS cloud infrastructure, ensure high availability, and automate deployment while handling troubleshooting and security compliance.
Top Skills:
AWSBashCi/CdCloudwatchElk StackGoGrafanaHelmIstioKubernetesPrometheusPythonTerraform
Cloud
The Senior Site Reliability Engineer will enhance the Splunk ecosystem and develop an Observability Platform by automating infrastructure and managing complex distributed systems, while optimizing log collection and incident response.
Top Skills:
AWSGCPGoKubernetesLinuxOpentelemetryPythonRubySplunkTerraform
Real Estate • Travel • PropTech
The Engineering Manager for Storage SRE will lead a team to ensure reliable database operations, improve developer experience, and expand tooling and operational models, focusing on mission-critical systems.
Top Skills:
Cloud InfrastructureDatabasesSite Reliability EngineeringStorage Systems
Big Data • Cloud • Marketing Tech • Social Impact • Software
The Senior Staff Site Reliability Engineer at LiveRamp will define the SRE strategy, oversee critical automation, and lead operational excellence in a global infrastructure, influencing architectural decisions and mentoring teams.
Top Skills:
Aws)CassandraCircleCICloud Security (GcpDynamoDBGoJenkinsKubernetesPythonScylladbSinglestoreTerraform
eCommerce • Fintech • Information Technology • Payments • Financial Services
Design, deploy, and operate highly available, scalable cloud infrastructure on AWS. Manage Kubernetes clusters, build CI/CD with GitHub Actions, automate via Terraform, optimize data layers (RDBMS/document stores), implement observability, and lead design reviews while mentoring teams on SRE practices.
Top Skills:
AWSBashDatadogDnsDockerDocument StorageDynatraceGitGithub ActionsKubernetesKubernetes OperatorsLinux/UnixLoad BalancingNew RelicNode.jsPythonRdbmsRuby On RailsTerraformVirtual Networking
Artificial Intelligence • Fintech • Machine Learning • Natural Language Processing • Payments • Software • Financial Services
Lead SRE responsible for reliability strategy, architecting resilient AWS/Kubernetes infrastructure, building observability, driving incident response and postmortems, improving deployment safety and automation, mentoring engineers, and partnering across product and engineering to scale platform reliability.
Top Skills:
AWSCi/CdDockerEvent-Driven ArchitecturesFastapiInfrastructure As CodeKubernetes (Eks)LogsObservability (MetricsPostgresql (Rds)PythonTraces)TypescriptVue
News + Entertainment
As a Senior Machine Learning Engineer, you will develop advanced machine learning and deep learning models and platforms for optimizing advertising performance and conduct complex experiments.
Top Skills:
AIControl SystemsDeep LearningMachine LearningReinforcement LearningStatistical Techniques
News + Entertainment
Design, operate, and scale cloud-native ML infrastructure across GCP and AWS (GPU/TPU), build CI/CD for models, maintain low-latency real-time inference systems, define observability and monitoring for ML models, participate in on-call incident response, and partner with data scientists to improve MLOps and platform usability.
Top Skills:
AerospikeApache AirflowApache FlinkSparkAWSChrononDatadogEksGCPGitlab RunnerGkeGpuGrafanaJavaJenkinsKafkaKubernetesKv StoreMlflowPrometheusPythonRayScalaTerraformTpuVector Database
Information Technology
Design, build, and maintain resilient cloud infrastructure for the Intelligence Community. Implement redundancy, monitoring, automation, patching, and hardening. Reduce toil via scripting and self-repair, and support security posture improvements using cloud and container tooling.
Top Skills:
AWSConfluenceDockerGitJenkinsJIRAKubernetesLinuxNessusPackerRhel
New
Track Smarter, Apply Better.
Ditch the spreadsheets. Organize your job search with our freeApplication Tracker.
Use For Free
Aerospace • Other
Design, deploy, and scale on‑premise compute and core infrastructure for Starlink. Develop automation, manage databases/monitoring/distributed storage, collaborate with software teams, troubleshoot end-to-end, and improve deployment and developer velocity.
Top Skills:
AnsibleBashCC++DatabasesDistributed StorageDockerGoHypervisor TechnologiesKubernetesLinuxMonitoringPythonTcp/IpTerraformVirtualization
Aerospace • Other
Design, deploy, and scale on-prem Kubernetes clusters and core infrastructure for Starlink. Build automation, manage databases, monitoring, and distributed storage. Collaborate with engineers to improve service lifecycle, availability, and performance; troubleshoot across the Starlink stack and drive reliability improvements.
Top Skills:
AnsibleBashBazelC++GoKubernetesLinuxMakefilesOci ContainersPythonTcp/IpTerraform
Cloud • Fintech • HR Tech
Design, automate, patch, and monitor infrastructure and services to keep production and non-production environments running. Create and maintain scripts and automation (CI/CD), manage containerized deployments (Docker/Kubernetes), provision baremetal and cloud infrastructure, and improve monitoring, alerting, and tracing to meet SLAs.
Top Skills:
AnsibleBaremetalCi/CdDockerGitGradleJenkinsKafkaKubernetesMavenMonitoringPackerPrivate CloudPublic CloudPythonShell ScriptingTerraformTracing
Other
The Sr. Site Reliability Engineer will maintain and administer enterprise systems, troubleshoot operational issues, and develop scripts. This role requires collaboration across teams and participation in project planning and execution.
Top Skills:
AnsibleApacheAzureC#ChefIisJavaJbossPerlPowershellPuppetPythonRubyTomcat
Edtech
The Senior Site Reliability Engineer ensures system reliability and performance, develops monitoring solutions, identifies problems, and partners with engineering teams for scalable solutions.
Top Skills:
AWSBashCC++DockerGCPJavaKubernetesPerlPython
Reposted 2 Days AgoSaved
Financial Services
The Senior Site Reliability Engineer will own the operational reliability of developer tooling ecosystems and improve developer productivity through efficient processes and automation.
Top Skills:
.NetBashPowershellPython
Artificial Intelligence • Blockchain • Information Technology • Consulting
Lead design and build of production-grade Azure infrastructure using Terraform, ensuring scalable, secure, and repeatable deployments. Provide technical leadership, platform enhancements, observability and incident response improvements, and Tier 2 infrastructure support while collaborating with engineering, security, and product teams to meet enterprise readiness and feature parity goals.
Top Skills:
ArgoAzureGoGrafanaKubernetesPrometheusPythonSpaceliftTerraform
Healthtech • Pharmaceutical
The Senior Site Reliability Engineer will ensure systems operate smoothly, improve performance, automate tasks, and coordinate with teams in a hybrid environment.
Top Skills:
AWSAzureAzure DevopsBambooDockerElixirGCPGithub ActionsGoJenkinsKubernetesPython
Software
The Senior SRE will ensure reliability of production systems, design monitoring processes, and build automation tools while collaborating in a regulated environment. The role blends operational tasks and coding responsibilities.
Top Skills:
AWSCdkCloudfrontEcsGithub ActionsHoneycombLambdaNixOpentelemetryPythonRdsSQLTerraformTypescript
Big Data • Analytics • Business Intelligence • Big Data Analytics
Seeking a Site Reliability Engineer to manage AI platform reliability, automate tasks, optimize ML pipelines, and lead incident response in a hybrid engineering role.
Top Skills:
ArgocdBigQueryCloud BuildDockerDvcGithub ActionsGoGrafanaKubeflowKubernetesMlflowPrometheusPub/SubPythonTerraformVertex Ai
Fintech
Responsible for enhancing application infrastructure, ensuring reliability and scalability, automating processes, implementing observability, and collaborating with software development teams.
Top Skills:
AWSDockerGitGoJavaJavaScriptKubernetesLinuxPythonRubySwarm
Digital Media • Software • Sports
Seeking a Senior Site Reliability Engineer to enhance system reliability, performance, and scalability. Focus on automation, observability, and improving CI/CD practices while collaborating with engineering teams for better incident response and metrics improvement.
Top Skills:
AWSAzureC++Ci/CdDatadogDockerElkGCPGoGrafanaJavaKubernetesLinuxPrometheusPythonTerraform
3 Days AgoSaved
Robotics • Software
Own reliability across vehicle and cloud stacks for AUV operations: onboard Jetson/ROS2 compute, topside systems, cloud ingestion/processing and customer platform. Build automation, observability, runbooks, and self-recovery to reduce on-call toil; manage AWS infrastructure, IaC, container orchestration, and reliability targets. Participate in shared 12-hour on-call shifts and field deployments, mentor team on operational excellence.
Top Skills:
AWSBashContainerizationDockerGoGrafanaIamJetsonKubernetesLinuxPrometheusPythonRosRos 2Terraform
Let Your Resume Do The Work
Upload your resume to be matched with jobs you're a great fit for.
Success! We'll use this to further personalize your experience.
Top Companies Hiring Site Reliability Engineers
See AllPopular Job Searches
All Software Engineer Jobs
.NET Developer Jobs
Aerospace Thermal Engineering Jobs
AI Engineer Jobs
Android Developer Jobs
Automation Engineer Jobs
Backend Developer Jobs
Blockchain Developer Jobs
C# Jobs
C++ Jobs
Cloud Architect Jobs
Cloud Engineer Jobs
Design Engineer Jobs
DevOps Engineer Jobs
Director Of Engineering Jobs
Electrical Engineering Jobs
Embedded Software Engineer Jobs
Engineering Jobs
Engineering Manager Jobs
Environmental Engineering Jobs
Field Engineer Jobs
Front End Developer Jobs
Full Stack Developer Jobs
Game Developer Jobs
Golang Jobs
Hardware Engineer Jobs
Industrial Engineering Jobs
iOS Developer Jobs
Java Developer Jobs
Javascript Developer Jobs
Linux Jobs
Manufacturing Engineer Jobs
Mechanical Engineering Jobs
Network Engineer Jobs
PHP Developer Jobs
Process Engineer Jobs
Project Engineer Jobs
Prompt Engineering Jobs
Python Jobs
QA Jobs
Robotics Engineer Jobs
Ruby on Rails Jobs
Salesforce Administrator Jobs
Salesforce Developer Jobs
Scala Jobs
Sharepoint Developer Jobs
Site Reliability Engineer Jobs
Software Engineering Manager Jobs
Solutions Architect Jobs
SQL Developer Jobs
Structural Engineer Jobs
System Engineer Jobs
Test Engineer Jobs
Web Developer Jobs
All Filters
Total selected ()
No Results
No Results




_1.png)



























