Get the job you really want.
Maximum of 25 job preferences reached.
Top Site Reliability Engineer Jobs
Cloud • Fintech • HR Tech
The role involves managing and automating Kubernetes infrastructure, ensuring high availability, maintaining security compliance, and collaborating with development teams. A focus on problem-solving and documentation is essential.
Top Skills:
Argo CdAWSCi/CdKubernetesTerraform
Cloud • Fintech • HR Tech
The role involves maintaining and automating a Kubernetes platform, ensuring high availability, scalability, and security while collaborating with development teams for deployment and compliance.
Top Skills:
Argo CdAWSKubernetesTerraform
Software
As a Senior Site Reliability Engineer, you will enhance platform reliability, own SLIs/SLOs, design observability systems, and lead incident response and improvements.
Top Skills:
Argo CdAWSBashGoGrafanaKubernetesOpentelemetryPrometheusPythonTerraform
AdTech
The Senior Site Reliability Engineer will manage database platforms like MySQL, Amazon Redshift, and Snowflake, focusing on scalability and reliability. Responsibilities include database design, performance optimization, operations, monitoring, and team collaboration to ensure effective database management and compliance with security policies.
Top Skills:
Amazon RedshiftApache AirflowAWSAws GlueMySQLSnowflakeSQL
Hardware • Semiconductor • Manufacturing
The Site Reliability Engineer will design, implement, and manage reliable infrastructure and services, ensuring operational excellence and uptime.
Top Skills:
AWSBashDockerGrafanaKubernetesLinuxAzureOpenshiftPrometheusProxmoxPythonVmware Vsphere
Healthtech • Software
As a Site Reliability Engineer, you'll ensure system availability and performance, respond to incidents, and improve operational processes. Collaborate with teams to enhance reliability and reduce risks through automation and monitoring.
Top Skills:
AWSCloudFormationDockerEcsGitMySQLNode.jsPythonTerraform
Artificial Intelligence • Automotive • Internet of Things • Software
The Site Reliability Engineer will ensure application reliability, performance, and availability, emphasizing incident response and collaboration with development teams.
Top Skills:
ActivemqAnsibleAppdynamicsAws LambdaCloudFormationCloudwatchEksGitGitJavaJavaScriptJenkinsJqueryKafkaKubernetesMskMySQLPostgresPythonRabbit MqRest ApisSignalsSpinnakerSQLTerraformVue
Financial Services
As a Site Reliability Engineer, you will enhance and monitor production systems, automate workflows, and respond to incidents to maintain system reliability.
Top Skills:
AirflowBazelGitGoGrafanaGrpcJenkinsKubernetesLinuxPandasPostgresPrometheusPythonRRelational DatabasesSQL
Healthtech • Insurance
The Senior Software Engineer will lead technical projects in cloud infrastructure, mentoring teams, improving DevOps practices, and ensuring system resilience.
Top Skills:
AWSCi/CdGCPGithub ActionsGrafanaIamIstioKubernetesPrometheusTerraformVpc Peering
Healthtech • Insurance
The Senior Software Engineer will lead complex projects, mentor engineers, and ensure cloud infrastructure is resilient and automated. Responsibilities include developing software, managing production environments, and enforcing coding standards.
Top Skills:
ArgocdAWSGCPGithub ActionsGrafanaIstioKubernetesPrometheusTerraform
Healthtech • Insurance
The Senior Software Engineer will lead cloud infrastructure projects, mentor junior engineers, ensure system reliability, and drive technical roadmaps.
Top Skills:
AWSCi/CdGCPGithub ActionsGrafanaIstioKubernetesPrometheusTerraform
Healthtech • Insurance
The Senior Software Engineer will lead technical projects, mentor engineers, and build resilient cloud infrastructures focusing on SRE best practices.
Top Skills:
AWSCi/CdGCPGithub ActionsGrafanaKubernetesPrometheusTerraform
New
Cut your apply time in half.
Use ourAI Assistantto automatically fill your job applications.
Use For Free
Cloud • Digital Media • Information Technology
Operate and improve Kubernetes-based production systems, manage cluster lifecycle and networking, build CI/CD and GitOps pipelines, define SLOs and incident response, automate resolution with AI, implement monitoring/alerting, and drive reliability through automation and chaos engineering.
Top Skills:
Kubernetes,Terraform,Ansible,Cni Plugins,Vxlan,Bgp,Dns,Fluxcd,Argocd,Python,Go,Bash,Prometheus,Grafana,Loki,Thanos,Victoriametrics,Datadog,Ebpf,Xdp,Falco,Coroot,Siem,Calico,Cilium,Metallb,Ceph,Longhorn
Healthtech • Professional Services • Software
Lead design, architecture, and reliability of scalable systems; own incident response, monitoring, and CI/CD automation. Mentor engineers, drive tooling and AI adoption, and collaborate across teams to meet business needs and maintain high system availability.
Top Skills:
Argo CdAzure Devops PipelinesCi/CdElk StackGCPGrafanaIstioKubernetesNew RelicOpentelemetryTerraform
Consumer Web • Information Technology • Mobile • Other • Software • App development
Build, maintain, and automate onX infrastructure and deployment pipelines using Terraform and GCP. Improve performance, availability, and cost; extend Terraform codebase; integrate observability tooling; drive incident response and participate in on-call rotation for core infrastructure.
Top Skills:
AirflowBigQueryBigtableChecklyClaude CodeCloud RunCloud SqlCockroachdbGis MappingGkeGoogle Cloud MonitoringGoogle Cloud PlatformGoogle Cloud StorageGoogle ComposerIamInfrastructure-As-CodeKubernetesOpentelemetryOpentofuPrometheusPub/SubRootlyTerraform
Cloud • Software • Database
Lead design, build, and operate the YugabyteDB DBaaS infrastructure. Drive architecture, automate lifecycle and maintenance, manage incidents and on-call rotations, implement security/encryption processes, and optimize reliability using SRE principles and observability.
Top Skills:
Kubernetes,Gke,Eks,Aks,Java,Bash,Shell,Python,Terraform,Ansible,Docker,Prometheus,Git,Github Actions,Linux,Postgresql,Aws,Gcp,Azure
Insurance
Lead reliability strategy and architecture for critical systems, drive incident management and root-cause analysis, build automation and SRE tooling, influence release/change practices and compliance, and mentor junior engineers to improve operational reliability.
Top Skills:
AngularAWSCi/CdCloudFormationContainerizationJavaJavaScriptLogsNettyNext.JsNode.jsNon-Relational DatabasesObservability (MetricsOrchestrationOrmReactRelational DatabasesServicenowSpringSpring BootTomcatTracing)
Financial Services
The Site Reliability Engineer will manage AWS infrastructure, implement CI/CD, ensure application resilience, and collaborate with teams to improve development practices.
Top Skills:
AWSAzureCi/CdDjangoDockerEc2EcsElastic SearchGitlabKafkaMySQLNumpyPandasPl/SqlPythonRedisSQL ServerTableau
Automotive • Hardware • Logistics
The Site Reliability Engineer III enhances system reliability by building automation and supporting large-scale systems, ensuring critical platforms function optimally.
Top Skills:
APIsAzure DevopsDynatraceGoogle Cloud PlatformGrafanaHTTPJavaKubernetesMicroservicesPrometheusTerraform
Healthtech • Payments • Software
The Site Reliability Engineer will enhance system reliability and observability, manage incident responses, and collaborate with teams to improve performance in data-intensive environments.
Top Skills:
Apache AirflowAWSAzureCloudFormationGCPGrafanaKafkaKubernetesPowershellPrometheusPythonSparkSplunkTerraform
Fintech
Lead reliability efforts by defining SLOs, building recoverability and observability, automating infrastructure-as-code, operating and improving large-scale cloud services, mentoring engineers, and participating in incident management and root cause analysis.
Top Skills:
Amazon RdsAnsibleAWSChefDockerElasticsearchExcelGCPGcp CloudsqlGoJavaKotlinKubernetesLucene/SolrMs Sql ServerMySQLOutlookPostgresPuppetPythonRedisTerraformWord
Aerospace • Artificial Intelligence • Logistics • Machine Learning • Software • Transportation • Defense
Lead efforts to deliver the Flyways AI Platform, deploying and maintaining secure cloud services, coding software solutions, and collaborating with teams.
Top Skills:
AWSDockerGrafanaHelmK8SPostgresPythonTerraform
Artificial Intelligence
Lead reliability, scalability, security, and automation efforts for business-critical services. Build infrastructure-as-code, implement compliance (FedRAMP/IL5), plan roadmaps, optimize cost, and collaborate with security and architects.
Top Skills:
Ai ToolsAnsibleAWSAzureCmmcDod Impact Level 5FedrampGCPGoIl5JavaNist 800-53PulumiPythonRubyTerraform
Healthtech • Software
As a Senior DevOps Engineer, you will build and maintain scalable infrastructure, manage application monitoring systems, and provide operational support to engineering teams. You will also design and implement CI/CD pipelines and troubleshoot complex issues across environments.
Top Skills:
AnsibleAWSAzureBashChefDockerGCPGithub ActionsJenkinsPostgresPuppetPythonTerraform
AdTech • Marketing Tech
Design, build, and maintain scalable, resilient infrastructure and tooling across cloud and on-prem data centers. Improve observability, define SLOs/SLIs, troubleshoot full-stack issues, and provide on-call support for shared services in a 24x7 production environment.
Top Skills:
AWSCi/CdContainerizationElasticsearchGoGoogle Compute PlatformInfluxdbJavaKafkaKubernetesLinuxMemcachedMySQLPythonRedisTerraform
Popular Job Searches
All Software Engineer Jobs
.NET Developer Jobs
Aerospace Thermal Engineering Jobs
AI Engineer Jobs
Android Developer Jobs
Automation Engineer Jobs
Backend Developer Jobs
Blockchain Developer Jobs
C# Jobs
C++ Jobs
Cloud Architect Jobs
Cloud Engineer Jobs
Design Engineer Jobs
DevOps Engineer Jobs
Director Of Engineering Jobs
Electrical Engineering Jobs
Embedded Software Engineer Jobs
Engineering Jobs
Engineering Manager Jobs
Environmental Engineering Jobs
Field Engineer Jobs
Front End Developer Jobs
Full Stack Developer Jobs
Game Developer Jobs
Golang Jobs
Hardware Engineer Jobs
Industrial Engineering Jobs
iOS Developer Jobs
Java Developer Jobs
Javascript Developer Jobs
Linux Jobs
Manufacturing Engineer Jobs
Mechanical Engineering Jobs
Network Engineer Jobs
PHP Developer Jobs
Process Engineer Jobs
Project Engineer Jobs
Prompt Engineering Jobs
Python Jobs
QA Jobs
Robotics Engineer Jobs
Ruby on Rails Jobs
Salesforce Administrator Jobs
Salesforce Developer Jobs
Scala Jobs
Sharepoint Developer Jobs
Site Reliability Engineer Jobs
Software Engineering Manager Jobs
Solutions Architect Jobs
SQL Developer Jobs
Structural Engineer Jobs
System Engineer Jobs
Test Engineer Jobs
Web Developer Jobs
All Filters
Total selected ()
No Results
No Results






























