Get the job you really want.
Maximum of 25 job preferences reached.
Top Site Reliability Engineer Jobs
Marketing Tech • Mobile • Software
As a Senior Site Reliability Engineer, you will ensure the reliability of internal services, improving automation and infrastructure, and collaborating with engineering teams to resolve issues and enhance product performance.
Top Skills:
DockerGoKafkaKubernetesMongoDBPostgresRedisRubyTerraform
Marketing Tech • Mobile • Software
As a Senior Site Reliability Engineer at Braze, you'll ensure uptime for internal services, improve automation, and develop infrastructure tools, collaborating across teams to enhance reliability and scalability.
Top Skills:
ChefDockerKafkaKubernetesMongoDBRedisRuby On RailsTerraform
Cloud • Software
In this role, you'll support large-scale applications, improve observability, mentor team members, and ensure reliability by collaborating on deployments and writing automation scripts while providing 24/7 support.
Top Skills:
AnsibleAWSBashConfluenceDockerElk StackGCPGitlab CicdGrafanaJenkinsJIRAKubernetesLinuxMongoDBMySQLNagiosOciPerlPostgresPrometheusPuppetPythonTerraform
Cloud • Information Technology
Operate and maintain high-performance, mission-critical visualization services; implement observability and instrumentation; optimize hardware usage; lead incident diagnosis and root-cause analysis; and develop deployment automation for on-prem and production integrations.
Top Skills:
Ci/CdDockerGrafanaHelmKubernetesNatsOpenshiftPrometheusPythonRhelTempoUnreal Engine
Energy
The Site Reliability Engineer will design and implement systems, drive automation, coordinate between teams, support deployed systems, and ensure scalability for rapid growth.
Top Skills:
Active DirectoryAnsibleAWSAzureChefJSONLinuxPuppetPythonRestVMwareWindows ServerYaml
Software
Lead SRE to define SRE strategy, architecture, and roadmap; design and operate containerized, compliant cloud environments; build observability, incident management, automation, and developer platform capabilities; mentor SRE team and collaborate with security, compliance, and product teams to ensure reliability at scale.
Top Skills:
AWSAws MarketplaceAzureAzure MarketplaceGCPGoogle Cloud MarketplaceGrafanaKubernetesPrometheusTerraform
Reposted 4 Days AgoSaved
Easy Apply
Easy Apply
Analytics
Operate and maintain high-performance visualization and mission systems; implement observability and metrics; optimize hardware usage; lead problem diagnosis and root-cause analysis; develop deployment automation and integrate visualization software with development and production infrastructures.
Top Skills:
Ci/CdDockerGrafanaHelmKubernetesNatsOpenshiftPrometheusPythonRhelTempoUnreal Engine
Artificial Intelligence • Software
As a Site Reliability Engineer at Mercor, you will ensure production reliability, develop SRE function, and collaborate with engineering teams to maintain system performance.
Top Skills:
AWSKubernetesSpaceliftTerraform
Computer Vision • Information Technology • Machine Learning • Natural Language Processing • Real Estate • Software
The SRE will maintain infrastructure for SaaS products on AWS, support developers, manage platform components, and handle IT tasks.
Top Skills:
AWSComputer VisionIacLarge Language ModelsNlpTerraform
Healthtech
Build and harden AWS cloud environments and CI/CD pipelines, manage IaC and container platforms, own observability and incident response, enforce security and HA/DR, and automate operational tasks to support a regulated medical-imaging platform.
Top Skills:
Apache AirflowAWSAws Cloudwatch InsightsBashCdkCloudFormationDicomDirect ConnectDockerEcsEksGitGrafanaHl7IamKmsKubernetesPrivatelinkPrometheusPythonSbomTerraformVpcVpn
Fintech • Payments • Financial Services
Build, operate, and scale AWS-based infrastructure using IaC (Terraform), manage EKS and serverless environments, create CI/CD pipelines, implement observability (OpenTelemetry/Prometheus/New Relic), support Postgres/RDS (Aurora), lead incident response and define SRE practices (SLIs/SLOs/error budgets).
Top Skills:
AuroraAWSAws RdsAzureCloudFormationEcsEksGithub ActionsGitlabGoGCPJavaKubernetesNew RelicOpentelemetryOpentofuPostgresPrometheusPythonRubyServerlessTerraformTerragrunt
Consumer Web • eCommerce • Fashion • Retail
Seeking a Staff Software Engineer for the SRE team to enhance CI/CD systems, optimize infrastructure, and improve developer productivity. Responsibilities include architecting solutions, mentoring engineers, and driving technical initiatives to elevate operational excellence.
Top Skills:
AnsibleBashGithub ActionsGoHelmJenkinsKubernetesPythonRubySpinnakerTerraform
New
Cut your apply time in half.
Use ourAI Assistantto automatically fill your job applications.
Use For Free
Software • Cybersecurity
The Engineering Intern will support backend, platform, or SRE tasks, learning to design reliable cloud infrastructure and automate processes using scripting languages. Responsibilities include monitoring and improving system reliability, assisting in incident management, and collaborating with engineering teams.
Top Skills:
AWSAzureDockerGCPGoKubernetesPython
Big Data • Real Estate • Software
Seeking a Senior Site Reliability Engineer to enhance platform reliability, observability, and operational excellence using AWS, Kubernetes, and various monitoring tools in a collaborative environment.
Top Skills:
Argo CdAWSCircleCICloudFormationDatadogEksFargateGoGrafanaJavaJenkinsKubernetesNewrelicPrometheusPythonSplunkTerraform
Aerospace • Artificial Intelligence • Hardware • Robotics • Security • Software • Defense
As a Senior Site Reliability Engineer, you'll deploy and maintain Anduril's system hardware and software, ensuring mission-critical capabilities and troubleshooting complex issues while working collaboratively with engineering teams and customers.
Top Skills:
C++GoPythonRust
Cloud • Information Technology • Security • Software
As a DevOps Architect, you'll lead automation efforts for SaaS services, mentor junior team members, and set strategic directions for CI/CD and monitoring solutions.
Top Skills:
AIAWSAzureFluxGCPGoGrafanaJavaJenkinsKubernetesOciProgramming Languages: C/C++PrometheusPython
On-Demand • Security • Software
The Site Reliability Engineer will maintain server and network health, manage hardware lifecycle, assist with network cabling, coordinate with vendors, and collaborate with IT and Engineering teams.
Top Skills:
Asset Tracking SoftwareMonitoring ToolsNetwork InfrastructureServer Infrastructure
Artificial Intelligence • Information Technology • Machine Learning • Software • Cybersecurity • Generative AI • Data Privacy
Lead global SRE and infrastructure teams to ensure reliability, scalability, and cost-efficiency of production and developer platforms. Define cloud and Kubernetes architecture, IaC, CI/CD, SLOs/SLIs, incident management, and cloud cost optimization while partnering with Security, Product, Finance, and Engineering.
Top Skills:
AIAutomationAWSCi/CdCloud-Native SystemsGCPInfrastructure As CodeKubernetesTerraform
Healthtech • Financial Services
As a Site Reliability Engineer I, you will support critical web applications, solve technical puzzles, collaborate across teams, and manage escalations while ensuring high service standards.
Top Skills:
AWSAzureC#JavaPostgresPythonSQL
Insurance • Cybersecurity
Lead AI enablement across engineering by designing and developing tools for AI-assisted development, driving tooling adoption, and ensuring infrastructure reliability in production environments.
Top Skills:
Ai-Assisted Development ToolsAWSDatadogEcsGithub ActionsGoKubernetesPythonTerraform
Information Technology
Design, develop, and secure container platforms and CI/CD pipelines; automate builds, testing, and deployments; troubleshoot pipeline and cloud issues; recommend container adoption strategies; support cloud-native SDLC and APM monitoring; collaborate to solve complex platform problems.
Top Skills:
AnsibleAWSAzureBash (Linux Shell Script)Ci/CdContainersDatadogDockerDynatraceGithub ActionsGitlabJenkinsKubernetesNew RelicOpenidPythonSAMLSingle-Sign-OnTerraform
Information Technology
As a Site Reliability Engineer, you will enhance system resilience, manage cloud infrastructures, automate tasks, and document technical procedures for the Intelligence Community.
Top Skills:
AWSC#CiscoCitrixDevOpsJavaJavaScriptJenkinsLinuxOracle CloudPythonVMwareWindows
Fintech • Financial Services
The SRE will enhance software and systems reliability, automate operations, and improve customer experiences while managing scalability and performance.
Top Skills:
.NetAIAmazon S3AngularAnsibleApp DynamicsAWSDockerHazelcastJavaKafkaKubernetesLinuxAzureOraclePythonSplunkSQLSybaseTerraform
Computer Vision • Machine Learning • Software
As a Site Reliability Engineer, ensure the reliability, performance, and scalability of Ditto's cloud infrastructure by developing observability solutions, leading incident management, and collaborating with product engineering teams.
Top Skills:
AWSAzureCDatadogGCPGoGrafanaHelmJavaKubernetesPrometheusRustTerraform
Information Technology
The Lead Site Reliability Engineer will ensure platform reliability and performance, guiding SRE principles, managing incidents, and fostering collaboration across teams while leveraging cloud technologies and automation.
Top Skills:
AWSAzureAzure DevopsBashBicepCloudFormationGithub ActionsGoJenkinsPowershellPythonTerraform
Top Companies Hiring Site Reliability Engineers
See AllPopular Job Searches
All Software Engineer Jobs
.NET Developer Jobs
Aerospace Thermal Engineering Jobs
AI Engineer Jobs
Android Developer Jobs
Automation Engineer Jobs
Backend Developer Jobs
Blockchain Developer Jobs
C# Jobs
C++ Jobs
Cloud Architect Jobs
Cloud Engineer Jobs
Design Engineer Jobs
DevOps Engineer Jobs
Director Of Engineering Jobs
Electrical Engineering Jobs
Embedded Software Engineer Jobs
Engineering Jobs
Engineering Manager Jobs
Environmental Engineering Jobs
Field Engineer Jobs
Front End Developer Jobs
Full Stack Developer Jobs
Game Developer Jobs
Golang Jobs
Hardware Engineer Jobs
Industrial Engineering Jobs
iOS Developer Jobs
Java Developer Jobs
Javascript Developer Jobs
Linux Jobs
Manufacturing Engineer Jobs
Mechanical Engineering Jobs
Network Engineer Jobs
PHP Developer Jobs
Process Engineer Jobs
Project Engineer Jobs
Prompt Engineering Jobs
Python Jobs
QA Jobs
Robotics Engineer Jobs
Ruby on Rails Jobs
Salesforce Administrator Jobs
Salesforce Developer Jobs
Scala Jobs
Sharepoint Developer Jobs
Site Reliability Engineer Jobs
Software Engineering Manager Jobs
Solutions Architect Jobs
SQL Developer Jobs
Structural Engineer Jobs
System Engineer Jobs
Test Engineer Jobs
Web Developer Jobs
All Filters
Total selected ()
No Results
No Results
.jpg)































