Get the job you really want.
Maximum of 25 job preferences reached.
Top Site Reliability Engineer Jobs
Fintech • Information Technology • Payments
The Staff Site Reliability Engineer will be responsible for DevOps on payment systems, production development, debugging code, data pipeline monitoring, and ensuring data integrity while guiding junior team members.
Top Skills:
Amazon RedshiftAWSAzureCassandraGCPGoogle BigqueryHadoopJavaKafkaMongoDBMySQLPostgresPythonScalaSnowflakeSparkSQL
Artificial Intelligence • Information Technology • Software
The Senior Site Reliability Engineer at BentoML will manage infrastructure for AI services, focusing on Kubernetes, Terraform, GPU clusters, and observability tools, while mentoring and driving SRE best practices.
Top Skills:
Amd GpuAWSAzureCi/CdGitopsGCPGrafanaKubernetesNvidia GpuOracle CloudPrometheusPulumiTerraform
Financial Services
The Staff Engineer will support and optimize messaging platforms, design solutions to improve operational efficiency, and collaborate with teams on business-focused solutions.
Top Skills:
AmpsAWSEksFixJavaKafkaKubernetesLinuxMqSpringSQL
Software • Database
The Senior Site Reliability Engineer will manage AWS infrastructures, improve CI/CD pipelines, and assist teams with scaling solutions. Responsibilities include overseeing logging, monitoring, and high-quality software development with strong security and reliability considerations.
Top Skills:
AnsibleAWSChefCloudFormationDatadogDockerDynamoElasticsearchGithub ActionsMySQLOpensearchPostgresPuppetPythonRedisS3Terraform
Artificial Intelligence • Legal Tech • Professional Services • Software
As a Staff Software Engineer in Site Reliability, you'll manage infrastructure for reliability and scalability, lead incident management, and automate operational tasks.
Top Skills:
AWSAzureBashCloudFormationDatadogGCPGoIncidentioPagerdutyPulumiPythonSentryTerraform
Artificial Intelligence • Legal Tech • Professional Services • Software
As a Software Engineer in Site Reliability, you will ensure the reliability and performance of our AI platform through automation and strategic infrastructure management.
Top Skills:
AWSAzureBashCloudFormationDatadogGCPGoKubernetesPagerdutyPythonSentryTerraform
Artificial Intelligence • Cloud • Information Technology • Mobile • Software • Consulting
The role involves designing and implementing observability solutions using OpenTelemetry, managing platform engineering tasks, and ensuring site reliability through various engineering practices.
Top Skills:
AWSAzureCi/CdCloudFormationDockerGCPGoJavaKubernetesNode.jsOpentelemetryPulumiPythonRustTerraform
Software • Cybersecurity
As a Staff SRE/DevOps Engineer, you'll lead cloud rearchitecture initiatives, drive modernization focusing on availability, and mentor teams on DevOps practices.
Top Skills:
ArgocdAWSBashCloudFormationDockerElkGCPGithub ActionsGoGrafanaKubernetesPrometheusPythonTerraform
Energy
The Site Reliability Engineer will design and implement systems, drive automation, coordinate between teams, support deployed systems, and ensure scalability for rapid growth.
Top Skills:
Active DirectoryAnsibleAWSAzureChefJSONLinuxPuppetPythonRestVMwareWindows ServerYaml
News + Entertainment
Design and maintain scalable infrastructure, collaborate with teams for reliability, handle incident response, and promote reliability culture.
Top Skills:
AWSAzureGCPGoJavaKubernetesPythonTerraform
Computer Vision • Information Technology • Software
The Site Reliability Engineer will enhance system stability, optimize performance, automate deployments, and monitor production systems in both on-premise and cloud environments.
Top Skills:
AnsibleAzureAzure Application InsightsBashBicepElk StackGrafanaPowershellPythonTerraform
Blockchain • Fintech • Financial Services • Cryptocurrency
The VP, Site Reliability Engineer will architect and maintain AWS infrastructure, optimize container workloads, and drive automation and reliability initiatives. Responsibilities include migration from VMs to containers, incident response, and cross-team collaboration.
Top Skills:
AWSDatadogEksKubernetesOpentelemetryTerraform
New
Cut your apply time in half.
Use ourAI Assistantto automatically fill your job applications.
Use For Free
6 Days AgoSaved
Fintech • Information Technology • Payments
As a Staff Site Reliability Engineer, you'll maintain and support Hadoop, Kafka, and Cloud platforms, ensuring their performance and reliability while driving innovation globally. You'll manage clusters, develop monitoring tools, collaborate on solutions, analyze production incidents, and create procedural documentation.
Top Skills:
AnsibleAWSAzureGCPGrafanaHadoopJavaKafkaLinuxPythonSparkSplunk
Real Estate • PropTech
The role involves enhancing Redfin's reliability through better tools and processes, guiding teams in effective production system operations, and leading educational efforts in reliability engineering.
Top Skills:
AWSC++DatadogJavaKubernetesPythonTerraform
Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
The Network Site Reliability Engineer will ensure high availability of network infrastructure, work on incident management, implement automation, and drive improvements for operational excellence.
Top Skills:
Alert ManagerAnsibleBgpBigpandaData Center Network TechnologiesFirewallsGoGrafanaIpv4Ipv6IsisItilJIRAL2 SwitchingLinuxLoad BalancersNautobotNetboxPrometheusPythonSaltServicenowTcpUdpVpnWireless
Cloud
The Software Engineer will enhance, optimize, and validate the MinIO cloud-native storage platform while collaborating with customers and the engineering team.
Top Skills:
CC++ContainersGoKubernetesMicroservicesRust
eCommerce • Fintech • Information Technology • Payments • Software
The Site Reliability Engineer Specialists will enhance software reliability, manage infrastructure processes, and mentor junior engineers while ensuring system performance and compliance.
Top Skills:
Akamai Global Traffic ManagementAmazon AwsHarnessJenkinsAzureRestSite Reliability EngineeringXML
Fitness
The Site Reliability Engineer will ensure system reliability and performance, design scalable architectures, improve CI/CD pipelines, maintain infrastructures, and lead incident response efforts.
Top Skills:
ArgocdAWSDatadogDockerGithub ActionsGoJavaScriptKubernetesPrometheusPythonTerraform
Artificial Intelligence
As an Applied AI Engineer, you will onboard customers, deploy AI solutions, work on complex projects, and provide technical guidance. You'll contribute to open-source projects and communicate effectively with stakeholders.
Top Skills:
AnsibleAWSAzureDockerGCPKubernetesPythonTerraform
Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
As a Site Reliability Engineer, you'll troubleshoot HPC environments, enhance automation, ensure system reliability, and collaborate to improve chip development processes.
Top Skills:
Centos,Rhel,Docker,Python,Bash,Ansible
Digital Media • Social Media • Software • Sports
Lead the technical architecture and execution of migration to AWS, drive developer enablement, and automate infrastructure using code-first principles.
Top Skills:
Aws EksDatadogGithub ActionsGoIstioK6KubernetesNode.jsTerraform
Fintech
As a Staff Site Reliability Engineer, you will shape reliability practices, optimize AWS infrastructure, lead incident response, and mentor engineers.
Top Skills:
AWSDatadogGitopsTerraform
Financial Services
As a Site Reliability Engineer, you'll optimize and manage cloud infrastructure, implement automation, and maintain system reliability for a global financial platform.
Top Skills:
AWSGCPGoHelmKubernetesLinuxPythonTerraform
Healthtech • Software
Monitor application health, respond to incidents, implement Infrastructure as Code, and collaborate with teams to maintain service reliability and performance.
Top Skills:
AWSDockerEmberGitMySQLNestjsNode.jsReact
Fintech • Software • Financial Services
As a Site Reliability Engineer at Luma, you'll manage AWS infrastructure, Kubernetes clusters, and CI/CD pipelines, ensuring platform reliability and security. You'll also automate processes and lead incident response efforts.
Top Skills:
AWSBashCi/CdGoJavaKubernetesPythonTerraform
Popular Job Searches
All Software Engineer Jobs
.NET Developer Jobs
Aerospace Thermal Engineering Jobs
AI Engineer Jobs
Android Developer Jobs
Automation Engineer Jobs
Backend Developer Jobs
Blockchain Developer Jobs
C# Jobs
C++ Jobs
Cloud Architect Jobs
Cloud Engineer Jobs
Design Engineer Jobs
DevOps Engineer Jobs
Director Of Engineering Jobs
Electrical Engineering Jobs
Embedded Software Engineer Jobs
Engineering Jobs
Engineering Manager Jobs
Environmental Engineering Jobs
Field Engineer Jobs
Front End Developer Jobs
Full Stack Developer Jobs
Game Developer Jobs
Golang Jobs
Hardware Engineer Jobs
Industrial Engineering Jobs
iOS Developer Jobs
Java Developer Jobs
Javascript Developer Jobs
Linux Jobs
Manufacturing Engineer Jobs
Mechanical Engineering Jobs
Network Engineer Jobs
PHP Developer Jobs
Process Engineer Jobs
Project Engineer Jobs
Prompt Engineering Jobs
Python Jobs
QA Jobs
Robotics Engineer Jobs
Ruby on Rails Jobs
Salesforce Administrator Jobs
Salesforce Developer Jobs
Scala Jobs
Sharepoint Developer Jobs
Site Reliability Engineer Jobs
Software Engineering Manager Jobs
Solutions Architect Jobs
SQL Developer Jobs
Structural Engineer Jobs
System Engineer Jobs
Test Engineer Jobs
Web Developer Jobs
All Filters
Total selected ()
No Results
No Results































