Staff Site Reliability Engineer

Reposted 8 Days Ago
Be an Early Applicant
4 Locations
In-Office
Expert/Leader
Logistics • Transportation
The Role
The Staff Site Reliability Engineer will architect and optimize cloud and infrastructure solutions, implement CI/CD pipelines, and lead engineering projects while collaborating across teams to enhance system architecture.
Summary Generated by Built In

Schedule: FT
Job Type: Hybrid
Salary Type: Salary
Req #993

About the Role

The ideal candidate for this role will possess deep expertise in architecting and optimizing solutions across both infrastructure and platform layers, including OpenStack, Kubernetes, and public cloud environments. They will be responsible for designing scalable, high-performance systems, implementing CI/CD pipelines, and developing automation and orchestration tools. Strong communication skills and the ability to collaborate across teams are essential. 

  • Design, validate, and deploy advanced technical solutions that drive the evolution of Uber Freight’s network and systems. 
  • Focus on modernizing infrastructure, cloud networking, and DevOps integration. 
  • Implement Infrastructure-as-Code, Kubernetes, and Zero Trust architecture. 
  • Apply Site Reliability Engineering to create resilient and scalable platforms. 
  • Collaborate with cross-functional teams to optimize performance and enhance system architecture. 
  • Enable and evolve Uber Freight’s foundational cloud infrastructure layer, including observability, connectivity, resilience, and availability. 

What the Candidate Will Do

The ideal candidate will have a deep understanding of system parameters and configurations, and will proactively identify system weaknesses and find solutions for improvement. They will lead engineering projects, create complex network designs, and present technical strategies to executive teams. Innovation, technical writing, and leadership are key aspects of this role. 

  • Lead cloud-based solution design, deployment, and management using IaC tools like Terraform, CloudFormation, or Ansible. 
  • Develop and implement cloud architecture best practices for performance, cost, and security. 
  • Collaborate with software teams to integrate cloud services and support CI/CD pipelines. 
  • Plan, coordinate, and execute Windows and Linux infrastructure and automation projects, including scoping, budgeting, scheduling, resourcing, risk management, and stakeholder communication. 
  • Conduct audits using tools like Splunk, Prometheus, and Grafana. 
  • Mentor engineers and lead architectural discussions. 
  • Design and implement automation solutions using PowerShell, Ansible, Terraform, Jenkins, Python, or other tools to streamline and optimize operational processes such as provisioning, configuration, patching, backup, recovery, monitoring, and auditing. 
  • Evaluate and adopt new cloud technologies and tools. 
  • Lead incident response and troubleshooting for cloud infrastructure. 
  • Communicate technical solutions clearly to stakeholders. 
  • Own vision, roadmap, and execution for cloud infrastructure. 
  • Establish and enforce Windows and Linux infrastructure automation standards, guidelines, and procedures, and conduct regular reviews and audits to ensure compliance. 
  • Deploy, manage, and maintain virtualization platforms (VMware and Proxmox). 
  • Improve operability and usability of Uber Freight’s cloud systems. 
  • Build and deliver training for engineering teams. 
  • Integrate Google Cloud services with core systems. 
  • Scale multi-region systems with focus on reliability and cost-efficiency. 
  • Collaborate with developers to improve deployment safety. 
  • Work within Agile teams to design, develop, test, and support full-stack solutions. 
  • Encourage innovation, inclusion, and continuous learning. 
  • Architect reference applications for public cloud patterns. 
  • Enable fast, compliant infrastructure provisioning and decommissioning. 
  • Create ephemeral environments for testing and demos. 
  • Implement cost management solutions for engineering teams. 
     

Basic Qualifications

The ideal candidate will have proven experience in cloud infrastructure design and management, with strong proficiency in cloud platforms such as AWS, Azure, or Google Cloud Platform. They will possess excellent problem-solving skills, strong communication abilities, and the ability to work effectively in a fast-paced, collaborative environment. Expertise in containerization technologies and orchestration tools is essential. 

  • Bilingual fluency in Spanish and English. 
  • 9+ years of experience in cloud infrastructure roles. 
  • Proficiency in AWS, Azure, or GCP. 
  • Experience with Ansible, Terraform, CloudFormation. 
  • Strong CI/CD knowledge and debugging skills. 
  • Expertise in Docker, Kubernetes, and orchestration. 
  • Familiarity with ELK stack, Prometheus, Grafana. 
  • Strong scripting in Python, Go, Bash. 
  • Excellent communication and documentation skills. 
  • Experience mentoring and collaborating across teams. 
  • Agile/Scrum experience. 
  • Experience with GitHub Actions, Terraform Enterprise, Sentinel, and OPA. 
  • Strong analytical, strategic, and conceptual thinking. 
  • Ability to influence and inspire others. 
  • Focused, driven, and results-oriented. 

Preferred Qualifications

The ideal candidate will possess expert-level industry certifications in cloud environments and have practical experience in cloud development, management, and operations. Strong written and verbal communication skills are essential, along with a proven track record of delivering scalable, reliable, and secure cloud solutions. Experience at top-tier SaaS/cloud infra companies is preferred. 

  • Expert certifications in AWS, Azure, or GCP. 
  • Bachelor’s degree in Computer Science or related field. 
  • Experience with Atlantis, Spacelift, Terraform Cloud. 
  • Experience with distributed systems in public cloud. 
  • Technical documentation and codelab creation. 
  • Prior experience at FAANG or top-tier SaaS/cloud infra companies. 
  • Passion for automation, learning, and high-performance teams. 
  • 2 years Linux systems administration. 
  • 2 years Windows systems administration. 
  • 3 years hands-on experience managing Kubernetes clusters and other virtualization systems (VMware, Proxmox, Xen, or Hyper-V). 
  • Knowledge of basic Windows technologies (AD, DNS, etc.). 
  • Experience incorporating PCI, SOX, SOC 2 controls related to OS hardening. 
  • Experience working on Data Center Migration Projects. 
  • Knowledge of basic Network technology as it relates to virtualization. 
  • Knowledge of basic Storage technology as it relates to virtualization. 
  • Experience in ITIL-based service management and agile methodologies. 
  • Experience with development team management software including JIRA, GitHub, Confluence. 
  • Strong problem-solving, troubleshooting, and analytical skills, and attention to detail and quality. 

About Uber Freight

Uber Freight is a market-leading enterprise technology company powering intelligent logistics. With a suite of end-to-end logistics applications, managed services and an expansive carrier network, Uber Freight advances supply chains and moves the world’s goods. Today, the company manages over $20 billion of freight and one of the largest networks of carriers. It is backed by best-in-class investors and provides services for 1 in 3 Fortune 500 companies, including Del Monte Foods, Nestle, Anheuser-Busch InBev, and more. For more, visit www.uberfreight.com.

Candidate Privacy Notice

EEOC

Uber Freight is proud to be an Equal Opportunity/Affirmative Action employer. All qualified applicants will receive consideration for employment without regards to sex, gender identity, sexual orientation, race, color, religion, national origin, disability, protected Veteran status, age, or any other characteristic protected by law. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements.

Top Skills

Ansible
AWS
Azure
Bash
Ci/Cd
CloudFormation
Docker
Elk Stack
Go
Google Cloud Platform
Grafana
Kubernetes
Openstack
Prometheus
Proxmox
Python
Terraform
VMware
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
Chicago, , Illinois
5,622 Employees

What We Do

Powering Intelligent Logistics

Similar Jobs

EchoStar Logo EchoStar

Site Reliability Engineer

Aerospace • Cloud • Digital Media • Information Technology • Mobile • News + Entertainment • Retail
In-Office
Plano, TX, USA
110K-157K Annually
Hybrid
Fort Worth, TX, USA
Hybrid
Fort Worth, TX, USA

Visa Inc, Logo Visa Inc,

Site Reliability Engineer

Fintech • Information Technology • Payments
In-Office
Austin, TX, USA
125K-181K

Similar Companies Hiring

Air Space Intelligence Thumbnail
Transportation • Software • Machine Learning • Logistics • Defense • Artificial Intelligence • Aerospace
Boston , Massachusetts
110 Employees
HERE Technologies Thumbnail
Software • Logistics • Internet of Things • Information Technology • Computer Vision • Automotive • Artificial Intelligence
Amsterdam, NL
6000 Employees
Axle Health Thumbnail
Logistics • Information Technology • Healthtech • Artificial Intelligence
Santa Monica, CA
17 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account