Site Reliability Engineer

Posted Yesterday
Hiring Remotely in USA
Remote
130K-160K Annually
Senior level
Other
The Role
Design, build, and maintain highly available cloud-native systems. Improve reliability through automation, CI/CD, Kubernetes, observability, and incident management. Collaborate with developers, security, and product teams to define SLOs, implement self-healing, debug production issues, and ensure secure deployments.
Summary Generated by Built In
We do Consulting Differently

Second Sight Solutions, a subsidiary of Berkeley Research Group (BRG), is a health technology company, and our innovative technology reimagines how drug discount data is exchanged, establishing new connections and improving transparency for drug manufacturers and their customers. Our customers and partners trust us to deliver reliable, first-to-market solutions and safeguard the data we receive. We trust our employees, and our culture gives them the freedom to create, collaborate, and grow. Our leaders are industry experts, creative, unafraid to challenge the status quo, and the pioneers of market-changing solutions.

We are seeking a Site Reliability Engineer to design, build, and maintain highly available systems and infrastructure.  The SRE will work closely with software developers and operations teams to improve system reliability, automate processes, and minimize downtime.

Responsibilities

  • Design, implement, and maintain scalable and reliable systems in cloud environments such as Azure Cloud Services.

  • Experience with CI/CD Platforms (GitHub Actions, GitLab CI)

  • Provide operational support for full-stack software applications.

  • Increase system resilience with expert-level coding, bulletproof release, and change management skills.

  • Develop service-level indicators and objectives to automate release validation. 

  • Improve automation and increase the system’s self-healing capability.

  • Collect operating system data and report performance metrics to stakeholders.

  • Ensure security best practices are followed in cloud infrastructure and application deployments.

  • Manage cloud and database system maintenance, debugging production issues as they arise.

  • Improve reliability, quality, and time-to-market of our suite of software solutions.

  • Partner with security and product teams to define and publish policies, processes, and playbooks to facilitate rapid and effective handling of alerts and incidents.

  • Lead incident management processes; respond to outages and service disruptions promptly.

Qualifications:

  • Bachelor’s degree in computer science or similar field.

  • Five years’ experience as a site reliability engineer or similar role.

  • Strong programming skills (Golang, Ruby, Python, or similar)

  • Proven ability to diagnose and monitor performance and reliability issues across the stack.

  • Expertise in Kubernetes.

  • Relevant industry certifications, such as through the Site Reliability Engineering (SRE) Foundation.

  • Proven experience working with cloud-native infrastructure (Azure Cloud Services, AWS, or GCP).

  • Experience working with observability and incident management tools (Datadog, OpsGenie, PagerDuty).

  • Experience scripting operating system tasks with Infrastructure as Code.

  • Impeccable communication skills.

  • Ability to problem-solve in a fast-paced, high-stakes environment.

Candidate must be able to submit verification of his/her legal right to work in the United States, without company sponsorship.

Salary: $130,000 - $160,000
 

About BRG

BRG combines world-leading academic credentials with world-tested business expertise and purpose-built emerging technologies. Our culture centers on agility and connectivity which sets us apart and gets you ahead.  


At BRG, our professionals include specialist consultants, industry experts, renowned academics, and leading-edge data scientists. Together, they bring a diversity of real-world experience, data, and human and artificial intelligence, to economics, disputes, and investigations; corporate finance; and performance improvement services that address the most complex challenges facing organizations across the globe.


Our unique structure nurtures the interdisciplinary relationships that give us the edge, laying the groundwork for more informed insights and more original, incisive thinking.  When paired with our global reach and resources, our diverse perspectives and technical capabilities make us uniquely capable to address our clients’ challenges. We get results because we know how to apply our thinking to your world.


At BRG, we don’t just show you what’s possible. We’re built to help you make it happen. 

BRG is proud to be an Equal Opportunity Employer. Our hiring practices provide equal opportunity for employment without regard to race, religion, color, sex, gender, national origin, age, United States military veteran status, ancestry, sexual orientation, marital status, family structure, medical condition including genetic characteristics or information, veteran status, or mental or physical disability so long as the essential functions of the job can be performed with or without reasonable accommodation, or any other protected category under federal, state, or local law.

Skills Required

  • Bachelor's degree in computer science or similar field.
  • Five years' experience as a site reliability engineer or similar role.
  • Strong programming skills (Golang, Ruby, Python, or similar).
  • Expertise in Kubernetes.
  • Experience with CI/CD platforms (GitHub Actions, GitLab CI).
  • Proven experience working with cloud-native infrastructure (Azure Cloud Services, AWS, or GCP).
  • Experience working with observability and incident management tools (Datadog, OpsGenie, PagerDuty).
  • Experience scripting operating system tasks with Infrastructure as Code.
  • Relevant industry certifications (Site Reliability Engineering Foundation).
  • Impeccable communication skills.
  • Ability to problem-solve in a fast-paced, high-stakes environment.
  • Ability to verify legal right to work in the United States without company sponsorship.
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Emeryville, CA
1,629 Employees
Year Founded: 2010

What We Do

Berkeley Research Group, LLC (BRG) is a global consulting firm that helps leading organizations advance in three key areas: disputes and investigations, corporate finance, and performance improvement and advisory. Headquartered in California with offices around the world, we are an integrated group of experts, industry leaders, academics, data scientists, and professionals working beyond borders and disciplines. We harness our collective expertise to deliver the inspired insights and practical strategies our clients need to stay ahead of what's next. We have in-depth experience across a wide range of industries and markets, from construction and energy to technology and healthcare. No matter what sector your business is in, we have experienced professionals who understand the challenges you face—making us better equipped to help solve them.

Similar Jobs

NBCUniversal Logo NBCUniversal

Site Reliability Engineer

AdTech • Cloud • Digital Media • Information Technology • News + Entertainment • App development
Remote or Hybrid
Centennial, CO, USA
68000 Employees
110K-145K Annually

Optum Logo Optum

Site Reliability Engineer

Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
In-Office or Remote
Eden Prairie, MN, USA
160000 Employees
73K-130K Annually

Optum Logo Optum

Site Reliability Engineer

Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
In-Office or Remote
Minnetonka, MN, USA
160000 Employees

MongoDB Logo MongoDB

Site Reliability Engineer

Big Data • Cloud • Software • Database
Easy Apply
Remote or Hybrid
7 Locations
5550 Employees
127K-249K Annually

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account