Senior Site Reliability Engineer (SRE)

Sorry, this job was removed at 12:08 p.m. (CST) on Wednesday, Jun 25, 2025
Be an Early Applicant
Israel
Artificial Intelligence • Healthtech • Software
Viz.ai improves access to life-saving treatments.
The Role

About Viz.ai

Viz.ai is the pioneer in the use of AI algorithms and machine learning to increase the speed of diagnosis and care across 1,800+ hospitals and health systems in the U.S. and Europe. The AI-powered Viz.ai OneTM is an intelligent care coordination solution that identifies more patients with a suspected disease, informs critical decisions at the point of care, and optimizes care pathways and helps improve outcomes. Backed by real-world clinical evidence, Viz.ai One delivers significant value to patients, providers, and pharmaceutical and medical device companies. For more information visit Viz.ai.

About the role:

We are seeking a skilled Site Reliability Engineer (SRE) to join our team and help build, maintain, and improve the reliability, scalability, and performance of our systems. As an SRE, you will be responsible for owning observability tools, driving incident management processes, and implementing automation to enhance our infrastructure. This role involves collaborating across teams to ensure a robust and efficient technology stack supporting mission-critical systems.

You will:

  • Proactively enhance system reliability, scalability, and performance through automation, monitoring, and capacity planning.
  • Develop and maintain observability systems, including distributed tracing, logging, and metrics platforms.
  • Establish and maintain organizational standards for monitoring, leveraging tools like Prometheus, Grafana, and OpenTelemetry.
  • Drive incident management, root cause analysis, and continuous improvement initiatives.
  • Partner with development teams to integrate reliability best practices into the software development lifecycle.
  • Manage infrastructure at scale in cloud services (AWS advantage) and  platforms  like Kubernetes or ECS.
  • Optimize resource utilization to reduce costs while maintaining service quality.
  • Treat AI as a core part of your workflow, using tools like ChatGPT to enhance productivity and output.

What success looks like: 

  • You will have reduced the frequency and impact of production incidents by building resilient systems and improving incident response processes.
  • You will have improved observability: Key metrics, logs, and traces are available and actionable for all critical services, empowering teams to quickly detect and resolve issues.
  • You will be actively engaged in proactive problem solving: You identify and resolve systemic issues before they impact customers, and continuously refine SLOs/SLIs to reflect evolving business needs.
  • Leadership & Mentorship: You are seen as a reliable thought leader within the organization, mentoring others and helping shape the future of our SRE practices.

We are looking for:  

  • At least 5 years of experience as a SRE.
  • Strong experience with Observability Tools: Proficiency with OpenTelemetry, Grafana, Prometheus, and ELK stack (Elasticsearch, Logstash, Kibana).
  • Experience with Cloud Platforms: In-depth knowledge of AWS services, including EC2, S3, RDS, and CloudFormation/Terraform for infrastructure-as-code.
  • Proficiency in scripting and/or development languages like Bash or Python.
  • Thorough understanding of CI/CD pipelines and automation tools.
  • Understanding of Infrastructure as Code, and strong experience with automation tools like Terraform and/or Ansible.
  • Solid troubleshooting and debugging skills.
  • A team player with a strong can-do mentality.

Why should you join us? 

  • If you are looking to make an impact, join our mission to develop life-saving products.
  • If you want to be part of an amazing team, our people are at the heart of everything we do.
  • If you are a self-starter and naturally motivated.
  • You have a passion for innovative technologies in the healthcare sector, this may be the place for you!.

Location: 

We are located in San Francisco, Tel Aviv,  This position is based in Tel Aviv.

Our office in Tel-Aviv is located in Menachem Begin 150, within walking distance of Arlozorov and Ha'Shalom train stations.

Similar Jobs

Akamai Technologies Logo Akamai Technologies

Senior Site Reliability Engineer

Cloud • Security • Software • Cybersecurity
In-Office or Remote
2 Locations
10285 Employees

NVIDIA Logo NVIDIA

Site Reliability Engineer

Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
In-Office
Yokneam, ISR
21960 Employees

NVIDIA Logo NVIDIA

Site Reliability Engineer

Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
In-Office
2 Locations
21960 Employees

Akamai Technologies Logo Akamai Technologies

Senior Site Reliability Engineer

Cloud • Security • Software • Cybersecurity
Remote or Hybrid
2 Locations
10285 Employees
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Francisco, CA
405 Employees
Year Founded: 2016

What We Do

Viz.ai is a leader in applied artificial intelligence in healthcare. Our mission is to fundamentally improve how healthcare is delivered globally through intelligent software that promises to reduce time to treatment and improve access to care. Our flagship product, Viz LVO, leverages advanced deep learning to communicate time-sensitive information about stroke patients straight to a specialist who can intervene and treat. In February 2018, the U.S. Food and Drug Administration (FDA) granted a De Novo clearance for Viz LVO, the first-ever computer-aided triage and notification platform. In 2020, Viz LVO became the first AI software to receive approval from CMS. We are a distributed team with offices in San Francisco, Tel Aviv, and Heerenveen. We are backed by leading Silicon Valley investors, including Kleiner Perkins, Google Ventures, Green Oaks, CRV, and Threshold Ventures.

Why Work With Us

We are a global organization where our values (Patients First, Time is Brain, Quality Squared, Kindness Wins, and I am Accountable) are demonstrated daily, and our product saves lives! At Viz.ai, we provide professional development opportunities, promote from within, and value teamwork. Join our team to make an impact in saving people's lives.

Gallery

Gallery

Similar Companies Hiring

Fairly Even Thumbnail
Software • Sales • Robotics • Other • Hospitality • Hardware
New York, NY
Bellagent Thumbnail
Artificial Intelligence • Machine Learning • Business Intelligence • Generative AI
Chicago, IL
20 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account