IMPORTANT: Please be aware, scammers may try to impersonate Zello by reaching out regarding job opportunities. We will never ask you for bank account information, checks, or other sensitive information as part of our hiring process. All correspondence will come from the zello.com email domain. If you’re unsure, please email [email protected] with questions.
About ZelloZello is a voice-first communication platform, powered by our industry-leading push-to-talk technology, to improve collaboration and productivity for desk-less workers. With over 175+ million users, we’re the #1 rated push-to-talk app in the world, delivering 9 billion (yes, with a B) messages a month.
At Zello, our company values are at the heart of what we do everyday. We’re proud to serve the frontline, we’re privileged to connect people in times of crisis across the globe, and we’re honored to support first responders.
And this is where you come in.
We’re looking for a Site Reliability Engineer to help us make our systems more observable, performant, and resilient. You’ll work closely with our platform and application teams to build the tooling, practices, and insights that keep Zello reliable as we scale.
After a successful first year, you will haveImplemented end-to-end observability tooling for application and infrastructure metrics, traces, and logs.
Delivered profiling and tracing systems that surface performance bottlenecks before they impact users.
Defined and tuned alerting to ensure only high-signal, actionable incidents reach engineers.
Helped evolve Zello’s incident response and postmortem processes, ensuring consistent learning and improvement.
Provided developers with clear visibility into application performance and release impact, driving data-informed engineering.
Build and maintain monitoring, tracing, and profiling systems that empower teams to measure and improve performance.
Partner with cross-organization teams to define SLIs, SLOs, and SLAs that reflect real user experience.
Lead efforts to optimize observability, from instrumentation standards to dashboard design.
Participate in and help coordinate our on-call rotation, incident response, and post-incident reviews.
Continuously evaluate and recommend tools or process improvements to strengthen reliability and reduce alert fatigue.
Collaborate on platform improvements that enhance system resilience and developer velocity.
BSc in Computer Science or equivalent experience.
6+ years of experience in site reliability, DevOps, or software engineering roles.
Deep understanding of monitoring, alerting, and observability platforms (e.g., Prometheus, Grafana, Loki, OpenTelemetry).
Experience implementing tracing, logging, and profiling for distributed systems.
Strong background in incident management, postmortem practices, and reliability metrics.
Familiarity with Linux, Kubernetes, Terraform, and GCP (preferred) or other major clouds.
Proficiency in a scripting or backend language (e.g., Python, Go, Bash).
Excellent problem-solving, communication, and collaboration skills.
Passionate about eliminating toil and driving continuous improvement in system health.
We hire for potential, passion for our mission, and a knack for solving difficult problems over checking every qualification box. We have competitive pay, equity with significant upside, and intentionally design our benefits to encourage healthy and well-balanced employees, flexible schedules and time off. We even offer a sabbatical after every five years of service so you’re able to pursue and enjoy what matters most to you. And of course, we wouldn’t be a technology company in Austin without a ping-pong table and free snacks in our break room. Join us!
Zello provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws.
All Zello personnel are required to comply with defined security, privacy, and compliance requirements applicable to their role along with requirements that are applicable to all Zello personnel.
Top Skills
What We Do
We started as a company that turned phones into walkie-talkies. Today, we modernize instant voice communication with our industry-leading push-to-talk technology to help mobile workers meet quickly changing, urgent, real-world challenges.
We have the highest-rated walkie-talkie app, with over 8 billion messages sent per month and 170 million users in industries such as transportation, retail, construction, hospitality, healthcare, and more.
We’re proud to serve frontline workers, we’re privileged to connect people in times of crisis across the globe, and we’re honored to support first responders. As demand for our app continues to rise, we’ve evolved from a startup to a scale-up — and we’re still growing rapidly, which is where you come in.
Why Work With Us
If you strive to work on technology with purpose, technology that actually changes how people communicate and work, then come talk to us. We like people who take pride in their work, and deliver with consistency and quality. We're collaborative, sometimes serious, sometimes not, but we're all in 110%.
Gallery
Zello Offices
Hybrid Workspace
Employees engage in a combination of remote and on-site work.
Zello is a hybrid workplace, where Austin employees typically work in the office on Tuesdays, Wednesdays, and Thursdays.


