Site Reliability Engineer

Sorry, this job was removed at 08:28 p.m. (CST) on Monday, May 19, 2025
Hiring Remotely in USA
Remote
Information Technology • Consulting
The Role

As a Site Reliability Engineer (SRE) at Circonus, you will be responsible for keeping Circonus SaaS and on-premise customers up and running as well as improving the automation, scalability, and performance of systems.  This is an unparalleled opportunity to grow on a small, collaborative, and friendly team with established leadership in the field of SRE. 

 

A successful candidate will be able to effectively communicate across multiple departments and customers, can shift gears at a moment’s notice, and enjoys the challenges of supporting enterprise clients.  This is a client facing role where presentation skills are important.  Also, a successful candidate will be working in a support rotation capacity. 

 

This position is 100% remote.  

Job Responsibilities

  •         Install, upgrade and manage systems powering customer infrastructure running Circonus software
  •         Troubleshoot availability and performance issues
  •         Diagnose production issues and perform front-line remediation
  •         Communicate with management and customers regarding aberrant system’s behavior
  •         Influence software and architecture design based on system and architecture observations related to performance and reliability
  •         Participate in an on-call schedule

Job Requirements

  •         Linux (RHEL, CentOS, Ubuntu)
  •         Experience working with cloud service providers such as AWS, Azure, or GCP
  •         Ansible, Chef or similar configuration system
  •         HAProxy, PostgreSQL, Apache or similar technologies
  •         Strong networking knowledge: firewalls, TCP & UDP, DNS, SSL/TLS
  •         Strong understanding of monitoring principles
  •         Familiarity leveraging REST and REST-like APIs for operations tasks
  •         UNIX troubleshooting skills: tcpdump, strace, bpftrace, etc
  •         Fluency in one or more of the Git, Subversion or Mercurial version control systems

Preferred Experience

  •         7+ years’ experience in the technology industry
  •         Experience and/or senior technical knowledge of monitoring and analytics solutions
  •         Experience with Docker, Kubernetes and containers
  •         Terraform, Chef and Ansible experience
  •         Open search experience
  •         The right person will be highly technical and analytical much like the company itself

Circonus offers a powerful telemetry intelligence platform to handle the world's most demanding use cases.  From mission-critical IT infrastructure to data-intensive IoT applications, Circonus works with any tech and at any scale. Circonus uses advanced data science and patented technology to ingest and analyze telemetry data to deliver unmatched clarity, insights, and performance.  From real-time alerts and fault detection to ML-based predictive analytics, Circonus helps companies optimize operations and deliver exceptional user experiences with confidence.

 

We recently raised a $10M Series B round led by Baird Capital with participation from our existing investors NewSpring Capital, Osage Venture Partners, and Bull City Venture Partners. This new funding is earmarked to further accelerate our growth, scale product innovation, and build upon the company’s record-setting performance in 2021.

 

Culturally, we operate like a startup. Small, agile teams with quick decisions and short, iterative cycle times. We relish our core values of respect, integrity, value, and growth, among others.

 

All of our positions include a discretionary PTO policy, generous employer health, and dental insurance, employer-matched 401(k) Plan, and more.

Similar Jobs

Dropbox Logo Dropbox

Site Reliability Engineer

Artificial Intelligence • Cloud • Consumer Web • Productivity • Software • App development • Data Privacy
Remote
United States
2500 Employees
223K-302K Annually

NBCUniversal Logo NBCUniversal

Site Reliability Engineer

AdTech • Cloud • Digital Media • Information Technology • News + Entertainment • App development
Remote or Hybrid
Orlando, FL, USA
68000 Employees

ServiceNow Logo ServiceNow

Site Reliability Engineer

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Remote or Hybrid
Santa Clara, CA, USA
29000 Employees
166K-290K Annually

Sprinter Health Logo Sprinter Health

Site Reliability Engineer

Artificial Intelligence • Healthtech • Logistics • Social Impact • Software • Telehealth
Remote or Hybrid
2 Locations
500 Employees
160K-255K Annually
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Malvern, PA
40 Employees
Year Founded: 2010

What We Do

Circonus is the monitoring and analytics platform built for the modern-day enterprise. Circonus delivers crystal-clear, real-time visibility of the behavior, health, trends, and performance of traditional infrastructure and cloud-based technologies in one powerful, unified platform. Led by experts in large-scale distributed systems and data science, Circonus is pioneering the way that telemetry data at scale is leveraged throughout the enterprise to drive smarter operations, deploy faster, make better decisions, and deliver mission-critical services with confidence.

Similar Companies Hiring

Amplify Platform Thumbnail
Fintech • Financial Services • Consulting • Cloud • Business Intelligence • Big Data Analytics
Scottsdale, AZ
62 Employees
Standard Template Labs Thumbnail
Artificial Intelligence • Information Technology • Software
New York, NY
25 Employees
Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account