Senior Software Engineer (Alerting & Observability)

Posted 8 Days Ago
Be an Early Applicant
Hiring Remotely in United States
Remote
160K-210K
Senior level
Software
Cribl, the Data Engine for IT and Security, empowers organizations to transform their data strategy.
The Role
In this role, you'll design and build alerting systems, develop query-based alerts, collaborate on requirements, and mentor others in observability best practices.
Summary Generated by Built In

Cribl does differently. 

What does that mean? It means we are a serious company that doesn’t take itself too seriously; and we’re looking for people who love to get stuff done, and laugh a bit along the way. We’re growing rapidly - looking for collaborative, curious, and motivated team members who are passionate about putting customers first. As a remote-first company we believe in empowering our employees to do their best work, wherever they are. 

As the data engine for IT and Security many of the biggest names in the most demanding industries trust Cribl to solve their most pressing data needs. Ready to do the best work of your career? Join the herd and unlock your opportunity.

Why You’ll Love This Role

In this role, you will work closely with Product, Operations, and other business functions while collaborating with your direct team to own and deliver end-to-end features and functionality for our alerting and observability platform. As a Senior Software Engineer specializing in alerting systems and metrics analysis, you will bring your experience and expertise to help your team build intelligent, responsive alerting capabilities. You will have the opportunity to tackle complex observability challenges, owning the design, implementation, and rollout of alerting infrastructure with the support of your team.


As An Active Member Of Our Team, You Will…

  • Design and build sophisticated alerting systems that enable proactive monitoring and incident detection across distributed systems
  • Develop query-based alert rules and expressions using PromQL, SQL, and other query languages to surface meaningful insights
  • Create intelligent alert routing, deduplication, and correlation mechanisms to reduce noise and improve signal quality
  • Build scalable backend services for alert evaluation, notification delivery, and alert management workflows
  • Optimize time-series data storage and query performance for high-volume metrics and telemetry data
  • Develop intuitive interfaces for alert configuration, visualization, and management using React and modern frontend technologies
  • Collaborate with cross-functional teams to understand monitoring requirements and deliver comprehensive alerting solutions
  • Mentor and guide engineers on best practices for observability and alerting architecture
  • This position will require stand-by, on-call, or off-hours duties


If You’ve Got It - We Want It

  • Strong proficiency in TypeScript/Node.js with a proven track record of building production-grade services
  • Experience with query languages for metrics and monitoring (PromQL, SQL, or similar) and ability to write complex queries for data analysis
  • Hands-on experience building or maintaining alerting systems, including rule evaluation engines and notification pipelines
  • Experience with time-series databases and columnar storage systems (ClickHouse experience is a plus)
  • Frontend development skills with React and modern JavaScript frameworks for building data visualization and management interfaces
  • Strong understanding of distributed systems, data structures, and algorithms
  • Experience with observability concepts including metrics, logs, traces, and their correlation
  • Ability to work independently with minimal supervision and a track record of learning quickly
  • Dedication to writing clean, maintainable, and well-tested code
  • Experience Prometheus ecosystem, including AlertManager
  • Background in building rule engines or expression evaluation systems
  • Experience with notification systems and integrations (PagerDuty, Slack, webhooks, etc.)
  • Familiarity with observability tools like Grafana, ELK stack, or similar solutions
  • Experience with CI/CD pipelines such as BitBucket, Jenkins, CircleCI, etc.
  • Understanding of alert fatigue mitigation strategies and intelligent alerting patterns
  • Experience with high cardinality data and performance optimization
  • Willingness to speak your mind and share ideas
  • Appreciation for humor and a love for goats
  • Comfort working remotely


Salary Range ($160,000 - $210,000)

The salary for this role is dependent on geographic location. The salary offered within the range described will be based on the individual candidate’s job-related knowledge, skills, and experience.  In addition to a competitive salary, Cribl also offers a generous benefits package which includes health, dental, vision, short-term disability, and life insurance, paid holidays and paid time off, a fertility treatment benefit, 401(k), equity, and eligibility for a discretionary company-wide bonus.

#LI-AM1
#LI-Remote

Bring Your Whole Self
Diversity drives innovation, enables better decisions to support our customers, and inspires change for the better. We’re building a culture where differences are valued and welcomed, and we work together to bring out the best in each other. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, or any other applicable legally protected characteristics in the location in which the candidate is applying.

Interested in joining the Cribl herd? Learn more about the smartest, funniest, most passionate goats you’ll ever meet at cribl.io/about-us

Top Skills

Ci/Cd
Clickhouse
Elk Stack
Grafana
Node.js
Pagerduty
Promql
React
Slack
SQL
Typescript
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Francisco, CA
600 Employees
Year Founded: 2018

What We Do

Cribl, the Data Engine for IT and Security, empowers organizations to transform their data strategy. Customers use Cribl’s vendor-agnostic solutions to analyze, collect, process, and route all IT and security data from any source or in any destination, delivering the choice, control, and flexibility required to adapt to their ever-changing needs. Cribl’s product suite, which is used by Fortune 1000 companies globally, is purpose-built for IT and Security, including Cribl Stream, the industry’s leading observability pipeline, Cribl Edge, an intelligent vendor-neutral agent, and Cribl Search, the industry’s first search-in-place solution. Founded in 2018, Cribl is a remote-first workforce with an office in San Francisco, CA.

Why Work With Us

We are building the company that will become the industry leader in IT and Security data. But, doing that doesn’t mean we’re always serious. We approach our work fearlessly, learn quickly, improve constantly, and celebrate our wins at every turn. And more importantly, we laugh a lot.

Gallery

Gallery

Similar Jobs

Bounteous Logo Bounteous

Architect

Agency • Digital Media • eCommerce • Professional Services • Software • Analytics • Consulting
Remote
United States
136K-170K Annually

BlackLine Logo BlackLine

Consultant

Cloud • Fintech • Information Technology • Machine Learning • Software • App development • Generative AI
Remote or Hybrid
New York, NY, USA
92K-116K Annually

BlackLine Logo BlackLine

Enterprise Account Manager

Cloud • Fintech • Information Technology • Machine Learning • Software • App development • Generative AI
Remote or Hybrid
United States
119K-140K Annually

Wipfli Logo Wipfli

Operations Supervisor

Cloud • Fintech • Software • Business Intelligence • Consulting • Financial Services
Remote or Hybrid
United States
60K-81K Annually

Similar Companies Hiring

Credal.ai Thumbnail
Software • Security • Productivity • Machine Learning • Artificial Intelligence
Brooklyn, NY
Standard Template Labs Thumbnail
Software • Information Technology • Artificial Intelligence
New York, NY
10 Employees
PRIMA Thumbnail
Travel • Software • Marketing Tech • Hospitality • eCommerce
US
15 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account