Staff Site Reliability Engineer, Platform

Posted 20 Days Ago
Hiring Remotely in United States
Remote
165K-200K Annually
Expert/Leader
Cloud • Information Technology
The Role
As a Staff Platform Engineer, you'll develop and maintain infrastructure components using Go and Node.js, improve service reliability, mentor juniors, and manage data ecosystems.
Summary Generated by Built In
Who we are

Kentik is the network intelligence platform for modern infrastructure teams. Unlike traditional monitoring and observability tools, we demystify complex network operations, enabling organizations to deliver applications and innovation at scale. Built by network experts to make critical insight accessible to every engineer, Kentik is the real-time source of truth that understands every network in context — from data center to cloud to the internet. This single platform unifies and correlates cloud, device, flow, synthetic data to turn telemetry into action. Market leaders like Akamai, Booking.com, Dropbox, and Zoom rely on Kentik to run, manage, and optimize their networks.

What we do
Kentik is looking for an experienced software engineer to join our Infrastructure team. This team is in charge of the software stack that powers Kentik - from configuration management and orchestration, to datastores and data pipelines, developer experience and internal observability. We are an international group of collaborative, experienced developers and operations practitioners, with broad and deep knowledge of networks, systems and applications.
If you're a senior engineer looking to move to a staff+ role, this will be a great opportunity! You will get to work with the rest of engineering, as well as product management and field engineers.
What you'll do

You will work on a very broad and diverse set of problems and technologies critical to the smooth operation of Kentik, the productivity and happiness of other engineers and the growth of our Engineering organization.

  • Build self-service, declarative and API-driven infrastructure components in go, nodejs
  • Contribute to our internal deployment tooling (mostly python CLI tools) and service orchestration platform based on Envoy, Nomad and other Hashicorp components
  • Help formulate and execute our strategy for datastores such as postgres, kafka, redis (reliability, performance, overhead, capacity planning, …)
  • Improve the reliability of our services, with code and testing improvements as well as internal advocacy and education
  • Mentoring of junior team members
  • Create and update technical documentation for infrastructure
  • Be on the on-call escalation path for services owned by the team
What you'll bring

Studies have shown that some candidates tend to apply to jobs only if they meet 100% of the qualifications. We encourage you to apply if you meet most of the criteria - even if you don’t match all of the qualifications, your skills and experience could be valuable in this role!

  • 8+ years of relevant experience
  • Passion for building and providing amazing tools and platforms to other engineers
  • Strong coding skills in Go or Python(alternatively:  server-side javascript, ruby, java …)
  • Significant experience with data ecosystems and tools, cloud or on-prem
  • An SRE mindset and and the intent to build reliable, easy to operate systems

Nice to haves:

  • Familiarity with Temporal (or similar workflow engines) for managing workflow execution and durable execution experience
  • Most our systems run on Linux bare-metal hosts managed with puppet - so any experience with that is a plus
Our tech stack
  • Our core data engine and platform are primarily written in Go
  • We use Node.js + Express for application serving, and React as our primary UI framework
  • We also use some JS and Python for tooling/scripting
  • In addition to our own database, we use Postgres, Kafka, Mysql, and Redis
  • Internal and public APIs expose both rest/json and gRPC endpoints
  • Haproxy, Envoy for API traffic routing and balancing
  • Github for source control, PRs, issues
  • Jenkins for automated builds
What we offer

Kentik is a fully remote company that operates globally. We seek professionals that will help us thrive as an organization, and in turn, to broaden and enhance your career. We’re very thorough in the interview process to understand your skills and how they will relate to your successful growth here at Kentik. Our compensation philosophy encompasses a fair program for all in order to attract, engage and retain talented individuals who will drive our business and wow our customers.

The compensation range for this position is: $165,000 - $200,000. This range reflects the low and high end of the U.S. compensation range Kentik reasonably and generally expects to pay the hired candidate in this role. The actual compensation offered may be lower or higher than the stated range depending on various factors, including but not limited to:

  • Experience with the skill sets required for success
  • Demonstrated competencies and potential 
  • A geographic market-based approach

In addition to a great career opportunity, Kentik offers stellar benefits for our employees, which include:

  • 100% of premiums are paid by company for health, vision and dental coverage for you and your dependents
  • Additionally, an annual Health Reimbursement Account (HRA) of $3,000 for an individual or $4,500 for a family
  • Paid family & medical leave 
  • Open PTO, a quarterly Wellness Day, and a minimum of 10 paid holidays
  • 401(k) retirement account
  • Home office reimbursement 
  • Stock options

Note: Benefits are as listed for all US full-time employees. For compensation, international applicants will be treated equitably in relation to the laws applicable within the countries in which we operate.

 

Come work with us

The true meaning of Kentik is visibility. We’re committed to making sure everyone feels empowered to use their voice, has a sense of belonging, and is represented at Kentik. 

We don’t look for individuals who fit the culture, but those who will continue to add to the culture.
We encourage everyone to apply, especially those individuals who are underrepresented in the industry: people of color, LGBTQI+ community, women, individuals with disabilities (both seen and unseen), veterans, and people of any age or family status. 

Kentik is committed to creating an inclusive interview process. If you require a reasonable accommodation during the application or interview process, please reach out to [email protected].

Come as you are!
You will be working at a fast-growing, well-funded startup alongside industry thought leaders and network aficionados as we build the future of observability and set the high bar for how network operations and digital businesses should run. With a competitive salary and amazing benefits on top of the meaningful and challenging projects you’ll take on, we’re sure you’ll enjoy joining the Kentik team.

#li-remote

Top Skills

Envoy
Express
Go
Jenkins
Kafka
MySQL
Node.js
Postgres
Puppet
Python
React
Redis
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Francisco, CA
155 Employees
Year Founded: 2014

What We Do

Kentik is the network observability company. Our platform is a must-have for the network front line, whether digital business, corporate IT, or service provider. Network professionals turn to the Kentik Network Observability Cloud to plan, run, and fix any network, relying on our infinite granularity, AI-driven insights, and insanely fast search. Kentik makes sense of network, cloud, host, and container flow, internet routing, performance tests, and network metrics. We show network pros what they need to know about their network performance, health, and security to make their business-critical services shine. Networks power the world’s most valuable companies, and those companies trust Kentik.

Similar Jobs

In-Office or Remote
Boston, MA, USA
2327 Employees
119K-165K Annually
Remote
United States
66 Employees
170K-180K Annually

Camunda Logo Camunda

Senior Site Reliability Engineer

Artificial Intelligence • Information Technology • Software • Automation
Remote
3 Locations
571 Employees
150K-247K Annually

Jellyfish Logo Jellyfish

Site Reliability Engineer

Big Data • Cloud • Productivity • Software • Database • Analytics • Automation
Remote or Hybrid
United States
225 Employees
165K-235K Annually

Similar Companies Hiring

Scrunch AI Thumbnail
Software • SEO • Marketing Tech • Information Technology • Artificial Intelligence
Salt Lake City, Utah
Amplify Platform Thumbnail
Fintech • Financial Services • Consulting • Cloud • Business Intelligence • Big Data Analytics
Scottsdale, AZ
62 Employees
Standard Template Labs Thumbnail
Software • Information Technology • Artificial Intelligence
New York, NY
15 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account