Principal SRE (Networking) - Platform Control Plane

Reposted 11 Days Ago
Hiring Remotely in United States
Remote
111K-176K Annually
Mid level
Cloud • Security • Software • Generative AI
Elastic, the Search AI Company, helps everyone find the answers they need in real time, using all their data, at scale.
The Role
The role involves designing and developing tooling for the Elastic Stack, managing production services, and supporting internal Elastic Stack usage for development and analytics.
Summary Generated by Built In

Elastic, the Search AI Company, enables everyone to find the answers they need in real time, using all their data, at scale — unleashing the potential of businesses and people. The Elastic Search AI Platform, used by more than 50% of the Fortune 500, brings together the precision of search and the intelligence of AI to enable everyone to accelerate the results that matter. By taking advantage of all structured and unstructured data — securing and protecting private information more effectively — Elastic’s complete, cloud-based solutions for search, security, and observability help organizations deliver on the promise of AI.

What is The Role:

As part of the Platform Engineering department, the Network Infrastructure team is crafting, building, and improving the multi-cloud platform at scale for Elastic Cloud Hosted and Serverless. We grow and mature our distributed large-scale network infrastructure that spans across multiple cloud service providers to support our cloud services . We are built on Kubernetes, Go, and custom orchestration architectures. In your daily life with us, you will participate in coding, innovating technical designs, crafting solutions, improving resilience, and prioritizing security, bug fixes, and features. For example, Debugging Azure Networking for Elastic Cloud Serverless is part of our efforts, and we want your experience to contribute to a truly exceptional customer experience!

What You Will Be Doing:
  • Taking an engineering approach in leading technical initiatives for designing, building and automating network infrastructure and services to guarantee the reliability of the global Elastic network infrastructure. Focusing on Layer 2/3/4 of the TCP/IP stack (Ethernet and/or IP encapsulation, routing, firewalling, load balancing).
  • Growing our global Platform network infrastructure to meet the increasing scaling demands by Developing and maintaining software, codebases, tooling and automations to serve our Network Infrastructure as Code principle.
  • Collaborating in an environment with an inclusive approach, and focusing on operational excellence which uplifts others.
  • Preventing repeated customer impact in response to major incidents and prioritised problem management. Our on call rotation is spread well, and we address complex customer concerns too.
What You Bring:
  • Excellent networking skills, with knowledge of protocols such as IP/IPv6, TCP/UDP, BGP, DNS.
  • Strong technical depth for building and automating networks (Terraform, Ansible) in collaboration with other engineers as an authority in identifying, implementing and delivering solutions.
  • Good knowledge of public CSP network components (Load balancers, VPC peering/Transit gateways, VPN connectivity, Direct Connects)
  • Success and lessons of experiences from striving for 'progress not perfection' in the name of Platform reliability. We want to hear about your customer first approach in solving operational problems for both today and the future.
  • Passion for developing solutions that involve inclusive communication methods to grow and strengthen partner and team relationships. Examples of working in distributed teams or working remotely is desirable.
  • Site-Reliability Engineering experience. We tackle problems with code, but fundamentally we keep things working and have proven success in operational excellence. Responding to and preventing repeated customer impact in response to major incidents and prioritized problem management. Our on call rotation uses follow-the-sun model where everyone participates in it in (mostly) their working hours.
Bonus Points:
  • You have operated a SaaS product in a public cloud ideally built using Infrastructure-as-Code tooling such as Crossplane or Terraform.
  • You have designed and/or operated large network topologies that dynamic routing is based on BGP.
  • You have operated network topologies based on software routers.
  • You have experience in IP address management (IPAM) and you have used relevant tools for automated IP allocations.
  • You have designed and /or operated overlay networks with use of encapsulation protocols such as IPSec, GRE and VXLANYou have built or operated a Kubernetes-at-scale infrastructure, ideally across multiple cloud providers, with knowledge of the Cilium CNI.
  • You have written non-trivial programs in Golang or other programming languages.
  • You have worked with containerized services (such as Docker.)
  • You have proven experience in leading and improving alerting and major incident management standard processes metrics systems (e.g. Elastic Stack, Graphite, Prometheus, Influx) to diagnose issues and quantify impacts to present to others at varying level of the organization.
  • You have experience in system and network administration with professional skills in Linux on distributed systems at scale.
  • You have diagnosed or designed, implemented and created solutions with the Elastic Stack.
  • You are experienced in thriving in a self-organizing and sharing in a globally distributed team environment.
  • You strengthen team members in bringing out the best of each other by uplifting others with coaching and mentoring.



Compensation for this role is in the form of base salary.  This role does not have a variable compensation component.  

The typical starting salary range for new hires in this role is listed below.  In select locations (including Seattle WA, Los Angeles CA, the San Francisco Bay Area CA, and the New York City Metro Area), an alternate range may apply as specified below. 

These ranges represent the lowest to highest salary we reasonably and in good faith believe we would pay for this role at the time of this posting.  We may ultimately pay more or less than the posted range, and the ranges may be modified in the future.  

An employee's position within the salary range will be based on several factors including, but not limited to, relevant education, qualifications, certifications, experience, skills, geographic location, performance, and business or organizational needs.

Elastic believes that employees should have the opportunity to share in the value that we create together for our shareholders. Therefore, in addition to cash compensation, this role is currently eligible to participate in Elastic's stock program.  Our total rewards package also includes a company-matched 401k with dollar-for-dollar matching up to 6% of eligible earnings, along with a range of other benefits offered with a holistic emphasis on employee well-being.

The typical starting salary range for this role is:
$110,900$175,500 USD
The typical starting salary range for this role in the select locations listed above is:
$133,200$210,700 USD
Additional Information - We Take Care of Our People

As a distributed company, diversity drives our identity. Whether you’re looking to launch a new career or grow an existing one, Elastic is the type of company where you can balance great work with great life. Your age is only a number. It doesn’t matter if you’re just out of college or your children are; we need you for what you can do.

We strive to have parity of benefits across regions and while regulations differ from place to place, we believe taking care of our people is the right thing to do.

  • Competitive pay based on the work you do here and not your previous salary
  • Health coverage for you and your family in many locations
  • Ability to craft your calendar with flexible locations and schedules for many roles
  • Generous number of vacation days each year
  • Increase your impact - We match up to $2000 (or local currency equivalent) for financial donations and service
  • Up to 40 hours each year to use toward volunteer projects you love
  • Embracing parenthood with minimum of 16 weeks of parental leave

Different people approach problems differently. We need that. Elastic is an equal opportunity employer and is committed to creating an inclusive culture that celebrates different perspectives, experiences, and backgrounds. Qualified applicants will receive consideration for employment without regard to race, ethnicity, color, religion, sex, pregnancy, sexual orientation, gender perception or identity, national origin, age, marital status, protected veteran status, disability status, or any other basis protected by federal, state or local law, ordinance or regulation.

We welcome individuals with disabilities and strive to create an accessible and inclusive experience for all individuals. To request an accommodation during the application or the recruiting process, please email [email protected]. We will reply to your request within 24 business hours of submission.

Applicants have rights under Federal Employment Laws, view posters linked below: Family and Medical Leave Act (FMLA) Poster; Pay Transparency Nondiscrimination Provision Poster; Employee Polygraph Protection Act (EPPA) Poster and Know Your Rights (Poster)

Elasticsearch develops and distributes technology and information that is subject to U.S. and other countries’ export controls and licensing requirements for individuals who are located in or are nationals of the following sanctioned countries and regions: Belarus, Cuba, Iran, North Korea, Syria, or Russia, including the Ukrainian territories annexed by Russia (The Crimea region of Ukraine, The Donetsk People's Republic (DNR), The Luhansk People's Republic (LNR), Kherson or Zaporizhzhia). If you are located in or are a national of one of the listed countries or regions, an export license may be required as a condition of your employment in this role. Please note that national origin and/or nationality do not affect eligibility for employment with Elastic.

Please see here for our Privacy Statement.

Skills Required

  • Deep proficiency in at least one programming language
  • Experience in a Site Reliability Engineering role
  • Experience administering Linux systems at scale
  • Experience with infrastructure-as-code practices

Elastic Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about Elastic and has not been reviewed or approved by Elastic.

  • Fair & Transparent Compensation Pay is considered competitive across many technical and go‑to‑market roles and compares well within its peer group. Strong equity components and benefits bolster perceived total compensation.
  • Healthcare Strength Health coverage is described as fully paid for employees and families in many locations, which is unusually generous among tech firms. Comprehensive medical, dental, and vision coverage is emphasized in official materials.
  • Parental & Family Support A minimum of 16 weeks of paid parental leave is offered globally, with additional family‑formation support. These policies are positioned as core elements of the package.

Elastic Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Francisco, CA
3,222 Employees
Year Founded: 2012

What We Do

Elastic, the Search AI Company, enables everyone to find the answers they need in real time, using all their data, at scale — unleashing the potential of businesses and people. The Elastic Search AI Platform, used by more than 50% of the Fortune 500, brings together the precision of search and the intelligence of AI to enable everyone to accelerate the results that matter. By taking advantage of all structured and unstructured data — securing and protecting private information more effectively — Elastic’s complete, cloud-based solutions for search, security, and observability help organizations deliver on the promise of AI.

Why Work With Us

Free and open isn’t just how we build our products, it’s how we build our culture. We value creativity and mobility, so you can grow how (and where) you want to… and be happier at work.

Gallery

Gallery

Similar Jobs

TIDAL Logo TIDAL

Designer

Consumer Web • Information Technology • Mobile • Music • News + Entertainment • Software
Remote or Hybrid
New York, NY, USA
450 Employees
252K-377K Annually

Samsara Logo Samsara

Business Operations Manager

Artificial Intelligence • Cloud • Computer Vision • Hardware • Internet of Things • Software
Easy Apply
Remote or Hybrid
Houston, TX, USA
4000 Employees
107K-162K Annually

Pricefx Logo Pricefx

Enterprise Account Executive

Artificial Intelligence • Cloud • Enterprise Web • Information Technology • Software • Analytics • Business Intelligence
In-Office or Remote
Chicago, IL, USA
400 Employees
145K-260K Annually

Zscaler Logo Zscaler

Senior Product Manager

Cloud • Information Technology • Security • Software • Cybersecurity
Easy Apply
Remote or Hybrid
San Jose, CA, USA
8697 Employees
119K-170K Annually

Similar Companies Hiring

Bellagent Thumbnail
Artificial Intelligence • Machine Learning • Business Intelligence • Generative AI
Chicago, IL
20 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account