Site Reliability Engineer (SRE)

Posted 4 Hours Ago
Be an Early Applicant
Burlingame, CA, USA
In-Office
170K-197K Annually
Mid level
Aerospace • Artificial Intelligence
Launching the next-generation of location infrastructure.
The Role
The Site Reliability Engineer will architect and manage ground infrastructure for satellite systems, ensuring high availability, automating deployments, and optimizing data management systems.
Summary Generated by Built In

Xona is the navigational intelligence company bringing real-time, centimeter-level certainty to any device, anywhere on Earth.

With Pulsar – the world’s most advanced PNT satellite infrastructure in Low Earth Orbit – Xona will offer a future-proof, backwards-compatible global positioning system optimized for absolute precision, superior power, and robust protection.

We are seeking a Site Reliability Engineer (SRE) to architect and manage the critical ground infrastructure for our satellite constellation. This role is responsible for the "last mile" of mission success: ensuring that the software controlling our orbital assets is highly available, scalable, and seamlessly integrated with Mission Operations.

You will own the lifecycle of our production environments, from automating deployments via Infrastructure as Code (IaC) to managing the core data systems that track constellation health and user activity.

Required Qualifications
  • Infrastructure as Code (IaC): Design and maintain scalable, repeatable cloud infrastructure (AWS) using tools like Terraform or CloudFormation.

  • Mission Ops Integration: Build and optimize the interfaces between core data management systems and Mission Operations software, ensuring reliable telemetry and command flows.

  • User & Data Management: Architect and maintain high-availability identity providers (IdP) and distributed databases to support global user access and real-time data processing.

  • Automated Deployment Pipelines: Create and manage robust CI/CD pipelines to deploy containerized applications into production with a focus on zero-downtime and rollback capabilities.

  • Observability & Reliability: Implement comprehensive monitoring, alerting, and logging (e.g., Prometheus, Grafana, ELK) to ensure 99.99% uptime for ground segment services.

  • Scalability Engineering: Perform capacity planning and performance tuning to handle the high-throughput data requirements of a growing satellite constellation.

Technical Qualifications
  • Cloud Operations: 4+ years of experience managing production-grade environments in AWS, GCP, or Azure.

  • Orchestration: Expert-level proficiency with Kubernetes (EKS), including networking, ingress controllers, and service mesh management.

  • Automation: Strong experience with configuration management and IaC (e.g., Terraform, Ansible, Helm).

  • Data Systems: Deep knowledge of SQL and NoSQL database administration, focusing on replication, backup, and disaster recovery.

  • Programming: Proficiency in Python and C++ for developing internal tooling and automating complex operational workflows.

  • Systems Internals: Strong understanding of Linux networking, storage, and kernel tuning.

Preferred Qualifications
  • Prior experience in Aerospace, Defense, or high-reliability sectors.

  • Familiarity with CCSDS standards or satellite ground station software.

  • Experience with secure, air-gapped, or hybrid-cloud deployments.

For U.S. Roles: To comply with U.S. Government space technology export regulations, applicant must be a U.S. citizen, lawful permanent resident of the United States (i.e. Green Card holder), or other protected individual as defined by 8 U.S.C. 1324b(a)(3).

For U.K. Roles: To comply with U.K. regulations, this role requires Baseline Personnel Security Standard (BPSS) checks, and successful candidates must be eligible to obtain UK Security Clearance (SC).

For Canada Roles: Successful candidates must obtain and hold a security clearance at the reliability status level, and pass security assessment for the Canadian Controlled Goods Program (CGP) and ITAR.

We celebrate diversity and are committed to creating an inclusive environment for all employees. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or veteran status.

Top Skills

Ansible
AWS
Azure
C++
CloudFormation
Eks
Elk
GCP
Grafana
Helm
Kubernetes
Prometheus
Python
Terraform
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Mateo, CA
83 Employees
Year Founded: 2019

What We Do

Xona is developing the most accurate and secure real-time PNT service on the planet.

Similar Jobs

BAE Systems, Inc. Logo BAE Systems, Inc.

Site Reliability Engineer

Aerospace • Hardware • Information Technology • Security • Software • Cybersecurity • Defense
Hybrid
San Diego, CA, USA
40000 Employees
133K-226K Annually

Crexi Logo Crexi

Senior Site Reliability Engineer

Real Estate • Sales • Software • PropTech
Easy Apply
Hybrid
Los Angeles, CA, USA
400 Employees
160K-214K Annually

MongoDB Logo MongoDB

Site Reliability Engineer

Big Data • Cloud • Software • Database
Easy Apply
Remote or Hybrid
4 Locations
5550 Employees
127K-249K Annually
Hybrid
3 Locations
1100 Employees
147K-278K Annually

Similar Companies Hiring

Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees
Bellagent Thumbnail
Artificial Intelligence • Machine Learning • Business Intelligence • Generative AI
Chicago, IL
20 Employees
Outpost Space Thumbnail
Aerospace • Defense
US
24 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account