What you will be doing:
- Design, deploy, and maintain highly-scalable, highly-available software systems in AWS
- Architect and manage containerized applications on Amazon EKS with focus on reliability and performance
- Build and maintain Infrastructure as Code using Terraform for AWS cloud resources
- Develop and optimize CI/CD pipelines for automated testing, deployment, and rollback capabilities
- Implement comprehensive monitoring, alerting, and observability solutions using CloudWatch, Prometheus, and Grafana
- Ensure system reliability through SLI/SLO definition, error budgets, and incident response procedures
- Collaborate directly with engineering teams to optimize application deployment and operations
- Manage deployments and scaling strategies to support mission-critical operations
- Automate and enforce cloud security, governance, and compliance controls
- Participate in on-call rotation and lead incident response for production level systems
What you bring to this role:
- 5+ years of experience in SRE, DevOps, or Platform Engineering roles
- Proven experience designing and operating mission-critical, highly-available systems within AWS
- Advanced proficiency in Infrastructure as Code using Terraform (OpenTofu)
- Deep experience with Kubernetes, EKS, Helm, and container orchestration
- Strong CI/CD pipeline development and management experience (Bitbucket preferred)
- Proficiency in Python and Bash scripting for automation
- Experience with monitoring and observability tools (Prometheus, Grafana, ELK Stack)
- Knowledge of capacity planning and performance optimization
- Experience with database operations and scaling (RDS, Aurora, or similar)
Extra bonus points for the following:
- AWS Solutions Architect Professional, Certified Kubernetes Administrator (CKA), or equivalent expertise
- Experience with incident management and post-mortem processes
- Experience with GitOps workflows and tools (ArgoCD, Flux)
- Knowledge of service mesh technologies (Istio, Linkerd)
- Experience with chaos engineering and disaster recovery planning
- Experience with Zero Trust Networking (ZTNA) or VPN solutions
- Background in aerospace, defense, or other mission-critical industries
- Strong intellectual curiosity and commitment to continuous learning
- Exceptional attention to detail and an ownership mentality
Top Skills
What We Do
E-Space is a global space company focused on bridging Earth and space with the most sustainable low earth orbit (LEO) network that is expected to reach over one hundred thousand multi-application communication satellites to help businesses and governments securely and affordably access the power of space to solve problems on Earth.
Founded by industry pioneer Greg Wyler, E-Space is focused on democratizing space and transforming industries by bringing down the cost of space-based communications, raising the level of satellite system resiliency and setting a new standard in sustainable space infrastructure that will effectively minimize and reduce space debris and destruction while preserving access to space for future generations.







