Staff Site Reliability Engineer

Posted 21 Days Ago
Hiring Remotely in India
Remote
5-7 Years Experience
Information Technology • Software • Consulting
The Role
Design, build, and maintain scalable and secure AWS and Kubernetes infrastructure, monitor system performance, collaborate with teams, participate in on-call rotation, implement security best practices, and identify vulnerabilities. Mentor engineering team and troubleshoot technical issues in production environments.
Summary Generated by Built In

About the company

 

Everbridge (NASDAQ: EVBG) empowers enterprises and government organizations to anticipate, mitigate, respond to, and recover stronger from critical events. In today’s unpredictable world, resilient organizations minimize impact to people and operations, absorb stress, and return to productivity faster when deploying critical event management (CEM) technology. Everbridge digitizes organizational resilience by combining intelligent automation with the industry’s most comprehensive risk data to Keep People Safe and Organizations Running™. For more information, visit www.everbridge.com, read the company blog, and follow on Twitter. Everbridge… Empowering Resilience

What you'll do

  • Design, build, and maintain scalable, reliable and secure AWS and Kubernetes infrastructure to support our applications and services. 
  • Manage infrastructure configuration using modern IaC tool such as Terraform. 
  • Monitor system performance and reliability metrics, troubleshoot issues, and implement solutions to minimize downtime and performance degradation. 
  • Collaborate with cross-functional teams to design and develop reliable, fault-tolerant systems. 
  • Participate in on-call rotation and respond to production incidents in a timely manner. 
  • Participate in post-incident reviews and implement preventive measures to mitigate future incidents. 
  • Implement and maintain security best practices throughout the infrastructure stack, ensuring compliance with industry standards and regulations. 
  • Monitor and identify security vulnerabilities in infrastructure and container runtime and mitigate them 

What you'll bring:

  • 5+ years of experience in building and managing production grade infrastructure in AWS, Kubernetes and/or EKS. 
  • In-depth knowledge of AWS services, including but not limited to EC2, S3, VPC, IAM, ECR, Route53, and API Gateway. 
  • Familiarity with security best practices in cloud environments, including identity and access management (IAM), encryption, and compliance standards (e.g., GDPR). 
  • Deep understanding of Kubernetes architecture, components, and ecosystem, including Docker, etcd, kube-proxy, and kube-controller- manager. 
  • Proficiency in container orchestration concepts and IaC with hands on experience in tools such as Helm, Terraform. 
  • Mentor and guide engineering team to adapt to infrastructure releated changes within their services. 
  • Familiarity with monitoring and logging tools such as DataDog, Sumologic, Prometheus, Grafana, and experience setting up monitoring/alerting systems for large-scale production environments. 
  • Solid understanding of Linux/Unix system administration, including shell scripting and system troubleshooting. 
  • Strong understanding of networking concepts, including TCP/IP, DNS, DHCP, VPN, and CDN tools like Cloudflare and/or AWS CloudFront. 
  • Strong troubleshooting and problem-solving skills, with the ability to quickly diagnose and resolve complex technical issues in production environments. 
  • Experience with CI/CD pipelines and tools such as Jenkins, GitLab CI, Argo CD or Spinnaker. 

#LI-BK1 


About Everbridge


Everbridge empowers enterprises and government organizations to anticipate, mitigate, respond to, and recover stronger from critical events. In today’s unpredictable world, resilient organizations minimize impact to people and operations, absorb stress, and return to productivity faster when deploying critical event management (CEM) technology. Everbridge digitizes organizational resilience by combining intelligent automation with the industry’s most comprehensive risk data to Keep People Safe and Organizations Running™. For more information, visit www.everbridge.com, read the company blog, and follow on Twitter. Everbridge… Empowering Resilience

 

Top Skills

AWS
Docker
Kubernetes
Linux
Terraform
The Company
Burlington, MA
1,437 Employees
On-site Workplace

What We Do

Keeping People Safe and Businesses Running. Faster.

Everbridge, Inc. (NASDAQ: EVBG) is a global software company that provides enterprise software applications that automate and accelerate organizations’ operational response to critical events in order to Keep People Safe and Businesses Running™. During public safety threats such as active shooter situations, terrorist attacks or severe weather conditions, as well as critical business events including IT outages, cyber-attacks or other incidents such as product recalls or supply-chain interruptions, over 5,300 global customers rely on the company’s Critical Event Management Platform to quickly and reliably aggregate and assess threat data, locate people at risk and responders able to assist, automate the execution of pre-defined communications processes through the secure delivery to over 100 different communication devices, and track progress on executing response plans.

Jobs at Similar Companies

Cencora Logo Cencora

Lead Administrator - System & Applications Administration

Healthtech • Logistics • Software • Pharmaceutical
Pune, Maharashtra, IND
46000 Employees

MassMutual India Logo MassMutual India

BI Developer

Big Data • Fintech • Information Technology • Insurance • Financial Services
Hyderabad, Telangana, IND

Silverfort Logo Silverfort

Sales Development Representative

Information Technology • Sales • Security • Cybersecurity • Automation
Dallas, TX, USA
357 Employees

Similar Companies Hiring

MassMutual India Thumbnail
Insurance • Information Technology • Fintech • Financial Services • Big Data
Hyderabad, Telangana
Silverfort Thumbnail
Security • Sales • Information Technology • Cybersecurity • Automation
GB
357 Employees
Cencora Thumbnail
Software • Pharmaceutical • Logistics • Healthtech
Conshohocken, PA
46000 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account