Site Reliability Engineer

Sorry, this job was removed at 08:11 p.m. (CST) on Wednesday, Jun 11, 2025
Atlanta, GA
In-Office
Security • Cybersecurity
The Role
Site Reliability Engineer
As a Site Reliability Engineer at DefenseStorm you will be playing a crucial role in ensuring the reliability, scalability, and performance of our cloud-based services. GRID is a high-throughput, data intensive application that currently handles 250k events/sec. You will drive best practices and contribute to both the design and implementation of robust cloud infrastructures that can scale rapidly to support the growing customer base of DefenseStorm.   
Location
Atlanta, GA
Remote

Job Duties and Responsibilities 

  • Lead the migration of EC2 workloads to ECS and develop DevOps tooling to empower development teams to build and manage containerized applications. 
  • Advance zero trust security initiatives by implementing a service mesh architecture with technologies such as Istio. 
  • Enhance the security, scalability, and reliability of AWS cloud-native infrastructure through continuous improvement and innovation. 
  • Design and implement proactive monitoring and alerting solutions using tools like Prometheus, Grafana, and OpsGenie, leveraging data-driven insights to optimize uptime and mitigate operational risks. 
  • Uphold SLAs and SLOs by applying SRE best practices, including incident response, post-mortem analysis, and the creation of operational playbooks. 
  • Build, manage, and scale cloud infrastructure using Infrastructure as Code (IaC) tools such as Terraform. 
  • Support SOC 2 and ISO compliance efforts by championing security best practices, streamlining evidence collection, and introducing automation to improve audit processes. 
  • Other duties as assigned by management 

Required Education and Experience 

  • Hands-on experience building and maintaining CI/CD pipelines using tools such as GitHub Actions. 
  • ​​​​​​​Strong understanding of networking principles and their application in cloud and containerized environments. 
  • Proven experience designing, building, and managing cloud infrastructure in AWS. 
  • Expertise with Infrastructure as Code (IaC) and deployment automation tools to streamline environment provisioning and management. 
  • Experience running and supporting containerized workloads in production environments. 
  • Familiarity with observability, monitoring, logging, and tracing tools to ensure system performance, reliability, and visibility. 
  • Experience using AWS, ECS, Elasticsearch, PostgreSQL, Prometheus, Grafana, GitHub Actions, Terraform   

Preferred Education and Experience 

  • Bachelor's degree in computer science or equivalent work experience 
  • ​​​​​​​3-5 years of hands-on experience in the cybersecurity field 

Similar Jobs

Milestone Systems Logo Milestone Systems

Site Reliability Engineer

Artificial Intelligence • Other • Security • Software • Analytics • Big Data Analytics
Remote or Hybrid
2 Locations
1500 Employees
160K-180K Annually

DFIN Logo DFIN

Site Reliability Engineer

Fintech • Software
Remote or Hybrid
United States
1750 Employees

Zeta Global Logo Zeta Global

Senior Site Reliability Engineer

AdTech • Artificial Intelligence • Marketing Tech • Software • Analytics
Easy Apply
Remote or Hybrid
United States
2429 Employees
140K-170K Annually

Ping Identity Logo Ping Identity

Site Reliability Engineer

Cloud • Security • Software
Easy Apply
Remote or Hybrid
USA
2001 Employees
129K-161K Annually
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Alpharetta, Georgia
104 Employees
Year Founded: 2014

What We Do

DefenseStorm provides an integrated platform of cyber risk assessment, governance, security, and fraud solutions that ensure financial institutions achieve and maintain cyber risk readiness. The only system specifically built for banking, it accounts for all the daunting challenges, regulations, and technology requirements financial institutions face. Their intelligent data engine, GRID ACTIVE, ensures real-time access, analysis, and action on all critical threat data. The Cyber Threat Surveillance Operations (CTS Ops) team offers access to managed resources 24x7x365, providing the help and expertise needed by financial institutions.

Offices locations:
Alpharetta, Georgia - Corporate Headquarters
Wilmington, North Carolina - Business Development Center

Similar Companies Hiring

Oso Thumbnail
Software • Security • Infrastructure as a Service (IaaS)
New York, New York
36 Employees
Credal.ai Thumbnail
Software • Security • Productivity • Machine Learning • Artificial Intelligence
Brooklyn, NY
Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account