Site Reliability Engineer

Posted 2 Days Ago
Hiring Remotely in Norfolk, VA, USA
In-Office or Remote
Mid level
Information Technology
The Role
The Site Reliability Engineer will implement reliability engineering practices, develop automation, maintain CI/CD pipelines, and ensure system health through monitoring.
Summary Generated by Built In

Arctiq is a global, intelligence-driven technology services company delivering professional and managed services across Hybrid Cloud Infrastructure, Networking & Connected Experiences, Cybersecurity, Data & AI, Autonomous Operations & Intelligence, and Enterprise Service Management. We help organizations operate, secure, and modernize complex environments by unifying infrastructure, networking, data, security, automation, and observability under a single, integrated operating model. Our work focuses on helping customers reduce operational friction, improve resilience, and make better, faster decisions as their environments evolve. Arctiq builds on decades of industry expertise and a customer-centric ethos to deliver exceptional value to clients across diverse industries.

The Site Reliability Engineer will focus on the execution and maintenance of reliability engineering practices for mission-critical government systems. Following the SRE Implementation Plan, you will bridge the gap between development and operations by applying a software engineering mindset to system administration. You will be responsible for building automation, maintaining CI/CD pipelines, and ensuring system health through robust monitoring.

This is a remote, contract opportunity for a project Arctiq is delivering for a client. Candidates must have or be able to obtain a Secret Clearance.

Key Responsibilities

  • Monitoring & Observability: Implement and maintain dashboards and alerting rules using Prometheus, Grafana, or ELK Stack. Support the identification of Service Level Indicators (SLIs).
  • Automation: Develop and maintain Infrastructure as Code (IaC) scripts using Terraform and Ansible to ensure repeatable, error-free deployments.
  • CI/CD Management: Maintain automated deployment pipelines, ensuring security scans and automated tests are integrated into the workflow.
  • Incident Response: Participate in on-call rotations and assist in troubleshooting system outages. Contribute to blameless post-mortem reports to drive continuous improvement.
  • Toil Reduction: Identify repetitive manual tasks and develop automation to reduce "toil," allowing the team to focus on high-value engineering.

Required Qualifications

  • 3–5 years of experience in SRE, DevOps, or Systems Engineering roles.
  • Proficiency in scripting languages (Python, Go, or Bash).
  • Hands-on experience with containerization (Docker, Kubernetes) and cloud platforms (AWS, Azure, or GCP).
  • Familiarity with NIST SP 800-53 security controls.
  • Education: Bachelor’s degree in Computer Science or a related technical field.

Skills Required

  • 3-5 years of experience in SRE, DevOps, or Systems Engineering roles
  • Proficiency in scripting languages (Python, Go, or Bash)
  • Hands-on experience with containerization (Docker, Kubernetes)
  • Experience with cloud platforms (AWS, Azure, or GCP)
  • Familiarity with NIST SP 800-53 security controls
  • Bachelor's degree in Computer Science or a related technical field
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Irvine, California
377 Employees

What We Do

Arctiq is a leader in professional IT services and managed services across three core Centers of Excellence: Enterprise Security, Modern Infrastructure and Platform Engineering. Renowned for our ability to architect intelligence, we connect, protect, and transform organizations, empowering them to thrive in today's digital landscape. Arctiq builds on decades of industry expertise and a customer-centric ethos to deliver exceptional value to clients across diverse industries.

Similar Jobs

Zscaler Logo Zscaler

Site Reliability Engineer

Cloud • Information Technology • Security • Software • Cybersecurity
Easy Apply
Remote or Hybrid
Crystal City, VA, USA
8697 Employees
140K-200K Annually

Runpod Logo Runpod

Site Reliability Engineer

Artificial Intelligence • Cloud • Software • Infrastructure as a Service (IaaS)
Easy Apply
Remote
USA
80 Employees
150K-200K Annually

Applied Systems Logo Applied Systems

Site Reliability Engineer

Cloud • Insurance • Payments • Software • Business Intelligence • App development • Big Data Analytics
Remote or Hybrid
2 Locations
3040 Employees
65K-135K Annually

MongoDB Logo MongoDB

Site Reliability Engineer

Big Data • Cloud • Software • Database
Easy Apply
Remote or Hybrid
10 Locations
5550 Employees
127K-249K Annually

Similar Companies Hiring

Scrunch  Thumbnail
Artificial Intelligence • Information Technology • Marketing Tech • Software • SEO
Salt Lake City, Utah
Standard Template Labs Thumbnail
Artificial Intelligence • Information Technology • Software
New York, NY
25 Employees
Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account