Site Reliability Engineer - Remote

Reposted Yesterday
Hiring Remotely in Reston, VA
In-Office or Remote
108K-184K Annually
Senior level
Consulting
Come solve with us
The Role
The Site Reliability Engineer will improve application reliability, define SLIs, SLOs, optimize CI/CD pipelines, conduct root cause analysis, and enhance operational responses in a regulated healthcare software environment.
Summary Generated by Built In

ICF is a mission-driven company filled with people who care deeply about improving the lives of others and making the world a better place. Our core values include Embracing Difference; we seek candidates who are passionate about building a culture that encourages, embraces, and hires dimensions of difference.  

Our Health Engineering Solutions (HES) team works side by side with customers to articulate a vision for success, and then make it happen. We know success doesn't happen by accident. It takes the right team of people, working together on the right solutions for the customer. We are looking for a seasoned SRE to establish a culture of improvement in observability and reliability.  

You will work closely with software engineering teams to ensure that applications, databases, pipelines and APIs run reliably You will be expected to create, set, and exceed service level objectives as key indicators of application health You will be working on a mission critical software program whose goal is to support the ecosystem of Centers for Medicare & Medicaid Services (CMS). 

Our core work hours are 10am - 4pm Eastern Time with the option to start earlier or work later depending on your time zone. 

Key Responsibilities: 

  • Define and maintain SLIs, SLOs, and SLAs for the Internet-based Quality Improvement and Evaluation System (iQIES) application. 

  • Performance tuning that will model load scenarios, forecasting capacity, and optimize scaling strategies 

  • Design and optimize the observability stack through New Relic, CloudWatch, and Jenkins CI/CD pipelines 

  • Participate in root cause analysis for operational issues and improve incident response process 

  • Participate in creating, monitoring, and optimizing actionable alerts to respond to issues in a timely manner 

  • Develop tools and scripts  

  • Develop and maintain Jenkins CI/CD pipelines, using declarative Jenkinsfiles and foundational Groovy for pipeline logic and enhancements 

  • Deploy services to Fargate, EKS, Lambda, Airflow, Databases 

  • Manage security groups and access controls. Thoroughly understand fundamentals like security groups, IAM, managing RDS 

  • Apply patch management and hardening practices 

  • Align with DevOps and Technical Leads to ensure overall strategy 

  • Actively participate in releases and product launches with expectation of being online during release windows 

Required Qualifications 

  • 5+ years experience in a software development environment and a Bachelor’s degree; OR 3+ years experience in a software development environment and a Master’s degree 

  • 5+ years supporting a highavailability production environment (cloud or onprem) 

  • 3+ years of working in a SRE role in a large scale cloud implementing high availability and scalability 

  • 3+ years of experience focused on SRE, DevOps, or Platform Engineering 

  • Must be able to obtain and maintain a public trust clearance 

  • Candidate must reside in the US, be authorized to work in the US, and work must be performed in the US 

  • Must have lived in the US 3 full years out of the last 5 years 

Preferred Qualifications 

  • Previous work in a regulated healthcare or federal agency environment 

  • Full stack web development experience 

  • Expert in deployment techniques to minimize down-time like Blue-Green, Canary, A/B testing approaches, and zero downtime deployments 

  • Understanding of security groups and access controls  

  • Experience with Atlassian tooling such as Jira and Confluence 

Professional Skills and Tools: 

  • Cloud platform experience with AWS 

  • Observability: CloudWatch, New Relic or similar 

  • Infrastructure: Kubernetes, Docker 

  • IaC: Terraform 

  • CI/CD: Git, Jenkins or GitHub Actions 

  • Database: SQL relational database 

  • Docker: Thorough understanding of Docker and Docker Compose. Understand best practices, caching, volume mounts, etc 

  • Highly effective analytical, problem-solving, and decision-making capabilities. 

  • Strong written and verbal communication skills  

  • Ability to clearly articulate and communicate complex technical ideas to non-SRE colleagues.  

  • Ability to understand project requirements and be innovative in finding solutions in highly regulated government environments.  

  • Flexibility and the ability to accept a change in priorities as necessary.  

  • Demonstrated time management skills.  

  • Strong organizational skills with attention to detail.  

Job Location:  

This position requires that the job be performed in the United States.  If you accept this position, you should note that ICF does monitor employee work locations and blocks access from foreign locations/foreign IP addresses, and also prohibits personal VPN connections. 

Working at ICF

ICF is a global advisory and technology services provider, but we’re not your typical consultants. We combine unmatched expertise with cutting-edge technology to help clients solve their most complex challenges, navigate change, and shape the future.

We can only solve the world's toughest challenges by building a workplace that allows everyone to thrive. We are an equal opportunity employer. Together, our employees are empowered to share their expertise and collaborate with others to achieve personal and professional goals. For more information, please read our EEO policy.

We will consider for employment qualified applicants with arrest and conviction records.

 

Reasonable Accommodations are available, including, but not limited to, for disabled veterans, individuals with disabilities, and individuals with sincerely held religious beliefs, in all phases of the application and employment process. To request an accommodation, please email [email protected] and we will be happy to assist. All information you provide will be kept confidential and will be used only to the extent required to provide needed reasonable accommodations.  

Read more about workplace discrimination rights or our benefit offerings which are included in the Transparency in (Benefits) Coverage Act. 

 

Candidate AI Usage Policy

At ICF, we are committed to ensuring a fair interview process for all candidates based on their own skills and knowledge. As part of this commitment, the use of artificial intelligence (AI) tools to generate or assist with responses during interviews (whether in-person or virtual) is not permitted. This policy is in place to maintain the integrity and authenticity of the interview process.  

However, we understand that some candidates may require accommodation that involves the use of AI. If such an accommodation is needed, candidates are instructed to contact us in advance at [email protected]. We are dedicated to providing the necessary support to ensure that all candidates have an equal opportunity to succeed.


 

Pay Range - There are multiple factors that are considered in determining final pay for a position, including, but not limited to, relevant work experience, skills, certifications and competencies that align to the specified role, geographic location, education and certifications as well as contract provisions regarding labor categories that are specific to the position.

The pay range for this position based on full-time employment is:

$108,476.00 - $184,409.00

Nationwide Remote Office (US99)

Top Skills

AWS
Cloudwatch
Docker
Git
Github Actions
Jenkins
Kubernetes
New Relic
Terraform
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Reston, VA
9,000 Employees
Year Founded: 1969

What We Do

ICF is a leading global solutions and technology provider with approximately 9,000 employees in industries across the public and private sectors. We combine unmatched expertise with cutting-edge technology to help clients solve their most complex challenges, navigate change, and shape the future.

Why Work With Us

At ICF, we’ve built a culture rooted in expertise, innovation, and purpose—enabling us to build a more prosperous and resilient world for all. When our employees grow, our solutions thrive—regardless of your role, team, or location, you’ll have opportunities to make connections, build community, and take action to develop your career.

Gallery

Gallery

Similar Jobs

DFIN Logo DFIN

Site Reliability Engineer

Fintech • Software
Remote or Hybrid
United States
1750 Employees
In-Office or Remote
53 Locations
185 Employees
60K-120K Annually

DFIN Logo DFIN

Site Reliability Engineer

Fintech • Software
Remote or Hybrid
United States
1750 Employees

Camunda Logo Camunda

Senior Site Reliability Engineer

Artificial Intelligence • Information Technology • Software • Automation
Remote
3 Locations
571 Employees
150K-247K Annually

Similar Companies Hiring

Energy CX Thumbnail
Utilities • Professional Services • Greentech • Financial Services • Energy • Consulting • Business Intelligence
Chicago, IL
108 Employees
Northslope Technologies Thumbnail
Software • Information Technology • Generative AI • Consulting • Artificial Intelligence • Analytics
Denver, CO
88 Employees
Amplify Platform Thumbnail
Fintech • Financial Services • Consulting • Cloud • Business Intelligence • Big Data Analytics
Scottsdale, AZ
62 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account