Site Reliability Engineer

Sorry, this job was removed at 06:35 p.m. (CST) on Monday, Aug 04, 2025
Washington, DC
Hybrid
Agency • Artificial Intelligence • Fintech • Healthtech • Information Technology • Professional Services • App development
10Pearls helps their clients grow, transform, and scale through custom digital product creation.
The Role

We are seeking a well-rounded Site Reliability Engineer / DevOps Engineer who has practical experience in infrastructure operations and DevOps practices. This role is ideal for a jack-of-all-trades who enjoys solving complex technical challenges and implementing reliable, scalable solutions. You will work closely with the Head of SRE to enhance our infrastructure, automate processes, and improve observability and disaster recovery strategies.

Key Responsibilities:

  • Enhance disaster recovery and multi-region capabilities to improve system resilience

  • Improve monitoring, alerting, and observability through tools like Grafana, Prometheus, Sentry, and OpenSearch

  • Support on-call processes by enhancing alerting strategies and automating responses where possible

  • Collaborate with development and operations teams to address reliability challenges and enhance performance

  • Automate routine infrastructure tasks using scripting and Infrastructure as Code tools

  • Contribute to postmortems and continuous improvement initiatives to improve system stability

  • Maintain accurate, up-to-date documentation of infrastructure, processes, and disaster recovery workflows

Qualifications:

  • 2–4 years of experience in Site Reliability Engineering, DevOps, or a related infrastructure role

  • Hands-on experience with AWS services including EC2, Lambda, S3, and Route 53

  • Proficiency with Kubernetes and Docker for container orchestration and deployment

  • Experience using Infrastructure as Code tools such as Terraform

  • Familiarity with monitoring and alerting systems like Prometheus, Grafana, OpenSearch, and Sentry

  • Experience with CI/CD pipelines, preferably GitLab CI or similar tools

  • Scripting skills in Python, Bash, or similar languages

  • Working knowledge of networking concepts, DNS, and load balancing

  • Strong troubleshooting skills with the ability to respond effectively to incidents

  • Ability to work collaboratively across teams and contribute to cross-functional initiatives

  • Clear and concise technical writing and documentation skills

Nice to Have:

  • Experience with multi-region AWS architectures and disaster recovery planning

  • Advanced knowledge of Grafana dashboards and alerting configurations

  • Familiarity with compliance frameworks such as HIPAA or HITRUST

About 10Pearls: 

10Pearls is a global, purpose-driven AI-powered digital engineering partner helping businesses re-imagine, ‎digitalize, and accelerate. As an end-to-end digital technology partner, 10Pearls helps businesses create future-proof, ‎transformative ‎digital products that leverage emerging technologies. ‎10Pearls' clients ‎include Global 2000 enterprises, high growth mid-size ‎businesses, and some of the most exciting ‎start-ups from industries like healthcare, fintech, ‎energy, education, ‎real estate, retail, and hi-tech. ‎Headquartered in the Washington DC metro area, 10Pearls has product engineering and ‎software development centers in North America, Latin America, Europe, and South Asia. To learn more, visit https://10pearls.com.   

10Pearls is an Equal Opportunity Employer and is committed to maintaining a diverse workplace. 

Similar Jobs

Milestone Systems Logo Milestone Systems

Site Reliability Engineer

Artificial Intelligence • Other • Security • Software • Analytics • Big Data Analytics
Remote or Hybrid
2 Locations
1500 Employees
160K-180K Annually

MongoDB Logo MongoDB

Senior Site Reliability Engineer

Big Data • Cloud • Software • Database
Easy Apply
Remote or Hybrid
7 Locations
5550 Employees
127K-249K Annually

DFIN Logo DFIN

Site Reliability Engineer

Fintech • Software
Remote or Hybrid
United States
1750 Employees

Zeta Global Logo Zeta Global

Senior Site Reliability Engineer

AdTech • Artificial Intelligence • Marketing Tech • Software • Analytics
Easy Apply
Remote or Hybrid
United States
2429 Employees
140K-170K Annually
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Vienna, VA
1,400 Employees
Year Founded: 2004

What We Do

10Pearls is a global, purpose-driven digital technology partner helping our clients re-imagine, digitalize, and accelerate their businesses. As an end-to-end digital partner, 10Pearls helps businesses create transformative ‎digital products incorporating emerging technologies and utilizing our broad expertise in ‎product management, UI/UX, cloud architecture, software development, data science, cybersecurity, and quality assurance. 10Pearls' clients include Global 2000 enterprises, high-growth mid-size ‎businesses, and exciting start-ups across several industries, including healthcare, financial services, ‎energy, education, real estate, and retail. ‎ 10Pearls has a far-reaching global presence with delivery centers in North America, Latin America, Europe, and South Asia.  

Inspirant Group was acquired by 10Pearls in April 2023

Why Work With Us

10Pearls is a double-bottom-line company that balances profits with our responsibility to our communities. We leverage the passions and intelligence of our people to ensure we deliver solutions ‎that meet and exceed our clients’ needs. 

Gallery

Gallery

Similar Companies Hiring

Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees
Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account