Site Reliability Engineer (SRE)

Posted 17 Days Ago
2 Locations
Remote
8-9
Senior level
Information Technology • Software
The Role
Seeking a proactive Site Reliability Engineer to ensure the reliability and performance of cloud-based systems. Responsibilities include designing AWS infrastructure, conducting performance tests, managing CI/CD pipelines, and improving system observability.
Summary Generated by Built In

Join us at Sparksoft, where we're not just another tech company—we're a catalyst for change. Our mission isn't just to offer IT solutions; it's to revolutionize the way you work. Here, passion isn't just a buzzword; it's the fuel behind groundbreaking ideas and transformative technologies. We serve a wide range of government clients, delivering impact that's felt across the nation.

Our true strength lies in our people. They're the problem-solvers and innovators consistently delivering extraordinary outcomes. With Sparksoft, you're not stepping into a routine job; you're joining a team committed to innovation and excellence. Our innovation extends beyond just delivering projects. Through our specialized Innovation Centers, we continuously refine our methods, ensuring we remain industry leaders.

We are Sparksoft!

ROLE & RESPONSIBILITIES:

We are seeking a skilled and proactive Site Reliability Engineer (SRE) with strong expertise in AWS infrastructure and performance testing. The ideal candidate will be responsible for ensuring the reliability, scalability, and performance of our cloud-based systems and applications. This role involves close collaboration with development, operations, and QA teams to build robust systems and improve service uptime

  • Design, implement, and maintain scalable and reliable infrastructure on AWS.
  • Monitor system performance and availability using tools like New Relic and Splunk and AWS CloudWatch.
  • Conduct performance testing using tools such as JMeter, Performance Center to identify bottlenecks and optimize system performance.
  • Develop and maintain CI/CD pipelines to support rapid and reliable software delivery.
  • Implement and manage incident response processes, including root cause analysis and postmortems.
  • Created dashboards and configured alerts in Splunk, New Relic, and AWS CloudWatch to monitor system performance, availability, and application health.
  • Collaborate with development teams to improve system reliability and performance.
  • Ensure security and compliance standards are met across infrastructure and applications.
  • Continuously improve observability and alerting systems.

REQUIRED EXPERIENCE: 

  • 8-9+ years of experience as an SRE, DevOps Engineer, Performance engineer or similar role.
  • Strong hands-on experience with AWS services (EC2, EBS, Lambda, RDS, S3, ECS, EKS etc.).
  • Proficiency in performance testing tools like JMeter, Performance Center and methodologies.
  • Experience with containerization and orchestration (Docker, Kubernetes).
  • Experience with monitoring and logging tools like CloudWatch, New Relic and Splunk.
  • Strong scripting skills (Bash, Groovy, Java Script, Python).
  • Experience with performance diagnostics, tuning, and JVM/JBoss EAP on Linux and on AWS EKS.
  • Working knowledge of Agile development practices.
  • Excellent problem-solving and communication skills.
  • Candidates must be able to obtain and maintain a Public Trust clearance
  • Candidates must have lived in the United States 3 out of the past 5 years

PREFERRED EXPERIENCE:

  • Experience with NoSQL databases.
  • Exposure to other cloud platforms like Azure, Google Cloud, or VMware.
  • Understanding of high availability and disaster recovery architectures.
  • Knowledge of application networking, firewalls, and load balancing.
  • Experience with both Linux and Windows operating systems.

EDUCATION & CERTIFICATIONS:

  • Bachelor’s degree in computer science, Information Technology or equivalent
  • AWS Certifications are preferred

If you need accommodation seeking employment with Sparksoft Corporation, please email [email protected] or call 410-424-7700. Accommodations are made on a case-by-case basis.

At Sparksoft Corporation, we take security and protection of personal information very seriously. We will never ask you to send private personal information over email. Accordingly, we ask you to immediately contact our security team via email at [email protected] upon receiving a suspicious request.

Top Skills

AWS
Aws Cloudwatch
Bash
Docker
Groovy
Java Script
Jmeter
Kubernetes
New Relic
Performance Center
Python
Splunk
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Catonsville, MD
162 Employees
Year Founded: 2004

What We Do

Sparksoft helps the clients achieve their business objectives by providing Innovative, best-of-breed software products and technology solutions at substantial cost savings. Sparksoft Team has considerable industry experience with wide range of leading companies.

Similar Jobs

Zapier Logo Zapier

Site Reliability Engineer

Artificial Intelligence • Productivity • Software • Automation
Remote
USA
89K-133K

Nexthink Logo Nexthink

Site Reliability Engineer

Artificial Intelligence • Big Data • Information Technology • Software
Remote or Hybrid
Phoenix, AZ, USA
174K-272K

Zapier Logo Zapier

Site Reliability Engineer

Artificial Intelligence • Productivity • Software • Automation
In-Office or Remote
2 Locations
141K-212K

NBCUniversal Logo NBCUniversal

Staff Software Engineer

AdTech • Cloud • Digital Media • Information Technology • News + Entertainment • App development
Remote or Hybrid
New York, NY, USA
130K-180K Annually

Similar Companies Hiring

Credal.ai Thumbnail
Software • Security • Productivity • Machine Learning • Artificial Intelligence
Brooklyn, NY
Standard Template Labs Thumbnail
Software • Information Technology • Artificial Intelligence
New York, NY
10 Employees
PRIMA Thumbnail
Travel • Software • Marketing Tech • Hospitality • eCommerce
US
15 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account