Site Reliability Engineer

Posted 2 Days Ago
Be an Early Applicant
Hiring Remotely in México
Remote
Senior level
Software • Consulting
The Role
The Site Reliability Engineer ensures platform reliability and performance on AWS and Kubernetes, resolving Tier 3 issues and improving operational readiness.
Summary Generated by Built In

Important Information:

  • Years of Experience: 5+ years

  • Job Mode: Full-time

  • Work Mode: Remote within Mexico

Job Summary:
We are seeking a Site Reliability Engineer (19324) to ensure the reliability, scalability, and performance of custom platforms running on AWS infrastructure and Kubernetes. This role focuses on Tier 3 issue resolution, operational readiness for new releases, and proactive improvements to platform stability and customer experience through SRE best practices.

Responsibilities and Duties:
Troubleshoot and resolve Tier 3 platform issues for AWS-based custom applications. Collaborate closely with engineering teams to prepare Operations for new releases and feature enhancements. Identify recurring issues and implement automation, tooling, or process improvements to prevent reoccurrence. Design and implement strategies to improve platform reliability, scalability, and performance. Monitor system health and proactively identify risks or degradation. Participate in incident response, root cause analysis, and post-mortem reviews. Contribute to operational documentation, runbooks, and readiness plans. Partner with internal stakeholders to continuously enhance customer experience and platform robustness.

Qualifications and Skills:
Hands-on experience supporting and operating AWS cloud environments. Strong knowledge of Kubernetes and container orchestration concepts. Proficiency in Python or Go for automation and scripting. Experience with platform support, troubleshooting, and performance optimization. Familiarity with CI/CD pipelines, monitoring, and observability tools. Strong problem-solving abilities with an engineering-focused mindset.

Role-specific Requirements:
Ability to handle complex production incidents and drive them to resolution. Experience working closely with development teams on operational readiness. Proven ability to identify systemic issues and implement long-term solutions. Understanding of SRE principles, incident management, and reliability metrics.

Technologies:
AWS, Kubernetes, Docker, Python, Go, CI/CD pipelines, Monitoring and Observability tools, Terraform or CloudFormation (preferred)

Skillset Competencies:
Cloud Infrastructure Management, Container Orchestration, Automation and Scripting, Incident Response, Root Cause Analysis, Reliability Engineering, Cross-team Collaboration, Documentation and Operational Excellence

About Encora:
Encora is the preferred digital engineering and modernization partner of some of the world’s leading enterprises and digital native companies. With over 9,000 experts in 47+ offices and innovation labs worldwide, Encora’s technology practices include Product Engineering & Development, Cloud Services, Quality Engineering, DevSecOps, Data & Analytics, Digital Experience, Cybersecurity, and AI & LLM Engineering.
At Encora, we hire professionals based solely on their skills and qualifications, and do not discriminate based on age, disability, religion, gender, sexual orientation, socioeconomic status, or nationality.

Top Skills

AWS
Ci/Cd Pipelines
CloudFormation
Docker
Go
Kubernetes
Monitoring And Observability Tools
Python
Terraform
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
Chennai
7,456 Employees
Year Founded: 1980

What We Do

Headquartered in Santa Clara, California, and backed by renowned private equity firms Advent International and Warburg Pincus, Encora is the preferred technology modernization and innovation partner to some of the world’s leading enterprise companies. It provides award-winning digital engineering services including Product Engineering & Development, Cloud Services, Quality Engineering, DevSecOps, Data & Analytics, Digital Experience, Cybersecurity, and AI & LLM Engineering. Encora's deep cluster vertical capabilities extend across diverse industries, including HiTech, Healthcare & Life Sciences, Retail & CPG, Energy & Utilities, Banking Financial Services & Insurance, Travel, Hospitality & Logistics, Telecom & Media, Automotive, and other specialized industries.
With over 9,000 associates in 47+ offices and delivery centers across the U.S., Canada, Latin America, Europe, India, and Southeast Asia, Encora delivers nearshore agility to clients anywhere in the world, coupled with expertise at scale in India. Encora’s Cloud-first, Data-first, AI-first approach enables clients to create differentiated enterprise value through technology

Similar Jobs

Easy Apply
Remote
Guadalajara, Jalisco, MEX
1213 Employees
Remote
Querétaro, MEX
981 Employees
Remote
México
1000 Employees
Remote
2 Locations
13042 Employees

Similar Companies Hiring

Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees
Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Fairly Even Thumbnail
Software • Sales • Robotics • Other • Hospitality • Hardware
New York, NY

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account