Site Reliability Engineer (SRE) / Azure Monitoring Engineer

Job Posted 16 Hours Ago Posted 16 Hours Ago
Be an Early Applicant
Chennai, Tamil Nadu
Mid level
Mobile • Software
The Role
As an Azure Edge Cloud Engineer, design and implement cloud solutions leveraging Azure, optimize edge environments, and collaborate with teams for system reliability and security.
Summary Generated by Built In

Description

At Mindera, We are seeking an experienced and highly skilled Site Reliability Engineer (SRE) / Azure Monitoring Engineer with 6 to 12+ years of hands-on experience to join our dynamic team in Chennai (Hybrid). The ideal candidate will have a strong background in managing cloud infrastructure, automation, and monitoring, with a focus on Azure cloud services and modern DevOps practices. This role is crucial to ensuring the reliability, scalability, and performance of our systems using Azure monitoring tools and practices like GitOps.

Requirements

Key Responsibilities:

Azure Infrastructure Monitoring & Optimization:

  • Develop and implement Azure monitoring solutions using tools like Azure Monitor, Application Insights, and Log Analytics to ensure the health and performance of cloud-based resources.
    Monitor and analyze system logs, metrics, and alerts from Azure services to detect and resolve issues proactively.
  • Proficiency in logging, monitoring & alerting setups of Azure 
  • Experience in on call support and usage of on call support tools.

Kubernetes & Docker Management:

  • Manage Kubernetes clusters and containerized applications using Azure Kubernetes Service (AKS).
  • Implement and maintain containerization best practices with Docker, ensuring optimal performance of containerized workloads.

Incident Management & Troubleshooting:

  • Lead and manage incident response for performance, availability, and security issues within cloud infrastructure.
  • Troubleshoot and resolve issues related to Azure services, Kubernetes, containers, and VMs, ensuring rapid resolution and minimal downtime.

Automation & GitOps Implementation:

  • Implement GitOps practices using GitHub Actions to automate deployment, monitoring, and incident management processes.
  • Automate routine monitoring and infrastructure management tasks with PowerShell and other scripting languages.

Reliability Engineering:

  • Define and manage Service Level Objectives (SLOs), Service Level Indicators (SLIs), and Service Level Agreements (SLAs) to ensure the reliability and uptime of services.
  • Continuously enhance the resilience of cloud infrastructure, applications, and services to meet high standards of performance and reliability.

Collaboration and Documentation:

  • Collaborate closely with cross-functional teams, including DevOps, development, and infrastructure teams, to ensure integrated monitoring and automation.
  • Document monitoring setups, procedures, troubleshooting guides, and incident response strategies.

Security & Compliance:

  • Ensure monitoring and automation practices are in compliance with internal security policies and best practices.
  • Perform audits and implement security controls to safeguard cloud infrastructure and sensitive data.

Required Skills and Experience:

Azure Cloud Services:

  • Extensive experience with Azure services, including Azure Monitor, Application Insights, Log Analytics, and other monitoring solutions.

Kubernetes & Docker:

  • Strong experience with Kubernetes and managing containerized applications in Azure Kubernetes Service (AKS).
  • Proficient in Docker for containerization and managing containerized environments.

Operating Systems:

  • Experience with Windows and Linux administration in cloud environments, including deployment, configuration, and troubleshooting.

Scripting & Automation:

  • Expertise in PowerShell and other scripting languages to automate monitoring and cloud infrastructure management tasks.

GitOps & CI/CD:

  • Hands-on experience with GitOps workflows and GitHub Actions for automating deployment pipelines and operational processes.

Incident Management & Troubleshooting:

  • Proven experience in incident response, troubleshooting, and resolving cloud-related infrastructure issues, ensuring rapid recovery and minimal service disruption.

Reliability Engineering & Monitoring:

  • Experience setting and managing SLOs, SLIs, and SLAs to measure and ensure system reliability and availability.

Problem-Solving & Analytical Skills:

  • Strong analytical and problem-solving skills to identify performance issues, their root causes, and to implement improvements.

Experience in Scaled applications

  • The candidate should have worked previously in scaled programmes/services that cater to high volumes of concurrent users and also demands high availability

Communication skills

  • Demonstrated experience in collaborating with clients across Europe and globally, comprehending various business requirements, and providing solutions that adhere to international standards.
  • Engagements with multiple vendors -> able to manage interactions with different parties.

Preferred Qualifications:

  • Azure Certifications:
  • Azure certifications (e.g., Azure Administrator, Azure Solutions Architect, Azure DevOps Engineer) are highly preferred.

Containerization & Orchestration Tools:

  • Familiarity with Helm for managing Kubernetes applications and other container orchestration tools.
Benefits
We Offer
    • Flexible working hours (self-managed)
    • Competitive salary
    • Annual bonus, subject to company performance
    • Access to Udemy online training and opportunities to learn and grow within the role

About Mindera

At Mindera we use technology to build products we are proud of, with people we love.

Software Engineering Applications, including Web and Mobile, are at the core of what we do at Mindera.

We partner with our clients, to understand their product and deliver high performance, resilient and scalable software systems that create an impact in their users and businesses across the world.

You get to work with a bunch of great people, where the whole team owns the project together.

Our culture reflects our lean and self-organisation attitude. We encourage our colleagues to take risks, make decisions, work in a collaborative way and talk to everyone to enhance communication.

We are proud of our work and we love to learn all and everything while navigating through an Agile, Lean and collaborative environment.

Follow our Linkedln page -

Check ot our Blog: and our Handbook:

Our offices are located: Aveiro, Portugal | Porto, Portugal | Leicester, UK | San Diego, USA | San Francisco, USA | Chennai, India | Bengaluru, India

Top Skills

Arm Templates
Azure
Docker
Edge Essentials
Github Actions
Kubernetes
Powershell
Terraform
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Diego, CA
490 Employees
On-site Workplace
Year Founded: 2014

What We Do

At Mindera we craft software with people we love.
Software Engineering Applications, including Web and Mobile, are at the core of what we do at Mindera.

You get to work with a bunch of great people, where the whole team owns the project together. Our culture reflects our lean and self-organization attitude. We encourage our colleagues to take risks, make decisions, work in a collaborative way and talk to everyone to enhance communication.

We partner with our clients, to understand their product and deliver high performance, resilient and scalable software systems that create an impact in their users and businesses across the world

Our offices are located in: Portugal | UK | USA | India | Romania | Brazil

Similar Jobs

Bounteous Logo Bounteous

Senior Braze Developer

Agency • Digital Media • eCommerce • Professional Services • Software • Analytics • Consulting
Hybrid
Chennai, Tamil Nadu, IND
4000 Employees

Bounteous Logo Bounteous

Senior Technical Architect, E-Commerce

Agency • Digital Media • eCommerce • Professional Services • Software • Analytics • Consulting
Chennai, Tamil Nadu, IND
4000 Employees
100K-160K

TransUnion Logo TransUnion

Lead Engineer, Java development

Big Data • Fintech • Information Technology • Business Intelligence • Financial Services • Cybersecurity • Big Data Analytics
Hybrid
Chennai, Tamil Nadu, IND
13000 Employees

Intelsat Logo Intelsat

Principal Network Reliability Engineer

Aerospace • Digital Media • Information Technology • Internet of Things • Mobile • Software
Hybrid
Chennai, Tamil Nadu, IND
2100 Employees

Similar Companies Hiring

True Anomaly Thumbnail
Software • Machine Learning • Hardware • Defense • Artificial Intelligence • Aerospace
Colorado Springs, CO
131 Employees
Caliola Engineering Thumbnail
Software • Machine Learning • Hardware • Defense • Data Privacy • App development • Aerospace
Colorado Springs, CO
53 Employees
Red 6 Thumbnail
Virtual Reality • Software • Hardware • Defense • Aerospace
Orlando, Florida
113 Employees
Not Eligible
Save
By clicking Apply you agree to share your profile information with the hiring company.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account