Senior Software Engineer - SRE

Reposted 15 Days Ago
Be an Early Applicant
Hyderabad, Telangana, IND
In-Office
Senior level
Information Technology
The Role
The Senior SRE Engineer ensures reliability and performance of production services, manages infrastructure, and automates operational tasks while optimizing system performance and security compliance.
Summary Generated by Built In

Who We Are

Solera is a global leader in data and software services that strives to transform every touchpoint of the vehicle lifecycle into a connected digital experience. In addition, we provide products and services to protect life’s other most important assets: our homes and digital identities. Today, Solera processes over 300 million digital transactions annually for approximately 235,000 partners and customers in more than 90 countries. Our 6,500 team members foster an uncommon, innovative culture and are dedicated to successfully bringing the future to bear today through cognitive answers, insights, algorithms, and automation.  For more information, please visit solera.com.

Job Summary:

A Senior SRE Engineer responsible for ensuring the reliability, availability, performance, and security of on-prem infrastructure and .NET-based fleet management applications. This role blends operational excellence with strong automation, observability, and incident-response capabilities across high-scale telemetry and real-time data systems. As a core member of the development team, you will work to build and maintain robust, reliable infrastructure and automate operational tasks to reduce toil and improve efficiency. We’re seeking an experienced SRE to deliver insights from massive-scale data in real time.

 

Essential Responsibilities And Duties:

Ensure high availability, scalability, and resilience of production services, including APIs, .NET applications, telemetry ingestion pipelines, and on-prem infrastructure.

Run and maintain production environments by continuously monitoring system health, availability, error rates, resource saturation, and end-to-end performance.

Define, implement, and monitor SLIs/SLOs/SLAs for uptime, latency, throughput, error budgets, and system reliability.

Build and maintain software systems to automate the management of platform infrastructure, deployments, and application operations.

Measure, analyse, and optimise system performance, proactively identifying bottlenecks and driving architectural improvements.

Own incident management, including detection, triaging, mitigation, communication, root cause analysis (RCA), and post-mortems.

Design and maintain monitoring, logging, and observability frameworks (Prometheus, Grafana, Datadog, ELK, APM tools) for distributed services, microservices, telemetry workloads, and on-prem infrastructure.

Develop and enhance automation, CI/CD pipelines to reduce manual toil and improve deployment reliability.

Ensure reliability, performance, and best practices are integrated into the SDLC.

Manage and operate on-prem infrastructure, including Rancher, OpenShift, Kubernetes, virtualisation, storage, networking, and security controls.

Provision, configure, and maintain infrastructure resources using IaC tooling, automation scripts, and configuration management tools.

Implement security and compliance best practices, especially around fleet data, driver information, telemetry, GPS, and regulatory requirements.

Perform capacity planning and performance tuning for backend services, telemetry systems, and high-load ingestion pipelines.

Provide primary operational support for large-scale distributed .NET applications and fleet-critical systems.

Maintain detailed documentation on architecture, operational processes, incident playbooks, and system runbooks.
Fleet/Telematics-Specific Responsibilities

Support real-time data ingestion pipelines (vehicle telemetry, IoT/edge devices, GPS/GNSS streams), ensuring low-latency and reliable data delivery.

Optimise backend systems for load spikes typical in fleet operations (e.g., start-of-day vehicle activations, peak trip windows).

Monitor the health of vehicle-facing and driver-facing data flows, including connectivity, message delivery, and ingestion reliability.

Enhance observability for mobile/embedded systems, considering intermittent connectivity, offline sync, and edge constraints.

Qualifications:

EDUCATION:  Bachelor’s degree in Computer Science or equivalent

EXPERIENCE: 6–8 years of relevant experience in DevOps, or Site Reliability Engineering, with hands-on expertise in operating production systems, CI/CD pipelines, and distributed application platforms.

 

Knowledge/Skills/Abilities:

Strong expertise in on-prem infrastructure & container orchestration — Rancher, Kubernetes/OpenShift, Docker, virtualisation, networking, storage, IP routing, firewalls, and security controls.

Deep observability and monitoring skills using Prometheus, Grafana, Datadog, ELK, APMs, log pipelines, distributed tracing, and alerting systems like PagerDuty, with the ability to build end-to-end monitoring for APIs, .NET apps and Java apps, and telemetry pipelines.

Advanced reliability engineering capabilities — defining/operationalising SLIs, SLOs, SLAs, error budgets, availability models, and capacity/performance planning for large-scale distributed systems.

Strong automation and CI/CD experience with GitHub, Octopus, Jenkins/Azure DevOps, IaC (Terraform/Helm/Kustomize), and scripting (PowerShell, Bash, Python) to reduce manual toil and improve deployment reliability.

Production operations mastery — incident management (detection → triage → mitigation → RCA/post-mortem), system health monitoring, performance analysis, scalability improvements, and maintaining high uptime SLAs.

Backend performance & systems engineering skills — thread/memory profiling for .NET apps, SQL/No-SQL Server/Redis tuning, telemetry ingestion optimisation, and handling high-load fleet/telematics workloads.

Experience supporting real-time data flows & IoT/telemetry systems, including GPS/GNSS streams, vehicle connectivity, ingestion reliability, offline/edge constraints, and mobility-driven scaling patterns.

Security and compliance knowledge — secrets management, least-privilege access, vulnerability scanning, data protection practices for fleet data, driver information, and regulated telemetry workloads.

Experience with cloud platforms such as AWS (EKS, EC2, RDS, S3, VPC, IAM) is a plus, especially in hybrid on-prem + cloud environments.  

Skills Required

  • 6-8 years of relevant experience in DevOps or Site Reliability Engineering
  • Bachelor's degree in Computer Science or equivalent
  • Expertise in on-prem infrastructure and container orchestration
  • Experience with CI/CD pipelines and scripting tools
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Westlake, TX
1,689 Employees
Year Founded: 2005

What We Do

Solera is a leading global provider of integrated vehicle lifecycle and fleet management software-as-a-service, data, and services. Through four lines of business – vehicle claims, vehicle repairs, vehicle solutions and fleet solutions – Solera is home to many leading brands in the vehicle lifecycle ecosystem, including Identifix, Audatex, DealerSocket, Omnitracs, eDriving/Mentor, Explore, CAP HPI, Autodata, and others. Solera empowers its customers to succeed in the digital age by providing them with a “one-stop shop” solution that streamlines operations, offers data-driven analytics, and enhances customer engagement, which Solera believes helps customers drive sales, promote customer retention, and improve profit margins. Solera serves over 300,000 global customers and partners in 100+ countries. For more information, visit www.solera.com.

Similar Jobs

Jade Global Logo Jade Global

Senior Software Engineer

Artificial Intelligence • Cloud • Information Technology • Analytics • Business Intelligence • Consulting • App development
In-Office
2 Locations
1794 Employees

Wise Logo Wise

India and South Asia Government Relations Manager

Fintech • Mobile • Payments • Software • Financial Services
Hybrid
Hyderabad, Telangana, IND
9000 Employees

Capco Logo Capco

RWA_BA

Fintech • Professional Services • Consulting • Energy • Financial Services • Cybersecurity • Generative AI
Remote or Hybrid
India
6000 Employees

Capco Logo Capco

Test Automation Engineer

Fintech • Professional Services • Consulting • Energy • Financial Services • Cybersecurity • Generative AI
Remote or Hybrid
India
6000 Employees

Similar Companies Hiring

Scrunch  Thumbnail
Artificial Intelligence • Information Technology • Marketing Tech • Software • SEO
Salt Lake City, Utah
Standard Template Labs Thumbnail
Artificial Intelligence • Information Technology • Software
New York, NY
25 Employees
Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account