Senior Site Reliability Engineer

Posted 5 Days Ago
Be an Early Applicant
Bangalore, Bengaluru Urban, Karnataka
In-Office
Senior level
Information Technology
The Role
Design, implement, and maintain reliable cloud infrastructure. Lead incident management, optimize performance, and mentor junior SREs. Collaborate with other teams for system reliability.
Summary Generated by Built In

Why Lytx:
Join our dynamic and passionate team of driven, low-ego engineers who are at the forefront of designing and supporting cutting-edge IoT infrastructure. As we rapidly grow and transition to the cloud, we're diving into the exciting realms of "Operations as Code," "Infrastructure as Code," and innovative infrastructure automation.
 

Our Site Reliability Engineering (SRE) team is pivotal in ensuring the availability, reliability,
observability, and resilience of Lytx’ services, both on-premises and in the cloud. We're not just keeping the lights on—we're engineering the future of our business's continuity.
If you're energized by crafting transformative solutions and excel at designing robust, detailed cloud infrastructure with a focus on continuous improvement, this could be the perfect role for you!
Responsibilities:
• System Design and Architecture: Design, implement, and maintain scalable and reliable
systems, ensuring they can handle both current and future demands.
• Incident Management: Lead incident response efforts, diagnose root causes, and
implement long-term solutions to prevent recurrence. Ensure effective communication
during outages.
Monitoring and Observability: Develop and maintain comprehensive monitoring and
alerting systems to proactively identify and address issues before they impact users.
Automation and Efficiency: Automate repetitive tasks and processes to improve
operational efficiency and reduce manual intervention.
Performance Tuning: Continuously optimize system performance, including fine-tuning
applications, databases, and infrastructure to meet service level objectives (SLOs).
Capacity Planning: Forecast future system requirements based on growth trends and
current usage, and plan capacity upgrades to ensure system reliability.
Collaboration and Mentoring: Work closely with development teams to integrate
reliability into the software development lifecycle. Mentor junior SREs and share best
practices.

Documentation and Knowledge Sharing: Create and maintain detailed documentation on
system design, incident response procedures, and operational practices to ensure
knowledge is preserved and accessible.
Requirements:
• 5+ years of experience as an SRE within AWS environments at medium to large-scale
organizations.
• 5+ years of hands-on experience implementing and managing observability tools, such
as Prometheus, New Relic, Grafana, or similar.
• Advanced programming skills in Python, Groovy, and Bash.
• Strong understanding of database technologies, including both SQL and NoSQL
systems.
• 3+ years of experience developing and managing infrastructure deployment pipelines
using Git, Terraform, Helm, Jenkins/Jenkins X/ArgoCD, or similar tools.
• Proven expertise in designing, evaluating, and supporting production environments in
AWS, including VPCs, EKS, IAM, AMI, EC2, CloudWatch, CloudTrail, Control Tower,
GuardDuty, MSK, S3, Glacier, Gateways, Direct Connect, Route 53, RDS, ALBs,
Autoscaling, and more.
• Hands-on experience with Linux systems and protocols and technologies such as HTTP,
REST, TCP/IP, SSL, DNS, SMTP, SSH, NTP, Load Balancing, SQL/NoSQL, Message
Brokers, Nginx, Vault, etc.
• Extensive experience with Kubernetes and various container and cloud-native
technologies.
• Significant experience in managing 24/7 on-call rotations, creating runbooks,
establishing support procedures, and proactively monitoring systems across multiple
geographic locations.
• Ability to thrive under pressure and excel in a technically challenging environment.

Innovation Lives Here

You go all in no matter what you do, and so do we. At Lytx, we’re powered by cutting-edge technology and Happy People. You want your work to make a positive impact in the world, and that’s what we do. Join our diverse team of hungry, humble and capable people united to make a difference.

Together, we help save lives on our roadways.

Find out how good it feels to be a part of an inclusive, collaborative team. We’re committed to delivering an environment where everyone feels valued, included and supported to do their best work and share their voices.

Lytx, Inc. is proud to be an equal opportunity/affirmative action employer and maintains a drug-free workplace. We’re committed to attracting, retaining and maximizing the performance of a diverse and inclusive workforce. EOE/M/F/Disabled/Vet.

Top Skills

Argocd
AWS
Bash
Dns
Git
Grafana
Groovy
Helm
HTTP
Jenkins
Kubernetes
Linux
Load Balancing
Message Brokers
New Relic
Nginx
NoSQL
Ntp
Prometheus
Python
Rest
Smtp
SQL
Ssh
Ssl
Tcp/Ip
Terraform
Vault
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
Framingham, MA
790 Employees
Year Founded: 1998

What We Do

Learn how Lytx video telematics can help you improve safety, efficiency, and DOT compliance in your fleet. Start improving your fleet operations today.

Similar Jobs

Hybrid
Bengaluru, Bengaluru Urban, Karnataka, IND
289097 Employees

Visa Inc, Logo Visa Inc,

Senior Site Reliability Engineer

Fintech • Information Technology • Payments
In-Office
Bangalore, Bengaluru Urban, Karnataka, IND
26500 Employees
3-3 Annually

Flexera Logo Flexera

Senior Site Reliability Engineer

Big Data • Cloud • Information Technology • Software • Business Intelligence • Cybersecurity
Hybrid
Bangalore, Bengaluru Urban, Karnataka, IND
2000 Employees

Flexera Logo Flexera

Senior Site Reliability Engineer

Big Data • Cloud • Information Technology • Software • Business Intelligence • Cybersecurity
Hybrid
Bangalore, Bengaluru Urban, Karnataka, IND
2000 Employees
4-4 Annually

Similar Companies Hiring

Axle Health Thumbnail
Logistics • Information Technology • Healthtech • Artificial Intelligence
Santa Monica, CA
17 Employees
Scrunch AI Thumbnail
Software • SEO • Marketing Tech • Information Technology • Artificial Intelligence
Salt Lake City, Utah
Standard Template Labs Thumbnail
Software • Information Technology • Artificial Intelligence
New York, NY
10 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account