Senior SRE

Posted 9 Hours Ago
Be an Early Applicant
Bengaluru, Bengaluru Urban, Karnataka
Senior level
Artificial Intelligence • Enterprise Web • Greentech • Machine Learning • Energy
The Role
As a Senior Site Reliability Engineer at Urbint, you will design high-availability systems, guide the development team, maintain system security, manage logging and metrics, and oversee build engineering. Your focus will be on ensuring optimal system uptime and collaborating with teams to implement best practices.
Summary Generated by Built In

Sr. Site Reliability Engineer

At Urbint, our mission is to make communities more resilient. We do this by pairing external data with artificial intelligence to identify areas of high risk and prevent catastrophic loss for utilities across the country. We are a team of close-knit engineers, entrepreneurs, and data geeks who obsess over problem-solving, new technologies and making a positive impact in our communities.

Job Summary

We are seeking a Senior Site Reliability Engineer to take charge of our servers, deployments and overall systems.

You will have a passion for the practical side of managing large, complex systems and services and planning for maximum uptime leveraging modern tools. Urbint has a mix of self-hosted services deployed within Google Cloud with most managed through Google Container Engine (Kubernetes) and a need to support on-premise deployments to address specific security postures of some clients. 

What You'll Do

  • Design High-Availability Systems - ensure that all of the systems that we deploy and depend on are configured to maintain full uptime. Planning out deployment strategies to ensure that uptime is maintained during upgrades and maintenance. Designing and building out an infrastructure-as-code project.
  • Guiding Development Team with Best Practices - working with the Development team to ensure that the software being built will be practical to deploy and maintain.
  • Maintaining System and Network Security - patch management, ensuring that dependencies are kept up to date. Staying informed about zero-day vulnerabilities and any risks that cannot be immediately patched and coming up with alternative methods to mitigate their risk.
  • Logging, Metrics and Alerting - managing and organizing an on-call schedule through Pagerduty, connected to metrics and log events. On-call responsibilities will be shared.
  • Build Engineering - managing build/deployment pipelines and ensuring best practices are followed in this.

Who You Are

  • 5+ years of experience designing and maintaining application systems
  • A friendly person first and a technologist second
  • A deep understanding of operating systems and computer architecture experience with:
    • Linux - at least 5 years
    • GCP or AWS experience - at least 3 years
    • Terraform - at least 2 years
    • Kubernetes experience - at least 2 years
    • Docker - at least 3 years
    • Monitoring systems (Graphite/prometheus/grafana/statsd/DataDog…)
    • Strong shell scripting ability
  • Solid programming abilities - to help build any glue components between service
    • Ideally professional Python dev experience
  • Excellent communication and organizational skills a must

Benefits

  • Mission Driven - Some companies use AI to serve better digital ads and trade stocks, we seek to make our communities safer and more resilient
  • Competitive compensation package
  • Generous Paid Time off, Paid Company Holidays including Mental Health Days 
  • Medical Insurance covering self, spouse, 2 children and parents/in-laws
  • Hybrid work - Monday, Tuesday and Wednesday at office; Thursday and Friday at home

We're an equal opportunity employer. All applicants will be considered for employment without attention to race, color, religion, sex, sexual orientation, gender identity, national origin, veteran or disability status.

Top Skills

Linux
Python
The Company
100 Employees
Remote Workplace
Year Founded: 2015

What We Do

Urbint is an international enterprise software company that enables utilities and infrastructure operators to manage risk and increase resiliency through AI-powered solutions. Our mission is to make communities more resilient.

Why Work With Us

In addition to our stimulating, mission-driven environment, Urbint is, in every sense of the word, diverse. We're a team made up of vastly different beliefs, preferences, backgrounds, upbringings & abilities - and this is a fact that we are immensely proud of. We are stronger, better, and thriving because of everything our team brings to the table.

Gallery

Gallery

Similar Jobs

BlackLine Logo BlackLine

Sr. Site Reliability Engineer

Cloud • Fintech • Information Technology • Machine Learning • Software • App development • Generative AI
Hybrid
Bengaluru, Karnataka, IND
1810 Employees

Atlassian Logo Atlassian

Senior Site Reliability Engineer, Customer Support Technology

Cloud • Information Technology • Productivity • Security • Software • App development • Automation
Remote
Bengaluru, Karnataka, IND
11000 Employees

Exabeam Logo Exabeam

Senior Site Reliability Engineer

Artificial Intelligence • Information Technology • Machine Learning • Security • Software • Cybersecurity • Generative AI
Hybrid
Bangalore, Bengaluru, Karnataka, IND
850 Employees
Bengalurus, Bangalore, Karnataka, IND
6000 Employees

Similar Companies Hiring

InCommodities Thumbnail
Renewable Energy • Machine Learning • Information Technology • Energy • Automation • Analytics
Austin, TX
234 Employees
RunPod Thumbnail
Software • Infrastructure as a Service (IaaS) • Cloud • Artificial Intelligence
Charlotte, North Carolina
53 Employees
Hedra Thumbnail
Software • News + Entertainment • Marketing Tech • Generative AI • Enterprise Web • Digital Media • Consumer Web
San Francisco, CA
14 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account