Sr Site Reliability Engineer
About The Opportunity
We're all about connecting hungry diners with our network of over 300,000 restaurants nationwide. User-friendly platforms and streamlined delivery capabilities set us apart in the world of online food ordering. Grubhub is a place where authentically fun culture meets innovation and teamwork. We believe in empowering people and opening doors for new opportunities. If you're looking for a place that values relationships, embraces diverse ideas-all while having fun together-then Grubhub is the place for you!
More About the Role
GrubHub is looking for an experienced SRE specialized in managing large important data persistence platforms including Cassandra and Elasticsearch on AWS. Grubhub platform supports high-volume applications in a container-based microservice architecture running on multiple AWS regions in fully Active/Active mode. The entire platform is powered by a very large multi-datacenter Cassandra infrastructure for persistence , and Elasticsearch for indexing and scaling search and content experience. You will work with a team of passionate and experienced engineers responsible for automation, scaling, tuning, and troubleshooting of Elasticsearch and Cassandra databases. You will also collaborate and work with a diverse group of engineers across the organization to design and engineer solutions.
The Impact You Will Make
- Manage large critical Cassandra and Elasticsearch clusters supporting Millions of transactions per day
- Build systems to automate all build and maintenance tasks using Ansible and python
- Develop self-service tools to allow engineers to manage and provision resources with GrubHub best practices
- Monitor cluster availability, read/ write latencies, and other important performance metrics to proactively identify SLO misses and help mitigate issues
- Evaluate new technologies and software versions. Test and develop roadmaps
- Tune Cassandra and ES databases for optimizing throughput and read /write latencies
- 24X7 on-call rotation support with rest of team for rapid incident response
- Implement DR strategies, including backups and recovery techniques with minimal downtime.
- Work with other engineers to manage our data persistence integration and performance with the GrubHub platform.
- Monitor and scale Elasticsearch/Cassandra clusters to handle growth in traffic
What You Bring to the Table
- Experience developing backend applications in Python or Java
- Experience managing, working or developing large Elasticsearch clusters in highly available 24x7 production environments
- Experience automating the maintenance of infrastructure using Python and Ansible or similar tools.
- Experience managing automated cloud infrastructures on AWS or other major cloud providers.
- Experience managing large Cassandra clusters in production is a strong plus.
- Experience working with docker is a plus
- Ability to quickly learn new concepts and technologies and adapt to changing needs
Additional Content :
- How Grubhub uses Elasticsearch
- https://www.elastic.co/videos/how-grubhub-turns-data-into-your-delicious-bites
- How Grubhub guarantees critical microservice actions
And Of Course, Perks!
- Flexible PTO . Grubhub employees enjoy a generous amount of time to recharge.
- Health and Wellness. Excellent medical, dental and vision benefits, 401k matching, employee network groups and paid parental leave are just a few of our programs to support your overall well-being.
- Compensation. You'll receive a great compensation package with eligibility for generous incentives, bonuses, commission, or RSUs (role-specific).
- Free Meals . Our employees get a weekly Grubhub credit to enjoy and support local restaurants.
- Social Impact. We believe in giving back through programs like the Grubhub Community Relief Fund, and provide our employees opportunities to support causes that are important to them.
Vaccination Requirement: Grubhub employees are required to be fully vaccinated. You must confirm vaccination status at time of hire, and must provide proof of full-Covid-19 vaccination within 2 weeks of starting employment. Fully vaccinated is defined as: "2 weeks have passed since your second dose in a 2-dose series, such as the Pfizer or Moderna vaccines, or 2 weeks after a single-dose vaccine, such as Johnson & Johnson's vaccine.
Grubhub is an equal opportunity employer. We welcome diversity and encourage a workplace that is just as diverse as the customers we serve. We evaluate qualified applicants without regard to race, color, religion, age, sex, sexual orientation, gender identity, national origin, disability, veteran status, and other legally protected characteristics. If you're applying for a job in the U.S. and need an accommodation for any part of the employment process, please send an email to [email protected] and let us know the nature of your request and contact information.
CA Privacy Notice: If you are a resident of the State of California and would like a copy of our CA privacy notice, please email [email protected].