Senior Site Reliability Engineer
About the job
Job Title: Senior Site Reliability Engineer I
Reports to: SRE Manager
Job Location: Los Angeles, CA
Job Status: Exempt, FT
About SHEIN
SHEIN is a global online fashion and lifestyle retailer, offering SHEIN branded apparel and products from a global network of vendors, all at affordable prices. Headquartered in Singapore, with more than 15,000 employees operating from offices around the world, SHEIN is committed to making the beauty of fashion accessible to all, promoting its industry-leading, on-demand production methodology, for a smarter, future-ready industry.
Position Summary
We are looking for a Senior Site Reliability Engineer (official title is Senior Site Reliability Engineer I) with experience working in large-scale mission-critical environments with zero downtime. The SHEIN SRE team is a mix of DBA/SRE/DBRE oriented folks whose overarching goal is to provide highly available data services at scale. They strive to build an extremely reliable, performant, and secure database infrastructure through the skillful use of automation. This team is responsible for providing new architectures and scalability solutions to ever-growing business and data processing needs.
Job Responsibilities
- Work closely with cross-functional teams to ensure the company has the right set of tools to generate, collect, analyze, visualize and alert on operational data.
- Participate in an on-call rotation to ensure 24/7/365 availability of company's production system. Own & operate critical open-source services like Elasticsearch, Kafka, RabbitMQ, Redis. Build tools and design processes that help improve observability and system resiliency of the platform.
- Triage Site Availability Incidents and proactively work towards reducing MTTR for customer impacting incidents.
- Partner with Service owners to implement Service Level Metrics & Service Level Objectives. Establish design patterns for monitoring, benchmarking and deploying new features for the backend services.
- Develop and maintain technical documentation, network diagrams, runbooks, and procedures. Increase efficiency, respond to production incidents and prevent repeatable issues, improve the reliability and performance of the infrastructure.
Job Requirements
- Bachelor's degree or a foreign equivalent in Computer Science, Information Systems or a related field, plus 3 years of experience in the job offered or as a Computer Systems Engineer, Software Engineer or related job titles.
- Applicable experience must include at least 3 years of experience with: (1) supporting mission-critical, real-time, high-traffic applications in cloud environments; (2) Knowledge of Cloud systems, continuous integration/build systems, Java, SQL and NoSQL databases; (3) observability tools such as Grafana, Prometheus, Zabbix; (4) scripting/programming languages (Python or GoLang); (5) one or more OSS technologies (Elasticsearch, Kafka or Redis); (6) container technology like Docker, Kubernetes, Mesos.
Pay
$107,600-$180,200max annually. Bonus & RSU offered.
Benefits and Perks
- Healthcare (medical, dental, vision, prescription drugs)
- Health Savings Account with Employer Funding
- Flexible Spending Accounts (Healthcare and Dependent care)
- Company-Paid Basic Life/AD&D insurance
- Company-Paid Short-Term and Long-Term Disability
- Voluntary Benefit Offerings (Voluntary Life/AD&D, Hospital Indemnity, Critical Illness, and Accident)
- Employee Assistance Program
- Business Travel Accident Insurance
- 401(k) Savings Plan with discretionary company match and access to a financial advisor
- Vacation, paid holidays, floating holiday and sick days
- Employee discounts
- Free weekly catered lunch
- Dog-friendly office (available at select locations)
- Free gym access (available at select locations)
- Free swag giveaways
- Annual Holiday Party
- Invitations to pop-ups and other company events
- Complimentary daily office snacks and beverages
SHEIN Technology is an equal opportunity employer committed to a diverse workplace environment.