Senior Site Reliability Engineer - Big Data

| San Diego, CA, USA
Employer Provided Salary: 107,600-180,200 Annually
Salary data is provided by the employer. Please note this is not a guarantee of compensation.
Sorry, this job was removed at 4:27 p.m. (CST) on Tuesday, May 14, 2024
Find out who's hiring remotely in San Diego, CA.
See all Remote Developer + Engineer jobs in San Diego, CA
Apply
By clicking Apply Now you agree to share your profile information with the hiring company.

About the job
Job Title: Senior Site Reliability Engineer I
Reports to: Senior Manager of Site Reliability Engineering
Job Location: San Diego, CA
Job Status: Exempt, FT
About SHEIN
SHEIN is a global online fashion and lifestyle retailer, offering SHEIN branded apparel and products from a global network of vendors, all at affordable prices. Headquartered in Singapore, with more than 15,000 employees operating from offices around the world, SHEIN is committed to making the beauty of fashion accessible to all, promoting its industry-leading, on-demand production methodology, for a smarter, future-ready industry.
Position Summary
We are looking for a Senior Site Reliability Engineer - Big Data (Official Title: Senior Site Reliability Engineer I) for our San Diego, CA-based office hub. Site Reliability Engineers work with the Technical Operations team at SHEIN and are hybrid software/systems engineers, whose overarching goal is to ensure that Production Services are "Always On." They strive to build the most reliable and performant systems on the planet.
SREs work closely cross-functional teams to ensure we have the right set of tools to generate, collect, analyze, visualize and alert on operational data, so we know exactly what happens across the ecosystem and can see problems before they occur and address them as quickly as possible.
They are also responsible for improving Operational Efficiency, Utilization and System Resiliency of the Platform. They own Critical Open-Source Software that our platform relies on and are core participants in every significant engineering effort underway in the platform.
They are also tasked with driving forward the operability of the platform to drive down the number of incidents while reducing MTTR. To accomplish this, the team combines software development, networking and systems engineering expertise, and a strong desire to be challenged by problems of scale and complexity to make our service better for our customers.
Job Responsibilities

  • Participate in an on-call rotation to ensure 24/7/365 availability of SHEIN's production system
  • Supervise capacity & utilization and work closely with cross-functional teams to orchestrate scale-up/down of the services
  • Own & operate critical open-source services like Elasticsearch, Kafka, RabbitMQ, Redis
  • Build tools and design processes that help improve observability and system resiliency of the platform
  • Triage Site Availability Incidents and proactively work towards reducing MTTR for customer impacting incidents
  • Partner with Service owners to implement Service Level Metrics & Service Level Objectives that act as service level health indicators
  • Establish design patterns for monitoring, benchmarking and deploying new features for the backend services
  • Develop and maintain technical documentation, network diagrams, runbooks, and procedures
  • Driving initiatives to evolve our current platform to increase efficiency and keep it in line with current standards and best practices
  • Responding to production incidents and using your experience in software development, systems engineering, and networking to proactively prevent repeatable issues
  • Provide relief and sustainable resolution to issues within our infrastructure
  • Drive initiatives with partner teams to improve the reliability and performance of the infrastructure through improved system design.
  • Join a culture of intolerance to manual activity which results in a highly automated environment delivering scalable solutions.
  • Drive efficiencies through software improvement and root cause analysis resulting in service delivery, maturity, and scalability.


Job Requirements

  • Bachelor's degree in Computer Science, Information Systems, or equivalent technical discipline is preferred
  • Experience with Big Data related component operation and maintenance, including Hadoop, Yarn, HBase, Hive, Spark, etc., is highly preferred
  • Experience with OSS technologies, like Elasticsearch, Kafka, and Redis, is highly preferred
  • Solid understanding of Linux system is preferred
  • Minimum 3 years working experience in an enterprise 24/7 production environment supporting mission-critical, real-time, high-traffic applications, especially in cloud environments is preferred
  • Systematic problem-solving approach, combined with a sense of ownership and drive
  • Full-stack debugging and performance optimization ability, including knowledge of Cloud systems (load balancing, caching, content distribution, etc.), continuous integration/build systems, Java, SQL and NoSQL databases
  • Track record monitoring and analyzing system performance, isolating issues or bottlenecks that could impact reliability, performance and scalability
  • Strong experience with observability tools such as Grafana, Prometheus, Zabbix etc
  • Good experience in any of the scripting/programming languages: Python, GoLang etc
  • Familiar with container technology, such as: Docker, Kubernetes, Mesos, etc.
  • Understanding and experience with SRE concepts and practices, including being an advocate for the elimination of toil and drive simple solutions
  • Good verbal and written communication skills, and be able to work effectively with geographically remote teams


Pay
$107,600.00 min - $180,200.00 max annually, Bonus & RSU offered.
Benefits and Perks
Healthcare (medical, dental, vision, prescription drugs)
Health Savings Account with Employer Funding
Flexible Spending Accounts (Healthcare and Dependent care)
Company-Paid Basic Life/AD&D insurance
Company-Paid Short-Term and Long-Term Disability
Voluntary Benefit Offerings (Voluntary Life/AD&D, Hospital Indemnity, Critical Illness, and Accident)
Employee Assistance Program
Business Travel Accident Insurance
401(k) Savings Plan with discretionary company match and access to a financial advisor
Vacation, paid holidays, floating holiday and sick days
Employee discounts
Free weekly catered lunch
Dog-friendly office (available at select locations)
Free gym access (available at select locations)
Free swag giveaways
Annual Holiday Party
Invitations to pop-ups and other company events
Complimentary daily office snacks and beverages
SHEIN Technology LLC is an equal opportunity employer committed to a diverse workplace environment.

Read Full Job Description
Apply Now
By clicking Apply Now you agree to share your profile information with the hiring company.

Technology we use

  • Engineering
  • Product
  • People Operations
    • C#Languages
    • C++Languages
    • JavaLanguages
    • JavascriptLanguages
    • PerlLanguages
    • PythonLanguages
    • SqlLanguages
    • ShellLanguages
    • jQueryLibraries
    • jQuery UILibraries
    • HadoopFrameworks
    • Node.jsFrameworks
    • SparkFrameworks
    • HBaseDatabases
    • HiveDatabases
    • Microsoft SQL ServerDatabases
    • MySQLDatabases
    • Google AnalyticsAnalytics
    • TableauAnalytics
    • AxureDesign
    • CanvaDesign
    • IllustratorDesign
    • PhotoshopDesign
    • ConfluenceManagement
    • Google DriveManagement
    • Google DocsManagement
    • JIRAManagement
    • Microsoft ProjectManagement
    • Microsoft TeamsCollaboration
    • ZoomCollaboration
    • Oracle FusionProject Management

An Insider's view of SHEIN Technology LLC

How does the company support your career growth?

Working at a start-up like SHEIN presents many opportunities for growth. As the company grows, we can and must learn and grow together. The company empowers employees in many ways, whether it be formal courses and certifications to on-the-job learning experiences. Exceptional contributors are recognized and rewarded accordingly.

Danny Chi

Head of Governance Risk and Compliance

What are SHEIN Technology LLC Perks + Benefits

SHEIN Technology LLC Benefits Overview

• Healthcare (medical, dental, vision, prescription drugs)
• Health Savings Account with Employer Funding
• Flexible Spending Accounts (Healthcare and Dependent care)
• Company-Paid Basic Life/AD&D insurance
• Company-Paid Short-Term and Long-Term Disability
• Voluntary Benefit Offerings (Voluntary Life/AD&D, Hospital Indemnity, Critical Illness, and Accident)
• Employee Assistance Program
• Business Travel Accident Insurance
• 401(k) savings plan with discretionary company match and access to a financial advisor to meet retirement planning goals.
• Vacation-Paid time off
• Paid Holidays and Sick Days
• Employee Discounts
• Perks (HQ Location)
• Free weekly catered lunch at HQ
• Dog-Friendly office
• Free Gym Access at HQ
• Free Swag Giveaways
• Annual Holiday Party
• Invitations to pop-ups and other company events
• Complimentary daily office snacks and beverages
• Free Shuttle Service from HQ to LA Union Station

Culture
Partners with nonprofits
Open door policy
OKR operational model
Open office floor plan
Quarterly engagement surveys
Health Insurance + Wellness
Flexible Spending Account (FSA)
Disability insurance
Dental insurance
Vision insurance
Health insurance
Life insurance
Financial & Retirement
401(K) matching
Child Care & Parental Leave
Family medical leave
Company sponsored family events
Vacation + Time Off
Paid holidays
Paid sick days
Office Perks
Company-sponsored outings
Free snacks and drinks
Some meals provided
Catered lunch weekly
Onsite office parking
Pet friendly
Relocation assistance
Onsite gym
Professional Development
Promote from within
Mentorship program

More Jobs at SHEIN Technology LLC

Apply Now
By clicking Apply Now you agree to share your profile information with the hiring company.
Learn more about SHEIN Technology LLCFind similar jobs like this