Principal Data Engineer

Posted 17 Days Ago
Culver City, CA
5-7 Years Experience
News + Entertainment • Payments • Software
The Role
As a Principal Data Engineer at Spotter, you will be responsible for designing, building, and optimizing scalable data infrastructure. You will develop and maintain data pipelines, work with large-scale datasets, and collaborate with internal teams to make data-informed decisions. This role requires 6+ years of software engineering experience, 5+ years of data engineering experience with Apache Spark or Apache Flink, and proficiency in working with DataFrames and SQL.
Summary Generated by Built In

Overview: 

Spotter is a platform for Creators, providing services and software designed to accelerate growth for the world’s best Creators and brands. Creators working with Spotter can access the capital, knowledge, community, and personalized AI software products they need to succeed. With unique knowledge of how Creators work, the resources they need to grow, and the challenges they face, Spotter is empowering top YouTube Creators to succeed.

Spotter has already deployed over $940 million to YouTube Creators to reinvest in themselves and accelerate their growth, with plans to reach $1 billion in investment by 2024. With a premium catalog that spans over 725,000 videos, Spotter generates more than 88 billion monthly watch-time minutes, delivering a unique scaled media solution to Advertisers and Ad Agencies that is transparent, efficient, and 100% brand safe. For more information about Spotter, please visit https://spotter.com.

OVERVIEW

The successful candidate will be responsible for processing huge data sets (billions of records) using distributed data processing frameworks (Apache Spark, etc...).

Must have:

  • Extensive experience working with very large data sets, creating performant & scalable ETL pipelines
  • In-depth understanding of performance bottlenecks in large-scale data processing

What You’ll Do:

Are you ready to help lead the charge in shaping the data-driven future of Spotter? We're in search of an exceptional Principal Data Engineer who will play a pivotal role in designing, building, and optimizing scalable data infrastructure. You will help us with data pipelines for acquisition and transformation of large datasets, storage and querying optimizations of varying data to support a large range of use cases from Analytics to Creator Products to Operations using traditional and ML focused access patterns. You will be a key player in empowering us to make data-informed decisions that will fuel our innovation and growth.

  • Develop and maintain scalable data pipelines, including:
    • ETL pipelines, both single and multi-node solutions
    • Build data quality assurance steps for new and existing pipelines
    • Create derived datasets with augmented properties
    • Work on analytics ready datasets to power internal and creator facing tools
    • Troubleshoot issues when they arise, working directly with internal data consumers
    • Automate pipeline runs with scheduling and orchestration tools
  • Work with large scale datasets
  • Work with/use various external APIs to enhance data
  • Setup database tables for analytics users to consume the data collected by the Data Engineering team
  • Work with big data technologies to improve data availability and data quality in the cloud (AWS)
  • Lead development of projects involving other team members and act as a mentor
  • Actively participate in team discussions about technology/architecture/solutions for new projects and to improve existing code and pipeline

Who You Are: 

  • Bachelor’s degree, preferably in Computer Science or Computer Information Systems
  • 6+ years of software engineering experience
  • 5+ years of data engineering experience with Apache Spark or Apache Flink
  • 4+ years of experience running software and services in the cloud
  • Proficiency in working with DataFrame APIs (Pandas and Spark) for parallel and single node processing
  • Proficiency using advanced languages and techniques with Python, Scala, etc. with modern data optimized file formats such as Parquet and Avro
  • Proficiency with SQL on RDBMS and data warehouse solutions like Redshift
  • Hands on experience with Data Lake technologies like Delta Lake and Iceberg
  • Experience with data acquisition from external APIs at large scale / in parallel processing
  • Experience supporting ML/AI projects: deployed pipelines for computing features, using models for inference on large datasets

Additional Valued Skills: 

  • Experience with YouTube APIs
  • Experience with AWS Glue metastore
  • Experience with Data-Mesh approaches
  • Experience with data cataloging, data lineage and data governance tools and approaches
  • Experience with vector databases

Why Spotter:

  • Medical and vision insurance covered up to 100%
  • Dental insurance
  • 401(k) matching
  • Stock options
  • Complimentary gym access
  • Autonomy and upward mobility
  • Diverse, equitable, and inclusive culture, where your voice matters.

In compliance with local law, we are disclosing the compensation, or a range thereof, for roles that will be performed in Culver City. Actual salaries will vary and may be above or below the range based on various factors including but not limited to skill sets; experience and training; licensure and certifications; and other business and organizational needs. The overall market range for roles in this area of Spotter are typically: $100K-$500K salary per year.  The range listed is just one component of Spotter’s total compensation package for employees. Other rewards may include annual discretionary bonus and equity. 

Spotter is an equal opportunity employer. Spotter does not discriminate in employment on the basis of race, religion, creed, color, national origin, ancestry, citizenship, physical or mental disability, medical condition, genetic characteristics or information, marital status, sex (including pregnancy, childbirth, breastfeeding, and related medical conditions), gender, gender identity, gender expression, age, sexual orientation, military status, veteran status, use of or request for family or medical leave, political affiliation, or any other status protected under applicable federal, state or local laws. 

Equal access to programs, services and employment is available to all persons. Those applicants requiring reasonable accommodations as part of the application and/or interview process should notify a representative of the Human Resources Department.

Top Skills

Apache Flink
Spark
The Company
HQ: Los Angeles, CA
129 Employees
On-site Workplace
Year Founded: 2019

What We Do

Spotter provides cash to Creators to grow or diversify their business while retaining their freedom.

Dedicated to empowering Creators and growing the Creator Economy, with Spotter, Creators receive cash for their catalogs through licensing their existing videos (and/or future video uploads) and receive a payout instantly. Creators then use the funds to fuel their growth through hiring resources, investing, or anyway they choose, all while remaining independent. In addition to funding, Spotter provides Creators with in-depth data insights into the performance of all existing content to further help educate the Creator on the value of their library, the value of future uploads and how they can improve performance in the future.

Founded in 2019 to help YouTube creators scale their brands, as of January 2022, Spotter has deployed over $300 million to YouTube creators to reinvest in themselves and accelerate their growth. Spotter has licensed content that consists of over 100,000 videos, generating 40 billion monthly watch-time minutes. With our curated premium video catalog, we deliver a unique scaled media solution to Advertisers and Ad Agencies that is transparent, efficient and 100% brand safe.

Being at the forefront of the Creator Economy, we have incredible teams, doing exceptional work. If you want to find a true sense of belonging with individuals who are incredibly passionate about the work they are doing, where we put people over profits, transparency over mystery and radical change over gradual growth (we go big and take ownership of any risk!), then you’ve come to the right place.

Jobs at Similar Companies

Cencora Logo Cencora

Engineer II - Quality & Testing (IN)

Healthtech • Logistics • Software • Pharmaceutical
Pune, Maharashtra, IND
46000 Employees
Louisville, CO, USA
69 Employees
80K-134K Annually

Similar Companies Hiring

TrainHeroic (A Peaksware Company) Thumbnail
Software • Fitness
Louisville, CO
23 Employees
TrainingPeaks (A Peaksware Company) Thumbnail
Software • Fitness
Louisville, CO
69 Employees
Cencora Thumbnail
Software • Pharmaceutical • Logistics • Healthtech
Conshohocken, PA
46000 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account