Senior Data Engineer

Reposted 3 Days Ago
San Francisco, CA
In-Office
170K-240K Annually
Senior level
Edtech
The Role
As a Senior Data Engineer at Speak, you'll design and implement data infrastructure, optimize architectures, support ML projects, and collaborate across teams to enhance language learning experiences.
Summary Generated by Built In
About us

Our mission is to reinvent the way people learn, starting with language.

Learning a language can change a life by opening doors to new cultures, careers, and communities. Two billion people around the world are actively trying to learn a language, but the best way to learn (one-on-one tutoring) is hard to access at scale and hasn’t been meaningfully improved in decades. Speak is building a human-level, AI-powered tutor in your pocket: a conversation-first experience that lets learners actually speak, get instant feedback, and progress through carefully designed lessons. The result is a complete path from beginner to confident speaker across multiple languages.

Speak first launched in South Korea in 2019, where Speak has now become the number one language learning app, and we now serve learners across many markets and 15+ languages. Speak is one of the world’s leading AI companies, with over $150m raised in venture investment from OpenAI, Accel, Founders Fund, Khosla Ventures, and more, with a distributed team across San Francisco, Seoul, Tokyo, Taipei, and Ljubljana.

About this role

As a Data Engineer at Speak, you'll play a pivotal role in shaping the future of digital language learning, propelling us towards our mission of making language proficiency accessible to millions worldwide.

Your responsibilities will span the crucial intersection of data infrastructure and analytics, from managing scalable data pipelines, to deploying sophisticated analytics solutions that drive personalized learning experiences. You'll work closely with our product, engineering, and analytics teams to ensure that our platform, powered by cutting-edge technology, is not only robust but also delivers actionable insights that enhance user engagement and learning outcomes.

What you'll be doing
  • Design and Build Data Infrastructure: You'll architect and implement robust, scalable data pipelines using Airflow for orchestration and dbt for transformation that ensure efficient data flow and processing. Your work will be critical in managing the ingestion, storage, and accessibility of data from various sources, ensuring our platform's backbone is strong and reliable.

  • Enable Data-Driven Decisions: By collaborating with cross-functional teams, you will develop and deploy tools and frameworks that facilitate data access and analysis, empowering product and business teams to make informed decisions. 

  • Optimize Data Architecture: Constantly evaluate and refine the data architecture to support our growing data needs and ensure optimal performance. This includes managing a data warehouse and various data sources, as well as implementing best practices for data modeling, data quality, and data governance.

  • Support Machine Learning Projects: Work closely with analysts and machine learning engineers by providing them with clean, structured data for building and deploying predictive models that enhance personalized learning experiences and engagement strategies.

  • Innovate and Experiment: Stay ahead of the curve by researching and implementing cutting-edge technologies and methodologies in data engineering and analytics.

  • Collaborate Across Teams: As a key player in the engineering team, you'll work closely with product managers, analysts, and other engineers to bring data-driven products and features from concept to launch.

What we're looking for
  • 5+ years of relevant experience

  • Data Modeling: Deep understanding of big data warehouses (BigQuery, Snowflake, Redshift), theories, principles, and practices. Ability to design, implement, and manage data warehouses effectively.

  • Programming Skills: Strong programming skills in Python and SQL. Ability to write efficient, reliable, and maintainable code.

  • Data Pipeline and ETL Development: Experience in building and optimizing data pipelines, architectures, and datasets. Familiarity with ETL (extract, transform, load) processes and tools.

  • Big Data Technologies: Experience with end-to-end data platform beyond creating pipelines, such as data ingestion, reverse ETL, visualization, data observability, etc.

  • Cloud Computing: Knowledge of cloud services (GCP, AWS, dbt) and understanding of how to leverage them for data processing and storage solutions.

  • Data Analysis and Visualization: Ability to analyze data to identify patterns, anomalies, and insights. Proficiency in using data visualization tools (e.g. Mode) to communicate findings clearly.

  • Debugging Skills: Strong problem-solving skills and the ability to approach complex challenges methodically including data inconsistency issues.

  • Effective Communication: Ability to communicate technical information to non-technical stakeholders clearly and effectively. This includes writing documentation, presenting findings, and collaborating on projects.

Office
  • San Francisco, CA

Why work at Speak
  1. Join a fantastic, tight-knit team at the right time: we're growing very quickly, we've most recently raised our Series C from some of the top investors in the valley, and we've achieved product-market fit in our initial markets. You'd join at a magical time when a single person could significantly change the course of the company.

  2. Do your life's work with people you’ll love working with: we care strongly about our craft and want every person at Speak to feel like they're growing every day. We believe in the idea that working with people you both enjoy and have respect for makes everything better. We hire thoughtfully and only work with people we admire deeply.

  3. Global in nature: We're live in over 40 countries and launching in a number of new markets soon. We have dedicated offices in San Francisco, Ljubljana, Seoul, and Tokyo, and you’ll have the opportunity to talk to users in each of these regions on a regular basis as well as travel.

  4. Impact people's lives in a major way: Learning a language is one of the single most life-changing skills one can learn, and right now 99% of people never achieve their goal because the process is broken. We’re helping millions of people achieve their goals and improve their lives.

Speak does not discriminate based upon race, religion, color, national origin, gender (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics.

Top Skills

Airflow
AWS
BigQuery
Dbt
GCP
Mode
Python
Redshift
Snowflake
SQL
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
San Francisco, CA
191 Employees

What We Do

Speak is a language learning app that sets users on the path to fluency with the world’s most advanced AI tutor. Built on the core learning philosophy of getting users speaking out loud as much as possible, Speak's AI language-learning experience encourages dynamic two-way dialogue through personalized content and real-time speech recognition.

Speak was founded by Connor Zwick and Andrew Hsu in 2016 to democratize access to high quality language education through AI. Backed by Y Combinator, Open AI, Founders Fund, Khosla Ventures, Matrix Partners and more, Speak is a series B startup with a global presence and offices in San Francisco, Seoul, Tokyo, and Ljubljana.

Featured by Apple as the ‘App of the Day’ and ‘Best New App’, Speak is hiring across the globe. Come join us as we teach the next billion people English and reinvent the way the world learns, staring with language

Similar Jobs

CrowdStrike Logo CrowdStrike

Senior Data Engineer

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Remote or Hybrid
37 Locations
145K-220K Annually

Circle Logo Circle

Data Engineer

Blockchain • Fintech • Payments • Financial Services • Cryptocurrency • Web3
In-Office
San Diego, CA, USA
148K-195K Annually

Circle Logo Circle

Data Engineer

Blockchain • Fintech • Payments • Financial Services • Cryptocurrency • Web3
In-Office
Los Angeles, CA, USA
148K-195K Annually

Circle Logo Circle

Data Engineer

Blockchain • Fintech • Payments • Financial Services • Cryptocurrency • Web3
In-Office
San Francisco, CA, USA
148K-195K Annually

Similar Companies Hiring

ReUp Education Thumbnail
Social Impact • Edtech
Austin, TX
180 Employees
Stepful Thumbnail
Software • Healthtech • Edtech • Artificial Intelligence
New York, New York
60 Employees
Learneo Thumbnail
Software • Machine Learning • Edtech • Artificial Intelligence
NL
397 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account