Senior Data Engineer

Posted Yesterday
Hiring Remotely in United States
Remote
90-125 Annually
Senior level
Digital Media • Professional Services • Consulting • Financial Services
The Role
Design, build, and own data infrastructure and ETL/ELT pipelines (Airflow, S3, Lambda) into a governed Snowflake warehouse. Implement data quality, IaC (Terraform/CDK), CI/CD, and analytics solutions (SQL, Tableau). Enable ML/LLM use cases like forecasting and anomaly detection and deliver automated reporting and data-driven marketing outputs.
Summary Generated by Built In
Who Are We?

The Motley Fool is a purpose-driven financial services company on a mission to make the world smarter, happier, and richer. For 30 years, we’ve been helping people make better investment decisions through transparency, education, and a healthy dose of Foolish fun. We’re a fast-moving, collaborative team that values high-quality work, curiosity, and initiative. We care deeply about what we do, and we’re driven by the impact our work has on real people’s financial futures.

What Does This Team Do?

We are seeking a Senior Data Engineer to design, build, and take full ownership of the data infrastructure powering our investment operations. You will own the complete data lifecycle, spanning ingestion, transformation, warehousing, analytics, and machine learning, serving as the connective tissue between raw data and the decisions that depend on it. If something breaks, you fix it. If a stakeholder needs data, you find it or build it. The Investment Committee and business partners rely on you to ensure they always have trustworthy, timely data in hand.

You will architect ETL/ELT pipelines orchestrated by Apache Airflow (AWS MWAA), build and tune a Snowflake data warehouse fed by S3 data lakes, and develop the analytical models and dashboards that turn raw data into actionable intelligence. You will also play a key role in introducing machine learning capabilities to enhance forecasting, anomaly detection, and portfolio analytics.

What Will You Do in This Role?

You will help lead the transformation of our data infrastructure, moving the team from manual processes and spreadsheet-based workflows into scalable, governed, automated data systems. This role sits at the center of investment operations, analytics, reporting, and emerging AI initiatives.

What Strategic Initiatives You Will Drive?
  • The “Golden Source” Transformation: Migrating data reliance from spreadsheets and manual processes into a governed Snowflake warehouse with documented lineage, quality checks, and self-service analytics.
  • Automated Ingestion Pipelines: Replacing manual file drops with event-driven Airflow DAGs and AWS Lambda functions that ingest, validate, and transform data from external vendors and internal systems in near real-time.
  • Infrastructure as Code: Defining all cloud infrastructure with Terraform or AWS CDK so that development, staging, and production environments are reproducible, version-controlled, and auditable.
  • Data-Powered AI Initiatives: Establishing the data foundations that enable AI across the business: clean, governed, and accessible datasets that feed AI agents, natural-language interfaces, and intelligent automation. Machine learning techniques such as anomaly detection, classification, and forecasting will augment these initiatives where appropriate.
  • Automate how we tell our story with data: Build reusable templates and automation frameworks to close the loop between our database and the materials our team uses to win and retain business. That means pulling live data into branded one-pagers, generating narrative-driven slide decks, websites, populating email campaigns, and producing social-ready content.
Okay, but what will you actually do in this role?

Data Engineering & ETL — 35%

  • Pipeline Design & Orchestration: Design, build, and maintain robust ETL/ELT pipelines using Apache Airflow (MWAA). Author DAGs that handle complex dependencies across external data vendors, internal models, and downstream consumers.
  • Data Integration: Ingest data from diverse sources including SFTP feeds, REST APIs, flat files, and third-party financial data providers. Normalize and conform data into a consistent analytical model.
  • Data Quality & Monitoring: Build “circuit breakers” into pipelines: automated data quality checks that halt downstream processing and alert the team via CloudWatch and Slack when anomalies are detected.
  • Serverless Processing: Implement AWS Lambda functions for lightweight, event-driven tasks such as triggering ingestion when files land in S3 or validating data payloads before loading.
  • Documentation: Maintain and document the data catalog so institutional knowledge lives in the system, not in your head.

Snowflake Data Warehouse & SQL — 35%

  • Warehouse Architecture: Serve as the subject-matter expert for Snowflake. Design schemas, manage data loading via Stages and Snowpipe, and implement role-based access controls.
  • Complex SQL Development: Write advanced analytical SQL: window functions, CTEs, recursive queries, pivots; to support investment reporting, performance attribution, and ad-hoc analysis.
  • Query Tuning & Optimization: Profile and optimize slow-running queries. Leverage clustering keys, micro-partition pruning, materialized views, and result caching to minimize compute cost and maximize performance.

Cloud Infrastructure & CI/CD — 10%

  • Infrastructure as Code: Define and deploy cloud resources using Terraform or AWS CDK. Treat infrastructure as software with version control, peer review, and automated testing.
  • CI/CD Pipelines: Help design and maintain CI/CD workflows with GitHub Actions for automated testing, linting, and deployment of data pipelines, infrastructure, and application code.

Analytics, Reporting & AI — 20%

  • Analytical Modeling: Partner with investment and business teams to translate questions into data models, dashboards, and reports that drive strategic decisions using Tableau.
  • Analytics Presentation: Design and build automated pipelines that pull data from source systems and render it into production-ready marketing outputs: one-pagers, pitch decks, email campaigns, and social content.
  • AI Integration: Be a resource for software engineers to build an AI layer on top of existing data infrastructure, enabling LLMs to securely query fund performance data via APIs and answer natural-language questions for internal stakeholders.
Required Experience and Skills:
  • Python Mastery: 5+ years of professional Python development. Comfortable with object-oriented design, data manipulation libraries (pandas, NumPy).
  • Familiarity with financial research data vendors and feed/API products such as CapIQ Xpressfeed, FactSet, Bloomberg, Thomson Reuters/Refinitiv/LSEG, Russell or MSCI.
  • Familiarity with financial business data and feed/API products from Broadridge, Morningstar and custodian banks and fund administrators.
  • ETL & Orchestration: Proven experience designing and operating ETL/ELT pipelines. Apache Airflow and Lambda experience is a plus.
  • Snowflake & Advanced SQL: Deep expertise in Snowflake architecture (clustering keys, micro-partitions, Snowpipe, Stages). Able to write complex analytical SQL, window functions, CTEs, recursive queries, and optimize them for cost and performance.
  • Infrastructure as Code: Hands-on experience with AWS CDK or Terraform. You define infrastructure in code, not in the AWS Console.
  • Exposure to LLM integration patterns: markdown files and prompt engineering. Experience with RAG, knowledge bases, and embeddings is a plus.
Nice-to-Have/Pluses:
  • Cloud Fluency (AWS): Working knowledge of the AWS ecosystem: Lambda, ECS Fargate, Step Functions, S3, EventBridge, CloudWatch, and RDS.
  • Experience with data visualization tools (Tableau, Streamlit, or similar) for self-service analytics.
  • Background in data governance, data cataloging, or data lineage tooling.
  • CI/CD Experience: Demonstrated experience maintaining CI/CD pipelines for automated testing and deployment of data engineering and application code using Github actions and Terraform.
  • SQL & Database Tuning: Experience profiling and optimizing queries across both OLAP (Snowflake) and OLTP (PostgreSQL/Aurora) systems. Familiarity with EXPLAIN plans, indexing strategies, and database-level performance tuning.
  • Machine Learning Fundamentals: Practical experience building, evaluating, and deploying ML models. Familiarity with common frameworks (scikit-learn, XGBoost) and an understanding of when and how to apply ML to business problems.

Compensation: 

Below is our target compensation range. While we are budget conscious, we’re also eager to find the right person for this role, so if your target is outside of this range, please don’t hesitate to apply and we’d be happy to have a conversation. 

Hourly Pay Range
$90$125 USD

By applying on this site, you acknowledge that The Motley Fool will be collecting the personal data you provide for our recruiting purposes. Please see our Applicant Privacy Notice for additional information about how we process, transfer, and store your data, including where that data is stored, and about any additional privacy rights you may have based on your jurisdiction.

Skills Required

  • 5+ years professional Python development (object-oriented, pandas, NumPy)
  • Proven experience designing and operating ETL/ELT pipelines (Apache Airflow)
  • Deep expertise with Snowflake architecture and advanced SQL (window functions, CTEs, query tuning, Snowpipe, stages)
  • Hands-on Infrastructure as Code using Terraform or AWS CDK
  • Familiarity with financial research data vendors and feeds (CapIQ, FactSet, Bloomberg, Refinitiv/LSEG, Russell, MSCI)
  • Familiarity with financial business data providers (Broadridge, Morningstar, custodian banks, fund administrators)
  • Exposure to LLM integration patterns, prompt engineering, and markdown-based approaches
  • Experience with CI/CD workflows for data pipelines and infra (GitHub Actions)
  • Experience with AWS serverless and data services (S3 event-driven ingestion, Lambda, CloudWatch)
  • Experience with data visualization and self-service analytics (Tableau, Streamlit)
  • Practical ML experience (scikit-learn, XGBoost) and applying ML to business problems
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Alexandria, VA
782 Employees
Year Founded: 1993

What We Do

The Motley Fool’s purpose is to make the world smarter, happier, and richer. While other companies may make you smarter, happier, or richer, we aim to do all three by providing outstanding business and investing advice — with a decidedly Foolish bent. Our name is in homage to the one character in Shakespearean literature — the court jester — who could speak the truth to the king and queen without having his or her head lopped off. The Fools of yore entertained the court with humor that instructed as it amused. More importantly, the Fool was never afraid to question conventional wisdom. We believe in treating every dollar as an investment in the future you want to create. We believe that investing in great businesses, for the long term, is the most effective path to wealth. We believe in the power of a community to learn and grow together. We believe in keeping score and being transparent in our investment performance. We strive to fulfill our purpose by truly serving every Fool, from our employees to our members to our community. We provide a variety of solutions to improve many areas of your financial life, including your investment portfolio, personal finances, real estate holdings, company, and career. The Motley Fool is headquartered in Alexandria, Va., with offices in Denver and Sydney, Australia, and serves investors and businesses in the UK, Australia, Hong Kong, Canada, Singapore, and Germany.

Similar Jobs

NBCUniversal Logo NBCUniversal

Senior Data Engineer

AdTech • Cloud • Digital Media • Information Technology • News + Entertainment • App development
Remote or Hybrid
New York, NY, USA
68000 Employees
115K-145K Annually

Gusto Logo Gusto

Senior Data Engineer

Fintech • HR Tech
Easy Apply
Remote or Hybrid
5 Locations
4405 Employees
155K-220K Annually

Optum Logo Optum

Senior Data Engineer

Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
In-Office or Remote
Eden Prairie, MN, USA
160000 Employees
92K-164K Annually

Jellyfish Logo Jellyfish

Senior Data Engineer

Big Data • Cloud • Productivity • Software • Database • Analytics • Automation
Remote or Hybrid
United States
225 Employees
190K-240K Annually

Similar Companies Hiring

Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
31 Employees
Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account