Data Engineer

Posted 14 Days Ago
New York, NY
In-Office
Mid level
Artificial Intelligence
The Role
The Data Engineer will design and maintain scalable data pipelines, integrate APIs, support ML teams, and optimize data systems for performance and reliability.
Summary Generated by Built In

Role Overview

We are looking for a high-caliber Data Engineer who can architect and scale the data systems that power our AI workflows. You’ll be responsible for building reliable data pipelines, integrating external APIs, maintaining clean and structured data models, and enabling the product and ML teams to iterate quickly.

You should thrive in ambiguous environments, enjoy wearing multiple hats, and be comfortable designing end-to-end data solutions with minimal direction.

What You’ll Own
  • Design, build, and maintain scalable data pipelines that process and transform large volumes of structured and unstructured data.

  • Manage ingestion from third-party APIs, internal systems, and customer datasets.

  • Develop and maintain data models, data schemas, and storage systems optimized for ML and product performance.

  • Collaborate with ML engineers to prepare model-ready datasets, embeddings, feature stores, and evaluation data.

  • Implement data quality monitoring, validation, and observability.

  • Work closely with product engineers to support new features that rely on complex data flows.

  • Optimize systems for performance, cost, and reliability.

  • Contribute to early architecture decisions, infrastructure design, and best practices for data governance.

  • Build tooling that enables the entire team to access clean, well-structured data.

Who You AreBuilder Mentality

You’re a hands-on engineer who thrives in a fast-paced environment, enjoys autonomy, and takes ownership of problems from start to finish.

Strong Communication

You translate technical complexity into clarity. You work well with ML, product, and GTM partners.

Practical, Not Academic

You can design elegant systems but default to shipping solutions that work and can be iterated on.

Detail-Oriented & Reliable

You care about clean pipelines, reproducibility, and data correctness.

What You Bring
  • 3+ years of experience as a Data Engineer, ML Engineer, Backend Engineer, or similar.

  • Proficiency in Python, SQL, and modern data tooling (dbt, Airflow, Dagster, or similar).

  • Experience designing and operating ETL/ELT pipelines in production.

  • Experience with cloud platforms (AWS, GCP, or Azure).

  • Familiarity with data lakes, warehouses, and vector databases.

  • Experience integrating APIs and working with semi-structured data (JSON, logs, event streams).

  • Strong understanding of data modeling and optimization.

  • Bonus: experience supporting LLMs, embeddings, or ML training pipelines.

  • Bonus: startup experience or comfort working in fast, ambiguous environments.

What Success Looks Like
  • Stable, documented, testable pipelines powering ML and product features.

  • High-quality data consistently available for analytics, modeling, and core product workflows.

  • Faster iteration cycles for the Engineering and ML teams due to improved tooling.

  • Clear visibility into data quality and reliability.

  • Strong cross-functional collaboration and communication.

Why Artisan
  • Build core systems at the heart of a fast-growing AI company.

  • High autonomy, high impact, zero bureaucracy.

  • Work with a talented, ambitious team solving meaningful problems.

  • Shape the data platform from the ground up.

Top Skills

Airflow
AWS
Azure
Dagster
Dbt
GCP
Python
SQL
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Francisco, CA
0 Employees

What We Do

At Artisan, we're creating AI Employees, called Artisans, and software which is beautiful, easy to use, and replaces the endless stack of point solutions.

We're starting with outbound sales. Our platform contains every tool needed for outbound sales - B2B data, AI email sequences, deliverability optimization tools and so much more.

Ava, our AI BDR, operates within our platform. She automates:
- Lead discovery with access to over 300M B2B contacts
- Lead research, with 10s of data sources
- Choosing a sales strategy and writing + sending hyper-personalized emails
- Managing deliverability with a suite of tools - from email warmup to placement tests

We're on a mission to create the final boss of software, with every SaaS product needed for sales and AI employees consolidated together in one exceptional platform.

This is the next Industrial Revolution.

Similar Jobs

Atlassian Logo Atlassian

Data Engineer

Cloud • Information Technology • Productivity • Security • Software • App development • Automation
In-Office or Remote
New York, NY, USA
11000 Employees
117K-183K Annually

Capital One Logo Capital One

Data Engineer

Fintech • Machine Learning • Payments • Software • Financial Services
Hybrid
2 Locations
55000 Employees
133K-166K Annually

Capital One Logo Capital One

Data Engineer

Fintech • Machine Learning • Payments • Software • Financial Services
Hybrid
2 Locations
55000 Employees
309K-384K Annually

Garner Health Logo Garner Health

Data Engineer

Big Data • Healthtech • HR Tech • Machine Learning • Software • Telehealth • Big Data Analytics
Easy Apply
In-Office
New York, NY, USA
350 Employees
125K-165K Annually

Similar Companies Hiring

LayerOne Thumbnail
Software • Information Technology • Artificial Intelligence
New York, NY
15 Employees
Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees
Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account