Data Engineer

Reposted 4 Days Ago
Washington, DC
In-Office
Senior level
Artificial Intelligence
The Role
As a Data Engineer, design and build data pipelines and architectures to support AI/ML initiatives, ensuring data quality and scalability.
Summary Generated by Built In
Data Engineer
Washington, DC (Hybrid)

About the Role:

We are looking for a talented Data Engineer to join our growing AI team. As a Data Engineer, you will design and build the data infrastructure and pipelines that power our AI/ML capabilities. Your work will ensure that our data scientists and ML engineers have clean, reliable, and scalable data to train, evaluate, and deploy models. You will be at the center of enabling our platform’s AI capabilities by ensuring robust data systems are in place to support experimentation, production workflows, and ongoing analytics.

Key Responsibilities:
  • Design, build, and maintain scalable ETL/ELT pipelines for structured and unstructured data.
  • Develop data architectures that support large-scale training, inference, and analytics workflows.
  • Ensure data quality, governance, and lineage across multiple sources and systems.
  • Partner with data scientists and ML engineers to deliver high-quality datasets for model development.
  • Optimize data workflows for performance, scalability, and reliability on cloud platforms (AWS, GCP, Azure).
  • Leverage modern data engineering tools (e.g., Spark, Databricks, Airflow, Kafka, dbt) to support pipelines and workflows.
  • Implement monitoring, alerting, and observability for data pipelines to ensure robustness.
  • Work across teams to ensure data systems align with platform and business goals.
Qualifications:
  • 5+ years of experience as a Data Engineer or in a similar role focused on large-scale data systems.
  • Strong programming skills in Python, SQL, and familiarity with Java/Scala a plus.
  • Hands-on experience with big data frameworks (e.g., Spark, Flink, Hadoop) and workflow orchestration (Airflow, Prefect, Dagster).
  • Proven experience with cloud-based data platforms (AWS, GCP, Azure) and data lake/warehouse technologies (Snowflake, BigQuery, Redshift, Delta Lake).
  • Strong understanding of data modeling, ETL/ELT processes, and distributed data systems.
  • Experience with streaming data systems (Kafka, Kinesis, Pub/Sub) preferred.
  • Knowledge of data governance, security, and compliance best practices.
  • Strong analytical and problem-solving skills, with a focus on building maintainable, scalable systems.
  • Excellent collaboration skills and ability to work across engineering, product, and AI teams

Top Skills

Airflow
AWS
Azure
BigQuery
Dagster
Delta Lake
Flink
GCP
Hadoop
Java
Kafka
Kinesis
Prefect
Pub/Sub
Python
Redshift
Scala
Snowflake
Spark
SQL
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Washington, DC
30 Employees
Year Founded: 2021

What We Do

We help companies realize the full value of AI by empowering their teams to instantly make any application AI-intelligent.


We value and believe in equitable access to AI where all people can adopt AI and have access to AI technology and insights regardless of their ethnicity, socio-economic status, educational or career level, work role, age, physical ability, or any other quality. The goal of our AI Squared technology is to level this playing field by providing AI integration technologies that would increase AI adoption and enable access to AI for anyone.

This is our story. Dr. Benjamin Harvey was the Chief of Operations Data Science at NSA where he experienced challenges first-hand associated with the inability to adopt AI, provide equitable access to AI, and unlock the value of AI for intelligence analysts and the military warfighter. His inspiration came from his two brothers who were Army officers recently deployed to the Middle East (Kuwait, Iraq and Qatar) and SW Asia (Afghanistan and Pakistan) who both relied on intelligence to achieve mission. He began a journey and developed a solution to solve these inequities that simplifies and accelerates AI integration into applications ultimately protecting lives of Americans and military warfighters in similar situations as his brothers.

Our vision is to build the world’s most powerful model integration framework and AI integration software which enables any application to become AI-powered. For the first time ever data scientists can describe their models using our model integration framework and our AI integration software will overlay intelligence from inference results directly into a pre-existing web application. We provide application developers with a software (e.g. SDK) and analysts with a no/low-code solution to customize the user experience

Similar Jobs

Circle Logo Circle

Data Engineer

Blockchain • Fintech • Payments • Financial Services • Cryptocurrency • Web3
In-Office
Washington, DC, USA
980 Employees
148K-195K Annually

Cloudflare Logo Cloudflare

Systems Engineer

Cloud • Information Technology • Security • Software • Cybersecurity
Hybrid
6 Locations
4400 Employees

Cloudflare Logo Cloudflare

Systems Engineer

Cloud • Information Technology • Security • Software • Cybersecurity
Hybrid
6 Locations
4400 Employees

CrowdStrike Logo CrowdStrike

Senior Data Engineer

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Remote or Hybrid
37 Locations
10000 Employees
145K-220K Annually

Similar Companies Hiring

Credal.ai Thumbnail
Software • Security • Productivity • Machine Learning • Artificial Intelligence
Brooklyn, NY
Standard Template Labs Thumbnail
Software • Information Technology • Artificial Intelligence
New York, NY
10 Employees
Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account