The Role
Design and build scalable data lakes, warehouses, and lakehouse architectures. Implement Python ETL/ELT pipelines, orchestrate workflows with Airflow, ingest from third-party APIs, optimize columnar storage formats, support ML initiatives, consult stakeholders, and provision cloud/on-prem infrastructure.
Summary Generated by Built In
AgileEngine is an Inc. 5000 company that creates award-winning software for Fortune 500 brands and trailblazing startups across 17+ industries. We rank among the leaders in areas like application development and AI/ML, and our people-first culture has earned us multiple Best Place to Work awards.
WHY JOIN US
If you're looking for a place to grow, make an impact, and work with people who care, we'd love to meet you!
ABOUT THE ROLE
We are looking for a Senior Data Engineer to design and build scalable data lakes, warehouses, and lakehouse architectures supporting a thematic research platform that processes large volumes of financial data daily. You will implement Python-based ETL/ELT pipelines, orchestrate workflows with Airflow, develop ingestion workflows from third-party APIs, and work with Snowflake, Spark, and AWS to deliver high-performance data infrastructure. The role combines hands-on engineering with technical consulting responsibilities, translating business goals into data architecture roadmaps.
WHAT YOU WILL DO
- Design and implement Python Data Engineering solutions;
- Design and build scalable Data Lakes, Data Warehouses, and Data Lakehouses;
- Design and implement robust ETL/ELT processes at scale using Python, incorporating modern pipeline orchestration tools like Airflow;
- Develop sophisticated ingestion workflows from diverse 3rd party APIs and data sources;
- Manage and optimize various file formats (Parquet, Avro, ORC) and columnar storage to ensure high-performance data retrieval;
- Work with AI development tools to support and accelerate ongoing development, machine learning initiatives and advanced analytics;
- Act as a technical consultant for stakeholders and leadership to gather requirements, understand business goals, and translate them into technical roadmaps;
- Work with Terraform and other tools to build AWS and on-prem infrastructure.
MUST HAVES
- You must be authorized to work for ANY employer in the US (e.g., Green card holders, TN visa holders, GC EAD, H4 EAD, U4U with EAD), as we are unable to sponsor or take over employment visa sponsorship at this time;
- Bachelor’s degree in computer science/engineering or other technical field, or equivalent experience;
- 5+ years of experience with Python;
- 5+ years of experience with data processing, manipulation, and analytics libraries like Pandas, Polars, PySpark or DuckDB;
- 2+ years of experience with Big Data technologies (Spark, Snowflake);
- Expert-level knowledge of pipeline orchestration using Airflow or similar industry-standard tools;
- Deep understanding of Medallion Architecture, columnar file formats, and diverse database technologies (SQL, NoSQL, and Lakehouse architectures);
- Proven ability to work with 3rd party APIs for complex data ingestion tasks;
- Proficiency with modern Cloud platforms (AWS, GCP, Snowflake) and advanced SQL optimization;
- Exceptional soft skills with a proven ability to gather requirements from leadership and collaborate effectively across cross-functional teams;
- Excellence in optimizing complex data pipelines and troubleshooting data latency or consistency issues in massive datasets;
- A self-starter mindset, regularly investigating more efficient data architectures and AI development tools to improve pipeline performance;
- Taking pride in data integrity and the accuracy of the end-to-end pipelines and architectures you build;
- Strong communication skills for seamless global collaboration with stakeholders and distributed teams;
- Upper-intermediate English level.
NICE TO HAVES
- Familiarity with the fintech industry, understanding of financial data, regulatory requirements, and business processes specific to the domain;
- Documentation skills to document data pipelines, architecture designs, and best practices for knowledge sharing and future reference;
- OpenSearch, Elasticsearch;
- AWS Sagemaker Studio, Jupyter for analyze data;
- Terraform;
- Scala.
PERKS AND BENEFITS
- Professional growth: Mentorship, TechTalks, and personalized growth roadmaps.
- Competitive compensation: USD-based pay with education, fitness, and team activity budgets.
- Exciting projects: Modern solutions with Fortune 500 and top product companies.
- Flextime: Flexible schedule with remote and office options.
Skills Required
- Authorized to work for any US employer without sponsorship
- Bachelor's degree in computer science/engineering or equivalent experience
- 5+ years experience with Python
- 5+ years experience with data processing/manipulation libraries (Pandas, Polars, PySpark, DuckDB)
- 2+ years experience with Big Data technologies (Spark, Snowflake)
- Expert-level pipeline orchestration using Airflow or similar
- Deep understanding of Medallion Architecture, columnar file formats (Parquet/Avro/ORC), and SQL/NoSQL/Lakehouse architectures
- Proven ability to build ingestion workflows from 3rd party APIs
- Proficiency with cloud platforms (AWS, GCP, Snowflake) and advanced SQL optimization
- Strong communication, stakeholder requirement gathering, and collaboration skills; upper-intermediate English
- Experience with fintech domain, financial data, or regulatory requirements
- Documentation skills for pipelines and architecture designs
- Familiarity with OpenSearch or Elasticsearch
- Experience with AWS SageMaker Studio or Jupyter for data analysis
- Terraform experience
- Scala knowledge
Am I A Good Fit?
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.
Success! Refresh the page to see how your skills align with this role.
The Company
What We Do
AgileEngine is a privately held company established in 2010 that builds dedicated teams of designers and developers. We turn good ideas into awesome software that people actually want to use. Some of the biggest names and the hottest startups around the world chose us to build their tech.







