Data Engineer – Classical Statistics & Machine Learning

Posted 2 Days Ago
McLean, VA, USA
In-Office
Mid level
AdTech • Agency • Marketing Tech
The Role
Build and maintain ETL/ELT pipelines in a lakehouse/cloud environment, ensure data quality and lineage, and apply classical statistics and ML (anomaly detection, regression, time-series) to support operational decision-making and stakeholder reporting.
Summary Generated by Built In
Job Title: Data Engineer
Company: BLN24
About Us: We find strength in teamwork-a better you is a better us
BLN24 is an award-winning Management Consulting Firm that supports the U.S. Federal Government in successfully achieving their mission and goals. Our service and solutions delivery start with understanding each client’s end-state, and then seamlessly integrating within each Agency’s organization to improve and enhance strategic and technical operations and deployments.

Position Overview:

BLN24 is seeking a mid-level Data Engineer to support a large-scale data and analytics platform modernization effort for a federal statistical agency client. This is a hybrid role: data engineering (building and maintaining the pipelines that bring data into the platform) and applied data science (using classical statistics and machine learning to analyze that data once it’s available).
The ideal candidate is equally comfortable writing production-grade ingestion and
transformation code as they are designing and validating a statistical or ML model.
This role works closely with SMEs across multiple program areas to understand source data, build reliable ETL/ingestion pipelines, and apply analytical methods — anomaly detection, statistical modeling, and machine learning — to support operational decision-making.

Key Responsibilities:
Data Engineering
  • Design, build, and maintain ETL/ELT pipelines to ingest data from multiple source systems into the platform’s central data store
  • Develop and maintain data ingestion workflows for both batch and near-real-time sources
  • Implement data validation, cleaning, and transformation logic to ensure data quality and consistency across pipelines
  • Work within a modern lakehouse/cloud data architecture, optimizing pipeline performance and reliability
  • Build and maintain data models and schemas that support downstream analytics and reporting needs
  • Monitor pipeline health, troubleshoot failures, and implement logging/alerting for data quality issues
  • Document data lineage, transformation logic, and pipeline architecture for governance and reproducibility
Data Science / Statistics & ML
  • Apply classical statistical methods (hypothesis testing, regression, time-series analysis, distributional comparisons) to identify trends, anomalies, and outliers in operational data
  • Design and implement benchmarking approaches that compare production data against historical, modeled, or external reference values
  • Develop and evaluate machine learning models where appropriate, balancing predictive performance with interpretability for non-technical stakeholders
  • Investigate flagged anomalies by digging into underlying data to identify root causes and contributing factors
  • Work with SMEs to translate operational questions into analytical approaches, and clearly communicate statistical/ML findings and their limitations
  • Account for data sensitivity classifications and governance requirements when designing analyses and models
  • Collaborate with visualization-focused team members to ensure outputs of statistical/ML work are presented clearly to stakeholders

Required Qualifications:
  • Bachelor’s degree in Data Science, Statistics, Computer Science, Engineering, or related field (or equivalent experience)
  • 3–5 years of experience spanning both data engineering and data science/statistical analysis
  • Strong proficiency in Python, including experience with data engineering libraries (e.g., pandas, PySpark) and statistical/ML libraries (e.g., scikit-learn, statsmodels)
  • Hands-on experience building and maintaining ETL/ELT pipelines, including ingestion, transformation, and validation logic
  • Solid grounding in classical statistical methods (hypothesis testing, regression, distributional analysis) and practical machine learning techniques
  • Experience working with SQL and relational/distributed data systems
  • Ability to work within a federal data environment, including familiarity with data sensitivity tiers and access/disclosure constraints
  • Strong communication skills, with the ability to explain technical/statistical concepts to non-technical stakeholders

Preferred Qualifications:
  • Prior experience supporting federal statistical agencies or other federal data programs
  • Familiarity with Databricks or modern lakehouse architectures (Spark, Delta Lake, etc.)
  • Experience with workflow orchestration tools (e.g., Airflow, Databricks Workflows)
  • Experience designing anomaly-detection or outlier-detection approaches beyond standard threshold-based methods
  • Exposure to disclosure avoidance concepts or working with regulated/protected government data
  • Experience working across multiple coding environments (Python, R, SAS) within the same analytics platform
  • Background in requirements gathering or systems design for enterprise data platforms

Work Environment:
  • Contract position supporting a federal agency data modernization engagement
  • Collaborative, cross-functional environment working alongside data engineers, data scientists, architects, and program SMEs
  • Requires U.S. citizenship and ability to obtain a public trust or other clearance/suitability determination typical of federal contractor engagements

What BLN24 brings to the Game:
BLN24 benefits are game changing. We like our team to play hard and that means they need to be taken care of — physically, financially, and emotionally. We make sure to keep them in the game by giving them access to generous medical, dental, and vision plans.
  • You can join one of the fastest growing companies headquartered in the Washington DC Metro Area.  We give you the opportunity to work in different sectors, so you have the chance at variety while maintaining stability.
  • Flexibility at BLN24 allows each individual the opportunity to balance quality work and their personal lives. Depending on projects, we allow remote working opportunities so you can always be in the game no matter where you call home.
BLN24 is an Equal Opportunity Employer. We believe people are our strength and understand diverse talents are key to our success. We are an equal opportunity employer and place a high value on diversity and inclusion at our company. We do not discriminate on the basis of any protected attribute, including race, religion, color, national origin, gender, sexual orientation, gender identity, gender expression, age, marital or veteran status, pregnancy or disability, or any other basis protected under applicable law. In accordance with applicable law, we make reasonable accommodations for applicants' and employees' religious practices and beliefs, as well as any mental health or physical disability needs.
 

Skills Required

  • Bachelor's degree in Data Science, Statistics, Computer Science, Engineering, or related field (or equivalent experience)
  • 3-5 years of combined data engineering and data science/statistical analysis experience
  • Strong proficiency in Python (including pandas, PySpark) and familiarity with statistical/ML libraries (scikit-learn, statsmodels)
  • Hands-on experience building and maintaining ETL/ELT pipelines, including ingestion, transformation, and validation logic
  • Solid grounding in classical statistical methods (hypothesis testing, regression, distributional analysis) and practical machine learning techniques
  • Experience working with SQL and relational/distributed data systems
  • Ability to work within a federal data environment, including familiarity with data sensitivity tiers and access/disclosure constraints
  • Strong communication skills to explain technical/statistical concepts to non-technical stakeholders
  • U.S. citizenship and ability to obtain a public trust or other suitability determination
  • Prior experience supporting federal statistical agencies or federal data programs
  • Familiarity with Databricks or modern lakehouse architectures (Spark, Delta Lake)
  • Experience with workflow orchestration tools (Airflow, Databricks Workflows)
  • Experience designing anomaly-detection or outlier-detection approaches beyond threshold-based methods
  • Exposure to disclosure avoidance concepts or working with regulated/protected government data
  • Experience working across multiple coding environments (Python, R, SAS) within the same analytics platform
  • Background in requirements gathering or systems design for enterprise data platforms
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Tysons Corner, VA
26 Employees
Year Founded: 2006

What We Do

BLN24 is a digital creative agency that is an 8(a) minority-owned small disadvantaged business (SDB). We add value by merging our commercial and government experience to create award-winning solutions for clients with the most important missions globally. BLN24 is a first-class communication, marketing, and technology agency for federal agencies and Fortune 500 companies. Founded by two former Booz Allen Hamilton employees, BLN24’s industry-lead marketing and communications strategies along with production and visual design has been on web and print advertisement campaigns as well as television networks across the nation.

Similar Jobs

Comcast Logo Comcast

Account Executive

Digital Media • Information Technology • News + Entertainment
Remote or Hybrid
Virginia, USA
115000 Employees

Comcast Logo Comcast

Senior Client Partner 4th Estate Comcast Government Services

Digital Media • Information Technology • News + Entertainment
Hybrid
Reston, VA, USA
115000 Employees

Comcast Logo Comcast

Servicenow Engineer

Digital Media • Information Technology • News + Entertainment
Hybrid
Reston, VA, USA
115000 Employees

General Motors Logo General Motors

District Manager Parts & Service

Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
Remote or Hybrid
United States
165000 Employees
81K-109K Annually

Similar Companies Hiring

ClickMint Thumbnail
AdTech • eCommerce • Marketing Tech • Generative AI
Malibu, CA
9 Employees
PRIMA Thumbnail
Travel • Software • Marketing Tech • Hospitality • eCommerce
US
15 Employees
Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account