Mid-Level Data Engineer

Posted Yesterday
Hiring Remotely in USA
Remote
Mid level
Cloud • Information Technology • Software
The Role
Build and maintain AWS-based ETL pipelines (Glue/PySpark, MWAA/Airflow, Lambda) to ingest and process terabytes of financial data into an S3 data lake using Iceberg/Parquet. Implement metadata, monitoring, tests (90% coverage), materialized views (Trino/Athena), CloudFormation deployments, documentation, and support agile delivery and operations for federal clients.
Summary Generated by Built In

At Simple Technology Solutions, our people are our priority. We know our team members are more than employees—they’re parents, friends, volunteers, artists, and athletes. That’s why we offer flexibility to help them thrive personally and professionally while delivering exceptional solutions to our Federal Government clients.

Our culture is built on collaboration, continuous learning, and excellence. We are mentors and thought leaders who share knowledge and foster growth. Recognized as a “Best Place to Work,” we believe a range of perspectives helps us drive innovation and exceed customer expectations. At STS, taking care of our people isn’t a perk—it’s the standard.

As a HUBZone company, we also offer special incentives for team members living in qualified HUBZones. Check out the HUBZone map HERE to see if you qualify!

Simple Technology Solutions is looking for a Mid-Level Data Engineer to add to our team.

Quick Position Overview:

  • US Citizenship is required
  • Bachelor's Degree is required
  • minimum of 3-5 years' position related experience is required

The Role: 

STS is looking for a Mid-Level Data Engineer to join a federal data engineering team. You will work alongside senior engineers building and maintaining ETL pipelines on a cloud-based Enterprise Data Platform (EDP) built on AWS, working at enterprise scale — processing terabytes of financial data across a large portfolio of automated pipelines — as part of an agile team building systems that support critical government functions. A willingness to learn, strong attention to detail, and a team-first mindset are prerequisites for this position. 

 

This position is contingent upon contract award. 

The Mid-Level Data Engineer at STS will: 

  • Develop new ETL pipelines and data ingestion processes alongside senior engineers using AWS Glue (Spark-based, PySpark), MWAA (Airflow), Lambda, and SNS, fully conforming to the agency's Enterprise ETL Standards, ETL Common Library, and PEP 8 Python coding standards 
  • Integrate the agency's ETL Common Library into Glue jobs for standardized orchestration, error handling, metadata recording, and SNS notifications for all success and error job events 
  • Ingest structured and semi-structured datasets (CSV, XML, JSON, Avro, pipe-delimited) into S3 landing, raw, and curated zones using Apache Iceberg tables with Parquet as the default format; enforce transactional loading and prevent duplicate loads per dataset reporting period 
  • Configure static ETL metadata in the centralized PostgreSQL metadata store; ensure dynamic metadata records job status and timestamps for all key execution steps 
  • Monitor assigned production jobs and participate in operations support rotations; identify and escalate failed jobs and performance issues promptly to maintain data availability within contractually required ingestion timelines 
  • Ensure ETL Load Reports are populated in real-time and ETL Gap Reports are updated on a weekly basis covering all gaps from the inception of the initial ingest process 
  • Build and maintain materialized views and semantic layer objects in Trino and Athena to ensure optimized query performance and consistent business logic 
  • Produce and maintain required documentation for each assigned dataset: Business Requirements, ETL Design Documents, Data Models (Mermaid format), Data Dictionaries, Mapping Documents, Deployment Documents, O&M Guides, and ETL Test Plans 
  • Write unit and integration tests achieving the 90% minimum code coverage threshold; complete security scans at least once per sprint as part of the Definition of Done 
  • Deploy ETL resources using CloudFormation templates through the agency CICD pipeline; submit Change Requests to the Change Control Board within required timelines 
  • Support transition of ETL jobs from other agency teams by verifying standards conformance, performing deployments, and validating data loads 
  • Support disaster recovery exercises, pre-production deployments, and ad hoc data requests as assigned 
  • Participate in 2-week sprint ceremonies, quarterly PI planning, backlog refinement, and agile delivery using JIRA and GitHub 

 

Education and Experience: 

 

Required 

 

  • Bachelor's degree or higher in Computer Science, Information Systems, Data Engineering, or a related field 
  • 3-5 years of experience in data engineering or a closely related technical role 
  • Hands-on experience with Python (PEP 8), PySpark, and SQL for ETL pipeline development 
  • Experience with AWS services including Glue, S3, MWAA (Airflow), Lambda, SNS, and SQS 
  • Familiarity with Apache Iceberg, Parquet, and ORC file formats and S3 data lake zone concepts 
  • Experience with PostgreSQL and basic familiarity with Redshift or Oracle 
  • Familiarity with Trino or Athena for query and semantic layer development 
  • Experience with CloudFormation, GitHub branching workflows, and CI/CD-integrated deployments 
  • Ability to produce clear ETL documentation including data models (Mermaid format) and data dictionaries 
  • Understanding of ETL metadata concepts including static and dynamic metadata, load reports, and gap reports 
  • Experience in agile development environments with sprint-based delivery 
  • Experience supporting IV&V and/or User Acceptance Testing (UAT) processes in a federal or technical program environment 
  • Experience with automated testing frameworks; ability to write unit and integration tests achieving defined code coverage thresholds 
  • Familiarity with FISMA, NIST 800-53, and OWASP ASVS Level 2 is a plus 
  • Must be able to work 8am-5pm Eastern Time regardless of home location 
  • Active federal public trust suitability determination or ability to obtain one required 
STS is committed to equal employment opportunity and merit-based employment practices. STS provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination, harassment, and retaliation in all employment practices and decisions in accordance with applicable federal, state, and local laws.

Employment decisions at STS are based on individual qualifications, performance, skills, and business needs, without regard to race, color, religion, sex, national origin, age, disability, protected veteran status, sexual orientation, gender identity, genetic information, marital status, or any other status protected by applicable law.

This policy applies to all terms and conditions of employment, including recruiting, hiring, placement, promotion, compensation, training, transfer, discipline, termination, layoff, recall, and leaves of absence.
---
Applicants may request removal from our applicant database, or specific information about how the data is used by contacting [email protected].

Skills Required

  • US Citizenship
  • Bachelor's degree in Computer Science, Information Systems, Data Engineering, or related field
  • 3-5 years of experience in data engineering or closely related technical role
  • Hands-on experience with Python (PEP 8), PySpark, and SQL for ETL pipeline development
  • Experience with AWS services including Glue, S3, MWAA (Airflow), Lambda, SNS, and SQS
  • Familiarity with Apache Iceberg, Parquet, and ORC file formats and S3 data lake zone concepts
  • Experience with PostgreSQL and basic familiarity with Redshift or Oracle
  • Familiarity with Trino or Athena for query and semantic layer development
  • Experience with CloudFormation, GitHub branching workflows, and CI/CD-integrated deployments
  • Ability to produce clear ETL documentation including data models (Mermaid format) and data dictionaries
  • Understanding of ETL metadata concepts including static and dynamic metadata, load reports, and gap reports
  • Experience in agile development environments with sprint-based delivery (JIRA, GitHub)
  • Experience supporting IV&V and/or User Acceptance Testing (UAT) processes in a federal or technical program environment
  • Experience with automated testing frameworks; ability to write unit and integration tests achieving defined code coverage thresholds
  • Must be able to work 8am-5pm Eastern Time regardless of home location
  • Active federal public trust suitability determination or ability to obtain one
  • Familiarity with FISMA, NIST 800-53, and OWASP ASVS Level 2
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Washington, DC
68 Employees
Year Founded: 2013

What We Do

Simple Technology Solutions (STS) has in-depth experience helping federal & state government agencies in their Agile/DevOps/Cloud adoption & transformation. We understand how people, process, and technology work together to dramatically transform how government IT organizations improve their ability to deliver quality software & cloud-enable IT services to their end-users. In addition to being experienced government contractors, we are experienced coaches and thought-leaders guiding government IT leaders through the Agile/DevOps/Cloud mental, skill & organizational shifts. To our government customers, we are a small business with a big knowledge of the DHS mission and technology landscape. To our industry customers, we are a deeply connected, knowledgeable partner that you can trust as a services provider or strategic guide. STS is transforming complex into simple. KEEP I.T. SIMPLE!

Similar Jobs

MWDN Logo MWDN

Data Engineer

Information Technology • Consulting
Remote
Georgia, USA
143 Employees
Easy Apply
Remote
United States
55 Employees
110K-150K Annually

Zeta Global Logo Zeta Global

Associate Vice President, Paid Search - EDU

AdTech • Artificial Intelligence • Marketing Tech • Software • Analytics
Easy Apply
Remote or Hybrid
United States
2429 Employees
140K-160K Annually

UL Solutions Logo UL Solutions

Senior Project Lead, Energy Storage Advisory

Automotive • Professional Services • Software • Consulting • Energy • Chemical • Renewable Energy
Remote or Hybrid
Chicago, IL, USA
15000 Employees
60K-170K Annually

Similar Companies Hiring

Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account