Apache Spark Developer

Posted 4 Days Ago
Be an Early Applicant
Herndon, VA, USA
In-Office
Senior level
Aerospace
The Role
Design, develop, and maintain Apache Spark pipelines for data processing and transformation, optimize Spark jobs, and integrate with analytics platforms and AI/ML workflows.
Summary Generated by Built In

Absolute Business Solutions Corp (ABSC) is not just another tech company. We’re a community of innovators, engineers, analysts and business professionals working together with our customers to tackle the most complex challenges. For more than 20 years we’ve supported critical DoD, IC and Federal Civilian missions and global, multi-national corporations. We specialize in supporting our clients in the Intelligence, Technology, Defense, AI/ML, and Data Science fields. As we continue to grow at a rapid pace, we are seeking some amazing new professionals to join our team.

We are actively hiring a TS/SCI-cleared Apache Spark Developer to support NGA’s Data Modernization Services (DMS) mission by building and optimizing large-scale data processing pipelines. This role focuses on developing high-performance Spark applications within a containerized, Kubernetes-based environment, supporting mission analytics, data exploitation, and AI/ML integration. The ideal candidate thrives in distributed data environments, understands performance tuning deeply, and can operate effectively in secure, air-gapped systems.

This role is on-site/flexible hours in Herndon, VA; Springfield, VA; St. Louis, MO; or Aurora, CO.

Clearance Required for this role: TS/SCI eligibility with willingness/ability to obtain CI polygraph.

Core Technology Stack

Data / Processing

  • Apache Spark (PySpark, Scala)
  • Delta Lake, Parquet
  • Structured Streaming

Infrastructure

  • Kubernetes (execution environment)
  • Docker

Storage / Cloud (Abstracted)

  • S3 / object storage
  • AWS / GCP / Azure (environment-dependent)

DevOps (Exposure Level)

  • Git, Jenkins (CI/CD)

Languages

  • Python (PySpark)
  • Scala (preferred)
  • Bash / scripting

Key Responsibilities

  • Design, develop, and maintain Apache Spark pipelines (batch and streaming) using PySpark and/or Scala
  • Process and transform large-scale datasets using modern data lake architectures (Delta Lake, Parquet)
  • Optimize Spark jobs for performance, including:

o Partitioning strategies

o Shuffle optimization

o Memory tuning

o File sizing and storage efficiency

  • Implement Structured Streaming pipelines for near real-time data processing
  • Develop and deploy Spark applications within containerized environments (Docker)
  • Execute workloads in Kubernetes clusters, supporting scalable and distributed processing
  • Integrate Spark pipelines with downstream systems, including:

o Analytics platforms (SQL, notebooks)

o AI/ML workflows and feature engineering pipelines

  • Support data ingestion and storage in object-based systems (e.g., S3-compatible storage)
  • Troubleshoot data pipeline failures and ensure reliability in mission-critical environments
  • Operate within secure, air-gapped environments, including:

o Managing dependencies without internet access

o Working within controlled network and security constraints

Required Qualifications:

  • TS/SCI (eligibility) with ability/willingness to obtain/maintain counterintelligence polygraph
  • Bachelor’s degree plus 5 years’ experience in data engineering or Spark development (will entertain additional years’ experience in lieu of degree)
  • Strong hands-on experience with:

o Apache Spark (mandatory)

o Python (PySpark)

o Data processing at scale

  • Experience working with:

o Parquet and/or Delta Lake

o Distributed data systems

  • Familiarity with:

o Docker / containerization

o Kubernetes (basic to intermediate experience)

  • Experience with object storage systems (e.g., S3 or equivalent)
  • Strong troubleshooting and performance tuning skills
  • Proficiency in Bash or scripting

Preferred Qualifications:

  • Experience with Scala for Spark development
  • Experience with Structured Streaming in production environments
  • Familiarity with Iceberg or lakehouse architectures
  • Experience with CI/CD pipelines (Jenkins, Git)
  • Exposure to Terraform or Infrastructure as Code
  • Experience supporting AI/ML data pipelines
  • Prior experience supporting NGA, IC, or DoD programs

Who we are:
ABSC is a technology and services company that combines the agility of a small business with proven processes refined over more than two decades in business. We specialize in supporting public sector clients in the Intelligence, Defense, Health, and Safety areas. Our team stands ready to deliver the next generation of programs, personnel, and solutions to help advance our federal government customers’ driving innovation, agility, and security across all mission areas.

Some of our benefits include:

  • Generous PTO plus 11 Federal Holidays
  • Retirement Planning – 401k Fully Vested with Match
  • Tuition Assistance Program – Annual contributions to help you pay down your loans
  • Annual Health and Wellness Allowance – buy an Apple Watch, a treadmill, or hit the gym on us
  • Career Development – Annual Funds to spend on Education and Training
  • Volunteer Time Off – Annually, all employees can spend 8 hours directly supporting a charity of choice
  • Charitable Match – ABSC matches an employee’s donation to a qualifying charity
  • Referral Program – We pay for internal and external referrals!
  • LOV Awards – Earn bonus awards throughout the year from our Living Our Values awards program

Apply to join our team today! We are always looking to grow our team - if you know someone who is seeking a new career opportunity, please share this job opening with them! ABSC offers generous external referral bonuses. You don’t need to be an employee to benefit from our Referral Program!

*ABSC is a proud V3, Virginia Values Vets, member which recognizes our commitment to hiring Veterans. If you are a veteran, please be sure to include that in your application. Thank you! *

Equal Opportunity Employer, including veterans and individuals with disabilities.

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
Herndon, VA
90 Employees
Year Founded: 2001

What We Do

ABSC is a technology and services company that combines the agility of a small business with proven processes refined over more than two decades in business supporting public sector clients in the Intelligence, Defense, Health, and Safety areas. Our big data, behavioral intelligence, and predictive analytics products and services span a wide range of critical mission areas. Our intelligence and predictive analytics foundations have expanded us into various mission areas, launching commercial products, and we continue to invest and prototype AI/ML, and data science solutions. The ABSC ideal solution is comprised of a customized behavioral analytics technology designed to provide strategic and operational decision makers, analysts, and operators with the ability to gain a clearer understanding of the global operational environment. We help customers prepare, plan, and engage in full spectrum operations with confidence. Our commitment to advancing artificial intelligence and machine learning solutions through continual investment, development, and iteration is part of our corporate DNA. Our commitment coupled with our company Values, propels us to put data to work for our clients, allowing them to focus, prepare, and engage in complex global missions with confidence. Interested to learn more about our solutions as we democratize data analytics, set a higher bar for predictive analytics, and grow our team of skilled professionals? Learn more about us on our website or contact us at [email protected] . If you're looking to chat about our career opportunities send us a message at [email protected] - a new and exciting opportunity to grow with us as we expand into larger markets is waiting for you!

Similar Jobs

General Motors Logo General Motors

Sales Manager

Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
Remote or Hybrid
United States
165000 Employees

General Motors Logo General Motors

Designer

Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
Remote or Hybrid
United States
165000 Employees
135K-208K Annually

General Motors Logo General Motors

Commercial Zone Manager North Central Region - GM Fleet

Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
Remote or Hybrid
United States
165000 Employees

MetLife Logo MetLife

Call Center Supervisor

Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
Remote or Hybrid
United States
43000 Employees
65K-72K Annually

Similar Companies Hiring

Red 6 Thumbnail
Aerospace • Hardware • Software • Virtual Reality • Defense
Orlando, Florida
186 Employees
Turion Space Thumbnail
Aerospace • Artificial Intelligence • Hardware • Information Technology • Software • Defense • Manufacturing
Irvine, CA
150 Employees
Outpost Space Thumbnail
Aerospace • Defense
US
24 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account