Data Scientist

Posted Yesterday
McLean, VA, USA
In-Office
Mid level
Information Technology • Software • Cybersecurity • Defense
The Role
Design, build, and operate production-scale data pipelines and ETL/ELT workflows using Spark/PySpark and Python. Orchestrate workflows (Step Functions, Airflow), containerize and deploy data applications on AWS, optimize SQL and databases, ensure data security/governance, collaborate with stakeholders, and troubleshoot data quality and performance issues.
Summary Generated by Built In

What Impact You'll Have

GRVTY is a member of 100% of the winning teams for the largest technology program in the Intel Community. We've been supporting this customer on many different sub-projects of this program since our founding in 2013. We've grown on this effort by providing the customer with Engineers who have done exceptional work, and we've retained our staff by paying very strong salaries, and working hard to ensure each Engineer is doing work that aligns with their career interest.

What You'll be Owning

GRVTY is seeking a Data Scientist with a TS/SCI + Poly clearance (applicable to this customer) to join one of our top projects in McLean, VA. The Data Scientist will be working in a fast-paced, dynamic, agile software development environment. The multi-disciplinary project team works together on multiple projects that includes automating processing of large forensic images, extracting and enriching metadata, and displaying resulting information in meaningful ways for analysts to conduct assessments. Team members utilize a mix of COTS and GOTS tools and technologies; as well as build integrations with a variety of external partner applications. Most solutions are cloud-based. The Sponsor adheres to Agile Scrum development methodology best practices and has 2-week sprint cycles. 

What You Must Have

  • Demonstrated experience building production data pipelines and ETL/ELT workflows at scale. 
  • Demonstrated experience with Apache Spark and PySpark for distributed data processing. 
  • Demonstrated experience with advanced Python programming skills including data manipulation libraries (Pandas, NumPy) and data engineering best practices. 
  • Demonstrated experience understanding data security, privacy, governance, and compliance principles. 
  • Demonstrated experience with workflow orchestration tools such as Step Functions and Airflow. 
  • Demonstrated experience with containerization such as Docker or Podman, and deploying data applications in cloud environments. 
  • Demonstrated experience with AWS services (in particular S3, Lambda, and Step Functions). 
  • Demonstrated experience with PostgreSQL and MySQL in production environments, including performance tuning and schema design. 
  • Demonstrated experience with SQL and query optimization for complex analytical workloads. 
  • Demonstrated experience with version control (Git) and CI/CD practices for data pipelines. 
  • Demonstrated experience working with stakeholders to understand data requirements, assess feasibility, and design appropriate solutions with minimal oversight. 
  • Demonstrated experience with strong problem-solving and debugging skills for data quality issues, pipeline failures, and performance bottlenecks. 

#LI-BPJ



Why Choose GRVTY

The toughest national security challenges demand vision and ingenuity, not just resources. We deliver mission and technical expertise to outpace our adversaries. We’re purpose-built to tackle the most entrenched, systemic national security issues around the world.

We partner with our customers to help them overcome challenges in every corner of technology and defense—including the ones still being explored. Our growing capabilities create complementary advantages, giving on-the-ground operations the edge they need to succeed. We muster everything we have to answer every challenge presented, every day of our lives.

At GRVTY, we believe that when our employees thrive, our company thrives. That’s why we offer a comprehensive and competitive benefits package designed to support your well-being, growth, and work-life balance.

•    Robust health plan including medical, dental, and vision  

•    Health Savings Account with company contribution  

•    Annual Paid Time Off and Paid Holidays  

•    Paid Parental Leave   

•    401k with generous company match  

•    Training and Development Opportunities  

•    Award Programs  

•    Variety of Company Sponsored Events


EEO Statement

GRVTY, is an Equal Opportunity Employer.  All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or status as a protected veteran and will not be discriminated against on the basis of disability. 

Anyone requiring reasonable accommodations should email [email protected] or call 703-544-7930 with requested details.  A member of the HR team will respond to your request within 2 business days. 

Know Your Rights: Workplace Discrimination is Illegal (eeoc.gov)  

Please review our current job openings and apply for the positions you believe may be a fit. If you are not an immediate fit, we will also keep your resume in our database for future opportunities.



Skills Required

  • Active TS/SCI with Polygraph clearance
  • Building production data pipelines and ETL/ELT workflows at scale
  • Apache Spark and PySpark for distributed data processing
  • Advanced Python programming including Pandas and NumPy
  • Understanding of data security, privacy, governance, and compliance principles
  • Experience with workflow orchestration tools such as Step Functions and Airflow
  • Containerization experience (Docker or Podman) and deploying data applications in cloud environments
  • Experience with AWS services, in particular S3, Lambda, and Step Functions
  • PostgreSQL and MySQL production experience, including performance tuning and schema design
  • SQL and query optimization for complex analytical workloads
  • Version control (Git) and CI/CD practices for data pipelines
  • Experience working with stakeholders to understand data requirements and design solutions with minimal oversight
  • Strong problem-solving and debugging skills for data quality issues, pipeline failures, and performance bottlenecks
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
1,000 Employees
Year Founded: 2025

What We Do

GRVTY is a national security and defense technology company that delivers mission and technical expertise in cyber, space, and spectrum. They provide automated ISR&T platforms, software, and data solutions to defense, intelligence, and homeland security customers, working on the frontlines against cyber adversaries.

Similar Jobs

BAE Systems, Inc. Logo BAE Systems, Inc.

Data Scientist

Aerospace • Hardware • Information Technology • Security • Software • Cybersecurity • Defense
Hybrid
Reston, VA, USA
40000 Employees
97K-165K Annually

Boeing Logo Boeing

Data Scientist

Aerospace • Information Technology • Software • Cybersecurity • Design • Defense • Manufacturing
In-Office
Herndon, VA, USA
170000 Employees
138K-186K Annually

Capital One Logo Capital One

Data Scientist

Fintech • Machine Learning • Payments • Software • Financial Services
Hybrid
McLean, VA, USA
55000 Employees
197K-225K Annually

Capital One Logo Capital One

Data Scientist

Fintech • Machine Learning • Payments • Software • Financial Services
Hybrid
2 Locations
55000 Employees
136K-169K Annually

Similar Companies Hiring

Outpost Space Thumbnail
Aerospace • Defense
US
24 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account