Data Scientist (TS/SCI with Poly Required)

Posted 12 Days Ago
Be an Early Applicant
McLean, VA, USA
In-Office
Mid level
Information Technology • Software • Analytics • Cybersecurity
The Role
The Data Scientist will build production data pipelines, work with AWS and databases, and collaborate with stakeholders to meet data requirements.
Summary Generated by Built In

GCI embodies excellence, integrity and professionalism. The employees supporting our customers deliver unique, high-value mission solutions while effectively leverage the technological expertise of our valued workforce to meet critical mission requirements in the areas of Data Analytics and Software Development, Engineering, Targeting and Analysis, Operations, Training, and Cyber Operations. We maximize opportunities for success by building and maintaining trusted and reliable partnerships with our customers and industry.

At GCI, we solve the hard problems. As a Data Scientist, a typical day will include the following duties:

Required Skills:

  • Demonstrated experience building production data pipelines and ETL/ELT workflows at scale.
  • Demonstrated experience with Apache Spark and PySpark for distributed data processing.
  • Demonstrated experience with advanced Python programming skills including data manipulation libraries (Pandas, NumPy) and data engineering best practices.
  • Demonstrated experience understanding data security, privacy, governance, and compliance principles.
  • Demonstrated experience with workflow orchestration tools such as Step Functions and Airflow.
  • Demonstrated experience with containerization such as Docker or Podman, and deploying data applications in cloud environments.
  • Demonstrated experience with AWS services (in particular S3, Lambda, and Step Functions).
  • Demonstrated experience with PostgreSQL and MySQL in production environments, including performance tuning and schema design.
  • Demonstrated experience with SQL and query optimization for complex analytical workloads.
  • Demonstrated experience with version control (Git) and CI/CD practices for data pipelines.
  • Demonstrated experience working with stakeholders to understand data requirements, assess feasibility, and design appropriate solutions with minimal oversight.
  • Demonstrated experience with strong problem-solving and debugging skills for data quality issues, pipeline failures, and performance bottlenecks.

Desired Skills:

  • Demonstrated experience with data lakehouse architectures using Apache Iceberg.
  • Demonstrated experience configuring, deploying, and integrating data platform components: Apache Ranger (access control and data governance), Trino (distributed SQL query engine), Data catalogs (Unity Catalog OSS, Apache Polaris, etc.), Apache Superset (data visualization and dashboarding).
  • Demonstrated experience with Bash scripting for automation and data processing tasks.
  • Demonstrated experience with Infrastructure as Code (Terraform or CloudFormation) for data infrastructure.
  • Demonstrated experience with tracking data lineage and associated tooling such as OpenLineage.
  • Demonstrated experience with Java.
  • Demonstrated experience with data quality frameworks, testing methodologies, and validation strategies.
  • Demonstrated experience or background with large-scale data migrations or platform modernization efforts.
  • Demonstrated experience integrating AI/ML services and models (translation, OCR, speech-to-text, NLP, language detection, topic modeling), LLMs, and RAG (retrieval-augmented generation) pipelines.
  • Demonstrated experience with geospatial data processing (H3, PostGIS, or similar).
  • Demonstrated experience Contributing to data engineering documentation, best practices, or design patterns.
  • Demonstrated experience with NoSQL databases (DynamoDB, etc.).
  • Demonstrated experience with excellent written and verbal communication skills with both technical and non-technical audiences.

*A candidate must be a US Citizen and requires an active/current TS/SCI with Polygraph clearance.

Equal Opportunity Employer / Individuals with Disabilities / Protected Veterans

Qualifications Education Preferred BA/BS or better. Experience Required Demonstrated related work experience. Equal Opportunity Employer
This employer is required to notify all applicants of their rights pursuant to federal employment laws. For further information, please review the Know Your Rights notice from the Department of Labor.

Skills Required

  • Demonstrated experience building production data pipelines and ETL/ELT workflows at scale
  • Demonstrated experience with Apache Spark and PySpark for distributed data processing
  • Demonstrated experience with advanced Python programming skills including data manipulation libraries (Pandas, NumPy)
  • Demonstrated experience with workflow orchestration tools such as Step Functions and Airflow
  • Demonstrated experience with containerization such as Docker or Podman
  • Demonstrated experience with AWS services (in particular S3, Lambda, and Step Functions)
  • Demonstrated experience with PostgreSQL and MySQL in production environments
  • Demonstrated experience with SQL and query optimization for complex analytical workloads
  • Demonstrated experience with version control (Git) and CI/CD practices for data pipelines
  • Demonstrated experience with strong problem-solving and debugging skills for data quality issues
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Reston, Virginia
180 Employees
Year Founded: 1989

What We Do

GCI is an Engineering and IT Services company focusing on Data Analytics, Engineering, Cyber Operations, Targeting and Analysis, Operations Solutions and Training. We help our customers solve their greatest challenges by providing exceptional consulting and mission solutions.

Similar Jobs

Comcast Logo Comcast

Enterprise Account Executive

Digital Media • Information Technology • News + Entertainment
Hybrid
Richmond, VA, USA
115000 Employees

Comcast Logo Comcast

Account Executive

Digital Media • Information Technology • News + Entertainment
Remote or Hybrid
Virginia, USA
115000 Employees

Comcast Logo Comcast

Software Engineer

Digital Media • Information Technology • News + Entertainment
Hybrid
Reston, VA, USA
115000 Employees
130K-213K Annually

TransUnion Logo TransUnion

Director, Product Management

Big Data • Fintech • Information Technology • Business Intelligence • Financial Services • Cybersecurity • Big Data Analytics
Hybrid
4 Locations
13000 Employees
169K-281K Annually

Similar Companies Hiring

Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account