Staff Data Engineer - TS/SCI Cleared

Posted 14 Days Ago
Be an Early Applicant
Arlington, VA, USA
In-Office
Senior level
Artificial Intelligence • Information Technology • Cybersecurity • Defense
The Role
The role involves leading the development of data lakes, designing schemas, building ETL pipelines, and mentoring engineers while ensuring data quality and performance.
Summary Generated by Built In
About the Company

At Twenty, we're taking on one of the most critical challenges of our time: defending democracies in the digital age. We develop revolutionary technologies that operate at the intersection of the cyber and electromagnetic domains, where the speed of operations exceeds human sensing and complexity transcends conventional boundaries. Our team doesn't just solve problems – we deliver game-changing outcomes that directly impact national security. We're pragmatic optimists who understand that while our mission of protecting America and its allies is challenging, success is possible.

Role Summary

You will own the data infrastructure that powers Twenty’s cyber operations applications and capabilities. This role is about building a durable, high-performance data lake and the pipelines, schemas, and query patterns that make petabyte-scale datasets usable and economical. You’ll partner closely with engineers and intelligence analysts to turn messy, high-volume operational data into reliable, well-modeled systems that drive real missions. You’ll also lead technical initiatives and mentor other engineers as we scale what we can support and ship.

Who You Are
  • You think in systems: data modeling, storage formats, compute engines, and access patterns all have to fit together.

  • You’re opinionated about schema and index design, and you can explain tradeoffs clearly.

  • You default to measurable reliability: data quality, lineage, repeatability, and operational excellence.

  • You’re comfortable working with ambiguous datasets and evolving requirements without lowering standards.

  • You collaborate tightly across roles, especially with engineers and analysts who need fast, correct answers.

  • You take leadership seriously—mentoring others, raising the bar, and driving initiatives to completion.

  • You’re motivated by national security outcomes and want your work to matter in the real world.

What You’ll Do
  • Lead the development and operation of a data lake for cyber operations and intelligence data.

  • Design schemas, partitions, and indexes that make complex datasets performant and cost-effective to query.

  • Partner with engineers and intelligence analysts to define query patterns and data products for mission use cases.

  • Build and evolve ETL pipelines that are observable, recoverable, and resilient to upstream change.

  • Drive technical initiatives end-to-end, from architecture decisions through production rollout and iteration.

  • Establish best practices for data quality, documentation, and operational ownership across the platform.

  • Mentor engineers on data modeling, performance tuning, and production-grade pipeline design.

  • Identify bottlenecks in storage/compute/query layers and ship improvements with clear performance wins.

Must Have
  • You have 8+ years of experience in data engineering and/or data architecture.

  • You have mastery-level expertise building ETL pipelines and operating them in production.

  • You have deep experience with data lake architecture and systems used to query data lakes.

  • You have strong schema and index design skills, including partitioning, indexing, and clustering strategies.

  • You have experience with column-oriented databases in production environments.

  • You have built data systems from scratch (not only maintained existing platforms).

  • You have proven leadership experience mentoring engineers and driving technical initiatives.

  • You are a U.S. citizen and can meet the role’s security requirements.

Nice To Have
  • You have experience with key-value datastores.

  • You have worked with streaming and message queue systems.

  • You have experience with graph database technologies.

  • You have worked with internet/networking datasets (e.g., scan data, DNS, netflow, certificates).

  • You have experience supporting analysts or operational users with high-stakes data needs.

Tech Environment (You Might Work With)
  • Data lakes: Apache Iceberg, Delta Lake, Apache Hive

  • Query engines: Trino, Presto, AWS Athena, Apache Spark

  • Column stores: ClickHouse, Amazon Redshift, Google BigQuery

  • ETL / orchestration: Airflow, AWS Glue, NiFi, ClickPipe

  • Streaming / queues: Kafka, RabbitMQ, NATS, AWS Kinesis

  • Graph: Neo4j, AWS Neptune, Memgraph, Apache AGE

Security / Work Environment

This role requires an active TS/SCI security clearance with appropriate polygraph and the ability to maintain it. This role is on-site in Arlington, VA with occasional travel to Fort Meade, MD.

If this role sounds like you, apply and share with us your interest.

Some positions may require eligibility to obtain a U.S. Government security clearance. Any clearance requirement will be listed in the role description.

Twenty is an equal opportunity employer. We consider all qualified applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, age, veteran status, disability, or any other protected status.

If you need a reasonable accommodation during the hiring process, let us know and we will work with you.

Skills Required

  • 8+ years of experience in data engineering and/or data architecture
  • Mastery-level expertise building ETL pipelines and operating them in production
  • Deep experience with data lake architecture and systems used to query data lakes
  • Strong schema and index design skills, including partitioning, indexing, and clustering strategies
  • Experience with column-oriented databases in production environments
  • Built data systems from scratch, not only maintained existing platforms
  • Proven leadership experience mentoring engineers and driving technical initiatives
  • You are a U.S. citizen and can meet the role's security requirements
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
33 Employees
Year Founded: 2023

What We Do

Twenty builds cyber technologies that protect democracies worldwide, developing revolutionary technologies that operate at the intersection of the cyber and electromagnetic domains.

Similar Jobs

TransUnion Logo TransUnion

Technical Product Manager

Big Data • Fintech • Information Technology • Business Intelligence • Financial Services • Cybersecurity • Big Data Analytics
Hybrid
5 Locations
13000 Employees
169K-281K Annually

Zeta Global Logo Zeta Global

Lead Software Engineer

AdTech • Artificial Intelligence • Marketing Tech • Software • Analytics
Easy Apply
Remote or Hybrid
United States
2429 Employees
150K-200K Annually
Remote or Hybrid
United States
240 Employees
150K-175K Annually

General Motors Logo General Motors

Sales Manager

Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
Remote or Hybrid
United States
165000 Employees
106K-141K Annually

Similar Companies Hiring

Bellagent Thumbnail
Artificial Intelligence • Machine Learning • Business Intelligence • Generative AI
Chicago, IL
20 Employees
Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees
Outpost Space Thumbnail
Aerospace • Defense
US
24 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account