Data Engineer

Posted 3 Days Ago
Be an Early Applicant
's-Gravenhage, NLD
In-Office
Mid level
Artificial Intelligence • Software • Cybersecurity
The Role
Develop and improve classified sandbox data pipelines to extract, deduplicate, convert and archive documents (office, image, video), extract metadata, build semantic search indexes, automate registrations and transfers to NATO archives, and report processing status and traceability.
Summary Generated by Built In

Spektrum have a wide range of exciting opportunities in several global locations.  We are always looking to add great new talent to our team and look forward to hearing from you.

Spektrum supports apex purchasers (NATO, UN, EU, and National Government and Defence) and their Tier 1 supplier ecosystem with a wide range of specialist services. We provide our clients with professional services, specialised aerospace and defence sales, delivery, and operational subject matter expertise. We are looking for personnel to join our team and support key client projects.

Who we are supporting 

The NATO Communication and Information Agency (NCIA) is responsible for providing secure and effective communications and information technology (IT) services to NATO's member countries and its partners. The agency was established in 2012 and is headquartered in Brussels, Belgium.

The NCIA provides a wide range of services, including:

  • Cyber Security: The NCIA provides advanced cybersecurity solutions to protect NATO's communication networks and information systems against cyber threats.
  • Command and Control Systems: The NCIA develops and maintains the systems used by NATO's military commanders to plan and execute operations.
  • Satellite Communications: The NCIA provides satellite communications services to enable secure and reliable communications between NATO forces.
  • Electronic Warfare: The NCIA provides electronic warfare services to support NATO's mission to detect, deny, and defeat threats to its communication networks.
  • Information Management: The NCIA manages NATO's information technology infrastructure, including its databases, applications, and servers.

Overall, the NCIA plays a critical role in ensuring the security and effectiveness of NATO's communication and information technology capabilities.

The program

Assistance and Advisory Service (AAS)

The NATO Communications and Information Agency (NCI Agency) is NATO’s principal C3 capability deliverer and CIS service provider. It provides, maintains and defends the NATO enterprise-wide information technology infrastructure to enable Allies to consult together under Article IV, and, when required, stand together in the face of attack under Article V.

To provide these critical services, in the modern evolving dynamic environment the NCI Agency needs to build and maintain high performance-engaged workforce. The NCI Agency workforce strategically consists of three major categorise's: NATO International Civilians (NIC)'s, Military (Mil), and Interim Workforce Consultants (IWC)'s. The IWCs are a critical part of the overall NCI Agency workforce and make up approximately 15 percent of the total workforce.

Role ID – 2026-0095

Role Background

The NATO Information and Communication Agency (NCIA) located in The Hague, Netherlands, is currently involved in processing vast amounts and highly variant data coming from theatre for the purpose of efficient archiving. In light of these activities, within NCIA Chief Technology Office, the Exploiting Data Science and Artificial Intelligence (EDS&AI) team is tasked to apply Big Data and AI technology to prepare, run and adjust processing pipelines for processing various source data into archiving formats and metadata, and prepare for (semantic) search. NATO has an obligation to support national investigations into situation that occurred in theatre. In order to support the different teams involved most optimal, the EDS&AI team brings the expertise to extract and exploit the vast and varied data on the table, by using the Agency’s high performance computing classified sandbox. The EDS&AI team provides the core data science skills and technology needed for big data analysis and AI. The EDS&AI team applies innovative technology to data whenever it is not possible to extract value with conventional approaches.

Role Duties and Responsibilities

  • Setting up / improving pipelines to process all required documents and that uniquely identifies and traces decisions and processing steps. This is to be conducted on the provided classified sandbox environment, with provided performance hardware and toolsets.
  • Implementing / improving (missing) pipeline steps for marking duplicate files, based on file attributes, path (structure) and content (similarity), and rules for considering a file or structure a duplicate.
  • Extracting document-format records from Functional Area Systems (FAS) databases and back-ups performed otherwise. Archiving SME’s and system SME’s are available for guidance on target formats and source system structure and data interpretation. Each FAS is processed separately.
  • Processing / Monitoring progress of various office, image and video file types to the accepted archiving formats, including extraction of metadata and preparing search semantic indexes.
  • Automating registering all processed documents with semantic indexes with the sandbox natural language search tool.
  • Automating the final copy of all non-duplicate and extracted archive documents with content and metadata to the NATO archiving system.
  • Reporting status, progress and statistics of the (raw) files being processed to archive formats, metadata and search indexes.
  • Delivering full reporting of results, trace of pipeline steps taken and (stakeholder) accepted failures. Quarterly updates.

Essential Skills, Experience and Certifications

  • At least 3 years of practical experience in the field of data science and/ or data analytics;
  • Experience using data processing/visualization/analytics software packages and development environments, preferably such as KNIME, VS Code, GitLab, Power BI, Jupyter Lab, and Docker-based API;
  • Experience with data processing Big Data, creating and utilizing containerized building blocks and running containers (APIs) on Kubernetes clusters;
  • Experience with programming/scripting in languages like Python, R, SQL and working with data formats like CSV, XML, JSON;
  • Experience performing content extraction from files/databases/systems, (LLM-based) embedding models, entity-extraction, key-word-extraction and content similarity measures;
  • Creative, flexible and pro-active overcoming obstacles;
  • Good drafting, communication and presentation skills in English, including technical and non-technical levels;
  • High attention to detail and accuracy;

Education

  • Master in Computer Science, Engineering or relevant field.
  • A higher degree in Data Science is preferred.

Working Location

  • The Hague, Netherlands

Working Policy

  • On-site

Travel

  • Some travel to other NATO sites may be required

Security Clearance

  • Valid National or NATO Secret personal security clearance

We never know what new opportunities might be just over the horizon. If this opportunity isn't for you, please feel free to send us your resume anyway and be the first to know if something suitable for your skills and experience comes up. 

Skills Required

  • At least 3 years of practical experience in data science and/or data analytics
  • Experience using KNIME, VS Code, GitLab, Power BI, Jupyter Lab and Docker-based APIs
  • Experience with Big Data processing, containerized building blocks and running containers on Kubernetes clusters
  • Programming/scripting experience in Python, R and SQL; working with CSV, XML and JSON data formats
  • Experience performing content extraction from files/databases/systems, LLM-based embedding models, entity extraction, keyword extraction and content similarity measures
  • Master in Computer Science, Engineering or relevant field
  • Higher degree in Data Science
  • Good drafting, communication and presentation skills in English (technical and non-technical)
  • High attention to detail and accuracy
  • Valid National or NATO Secret personal security clearance
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company

What We Do

Spektrum Labs is building the backbone of the cyber resilience ecosystem, unifying cybersecurity, compliance, and insurance into one seamless platform powered by AI, cryptography, and automation.

Similar Jobs

Sia Partners Logo Sia Partners

Consultant

Healthtech • Professional Services • Consulting • Generative AI • Manufacturing
Hybrid
Rotterdam, NLD
3323 Employees

SPEKTRUM GROUP Logo SPEKTRUM GROUP

Data Engineer

Artificial Intelligence • Software • Cybersecurity
In-Office
's-Gravenhage, NLD

DataChef Logo DataChef

Data Engineer

Big Data • Sales • Analytics
In-Office or Remote
Nootdorp, NLD
19 Employees

Darktrace Logo Darktrace

Data Engineer

Security • Cybersecurity
In-Office
's-Gravenhage, NLD
1763 Employees

Similar Companies Hiring

Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
42 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account