Senior AI Data Engineer

Reposted 11 Days Ago
Be an Early Applicant
Hiring Remotely in Sofia, Sofia-grad, BGR
Remote
Senior level
Healthtech
The Role
Lead the design and optimization of data infrastructure for AI initiatives. Collaborate with teams to develop scalable data pipelines and ensure data quality and governance, while implementing advanced data models and monitoring systems.
Summary Generated by Built In
Internal Job Description
Role Description

We are seeking an experienced Senior Data Engineer to join our AI team. In this role, you will lead the development and optimization of data infrastructure supporting our Agentic AI initiatives. You will collaborate with ML engineers, AI scientists, and product managers to architect, implement, and maintain robust data pipelines powering autonomous AI agents. As a senior member of the R&DS AI Innovation Program, you will help shape data strategy and ensure our data solutions scale to meet the demanding requirements of next‑generation AI systems.

Key ResponsibilitiesMandatory
  • Design, develop, and maintain scalable data pipelines and ETL processes supporting AI research and development.

  • Design and maintain scalable data models (e.g., star schemas, feature‑ready datasets, semantic layers) for analytics, ML training, and agent workflows.

  • Collaborate with AI scientists and engineers to gather data requirements and ensure availability and quality.

  • Implement data governance and security measures to protect sensitive information.

  • Establish observability, lineage tracking, and monitoring frameworks to detect anomalies, freshness issues, and operational failures.

  • Implement data partitioning, indexing, and storage optimization techniques for large‑scale AI datasets.

  • Monitor and troubleshoot data pipeline issues to ensure continuity and reliability.

  • Stay current with emerging data engineering and AI technologies.

  • Drive data platform reliability, scalability, and cost optimization across cloud‑based infrastructure.

Preferred
  • Design and implement scalable, resilient data architectures for AI agent training, fine‑tuning, and inference workflows.

  • Build streaming and event‑driven pipelines enabling real‑time agent feedback, telemetry, and adaptive learning.

  • Develop and maintain high‑performance pipelines using modern orchestration frameworks to support real‑time agent interactions.

  • Create specialized storage and retrieval systems for vector embeddings, knowledge graphs, and symbolic reasoning components.

  • Implement automated data validation, schema testing, and quality checks ensuring reliable AI training datasets.

  • Implement comprehensive monitoring and governance frameworks ensuring high‑quality training data and compliance with privacy regulations.

  • Continuously optimize system performance with a focus on reducing latency for agent decision‑making.

Qualifications

Education

  • Bachelor’s or Master’s degree in Computer Science, Data Engineering, or a related field; advanced degree preferred.

Experience

  • 5+ years of professional experience in data engineering, including at least 2 years focused on ML/AI data infrastructure.

Programming & Technologies
  • Advanced proficiency in Python and Scala; experience with Rust, Go, Java, or Julia is valued.

  • Expert‑level knowledge of SQL and NoSQL databases.

  • Hands‑on experience with vector databases (e.g., Pinecone, Weaviate, Milvus).

  • Proficiency with modern data orchestration platforms (e.g., Airflow 2.x).

Cloud & Infrastructure
  • Extensive experience with at least one major cloud platform (AWS, Azure, or GCP).

  • Expertise in containerization and orchestration (Docker, Kubernetes).

  • Experience with Infrastructure as Code tooling (e.g., Terraform).

Data Processing
  • Experience with distributed computing frameworks (Spark, Dask, Ray).

  • Proficiency with streaming technologies (Kafka, Flink).

  • Knowledge of modern data lakehouse architectures.

Preferred Qualifications
  • Certifications in cloud platforms, big data technologies, engineering, or ML operations.

  • Experience collaborating with ML engineers on CI/CD pipelines for data processing and model deployment.

  • Working knowledge of ML frameworks (PyTorch, TensorFlow).

  • Experience with feature stores and experiment‑tracking platforms.

  • Understanding of LLM fine‑tuning data requirements and processing.

  • Experience developing data systems for autonomous AI agents or agentic AI applications.

  • Background in prompt engineering or retrieval‑augmented generation systems.

  • Experience with semantic caching and efficient storage/retrieval of AI‑generated artifacts.

  • Familiarity with LLM evaluation metrics and benchmarking frameworks.

  • Expertise in hybrid architectures combining traditional databases with vector stores.

  • Experience with RAG systems and related data pipelines.

  • Knowledge of RLHF data workflows.

  • Experience mentoring junior engineers, establishing best practices, and contributing to architectural decisions.

IQVIA is a leading global provider of clinical research services, commercial insights and healthcare intelligence to the life sciences and healthcare industries. We create intelligent connections to accelerate the development and commercialization of innovative medical treatments to help improve patient outcomes and population health worldwide. Learn more at https://jobs.iqvia.com

IQVIA is committed to integrity in our hiring process and maintains a zero tolerance policy for candidate fraud. All information and credentials submitted in your application must be truthful and complete. Any false statements, misrepresentations, or material omissions during the recruitment process will result in immediate disqualification of your application, or termination of employment if discovered later, in accordance with applicable law. We appreciate your honesty and professionalism.

Top Skills

Airflow
AWS
Azure
Dask
Docker
Flink
GCP
Go
Java
Julia
Kafka
Kubernetes
NoSQL
Python
Ray
Rust
Scala
Spark
SQL
Terraform
Vector Databases
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
Bangalore, Karnataka
61,500 Employees
Year Founded: 2016

What We Do

IQVIA (NYSE:IQV) is a leading global provider of advanced analytics, technology solutions, and clinical research services to the life sciences industry. IQVIA creates intelligent connections across all aspects of healthcare through its analytics, transformative technology, big data resources and extensive domain expertise. IQVIA Connected Intelligence™ delivers powerful insights with speed and agility — enabling customers to accelerate the clinical development and commercialization of innovative medical treatments that improve healthcare outcomes for patients. With approximately 70,000 employees, IQVIA conducts operations in more than 100 countries. To learn more, visit www.iqvia.com.

Similar Jobs

Smartling Logo Smartling

Don't see the role you're looking for currently available? Apply here.

Artificial Intelligence • Cloud • Information Technology • Machine Learning • Natural Language Processing • Software
Easy Apply
Remote
28 Locations
107 Employees

DraftKings Logo DraftKings

Manager, Database Infrastructure Engineering

Digital Media • Gaming • Information Technology • Software • Sports • Esports • Big Data Analytics
Remote or Hybrid
Sofia, Sofia-grad, BGR
6400 Employees

DraftKings Logo DraftKings

Software Architect

Digital Media • Gaming • Information Technology • Software • Sports • Esports • Big Data Analytics
Remote or Hybrid
Sofia, Sofia-grad, BGR
6400 Employees

DraftKings Logo DraftKings

Software Architect

Digital Media • Gaming • Information Technology • Software • Sports • Esports • Big Data Analytics
Remote or Hybrid
Bulgaria
6400 Employees
8-8 Annually

Similar Companies Hiring

Camber Thumbnail
Social Impact • Healthtech • Fintech
New York, NY
53 Employees
Sailor Health Thumbnail
Healthtech • Social Impact • Telehealth
New York City, NY
20 Employees
Granted Thumbnail
Mobile • Insurance • Healthtech • Financial Services • Artificial Intelligence
New York, New York
23 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account