Internship September 2026 - Agentic AI for Semi-Automating the Data Science Workflow

Posted 6 Days Ago
Be an Early Applicant
Paris, Île-de-France, FRA
Hybrid
Internship
Artificial Intelligence • Database • Analytics
The Role
The intern will design and build a multi-agent system to semi-automate the data science workflow, collaborating with senior data scientists to ensure quality and efficiency in feature engineering and modeling.
Summary Generated by Built In

Ekimetrics is the European leader in data science, with over 500 data scientists and 1,000+ projects since 2006. With offices in Paris, London, New York, and Hong Kong, we conduct projects in over 50 countries across various sectors, including financial services, retail, telecom, healthcare, and more. 

Our mission is to help companies audit their data opportunities, enhance their analytical capital, and deploy actionable solutions that maximize their marketing and operational performance while reinvigorating business models. 

Our absolute focus is on delivering short-term gains while ensuring the long-term development of our clients' data capital. We are committed to offering the most advanced data science approaches and building ethical, sustainable AI practices. 

Key Figures 

  • 18 years of experience in data science 

  • 500+ data scientists, all consultants 

  • 5 offices in Paris, London, New York, Hong Kong, and Shanghai 

  • 350+ clients (CAC40, Fortune500) 

  • $1M+ in profit generated for our clients since 2006 

  • 1,000+ data science projects 

Within Ekimetrics, the Innovation Department works on AI research topics in collaboration with our industrial and academic partners. The department comprises several PhD experts in generative AI, deep learning, computer vision, time series, explainability, and causality. Two CIFRE PhD projects are underway, with two more to start in 2025. Each expert leads teams tasked with testing state-of-the-art algorithms and adapting them to specific business problems, developing new methodologies or algorithms to address identified challenges, and ensuring a seamless handover for integration within Ekimetrics' industrial ecosystem. 

A central focus of the Innovation Department is closing the gap between cutting-edge AI agents and the day-to-day reality of delivering data science at scale. As large language models reshape how software and analytics are produced, the department is investing in agentic systems that can accelerate the full data science lifecycle, from data understanding and feature engineering through modeling, evaluation, and deployment, while keeping data scientists firmly in control of quality, governance, and business relevance. This internship sits at the heart of that effort, combining applied AI research with production-grade engineering on Ekimetrics' modern data stack.

In recent years, foundation models and LLM-based agents have transformed software engineering and knowledge work. Yet the data science workflow itself (problem framing, data preparation, feature engineering, modeling and experimentation, evaluation, deployment, and monitoring) remains largely manual, expert-intensive, and difficult to scale. A typical project follows a recognizable sequence of steps, which means it can be explicitly structured as a workflow that a multi-agent system can accelerate, with human review at the points where judgment matters most. 

This internship will focus on designing and building such a system: a multi-agent orchestration in which specialized agents (e.g., feature engineering, modeling, evaluation) collaborate under a coordinating agent to produce runnable code that a data scientist reviews, refines, and merges. The system is intentionally semi-automated: some steps run human-in-the-loop with explicit approval gates, while others run autonomously with human review of the output. The intern will start with the feature engineering and modeling/experimentation loop, the part of the workflow where iteration is most intensive, and progressively extend coverage toward the end-to-end pipeline. 

The work builds on recent state-of-the-art research in agentic data science, including AIDE [1], which frames machine learning engineering as a tree search over candidate code solutions; Data Interpreter [2], an end-to-end LLM agent that decomposes a data science task into a dynamic dependency graph; and AutoKaggle [3], a multi-agent pipeline whose phase-by-phase design and validated toolkit allow users to intervene at each stage, integrating automated intelligence with human expertise. 

This internship offers an opportunity to engage with cutting-edge AI research and production engineering.

Key Responsibilities  :

Working alongside senior data scientists and AI experts who lead the system's overall design and architecture, the intern will: 

  • Conduct a focused literature review on agentic data science, automated machine learning, and LLM-agent techniques (planning, tool use, multi-agent orchestration), including human-in-the-loop and evaluation methods. 

  • Contribute to the design of the multi-agent system and implement its components, starting with the feature engineering and modeling/experimentation loop and producing runnable notebooks/code for human review. 

  • Participate in defining the human-in-the-loop interaction model (which steps run with explicit human approval versus autonomously with human review) and implement the corresponding control and guardrail logic. 

  • Help integrate the system with the Databricks stack: governed data access via Unity Catalog, execution on Databricks compute, and experiment tracking, agent tracing, and evaluation via MLflow. 

  • Support work on the safe and governed execution of agent-generated code, including isolation and access-control considerations. 

  • Run experiments, evaluate prototypes on real-world datasets, and benchmark against prior art and MLE-bench-style task performance, iterating on architecture and prompts in collaboration with your N+1. 

  • Share findings with the team and contribute to discussions on the long-term strategy for deploying agentic data science tooling in production. 

Profile :

  • Currently in the final year of a Master's or Engineering degree, with strong foundations in machine learning, statistics, and programming. 

  • Strong Python skills and experience with deep learning frameworks (PyTorch, TensorFlow); comfortable with the standard data science toolkit (pandas/polars, scikit-learn, gradient-boosting libraries). 

  • Familiarity with large language models and agentic techniques (prompting, tool/function calling, retrieval, multi-agent orchestration); hands-on experience with an agent framework is a plus. 

  • Solid software engineering practices (version control, testing, clean code) and an interest in building reliable, scalable AI systems. 

  • Exposure to Databricks, Spark, MLflow, or cloud data platforms is a plus. 

  • Strong analytical and problem-solving skills, with the ability to work autonomously on an open-ended research-and-build problem. 

  • Fluency in English. 

Benefits :

  • Hands-on Research & Engineering Experience: Build a pioneering agentic system on real data science workflows, contributing to a tool with direct real-world impact. 

  • Collaborative Learning: Engage with a team of data scientists, AI experts, and PhDs in a dynamic, research-driven environment within Ekimetrics' Innovation Department. 

  • Skill Development: Gain expertise in LLM agents and multi-agent orchestration, modern agent frameworks, and production tooling on Databricks (Unity Catalog, MLflow). 

  • Career Growth: Opportunity to transition into a full-time Data Scientist role in AI at Ekimetrics, or into a PhD program, upon successful completion of the internship. 

  • Scientific Contribution: Opportunity to co-author and publish scientific papers in reputable AI and data science conferences and journals. 

  • Networking: Collaborate with Ekimetrics' industrial and academic partners, enhancing your professional network in the AI research community. 

  • Ethical and Sustainable AI: Learn to develop AI solutions that align with ethical, governed, and sustainable best practices, a core commitment of Ekimetrics. 

Duration: 6 months, Paris 

References :

[1] AIDE: AI-Driven Exploration in the Space of Code. https://arxiv.org/abs/2502.13138 

[2] Data Interpreter: An LLM Agent for Data Science. https://arxiv.org/abs/2402.18679 

[3] AutoKaggle: A Multi-Agent Framework for Autonomous Data Science Competitions. https://arxiv.org/abs/2410.20424 

Why join us?

Joining Ekimetrics means joining a company where values are applied every day:

•    Evolve in an entrepreneurial and non-traditional environment (#curiosity)
•    Be open to both top-down and bottom-up feedback for continuous improvement (#excellence)
•    Receive training upon arrival and continuously through a unique learning experience enriched with numerous resources (internal, external, live, and digital), encompassing technical knowledge and soft skills (#generosity)
•    Be part of a friendly and united community (#enjoyment)
•    Imagine unexpected solutions and step out of your comfort zone (#creativity)
 
In 2023, Ekimetrics has obtained the mission-driven company status, which demonstrates our strong commitment to Corporate Social Responsibility. Our mission statement: Accelerate organisations’ transformation towards sustainability, through the application of data science and artificial intelligence.
 
We are also Great Place to Work certified© in France, the UK, and the US, and received the ‘Best companies to Work for in Asia 2023©’ award in Hong Kong.
 
🤩You will have access to… 
 
•    The Eki.Academy training catalog, which contains programs that will enhance your skills on our solutions and jobs, learning paths on our digital platform, as well as programs dedicated to our priority challenges, including awareness of environmental issues with the Climate School;
•    A sporty, artistic, musical, playful, charitable, and committed life: from our private gym to art exhibitions, video games, and concerts, or even CSR challenges on our dedicated platform (Vendredi);
•    Many events and seminars to stay close to your community;
•    Modern premises in a dynamic area in the heart of Paris and London; 
•    Flexible working-from-home policy.
 
🔄 Our recruitment process
🔸Technical assessment on HackerRank
🔸HR interview with a Talent Acquisition Specialist
🔸Case study 
🔸Final interview with a Manager
 
We would be delighted to provide you with further information during an interview and look forward to receiving your application! As an employer, Ekimetrics offers equal employment opportunities to all, regardless of gender, ethnicity, religion, sexual orientation, social status, disability, or age. Ekimetrics strives to develop an inclusive work environment that reflects diversity within its teams.
 
We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.

Skills Required

  • Strong foundations in machine learning, statistics, and programming
  • Strong Python skills
  • Experience with deep learning frameworks (PyTorch, TensorFlow)
  • Familiarity with large language models and agentic techniques
  • Solid software engineering practices
  • Exposure to Databricks, Spark, MLflow, or cloud data platforms
  • Strong analytical and problem-solving skills
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Brooklyn, NY
377 Employees
Year Founded: 2006

What We Do

Ekimetrics is a pioneering leader in data science and AI-powered solutions for sustainable business performance. We help companies get more from their data and implement pre-packaged AI solutions, so they can combine high impact with long-term business purpose. A focus on impact – steered by purpose Ekimetrics works with many of the world’s leading businesses to accelerate their transformation to sustainability through the application of data science and artificial intelligence. We are uniquely specialized in creating scalable data and analytics solutions that drive high-impact optimizations in alignment with overarching brand strategy and sustainability goals. Our goal: Enable our clients to pursue practical short-term gains combined with long-term value creation. For over 16 years, we’ve pioneered the use of AI and advanced data science applied to unified marketing measurement, holistic business optimization and broad-ranging sustainability goals. In particular, we’ve specialized in using its strengths in balancing multiple constraints in order to reconcile financial KPIs with non-financial goals. A business-first approach to solutions delivery All our 320 data experts and industry specialists work together in integrated squads. We value specific industry and domain expertise and focus on practical solutions in use. This is why we don’t outsource execution: so we can reduce the time to results while maintaining quality, believing in a high-service approach. We maintain our own data science platform, and a collection of pre-packaged solutions that we use to provide insights and to deploy scalable solutions faster. Key figures: • 320+ data experts • 1,000+ data science projects • 50+ countries in which we deliver projects • 4 offices worldwide Visit us at ekimetrics.com.

Similar Jobs

Atlassian Logo Atlassian

Field Marketing Manager

Cloud • Information Technology • Productivity • Security • Software • App development • Automation
In-Office or Remote
Paris, Île-de-France, FRA
11000 Employees

Samsara Logo Samsara

Senior Sales Engineer

Artificial Intelligence • Cloud • Computer Vision • Hardware • Internet of Things • Software
Easy Apply
Remote or Hybrid
France
4000 Employees

ServiceNow Logo ServiceNow

Enterprise Account Executive

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Remote or Hybrid
Issy-les-Moulineaux, Hauts-de-Seine, Île-de-France, FRA
29000 Employees
10-10 Annually
Hybrid
Clamart, Hauts-de-Seine, Île-de-France, FRA
90000 Employees

Similar Companies Hiring

GC AI Thumbnail
Artificial Intelligence • Legal Tech
San Mateo, California
100 Employees
Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account