Senior Data Scientist - Europe

Posted Yesterday
Be an Early Applicant
6 Locations
In-Office or Remote
Senior level
Artificial Intelligence • Information Technology
The Role
Lead development of a document intelligence platform converting scanned PDFs with chemical structures into searchable knowledge. Design ML/DL models, build LLM-based retrieval and RAG systems, process OCR/chemical data, deploy scalable AWS pipelines, and collaborate with engineers and domain experts to productionize and monitor solutions.
Summary Generated by Built In

About us:

Where elite tech talent meets world-class opportunities!

At Xenon7, we work with leading enterprises and innovative startups on exciting, cutting-edge projects that leverage the latest technologies across various domains of IT including Data, Web, Infrastructure, AI, and many others. Our expertise in IT solutions development and on-demand resources allows us to partner with clients on transformative initiatives, driving innovation and business growth. Whether it's empowering global organizations or collaborating with trailblazing startups, we are committed to delivering advanced, impactful solutions that meet today’s most complex challenges.

We are building a community of top-tier experts and we’re opening the doors to an exclusive group of exceptional AI & ML Professionals ready to solve real-world problems and shape the future of intelligent systems.

Location: Remote (Europe - CET)
Experience: 5+ Years

Project Duration: ~ 2 months (B2B contract)
Availability: Immediate Joiner

Role Overview

We are seeking a visionary and highly skilled Senior Data Scientist to lead the development of a cutting-edge document intelligence and discovery platform. This unique role sits at the intersection of advanced Generative AI, Machine Learning, and Cheminformatics. Your primary mission will be to solve a highly complex unstructured data challenge: transforming scanned PDF documents containing intricate chemical structures into highly searchable, interpretable, and actionable knowledge bases.

  • Architect Document Intelligence Solutions: Design and implement advanced Machine Learning and Deep Learning models to parse, extract, and interpret text and complex chemical structures from unstructured, scanned PDF documents.
  • Develop LLM & Retrieval Systems: Build and optimize Large Language Model (LLM) applications, leveraging vector databases to enable semantic search, advanced data interpretation, and retrieval-augmented generation (RAG).
  • End-to-End ML Pipelines: Own the entire machine learning lifecycle, including data preprocessing (specifically for chemical data and OCR outputs), model training, evaluation, deployment, and post-deployment monitoring.
  • Bridge Chemistry & AI: Apply your chemistry domain knowledge to translate molecular structures, diagrams, and chemical data into machine-readable formats, embeddings, and actionable insights.
  • Cloud Architecture & Deployment: Deploy scalable, secure, and production-ready AI/ML pipelines within the AWS ecosystem, ensuring high availability and performance.
  • Cross-Functional Collaboration: Partner closely with software engineers, data engineers, and domain experts to integrate ML models into the core product architecture and align with business goals.

Requirements
  • Professional Experience: 5+ years of proven experience working as a Data Scientist, with a track record of delivering production-grade machine learning models.
  • Domain Expertise: A strong background in Chemistry, Cheminformatics, or a highly related scientific field, with a demonstrated ability to interpret and manipulate complex chemical structures and data types.
  • Core Technical Stack: Advanced proficiency in Python and deep hands-on experience with the AWS cloud stack (e.g., SageMaker, Lambda, S3, EC2).
  • Generative AI & Search: Practical, hands-on experience working with LLMs (fine-tuning, prompt engineering, or API integration) and vector databases (e.g., Pinecone, Milvus, Weaviate, or Qdrant).
  • ML/DL Mastery: Robust experience in model development, validation, deployment, and evaluation framework tools (e.g., PyTorch, TensorFlow, Scikit-Learn).
  • Document Processing (Plus): Prior experience with Computer Vision, Optical Character Recognition (OCR), or Document AI systems is highly desirable given the scanned PDF focus.
  • Soft Skills: Strong analytical problem-solving skills, excellent communication, and the ability to thrive in a highly collaborative, cross-disciplinary project environment.

Benefits

At Xenon7, we're not just building AI systems—we're building a community of talent with the mindset to lead, collaborate, and innovate together.

  • Ecosystem of Opportunity: You'll be part of a growing network where client engagements, thought leadership, research collaborations, and mentorship paths are interconnected. Whether you're building solutions or nurturing the next generation of talent, this is a place to scale your influence.
  • Collaborative Environment: Our culture thrives on openness, continuous learning, and engineering excellence. You'll work alongside seasoned practitioners who value smart execution and shared growth.
  • Flexible & Impact-Driven Work: Whether you're contributing from a client project, innovation sprint, or open-source initiative, we focus on outcomes—not hours. Autonomy, ownership, and curiosity are encouraged here.
  • Talent-Led Innovation: We believe communities are strongest when built around real practitioners. Our Innovation Community isn’t just a knowledge-sharing forum—it’s a launchpad for members to lead new projects, co-develop tools, and shape the direction of AI itself.

Skills Required

  • 5+ years proven experience as a Data Scientist delivering production-grade ML models.
  • Strong background in Chemistry or Cheminformatics (ability to interpret and manipulate molecular structures).
  • Advanced proficiency in Python.
  • Hands-on experience with AWS cloud stack (SageMaker, Lambda, S3, EC2).
  • Practical experience with LLMs (fine-tuning, prompt engineering, API integration).
  • Experience with vector databases (Pinecone, Milvus, Weaviate, Qdrant).
  • Experience with ML/DL frameworks and tooling (PyTorch, TensorFlow, Scikit-Learn).
  • Ownership of end-to-end ML pipelines including preprocessing, training, evaluation, deployment, and monitoring.
  • Experience preprocessing chemical data and handling OCR outputs.
  • Prior experience with Computer Vision, OCR, or Document AI systems.
  • Strong analytical problem-solving, communication, and cross-functional collaboration skills.
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
11 Employees

What We Do

Xenon7 delivers specialized AI operations, AI products and services. Our innovation practice helps you separate initiatives warranting business investment from hype. We operate free from the bloat, weight and pyramidal structure of legacy consulting firms. Xenon7 enables our clients to make better human and technology decisions, and ethically achieve more with less

Similar Jobs

Tulip Logo Tulip

Marketing Manager

Enterprise Web • Hardware • Internet of Things • Software
Easy Apply
Remote or Hybrid
27 Locations
310 Employees

Mondelēz International Logo Mondelēz International

Change Manager o9 MEU, Demand Planning

Big Data • Food • Hardware • Machine Learning • Retail • Automation • Manufacturing
Remote or Hybrid
9 Locations
90000 Employees

Nexthink Logo Nexthink

Senior Software Engineer

Artificial Intelligence • Big Data • Cloud • Information Technology • Machine Learning • Software
Remote or Hybrid
Madrid, Comunidad de Madrid, ESP
1200 Employees

Mondelēz International Logo Mondelēz International

Automation Engineer

Big Data • Food • Hardware • Machine Learning • Retail • Automation • Manufacturing
Remote or Hybrid
2 Locations
90000 Employees
56K-85K Annually

Similar Companies Hiring

Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
42 Employees
Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account