Xantura Limited

AI NLP Engineer

Posted 24 Days Ago

Be an Early Applicant

London, Greater London, England, GBR

Hybrid

Mid level

Artificial Intelligence • Big Data • Information Technology • Software

The Role

Own and evolve Xantura's text analytics platform: build NLP pipelines, LLM-based extraction and RAG, manage embedding models and vector DBs, maintain classical NLP components, and design agentic AI while ensuring explainability and Responsible AI for sensitive health and social care data.

Summary Generated by Built In

In this role you will work in the Platform team – a function for the deployment and evolution of the backend platform that underpins the core of the Xantura business.

The Role

You'll own and evolve Xantura's text analytics platform (XTA), the NLP system that extracts structured intelligence from unstructured case notes across health, housing, and social care. You'll work across the full spectrum of NLP: from classical text classification and entity extraction through to LLM-based information extraction, embedding models, and retrieval systems. As the platform matures, you'll help shape our agentic AI capabilities.

Key Responsibilities

Own and evolve the core text analytics pipeline; advancing large-scale concept extraction, classification, and information retrieval across complex social and clinical text corpora.
Design and implement LLM-based processing chains for structured information extraction, leveraging prompt engineering, output parsing, and model orchestration to produce high-quality, auditable outputs at scale.
Build and manage embedding infrastructure; training, fine-tuning, and serving embedding models, and setting up and operating vector databases to enable semantic search and retrieval across client data.
Develop and maintain classical NLP components where appropriate; training smaller classifiers, entity recognisers, and domain-specific models for tasks where efficiency and interpretability outweigh generative approaches.
Lay the groundwork for agentic AI capabilities as the platform evolves; contributing to the design of multi-agent orchestration, tool integration, and conversational interfaces over Xantura's services.
Ensure all NLP systems are robust, explainable, and aligned with Responsible AI principles; essential where outputs inform decisions about vulnerable people in health and social care.

Skills, Knowledge & Expertise

Bachelor's or Master's degree in Computer Science, Computational Linguistics, Machine Learning, or a related technical field, or equivalent practical experience.
3+ years of professional experience in an NLP, ML, or AI engineering role
Strong programming skills and production experience in Python.

Clear evidence of practical experience across some or all of the following:

LLM utilisation in production; prompt engineering, output structuring, chaining, and integrating LLMs into data processing pipelines (e.g. via LangChain, PydanticAI, or similar);
Embedding models; training, fine-tuning, or serving embedding models (e.g. sentence-transformers, bi-encoders, cross-encoders), with practical experience setting up and managing vector databases (e.g. Qdrant, Weaviate, Milvus, pgvector) along with understanding trade-offs e.g. when to use sparse or dense embeddings (or both) etc;
Classical NLP training and evaluating text classifiers, NER models, or other supervised/semi-supervised NLP models for domain-specific tasks.

In addition, the following would be an advantage:

Experience with knowledge graphs/triplestores/semantic web frameworks (e.g. Neo4j, RDF/SPARQL/OWL, Apache Jena).
Practical experience with entity linking, concept normalisation, or ontology-driven NLP.
Experience with retrieval-augmented generation (RAG) pipelines
Familiarity with agentic AI frameworks and multi-agent orchestration (e.g. LangGraph, AutoGen).
Good familiarity with the Azure ecosystem (Azure Kubernetes Service, Azure Container Registry, Azure DevOps, Azure Blob Storage, Azure Monitor, Azure Key Vault).

This is a hybrid role based in our office in London (Borough). You would be expected to be able to work from the office at least 1-2 days per week. Some travel is required for on-site client engagements as needed.

Job Benefits

Competitive salary reviewed annually
Work for a passionate, mission-driven company solving society’s big problems
Work flexible hours around life commitments with a focus on delivering company value rather than hours worked
Ability to work remotely (excluding face-to-face Team Meetings and client meetings)
Training and development opportunities
25 days annual leave (plus bank holidays)
Company pension
Private medical insurance
Generous enhanced parental leave policies
Cycle to work scheme
Flu Vaccinations,
Eye Test and contribution towards Glasses for VDU use

Employee Assistance Programme

Mental health and wellbeing support
Remote GP access
Counselling/therapy
Physiotherapy
Medical second opinions

About
At Xantura, we’re on a mission to reduce societal inequality by helping local authorities use data more effectively. Our AI-driven platform empowers frontline workers with the insights they need to prevent complex issues like homelessness or children being taken into care — before they happen. We make this possible by connecting siloed datasets, applying advanced machine learning to enrich the data, and using predictive analytics to identify those most at risk. Our platform then distills this into clear, actionable insights that help frontline staff intervene early and make a real difference. It’s an exciting time to join Xantura. We’re scaling quickly, bringing on new clients, strengthening our platform, and expanding into new areas. While we’re a technology company at heart, our true focus is on improving lives — and we’re looking for people who share that vision.

Skills Required

Bachelor's or Master's degree in Computer Science, Computational Linguistics, Machine Learning, or related field (or equivalent experience)
3+ years professional experience in an NLP, ML, or AI engineering role
Strong programming skills and production experience in Python
Production experience with LLMs: prompt engineering, output structuring, chaining, and integrating LLMs into data pipelines (e.g., via LangChain, PydanticAI)
Experience with embedding models: training, fine-tuning, or serving embeddings (e.g., sentence-transformers, bi-encoders, cross-encoders) and operating vector databases (Qdrant, Weaviate, Milvus, pgvector)
Experience training and evaluating classical NLP models: text classifiers, NER, supervised/semi-supervised domain-specific models
Experience with knowledge graphs/triplestores/semantic web frameworks (Neo4j, RDF/SPARQL/OWL, Apache Jena)
Practical experience with entity linking, concept normalisation, or ontology-driven NLP
Experience with retrieval-augmented generation (RAG) pipelines
Familiarity with agentic AI frameworks and multi-agent orchestration (e.g., LangGraph, AutoGen)
Familiarity with Azure ecosystem: AKS, ACR, Azure DevOps, Blob Storage, Monitor, Key Vault

View all jobs at Xantura Limited

View Xantura Limited Profile

Report Job

Am I A Good Fit?

beta

Get Personalized Job Insights.

Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company

Year Founded: 2008

What We Do

Xantura is a technology company that provides data sharing and advanced analytics to the public sector, using AI and machine learning to help local authorities improve outcomes for vulnerable people and reduce societal inequality.