AI Scientist, Safety - Paris/London

Sorry, this job was removed at 04:08 p.m. (CST) on Friday, Aug 15, 2025
Be an Early Applicant
2 Locations
Hybrid
Artificial Intelligence
The Role
About Mistral

At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to integrate seamlessly into daily working life.

We democratize AI through high-performance, optimized, open-source and cutting-edge models, products and solutions. Our comprehensive AI platform is designed to meet enterprise needs, whether on-premises or in cloud environments. Our offerings include le Chat, the AI assistant for life and work.

We are a dynamic, collaborative team passionate about AI and its potential to transform society.
Our diverse workforce thrives in competitive environments and is committed to driving innovation. Our teams are distributed between France, USA, UK, Germany and Singapore. We are creative, low-ego and team-spirited.

Join us to be part of a pioneering company shaping the future of AI. Together, we can make a meaningful impact. See more about our culture on https://mistral.ai/careers.

Role Summary

We are seeking an AI Scientist, Safety to evaluate, enhance, and build safety mechanisms for our large language models (LLMs). This role involves identifying and addressing potential risks, biases, and misuses of LLMs, ensuring that our AI systems are ethical, fair, and beneficial to society. You will work to monitor models, prevent misuse, and ensure user well-being, applying your technical skills to uphold principles of safety, transparency, and oversight.

Location : Paris or London 

What you will do

Adversarial & Fairness Testing
• Design and execute adversarial attacks to uncover vulnerabilities in LLMs.
• Evaluate potential risks and harms associated with LLM outputs.
• Assess LLMs for biases and unfairness in their responses, and develop strategies to mitigate these issues.

Tools & Monitoring
• Develop monitoring systems (eg. moderation tools) to detect unwanted behaviors in Mistral’s products.
• Build robust and reliable multi-layered defenses for real-time improvement of safety mechanisms that work at scale.
• Investigate and respond to incidents involving LLM misuse or harmful outputs, and develop post-incident recommendations.
• Analyze user reports of inappropriate content or accounts.
• Contribute to the development of AI ethics policies and guidelines that govern the responsible use of LLMs.

Safety Fine Tuning
• Work on safety tuning to improve robustness of models.
• Collaborate with the AI development team to create and implement safety measures, such as content filters, moderation tools, and model fine-tuning techniques.
• Keep up-to-date with the latest research and trends in AI safety, LLMs, and responsible AI, and continuously improve our safety practices.
• Build robust and reliable multi-layered defenses for real-time improvement of safety mechanisms that work at scale

About you

• You have a degree in Computer Science, AI, Machine Learning, or a related field. Advanced degrees (MSc, PhD) are preferred.
• You are familiar with Python and you are a highly proficient software engineer in a least one programming language (e.g. Python, Rust, Go, Java)You have, hands-on experience with AI frameworks and tools (e.g., TensorFlow, PyTorch, Jax)
• You have high technical engineering competence. This means being able to design complex software and make them usable in production
• You have a high scientific track record in a field of science.
• You are self-starter, autonomous and low-ego.
• Collaborative and have a real team player mindset.

Note that this is not an exhaustive or necessary list of requirements, please consider applying if you believe you have the skills to contribute to Mistral's mission.

Now, it would be ideal if
• You have proven experience in AI safety, responsible AI, or a related field. Familiarity with LLMs and their potential risks is essential.
• You have hands-on experience with Generative AI e.g. experience with transformer based models and a broad knowledge of the field of AI, and specific knowledge or interest in fine-tuning and using language models for applications.
• You are able to navigate the full MLOps technical stack, with a focus on architecture development and model evaluation and usage

Benefits

France

💰 Competitive cash salary and equity
🥕 Food : Daily lunch vouchers
🥎 Sport : Monthly contribution to a Gympass subscription 
🚴 Transportation : Monthly contribution to a mobility pass
🧑‍⚕️ Health : Full health insurance for you and your family
🍼 Parental : Generous parental leave policy
🌎 Visa sponsorship

UK

💰 Competitive cash salary and equity
🚑 Insurance
🚴 Transportation: Reimburse office parking charges, or 90GBP/month for public transport
🥎 Sport: 90GBP/month reimbursement for gym membership
🥕 Meal voucher: £200 monthly allowance for its meals
💰 Pension plan: SmartPension (percentages are 5% Employee & 3% Employer)

Similar Jobs

Graphcore Logo Graphcore

Physical Security Manager

Artificial Intelligence • Semiconductor
Hybrid
Bristol, England, GBR
488 Employees

Graphcore Logo Graphcore

Scientist

Artificial Intelligence • Semiconductor
Hybrid
3 Locations
488 Employees

Graphcore Logo Graphcore

System Test Manager

Artificial Intelligence • Semiconductor
Hybrid
Bristol, England, GBR
488 Employees

Graphcore Logo Graphcore

Director, Silicon Verification

Artificial Intelligence • Semiconductor
Hybrid
Bristol, England, GBR
488 Employees
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Paris
92 Employees
Year Founded: 2023

What We Do

Fast, open-source and secure language models. Facilitated specialisation of models on business use-cases, leveraging private data and usage feedback. Built from a world-class team in Europe, targeting global market. Join the team ! https://jobs.lever.co/mistral/

Similar Companies Hiring

Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees
Bellagent Thumbnail
Artificial Intelligence • Machine Learning • Business Intelligence • Generative AI
Chicago, IL
20 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account