AI Scientist, Safety - Paris/London

Posted 5 Days Ago
Be an Early Applicant
Paris, Île-de-France
Hybrid
Mid level
Artificial Intelligence
The Role
The AI Scientist, Safety will evaluate and enhance safety mechanisms for large language models, focusing on risks and biases. Responsibilities include adversarial testing, monitoring tools, and collaborating on safety tuning. The role requires a strong background in AI technologies and frameworks to ensure ethical and safe AI outputs.
Summary Generated by Built In


Role Summary


We are seeking an AI Scientist, Safety to evaluate, enhance, and build safety mechanisms for our large language models (LLMs). This role involves identifying and addressing potential risks, biases, and misuses of LLMs, ensuring that our AI systems are ethical, fair, and beneficial to society. You will work to monitor models, prevent misuse, and ensure user well-being, applying your technical skills to uphold principles of safety, transparency, and oversight.


Location : Paris or London 

Responsibilities

  • Adversarial & Fairness Testing
  • Design and execute adversarial attacks to uncover vulnerabilities in LLMs.
  • Evaluate potential risks and harms associated with LLM outputs.
  • Assess LLMs for biases and unfairness in their responses, and develop strategies to mitigate these issues.

  • Tools & Monitoring
  • Develop monitoring systems (eg. moderation tools) to detect unwanted behaviors in Mistral’s products.
  • Build robust and reliable multi-layered defenses for real-time improvement of safety mechanisms that work at scale.
  • Investigate and respond to incidents involving LLM misuse or harmful outputs, and develop post-incident recommendations.
  • Analyze user reports of inappropriate content or accounts.
  • Contribute to the development of AI ethics policies and guidelines that govern the responsible use of LLMs.

  • Safety Fine Tuning
  • Work on safety tuning to improve robustness of models.
  • Collaborate with the AI development team to create and implement safety measures, such as content filters, moderation tools, and model fine-tuning techniques.
  • Keep up-to-date with the latest research and trends in AI safety, LLMs, and responsible AI, and continuously improve our safety practices.
  • Build robust and reliable multi-layered defenses for real-time improvement of safety mechanisms that work at scale

You may be a good fit if

  • You have a degree in Computer Science, AI, Machine Learning, or a related field. Advanced degrees (MSc, PhD) are preferred.
  • You are familiar with Python and you are a highly proficient software engineer in a least one programming language (e.g. Python, Rust, Go, Java)You have, hands-on experience with AI frameworks and tools (e.g., TensorFlow, PyTorch, Jax)
  • You have high technical engineering competence. This means being able to design complex software and make them usable in production
  • You have a high scientific track record in a field of science.
  • You are self-starter, autonomous and low-ego.
  • Collaborative and have a real team player mindset.

  • Note that this is not an exhaustive or necessary list of requirements, please consider applying if you believe you have the skills to contribute to Mistral's mission.


    Now, it would be ideal if

  • You have proven experience in AI safety, responsible AI, or a related field. Familiarity with LLMs and their potential risks is essential.
  • You have hands-on experience with Generative AI e.g. experience with transformer based models and a broad knowledge of the field of AI, and specific knowledge or interest in fine-tuning and using language models for applications.
  • You are able to navigate the full MLOps technical stack, with a focus on architecture development and model evaluation and usage

Benefits

  • Competitive cash salary
  • Equity

  • France

    🥕 Food : Daily lunch vouchers

    🥎 Sport : Monthly contribution to a Gympass subscription 

    🚴 Transportation : Monthly contribution to a mobility pass

    🧑‍⚕️ Health : Full health insurance for you and your family 

    🍼 Parental : Generous parental leave policy 

    🌎 Visa sponsorship


    UK

    🧑‍⚕️ Health : Competitive Healthcare program

    🏠 Pension : Monthly contribution 

    🚴 Transportation: Monthly contribution

    🥎 Sport: Monthly contribution 

    🌎 Visa sponsorship 

About Mistral 


At Mistral AI, our mission is to make AI ubiquitous and open. We are passionate about bridging the gap between technology and businesses of all sizes. We are a leading innovator in the field of open-source large language models.


Our advanced LLM solutions can be seamlessly deployed on any cloud, allowing for optimized integration and robust performance. Developers are using our API via la Plateforme to build incredible AI-first applications powered by our models that can understand and generate natural language text and code. We are multilingual at our core. We released le Chat, as a demonstrator of our models.


We are a tight-knit, nimble team dedicated to bringing our cutting-edge AI technology to the world. Our teams are distributed between France, UK and USA. We are creative, low-ego, team-spirited, and have been passionate about AI for years. We hire people who thrive in competitive environments, because they find them more fun to work in. We hire passionate women and men from all over the world.

Top Skills

Go
Java
Python
Rust
The Company
HQ: Paris
92 Employees
On-site Workplace
Year Founded: 2023

What We Do

Fast, open-source and secure language models. Facilitated specialisation of models on business use-cases, leveraging private data and usage feedback.

Built from a world-class team in Europe, targeting global market. Join the team ! https://jobs.lever.co/mistral/

Similar Jobs

Hybrid
Paris, Île-de-France, FRA
289097 Employees

Arrow Electronics, Inc. Logo Arrow Electronics, Inc.

Broadcom Market Analyst

Cloud • Enterprise Web • Hardware • Information Technology • Internet of Things • Robotics • Semiconductor
Courbevoie, Hauts-de-Seine, Île-de-France, FRA
22000 Employees

Mirakl Logo Mirakl

Business Value Consultant

eCommerce • Information Technology • Retail • Software • Consulting
Easy Apply
Paris, Île-de-France, FRA
750 Employees
Hybrid
Paris, Île-de-France, FRA
289097 Employees

Similar Companies Hiring

Eastwall Thumbnail
Software • Information Technology • Consulting • Cloud • Big Data Analytics • Artificial Intelligence • App development
Denver, CO
20 Employees
Smartcat Thumbnail
Natural Language Processing • Machine Learning • Conversational AI • Artificial Intelligence
Boston, Massachusetts
242 Employees
RunPod Thumbnail
Software • Infrastructure as a Service (IaaS) • Cloud • Artificial Intelligence
Charlotte, North Carolina
53 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account