Systems Engineer, HPC

Reposted 22 Days Ago
Be an Early Applicant
6 Locations
In-Office or Remote
Mid level
Artificial Intelligence
The Role
The Systems Engineer will design, operate, and scale AI infrastructure, focusing on Linux systems administration, automation, and performance improvements, working with HPC and research teams to ensure reliable cluster operation.
Summary Generated by Built In

About Mistral

 

At Mistral AI, we build high-performance, open, and efficient AI systems designed to power the next generation of applications. Our infrastructure combines large-scale distributed systems, cloud platforms, and HPC environments to support cutting-edge research and production workloads.

We are a collaborative, low-ego, and highly technical team, operating across Europe, the US, and beyond. As we scale rapidly, we are building the foundational infrastructure to support thousands of nodes and petabyte-scale systems.

Join us to be part of a pioneering company shaping the future of AI. Together, we can make a meaningful impact. See more about our culture on https://mistral.ai/careers.

 

About the Role

We are looking for Systems Engineers / System Administrators to help design, operate, and scale the infrastructure behind Mistral’s AI platforms.

This is a hands-on, hybrid role combining:

Systems administration (operating and troubleshooting large-scale Linux environments)

Systems engineering (automation, scalability, and performance improvements)

You’ll work closely with infrastructure, HPC, and research teams to ensure our clusters and platforms run reliably at scale.

 

What You’ll Work OnCore Systems Operations
  • Operate and maintain large-scale Linux environments (bare metal, clusters, cloud)
  • Monitor system health, troubleshoot incidents, and ensure high availability
  • Support production and research workloads across multiple environments
      Scaling Infrastructure
  • Help scale clusters toward hundreds to thousands of nodes

  • Work on systems handling petabyte-scale storage

  • Improve performance, reliability, and resource utilisation

      Automation & Engineering
  • Automate operational tasks using tools like Python, Bash, Ansible, or Terraform

  • Improve deployment, provisioning, and system lifecycle management

  • Contribute to system design and architecture decisions

      Cross-Functional Collaboration
  • Work closely with:

    • HPC / infrastructure teams

    • Platform / DevOps engineers

    • Research teams

  • Act as a bridge between users and infrastructure

What We’re Looking ForMust-have
  • Strong Linux systems administration experience (core requirement)

  • Experience working in large-scale environments:

    • HPC clusters or cloud infrastructure

  • Experience with Job schedulers (e.g. Slurm)

  • Solid troubleshooting skills across systems, hardware, and networks

Nice-to-have (any of these)

We are not expecting everything — strong depth in one area is valuable.

  • Containers / orchestration (e.g. Kubernetes)

  • Storage systems (e.g. Ceph, Lustre, NFS)

  • Networking fundamentals (Ethernet; InfiniBand is a plus)

  • Infrastructure as Code / automation tooling

  • GPU or AI/ML experience

Profile We Value
  • Pragmatic problem solver who can operate in fast-scaling environments

  • Comfortable working across multiple domains (“Swiss army knife” mindset)

  • Able to go deep in one area while learning others

  • Low-ego, collaborative, and hands-on

—------------------------------------------------------------------

Why Join Mistral?
  • Impact: Play a pivotal role in scaling Mistral’s cutting-edge AI infrastructure.

  • Growth: Opportunity to shape data centre operations from the ground up in a high-growth startup environment.

  • Collaboration: Work with a talented, cross-functional team passionate about AI and technology.

  • Flexibility: Competitive compensation, benefits, and the chance to contribute to revolutionary projects.

Skills Required

  • Strong Linux systems administration experience
  • Experience working in large-scale environments: HPC clusters or cloud infrastructure
  • Experience with Job schedulers (e.g. Slurm)
  • Solid troubleshooting skills across systems, hardware, and networks
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Paris
92 Employees
Year Founded: 2023

What We Do

Fast, open-source and secure language models. Facilitated specialisation of models on business use-cases, leveraging private data and usage feedback. Built from a world-class team in Europe, targeting global market. Join the team ! https://jobs.lever.co/mistral/

Similar Jobs

Zscaler Logo Zscaler

Sr. Partner Business Manager

Cloud • Information Technology • Security • Software • Cybersecurity
Easy Apply
Remote or Hybrid
Germany
8697 Employees
105K-150K Annually

CrowdStrike Logo CrowdStrike

Intelligence Intern - Applied Research Cell (Remote)

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Remote or Hybrid
2 Locations
10000 Employees

CrowdStrike Logo CrowdStrike

Sr. Security Researcher, TAC Cloud (Remote)

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Remote or Hybrid
2 Locations
10000 Employees

CrowdStrike Logo CrowdStrike

Sr. Intelligence Analyst, Recon+ (Remote, GBR)

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Remote or Hybrid
5 Locations
10000 Employees

Similar Companies Hiring

Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees
Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
31 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account