Machine Learning Engineer (Post-Training)

Reposted 2 Days Ago
Be an Early Applicant
Paris, Île-de-France, FRA
In-Office
Mid level
eCommerce • Logistics
The Role
The role involves designing post-training environments for machine learning agents, creating evaluation frameworks, building data pipelines, and optimizing training systems for supply chain applications.
Summary Generated by Built In

Machine Learning Engineer – Post-Training, AI Studio 

About the AI Studio 

The AI Studio's mission is to find the fastest possible path to an autonomous supply chain. We're developing AI agents, learning systems, training models, and more to overcome the biggest challenges remaining in the global supply chain. 

In short, we are having a lot of fun. 

Your mission in this role 

We're looking for an ambitious Machine Learning Engineer specializing in Post-Training to work on environments, evaluations, data pipelines, and tooling for robust training systems. 

Your work will directly impact how our agents learn to make decisions in complex supply chain environments. You'll help shape how we approach reward modeling, environment design, and agent training. 

This role blends research and engineering. You'll implement novel approaches and contribute to our research direction while shipping production-grade systems. If you're energized by pushing the boundaries of what's possible, this is your chance. 

Responsibilities 

  • Design and implement post-training environments for supply chain decision-making 

  • Create evaluation frameworks to measure agent performance and catch failure modes 

  • Build data pipelines for training and human feedback collection 

  • Optimize training infrastructure for throughput, efficiency, and fault tolerance 

  • Debug complex issues in training pipelines and model behavior 

  • Collaborate with the team to translate research ideas into reliable systems 

  • Document what works (and what doesn't) so we can compound our learnings 

  • Stay on top of industry trends and cutting edge use cases 

We want to talk if you 

  • Have trained or fine-tuned LLMs for agents with SFT/DPO 

  • Are proficient in Python, PyTorch and HF Transformers 

  • Can balance research exploration with shipping working code 

  • Are comfortable working with large datasets and building data pipelines at scale 

  • Thrive in fast-moving environments where priorities shift 

  • Are excited about AI-assisted tools and getting the most out of them 

  • Can balance research exploration with shipping working code 

  • Care about craft in your work 

  • Have a deep sense of curiosity and make a habit of learning 

  • Think globally about how your work impacts the entire organization 

Bonus points if 

  • Have hands-on experience with RL techniques (reward shaping and design, PPO, GRPO and other RLHF approaches) 

  • Have experience with distributed training systems and techniques (DDP, FSDP, N-D parallelism) 

  • You have experience with human-in-the-loop ML systems 

  • You've built evaluation frameworks for open-ended tasks 

  • You're familiar with supply chain, logistics, or operations domains 

  • You have experience with Kubernetes and cloud infrastructure (AWS, GCP) 

  • You've worked on reward hacking detection or robustness problems 

  • You have a side project that shows you can't stop tinkering 

 

We are looking forward to hearing from you! Please submit applications and resumes in english.

Our Values

If you want to know the heart of a company, take a look at their values. Ours unite us. They are what drive our success – and the success of our customers. Does your heart beat like ours? Find out here: Core Values

All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability or protected veteran status.

Top Skills

AWS
GCP
Hf Transformers
Kubernetes
Python
PyTorch
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Scottsdale, AZ
5,001 Employees
Year Founded: 1985

What We Do

Blue Yonder is the world leader in digital supply chain and omni-channel commerce fulfillment. Our intelligent, end-to-end platform enables retailers, manufacturers and logistics providers to seamlessly predict, pivot and fulfill customer demand. With Blue Yonder, you can make more automated, profitable business decisions that deliver greater growth and re-imagined customer experiences. Blue Yonder - Fulfill your Potential Blue Yonder’s tagline “Fulfill Your Potential” reflects the company’s mission to empower every organization and person on the planet to fulfill their potential. Each day, our global teams of associates and business partners work together to accelerate global economic growth, increase sustainability and prosperity with a Sonoran Spirit.

Similar Jobs

SailPoint Logo SailPoint

RVP France Strategic

Artificial Intelligence • Cloud • Sales • Security • Software • Cybersecurity • Data Privacy
Remote or Hybrid
France
2461 Employees

ServiceNow Logo ServiceNow

Engagement Manager

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Remote or Hybrid
Issy-les-Moulineaux, Hauts-de-Seine, Île-de-France, FRA
28000 Employees

ServiceNow Logo ServiceNow

Sales Executive

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Remote or Hybrid
Issy-les-Moulineaux, Hauts-de-Seine, Île-de-France, FRA
28000 Employees

ServiceNow Logo ServiceNow

Resell Partner Manager

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Remote or Hybrid
Issy-les-Moulineaux, Hauts-de-Seine, Île-de-France, FRA
28000 Employees

Similar Companies Hiring

PRIMA Thumbnail
Travel • Software • Marketing Tech • Hospitality • eCommerce
US
15 Employees
Scotch Thumbnail
Artificial Intelligence • eCommerce • Fintech • Payments • Retail • Software • Analytics
US
35 Employees
Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account