Machine Learning Engineer, International Public Sector

Reposted 5 Days Ago
2 Locations
In-Office
Senior level
Artificial Intelligence • Big Data • Machine Learning
The Data Platform for AI: High quality training and validation data for AI applications.
The Role
The Machine Learning Engineer will design, train, deploy, and evaluate AI models to address public sector challenges, ensuring high-quality data training and evaluating systems.
Summary Generated by Built In

Scale’s mission is to develop reliable AI systems for the world's most important decisions. Our core work consists of:

  • Creating custom AI applications that will impact millions of citizens
  • Generating high-quality training data for national LLMs
  • Upskilling and advisory services to spread the impact of AI

Scale is hiring ML Research Engineers to bridge the gap between frontier research and real-world impact. While we solve critical challenges for global governments, your role will extend beyond implementation. You will lead the charge in research into Agent design, Deep Research and AI Safety/reliability, developing novel methodologies that not only power public sector applications but set new standards across the entire Scale organisation.

Your mission is threefold:

  • Frontier Research & Publication: Leading research into LLM/agent capabilities, reasoning, and safety, with the goal of publishing at top-tier venues (NeurIPS, ICML, ICLR).
  • Cross-Org Impact: Developing generalised techniques in Agent design, AI Safety and Deep Research agents that scale across our commercial and government platforms.
  • Mission-Critical Applications: Engineering high-stakes AI systems that impact millions of citizens globally.

You will:

  • Pioneer Novel Architectures: Design and train state-of-the-art models and agents, moving beyond “off-the-shelf” solutions to create custom architectures for complex public sector reasoning tasks.
  • Lead AI Safety Initiatives: Research and implement robust safety frameworks, including red teaming, alignment (RLHF/DPO), and bias mitigation strategies essential for sovereign AI.
  • Drive Deep Research Capabilities: Develop agents capable of long-horizon reasoning and autonomous information synthesis to solve complex problems for national security and public policy.
  • Publish and Contribute: Represent Scale in the broader research community by publishing high-impact papers and contributing to open-source breakthroughs.
  • Consult as a Subject Matter Expert: Act as a technical authority for public sector leaders, advising on the theoretical limits and safety requirements of emerging AI.
  • Build Evaluation Frontiers: Create new benchmarks and evaluation protocols that define what success looks like for high-stakes, non-commercial AI applications.

Ideally, you’d have:

  • Advanced Degree: PhD or Master’s in Computer Science, Mathematics, or a related field with a focus on Deep Learning.
  • Research Track Record: A portfolio of first-author publications at major conferences (NeurIPS, ICML, CVPR, EMNLP, etc.).
  • Engineering Rigour: Strong proficiency in Python, deep learning frameworks (PyTorch/JAX), with the ability to write production-ready code that scales.
  • Safety Expertise: Experience in alignment, robustness, or interpretability research.

Nice to haves:

  • Experience with large-scale distributed training on massive clusters.
  • Experience in building agentic systems that are reliable.
  • Experience in Sovereign AI or working with highly regulated data environments.
  • A zero-to-one mindset: Comfortable navigating ambiguity and defining research directions from scratch.

PLEASE NOTE: Our policy requires a 90-day waiting period before reconsidering candidates for the same role. This allows us to ensure a fair and thorough evaluation of all applicants.

About Us:

At Scale, our mission is to develop reliable AI systems for the world's most important decisions. Our products provide the high-quality data and full-stack technologies that power the world's leading models, and help enterprises and governments build, deploy, and oversee AI applications that deliver real impact. We work closely with industry leaders like Meta, Cisco, DLA Piper, Mayo Clinic, Time Inc., the Government of Qatar, and U.S. government agencies including the Army and Air Force. We are expanding our team to accelerate the development of AI applications.

We believe that everyone should be able to bring their whole selves to work, which is why we are proud to be an inclusive and equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity or Veteran status. 

We are committed to working with and providing reasonable accommodations to applicants with physical and mental disabilities. If you need assistance and/or a reasonable accommodation in the application or recruiting process due to a disability, please contact us at [email protected]. Please see the United States Department of Labor's Know Your Rights poster for additional information.

We comply with the United States Department of Labor's Pay Transparency provision

PLEASE NOTE: We collect, retain and use personal data for our professional business purposes, including notifying you of job opportunities that may be of interest and sharing with our affiliates. We limit the personal data we collect to that which we believe is appropriate and necessary to manage applicants’ needs, provide our services, and comply with applicable laws. Any information we collect in connection with your application will be treated in accordance with our internal policies and programs designed to protect personal data. Please see our privacy policy for additional information.

Top Skills

C++
JavaScript
Python
Typescript
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
San Francisco, CA
523 Employees
Year Founded: 2016

What We Do

Scale accelerates the development of AI applications by helping machine learning teams generate high-quality ground truth data. Our advanced LiDAR, image, video and NLP annotation APIs allow machine learning teams at companies like OpenAI, Lyft, Pinterest, and Airbnb focus on building differentiated models vs. labeling data.

Similar Jobs

WHOOP Logo WHOOP

Research Manager, WHOOP Labs Doha

Fitness • Hardware • Healthtech • Sports • Wearables
Easy Apply
Hybrid
Doha, Al Doha, QAT
500 Employees

Ericsson Logo Ericsson

Enterprise and MCN Solution Expert

Cloud • Information Technology • Internet of Things • Machine Learning • Software • Cybersecurity • Infrastructure as a Service (IaaS)
In-Office or Remote
51 Locations
89000 Employees

WHOOP Logo WHOOP

Sales Manager

Fitness • Hardware • Healthtech • Sports • Wearables
Easy Apply
Remote or Hybrid
2 Locations
500 Employees

WHOOP Logo WHOOP

Associate Director of Research, Doha

Fitness • Hardware • Healthtech • Sports • Wearables
Easy Apply
Hybrid
Doha, Al Doha, QAT
500 Employees

Similar Companies Hiring

Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees
Bellagent Thumbnail
Artificial Intelligence • Machine Learning • Business Intelligence • Generative AI
Chicago, IL
20 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account