AI/ML Software Engineer (RL Environments) (Contract)

Posted Yesterday
Hiring Remotely in United States
Remote
Mid level
Artificial Intelligence • HR Tech • Software • Generative AI
The Role
Design and build reinforcement-learning training environments and diverse tasks to evaluate and improve LLM agents; iterate rapidly on task designs from customer feedback, deliver high-quality outputs with minimal supervision, and maintain PST overlap for collaboration.
Summary Generated by Built In
About the Role

We're seeking experienced Machine Learning Engineers and Software Engineers with ML experience to design and build high-quality RL training environments for LLM agents. As an RL Environment Engineer, you'll create diverse machine learning tasks that challenge and improve language models, working with minimal supervision to deliver consistent, quality outputs.

What You'll Do
  • Design and build tasks for machine learning domains that target specific language models and difficulty distributions

  • Iterate rapidly on task designs based on customer feedback, with 24-hour turnaround times

  • Create diverse, challenging scenarios that test language model capabilities and expose their limitations

  • Hit the ground running with minimal onboarding time

What We're Looking For
  • Strong machine learning background through coursework, previous work experience, or personal projects

  • Python fluency: you write clean, efficient Python code regularly

  • Heavy LLM user who understands current model capabilities and failure modes through daily hands-on experience

  • Self-directed and creative. You can generate novel ML task ideas in your domain without constant guidance

  • High responsibility and integrity. You deliver quality work consistently and meet deadlines

  • Availability overlap with PST 9am-5pm (minimum 3 hours required)

Work Details
  • Location: Remote

  • Type: Contractor

Time Commitment: 40 hours a week. Must have at least 3 hours of overlap with PST business hours (9am-5pm)

Selection Process:
  1. Screening

  2. Hacker rank assessment

  3. 1 Week paid task

  4. Full time

Skills Required

  • Experience as a Machine Learning Engineer or Software Engineer with ML experience
  • Strong machine learning background via coursework, work experience, or personal projects
  • Fluency in Python, writing clean and efficient Python code
  • Daily hands-on use and deep understanding of Large Language Models and their failure modes
  • Experience designing and building reinforcement learning (RL) training environments and tasks
  • Self-directed, creative ability to generate novel ML task ideas with minimal guidance
  • High responsibility, integrity, and consistent on-time delivery of quality work
  • Availability overlap with PST business hours (minimum 3 hours overlap)
  • Ability to work 40 hours per week as a contractor
  • Ability to iterate rapidly on task designs with quick turnaround (e.g., 24-hour turnaround)
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
0 Employees

What We Do

Careerflow.ai is an AI-powered career management platform and 'career copilot' dedicated to helping job seekers land their dream jobs. The company provides a comprehensive end-to-end toolkit featuring an AI resume builder, LinkedIn profile optimizer, and job tracking tools. By streamlining the application process and optimizing professional profiles, Careerflow helps users navigate the competitive job market and get hired at top tech and startup companies faster.

Similar Jobs

Drata Logo Drata

Commercial Account Executive

Security • Software • Cybersecurity • Automation
Remote
United States
600 Employees
170K-210K Annually

CDW Logo CDW

Director, Software Engineering

Information Technology
Remote or Hybrid
TX, USA
15100 Employees
200K-250K Annually

Granted Logo Granted

Engineering Manager

Artificial Intelligence • Healthtech • Insurance • Mobile • Financial Services
Remote or Hybrid
2 Locations
23 Employees
206K-228K Annually

People Inc. Logo People Inc.

Operations Specialist

AdTech • Consumer Web • Digital Media • eCommerce • Marketing Tech
Remote or Hybrid
Des Moines, IA, USA
3500 Employees
20-25 Hourly

Similar Companies Hiring

Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
LTX Thumbnail
Conversational AI • Generative AI
Jerusalem, Israel
360 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account