Math IMO Expert

Posted Yesterday
Be an Early Applicant
11 Locations
Remote
14-14 Hourly
Senior level
Artificial Intelligence • HR Tech • Software • Generative AI
The Role
Design original IMO/AIME/HMMT-level math problems across algebra, number theory, combinatorics, and geometry; write rigorous LaTeX solutions; craft prompts to expose LLM reasoning failures; review and diagnose model outputs; label and classify prompts; contribute to evaluation benchmark design. Remote contractor role, 40 hrs/week with PST overlap.
Summary Generated by Built In

Total Number of Positions: 40

Role Overview

  • In this role, you will work on projects that improve and evaluate large language models by crafting challenging, competition-level mathematics problems and rigorously assessing model reasoning.

  • The ideal candidate has a strong foundation in competitive mathematics at the AIME, HMMT, and IMO (Olympiad) level across the four classic pillars: Algebra, Number Theory, Combinatorics, and Geometry.

  • You should be able to design novel, "Google-proof" problems intended to expose deep reasoning deficiencies in state-of-the-art models, and to diagnose precisely where and why a model's reasoning breaks down.

  • The role combines original problem authoring, rigorous solution writing, and detailed evaluation of model-generated responses.

  • This is your chance to future-proof your career in an AI-first world by working at the frontier of mathematical reasoning evaluation.

What does the day-to-day look like:

  • Design original, challenging mathematics problems at AIME, HMMT, and IMO difficulty that test the reasoning limits of large language models in multi-step, abstract settings, drawn strictly from Algebra, Number Theory, Combinatorics, or Geometry.

  • Author novel prompts that "break" evaluated models, meaning the model arrives at an incorrect final answer; ensure problems cannot be bypassed via brute-force or computationally intensive methods.

  • Solve problems independently and write detailed, logically structured, self-contained solutions with clear justifications, properly rendered in LaTeX.

  • Review model-generated solutions, identify mathematical errors, logical fallacies, or missing arguments, and diagnose the root cause using defined failure categories (Final Answer, Reasoning Steps, Instruction Following).

  • Contribute to defining new evaluation benchmarks across competition and Olympiad-level mathematics curricula.

  • Classify each prompt accurately by domain, sub-domain, topic, and proficiency level within the labeling tool.

Requirements

  • Mathematical Expertise: Strong command of competitive mathematics at the level of AIME, HMMT, and IMO across Algebra, Number Theory, Combinatorics, and Geometry.

  • Writing Proficiency: Excellent structured written communication, including fluency with standard LaTeX delimiters for all mathematical expressions.

  • Analytical Skills: Strong research and analytical skills, with the ability to construct rigorous, proof-based reasoning.

  • Creative Thinking: Creative and lateral thinking abilities to design novel problems that are not adapted from existing competitions or online repositories.

  • Feedback Skills: Ability to provide constructive feedback, precise annotations, and accurate error diagnosis on model outputs.

  • Independence: Self-motivated and able to work independently in a remote setting.

  • Technical Setup: Desktop/Laptop setup with a good internet connection.

Preferred Qualifications

  • Candidates pursuing or holding a Bachelor’s/Master's degree in Mathematics, Applied Mathematics, Statistics, Engineering, or a related field are eligible and encouraged to apply.

  • Prior experience in competitive mathematics (e.g., national or international Olympiads or equivalent competitive examinations) as a participant, coach, or problem setter is a bonus.

  • Ability to analyze and solve complex problems with a structured, logical approach and to express solutions clearly and rigorously.

Perks of Freelancing

  • Work in a fully remote environment.

  • Opportunity to work on cutting-edge AI projects with leading LLM companies.

  • Potential for contract extension based on performance and project needs.

Offer Details

  • Commitments Required: At least 8 hours per day and 40 hours per week, with 4 hours of overlap with PST.

  • Engagement type: Contractor assignment/freelancer (no medical/paid leave).

Skills Required

  • Strong command of competitive mathematics at AIME/ HMMT/ IMO level across Algebra, Number Theory, Combinatorics, Geometry
  • Fluency with LaTeX delimiters and ability to render mathematical expressions in LaTeX
  • Ability to construct rigorous, proof-based solutions and clear logical reasoning
  • Creative problem design to produce novel, non-derivative competition-level problems
  • Ability to author prompts that expose LLM reasoning failures and avoid brute-force bypass
  • Experience reviewing model-generated solutions and diagnosing errors by failure category
  • Excellent structured written communication and provide precise annotations/feedback
  • Self-motivated, able to work independently in a remote setting with required overlap with PST
  • Desktop/laptop with reliable internet connection
  • Bachelor's/Master's in Math, Applied Math, Statistics, Engineering, or related (preferred)
  • Prior participation, coaching, or problem-setting experience in competitive mathematics (preferred)
  • Ability to analyze and solve complex problems with structured, rigorous presentation (preferred)
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company

What We Do

Careerflow.ai is an AI-powered career management platform and 'career copilot' dedicated to helping job seekers land their dream jobs. The company provides a comprehensive end-to-end toolkit featuring an AI resume builder, LinkedIn profile optimizer, and job tracking tools. By streamlining the application process and optimizing professional profiles, Careerflow helps users navigate the competitive job market and get hired at top tech and startup companies faster.

Similar Jobs

Tapestry - Coach and Kate Spade Logo Tapestry - Coach and Kate Spade

Developer, Product Development (Footwear)

eCommerce • Fashion • Retail • Sales • Wearables • Design
Remote or Hybrid
Haiphong, VNM
16000 Employees

Pfizer Logo Pfizer

Senior Health Representative - (Vaccines Hospital - Dong Nai/An Giang/Kien Giang)

Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Biotech • Pharmaceutical
In-Office or Remote
6 Locations
121990 Employees

Tapestry - Coach and Kate Spade Logo Tapestry - Coach and Kate Spade

Fitting Model (Female US6B)

eCommerce • Fashion • Retail • Sales • Wearables • Design
Remote or Hybrid
Haiphong, VNM
16000 Employees

Pfizer Logo Pfizer

Commercial Lead, Vietnam and Thailand

Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Biotech • Pharmaceutical
Remote or Hybrid
2 Locations
121990 Employees

Similar Companies Hiring

Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
LTX Thumbnail
Conversational AI • Generative AI
Jerusalem, Israel
360 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account