Educational Technology AI Rater & Evaluator

Posted 3 Days Ago
Be an Early Applicant
Hiring Remotely in San Francisco, California, USA
In-Office or Remote
Mid level
Artificial Intelligence • Information Technology • Software
Make anything multilingual. Translation, AI data set creation, and human expert evals. For businesses and governments.
The Role
The job involves evaluating AI outputs for educational content, assessing accuracy, clarity, and effectiveness, and providing oversight for AI evaluation processes.
Summary Generated by Built In
Overview

LILT is building a global network of domain experts to support high-quality AI evaluation across training, benchmarking, red-teaming, and ongoing model monitoring. We are seeking education and learning professionals to contribute expert judgment to human-in-the-loop AI evaluation workflows used by leading enterprises and hyperscalers.

This role is designed for professionals who understand how educational content, learning experiences, and instructional systems work in real-world academic and professional learning environments and who can apply that expertise to evaluate, assess, and improve multilingual AI systems.

Your contribution of expertise will directly influence multilingual AI model quality, safety, and deployment readiness.

This role includes two distinct expert tracks, based on experience level and scope of responsibility.

Track A: EdTech AI Rater

Raters execute structured evaluation tasks using clearly defined rubrics and instructions.

Responsibilities

  • Evaluate AI outputs related to educational, instructional, and learning content

  • Perform structured scoring, comparison, classification, and judgment tasks

  • Assess pedagogical accuracy, clarity, appropriateness, and learning effectiveness

  • Identify hallucinations, misleading explanations, factual errors, or unsafe educational guidance

  • Apply domain-specific education and instructional guidelines consistently across tasks

Ideal Background

  • Educators, instructional designers, curriculum developers, or learning professionals

  • Experience with teaching, curriculum design, assessment, or educational technology

  • Strong attention to detail and comfort working with structured evaluation criteria

Track B: EdTech AI Evaluator (Senior Track)

Evaluators provide higher-level domain oversight and help shape how evaluation is performed.

Responsibilities

  • Validate and refine evaluation rubrics and edge-case handling

  • Perform adjudication where raters disagree

  • Conduct error analysis and qualitative reviews of model behavior

  • Partner with LILT research, product, and customer teams on evaluation design

  • Support red-teaming, educational quality review, and model readiness assessments

Ideal Background

  • Senior educators, academic leaders, learning scientists, or education subject matter experts

  • Experience defining instructional standards, reviewing complex edge cases, or advising on learning outcomes

  • Ability to clearly explain nuanced pedagogical reasoning and tradeoffs

Evaluation Focus & Requirements

Types of AI Evaluation Work

Depending on project demands, work may include:

  • Educational and instructional content evaluation

  • Learning accuracy and conceptual understanding assessment

  • Benchmarking and comparative model analysis

  • Red-teaming for misleading or harmful educational content

  • Ongoing model monitoring and regression testing

What We Look For

  • Deep domain expertise in education, instructional design, or learning sciences

  • Strong judgment and ability to apply criteria consistently

  • Comfort working with structured evaluation workflows

  • Ability to explain reasoning clearly, especially in instructional or learner-facing scenarios

  • Reliability, professionalism, and respect for quality standards

Engagement Model

  • Contract-based, flexible participation

  • Project-based work with clear expectations and timelines

  • Opportunities for recurring work based on performance and demand

  • Compensation communicated upfront per project or task type

Why This Work Matters

Your expertise helps ensure that AI systems:

  • Provide accurate, effective, and responsible educational content

  • Align with instructional best practices and learning standards

  • Are trustworthy and supportive for learners across languages

Language Requirements

  • Native or professional fluency in one or more supported languages is required

  • Supported languages span 30+ global languages

  • Language-specific nuance is assessed through screening and task-based evaluation, not separate job descriptions

  • English fluency is required for guidelines, feedback, and collaboration

AI is changing how the world communicates — and LILT is leading that transformation.

LILT's mission is to make the world's information available to everyone, no matter the language they speak. Join our global community who thrive on innovation and excellence. Our collective knowledge, uniqueness, and skills deliver multilingual AI and human-verified services to Enterprises, Governments, and AI Developers around the world.

Earn money. Have fun. Advance human knowledge. Work on diverse projects from anywhere, any time you want. Get paid quickly and fairly, and build your professional network in a supportive community—all through a streamlined application process tailored to your expertise.

Information collected and processed as part of your application process, including any job applications you choose to submit, is subject to LILT's Privacy Policy at https://lilt.com/legal/privacy.

At LILT, we are committed to a fair, inclusive, and transparent hiring process. As part of our recruitment efforts, we may use artificial intelligence (AI) and automated tools to assist in the evaluation of applications, including résumé screening, assessment scoring, and interview analysis. These tools are designed to support human decision-making and help us identify qualified candidates efficiently and objectively. All final hiring decisions are made by people. If you have any concerns, require accommodations, or would like to opt-out of the use of AI in our hiring process, please let us know at [email protected].

LILT is an equal opportunity employer. We extend equal opportunity to all individuals without regard to an individual’s race, religion, color, national origin, ancestry, sex, sexual orientation, gender identity, age, physical or mental disability, medical condition, genetic characteristics, veteran or marital status, pregnancy, or any other classification protected by applicable local, state or federal laws. We are committed to the principles of fair employment and the elimination of all discriminatory practices.

Skills Required

  • Experience with teaching, curriculum design, assessment, or educational technology
  • Strong attention to detail
  • Native or professional fluency in one or more supported languages
  • Experience defining instructional standards or advising on learning outcomes

LILT AI Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about LILT AI and has not been reviewed or approved by LILT AI.

  • Fair & Transparent Compensation Pay for full‑time roles is characterized as market‑aligned in engineering and go‑to‑market functions; overall compensation is seen as acceptable to good for these teams.
  • Healthcare Strength Core benefits include full healthcare coverage (medical, dental, vision) for full‑time employees; this provides a solid baseline for the package.
  • Retirement Support U.S. offerings include a 401(k) match; retirement support is clearly part of the total rewards mix.

LILT AI Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Francisco, California
690 Employees
Year Founded: 2015

What We Do

Make anything multilingual. A complete solution for translation and data set creation for businesses and governments. Founded by research scientists who met working on Google Translate, LILT is a global team of engineers, scientists, GTM experts, and operators transforming global business communications.

Similar Jobs

MetLife Logo MetLife

Customer Care Advocate Disability Service- Omaha NE 7.20.26

Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
Remote or Hybrid
United States
43000 Employees
42K-42K Annually

Airwallex Logo Airwallex

Data Science Director, Growth

Artificial Intelligence • Fintech • Payments • Business Intelligence • Financial Services • Generative AI
Remote or Hybrid
San Francisco, CA, USA
2000 Employees

Airwallex Logo Airwallex

Customer Insights Lead

Artificial Intelligence • Fintech • Payments • Business Intelligence • Financial Services • Generative AI
Remote or Hybrid
San Francisco, CA, USA
2000 Employees

Nexthink Logo Nexthink

Client Director- West

Artificial Intelligence • Big Data • Cloud • Information Technology • Machine Learning • Software
Remote or Hybrid
San Diego, CA, USA
1200 Employees
113K-176K Annually

Similar Companies Hiring

Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York City, NY
100 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account