Policy and Toxicity Evaluator

Posted 4 Days Ago
Be an Early Applicant
Hiring Remotely in Philippines
Remote
Entry level
Machine Learning • Natural Language Processing
The Role
As a Policy and Toxicity Evaluator, you will analyze AI model outputs for quality and safety, identify toxic content, and ensure compliance with project guidelines.
Summary Generated by Built In
Overview
We are seeking Policy and Toxicity Evaluators to support a pilot project focused on analyzing and grading AI-generated model outputs for a leading AI platform. In this role, you will assess model responses to ensure compliance, safety, and adherence to project-specific guidelines. Evaluators will work across three workflows: refusal analysis, toxicity detection, and policy verification, helping to validate the model’s behavior and improve its safety and utility. 

What you will do: 
Analyze AI model outputs to assess quality and safety. 
Identify instances where the model refuses to answer a prompt and determine if the refusal was necessary or an error (over-refusal).
Identify toxic content such as hate speech, harassment, explicit material, or self-harm encouragement.
Review model responses for compliance with project-specific policy guidelines.
Shift between refusal, toxicity, and policy workflows as needed based on project volume.

Requirements: 
Excellent command of written English to process complex prompts and long-form model responses quickly. 
Strong critical thinking and the ability to make objective decisions. 
Exceptional attention to detail, including identifying borderline violations. 
Comfort navigating ambiguous or complex content requiring deep judgment. 
Ability to internalize and strictly follow complex policy guidelines. 
Prior experience in Trust & Safety, Content Moderation, or RLHF (Reinforcement Learning from Human Feedback) annotation is highly beneficial. 
 
Project Details: 
Contract Type: Freelance 
Location: Philippines (remote)
Duration: 1 to 2 weeks
Schedule: 10 hours weekly; flexible based on client's needs
 
Note: Please do not use VPNs or IP-masking tools during the recruitment process — our security system requires accurate regional verification. 

Why Join Welo Data?  
✨ Limitless Flexibility  
Project-based opportunities that fit your availability. Choose when and how much you want to contribute—fully remote, with complete autonomy.   
🌱 Limitless Growth  
Optional access to AI and Large Language Model workshops designed specifically for professionals like you. No coding required—just your expertise.   
🌍 Limitless Support  
Be part of a global contributor community with responsive guidance and support.   
💡 Real Impact  
Apply your expertise in the Legal field to influence the AI systems shaping the future of your industry—while collaborating with data professionals and expanding your skills.  

How to Apply? 
Apply now by answering a few quick questions to join our database and become part of our growing community. 

About Welo Data 
Welo Data, part of Welocalize, is a global AI data company with 500,000+ contributors delivering high-quality, ethical data to train the world’s most advanced AI systems. We’re building smarter, more human AI with a diverse community in 100+ countries.  
At Welo Data, Limitless AI. Limitless You. isn’t just a slogan—it’s our promise. We build smarter AI through the power of human contribution, offering limitless opportunities for our global community to grow, contribute, and work on their terms. 

Top Skills

AI
English
Model Outputs
Policy Guidelines
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: New York, NY
2,331 Employees
Year Founded: 1997

What We Do

Welocalize accelerates the global business journey by enabling brands and companies to reach, engage, and grow international audiences. Welocalize delivers multilingual content transformation services in translation, localization, and adaptation for over 250 languages with a growing network of over 250,000 in-country linguistic resources. Driving innovation in language services, Welocalize delivers high-quality training data solutions for NLP-enabled machine learning by blending technology and human intelligence to collect, annotate, and evaluate all content types. Our people work across offices in North America, Europe, and Asia serving our global clients in the markets that matter to them.

• Global team of 2,100+
• Offices in North America, Europe and Asia
• Quality Certifications: ISO 9001:2015, ISO/IEC 27001:2013, ISO 17100:2015, ISO 13485:2016, ISO 18587:2017
• Accredited professional translators and interpreters for 250+ languages

www.welocalize.com

Similar Jobs

SailPoint Logo SailPoint

Customer Success Manager

Artificial Intelligence • Cloud • Sales • Security • Software • Cybersecurity • Data Privacy
Remote or Hybrid
Philippines
2461 Employees

Atlassian Logo Atlassian

Team Lead

Cloud • Information Technology • Productivity • Security • Software • App development • Automation
In-Office or Remote
Manila, Metro Manila, National Capital Region, PHL
11000 Employees

Mondelēz International Logo Mondelēz International

Regional S4o9 Data SME LA

Big Data • Food • Hardware • Machine Learning • Retail • Automation • Manufacturing
Remote or Hybrid
4 Locations
90000 Employees

CrowdStrike Logo CrowdStrike

Sales Engineer

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Remote or Hybrid
Philippines
10000 Employees

Similar Companies Hiring

Blissway Thumbnail
Transportation • Software • Machine Learning • Internet of Things • Hardware • Fintech • Computer Vision
Denver, CO
20 Employees
Yooz Thumbnail
Software • Machine Learning • Fintech • Financial Services • Cloud • Automation • Artificial Intelligence
Aimargues, FR
470 Employees
Credal.ai Thumbnail
Software • Security • Productivity • Machine Learning • Artificial Intelligence
Brooklyn, NY

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account