Red Teaming | Generative AI Analyst - California

Posted 21 Days Ago
Be an Early Applicant
Hiring Remotely in California, USA
Remote
47-47 Hourly
Entry level
Artificial Intelligence • Machine Learning • Natural Language Processing • Professional Services
The Role
Evaluate generative AI safety by creating and testing prompts, documenting failures, reviewing multimodal outputs, and applying safety taxonomies and guidelines during high-volume production sprints. Participate in calibration, quality reviews, and flag unclear guidelines or recurring model behaviors.
Summary Generated by Built In
About the Role

We are hiring Red Teaming | Generative AI Analyst to support generative AI safety evaluation. In this role, you will interact with AI models, create and evaluate prompts, and identify where model responses fail against defined safety expectations.

Project Details
  • Job Title: Red Teaming | Generative AI Analyst
  • Location: Remote with the option to work onsite in Santa Clara, CA area.
  • Hours: 40 hours per week
  • Employment Type: W2 Full-Time Employee
  • Pay Rate: $47.44/hour

What You’ll Do

  • Interact with generative AI models using project-provided guidelines, safety taxonomies, and attack-vector guidance.
  • Create and evaluate prompts designed to test model behavior across safety-related categories.
  • Identify where model responses become unsafe, noncompliant, inconsistent, or otherwise problematic.
  • Document model breakability, effort level, point of failure, and relevant category alignment.
  • Review text, image, audio, video, or other multimodal content as required by the workflow.
  • Apply detailed guidelines consistently across short, high-volume production sprints.
  • Use sound judgment to evaluate ambiguous, edge-case, or policy-sensitive outputs.
  • Conduct self-review to ensure work is accurate, complete, and aligned with project expectations.
  • Flag unclear guidelines, tooling issues, or recurring model behavior patterns.
  • Participate in calibration, feedback, and quality review sessions to improve consistency.
  • Maintain readiness to pivot quickly between different red teaming runs when active work is launched.

Requirements:

  • Native-level or near-native English proficiency with excellent written communication skills.
  • Work Authorization is required for the role.
  • Strong creative writing ability and comfort constructing varied prompts.
  • Experience with red teaming, safety data annotation, content evaluation, safety review, content moderation, QA, or AI model evaluation preferred.
  • Strong attention to detail and ability to follow complex project guidelines.
  • Ability to think critically and evaluate open-ended model responses.
  • Comfort working with sensitive, adult, NSFW, or policy-relevant content where required.
  • Interest in generative AI, AI safety, large language models, or emerging AI technologies.
  • Ability to work quickly and accurately during short production windows.
  • Bachelor’s degree or equivalent practical experience preferred.

Ways to Stand Out from the Crowd

  • Background in creative writing, English, linguistics, journalism, communications, policy, trust and safety, or content moderation.
  • Experience evaluating generative AI prompts and responses.
  • Familiarity with AI safety, red teaming, jailbreak testing, RLHF, or model evaluation workflows.
  • Experience working with safety taxonomies, policy guidelines, evaluation rubrics, or defect categories.
  • Prior experience reviewing sensitive, adult, NSFW, or policy-relevant content in a professional setting.
  • Experience with multimodal AI workflows involving text, image, audio, or video.
  • QA/testing experience within AI, data operations, content review, or annotation environments.
  • Ability to explain a repeatable approach for staying consistent during high-volume, judgment-based work.

Skills Required

  • Native-level or near-native English proficiency with excellent written communication skills.
  • Work authorization (must be authorized to work).
  • Strong creative writing ability and comfort constructing varied prompts.
  • Strong attention to detail and ability to follow complex project guidelines.
  • Ability to think critically and evaluate open-ended model responses.
  • Comfort working with sensitive, adult, NSFW, or policy-relevant content.
  • Ability to work quickly and accurately during short production windows.
  • Experience with red teaming, safety data annotation, content evaluation, content moderation, QA, or AI model evaluation.
  • Bachelor's degree or equivalent practical experience.
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
6,543 Employees

Similar Jobs

HopSkipDrive Logo HopSkipDrive

Strategic Talent Lead

Automotive • Edtech • Kids + Family • Mobile • Social Impact • Transportation
Easy Apply
Remote
USA
450 Employees
125K-145K Annually

Liberty Mutual Insurance Logo Liberty Mutual Insurance

Principal SAP PaPM Configuration Engineer

Artificial Intelligence • Fintech • Insurance • Marketing Tech • Software • Analytics
Remote or Hybrid
United States
40000 Employees
120K-225K Annually

Toast Logo Toast

Senior Software Engineer

Cloud • Fintech • Food • Information Technology • Software • Hospitality
Remote
USA
5000 Employees
159K-254K Annually

Toast Logo Toast

Sr. Manager Customer Success I, RMM

Cloud • Fintech • Food • Information Technology • Software • Hospitality
Remote
US
5000 Employees
99K-120K Annually

Similar Companies Hiring

Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees
Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
42 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account