AI Quality Analyst

Posted Yesterday
3 Locations
Remote
Entry level
Edtech • Kids + Family • Sports
The Role
Own end-to-end QA for AI SMS bot and call-training system: review production conversations, create JSON test files, edit Markdown knowledge base (WRONG/RIGHT examples), run test suites, verify fixes, and maintain knowledge base. Operate existing infrastructure using CLI and Git, and write realistic conversational examples and pass/fail criteria.
Summary Generated by Built In

We run an AI-powered SMS bot and an AI-graded call training system for a private school

network. Both systems have extensive test suites (400+ test files) and a RAG-based knowledge

base that drives bot behavior

We need someone to own the full quality cycle:

- Review production conversations to find bad bot responses

- Create test files (JSON) that reproduce the issue

- Fix the knowledge base content (Markdown files, WRONG/RIGHT examples)

- Run test suites to verify fixes and catch regressions

- Maintain and expand the knowledge base as our program evolves

This is NOT a software engineering role. The infrastructure is built. You're operating it — editing

JSON, writing Markdown, running CLI scripts, reading test reports.

What You'll Work With

- JSON test files (you'll write and edit these daily)

- Markdown knowledge base documents

- Terminal commands (copy-paste and run scripts)

- Git (commit, push, basic branching)

- VS Code or similar editor


Requirements

- Native-level English fluency (non-negotiable). You'll be writing realistic SMS conversations

between parents and our school. The language has to sound like a real person texting, not a

corporate chatbot. You'll also be writing precise pass/fail evaluation criteria.

- Comfortable editing JSON and Markdown in an IDE

- Can run commands in a terminal without hand-holding

- Extreme attention to detail

- Ability to learn a complex domain quickly (education, state government programs, etc)

Nice-to-Haves

- Experience writing test cases or QA documentation

- Experience with chatbot QA, conversational AI testing, or LLM evaluation

Hours & Ramp-Up

- 30 hrs/week to start, ramping up as needed. US time zones required.

- The first 1-2 weeks will be focused on learning our domain — how our school works, how the

state voucher program works, compliance rules, etc. We have extensive internal documentation.

Skills Required

  • Native-level English fluency
  • Comfortable editing JSON and Markdown in an IDE
  • Able to run terminal/CLI commands without hand-holding
  • Extreme attention to detail
  • Ability to learn a complex domain quickly (education, state programs, compliance)
  • Write realistic SMS conversations and precise pass/fail evaluation criteria
  • Basic Git skills (commit, push, basic branching)
  • Availability ~30 hours/week and work in US time zones
  • Experience writing test cases or QA documentation
  • Experience with chatbot QA, conversational AI testing, or LLM evaluation
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
14 Employees

What We Do

Texas Sports Academy is a private K-8 school that combines AI-powered academic learning with elite athletic training to help students excel in both areas.

Similar Jobs

Affirm Logo Affirm

Lead, Customer Advocacy

Big Data • Fintech • Mobile • Payments • Financial Services
Easy Apply
Remote
Canada
2200 Employees
89K-139K Annually

Coinbase Logo Coinbase

Accounting and Regulatory Reporting Manager, Canada

Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
Easy Apply
Remote
Canada
4700 Employees
170K-170K Annually

PwC Logo PwC

Consultant

Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Remote or Hybrid
60 Locations
370000 Employees
77K-202K Annually

PwC Logo PwC

Data Engineer

Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Remote or Hybrid
65 Locations
370000 Employees
99K-232K Annually

Similar Companies Hiring

Marble Health Thumbnail
Healthtech • Kids + Family • Social Impact • Software • Telehealth • Conversational AI
New York, New York
35 Employees
CodePath.org Thumbnail
Edtech • Social Impact
US
55 Employees
Playground (tryplayground.com) Thumbnail
Kids + Family • Payments • Social Impact • Software
New York City, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account