Staff Quality Assurance Engineer

Posted Yesterday
Be an Early Applicant
Hoboken, NJ, USA
Hybrid
130K-150K Annually
Senior level
Edtech
The Role
Lead end-to-end quality engineering for AI/LLM systems: define quality strategy, design/execute automated tests (UI, API, backend), evaluate non-deterministic AI outputs, build evaluation frameworks and baselines, integrate tests into CI/CD, and lead QE delivery for complex projects.
Summary Generated by Built In

We are seeking a Senior Expert, Quality Engineer (QE) with deep expertise in quality engineering for AI-powered systems. In this role, you will own and lead end-to-end quality for your project while playing a key role in testing and evaluating complex, non-deterministic AI systems. You will help shape the quality strategy for the project and raise the overall quality bar for the teams you work with.

What You’ll Do

Core Quality Engineering

  • Define quality strategies and own test planning from requirements through release, partnering closely with cross-functional teams
  • Design and execute scalable functional, regression, and integration testing strategies
  • Apply hands-on automation framework expertise across UI, API, and backend layers to support major product initiatives

AI & LLM System Testing

  • Test AI-driven functionality, including LLM agents, multi-agent workflows, and RAG pipelines
  • Validate AI outputs for accuracy, relevance, consistency, and edge cases across non-deterministic systems

  • Own evaluation frameworks (e.g., RAGAS), test datasets, and regression baselines for AI features

  • Stay current with the evolving landscape of LLM evaluation, agent testing, and AI observability, and translate emerging techniques into practical testing approaches

Technical Leadership & Delivery

  • Lead QE team for your project by setting technical direction, defining quality strategy, and ensuring delivery confidence
  • Contribute to QE standards and frameworks within your project scope, establishing practices that other engineers can adopt
  • Integrate automated tests into CI/CD pipelines and partner with engineering leads to embed quality into the delivery process
  • Leverage data, risk signals, and production insights to continuously improve test coverage and effectiveness

What You’ll Bring

  • 8+ years of experience in Quality Engineering or Quality Assurance
  • Proven hands-on experience testing AI/LLM-based systems, including agents, multi-agent workflows, and RAG pipelines
  • Experience with AI evaluation approaches (e.g., RAGAS, LLM-as-judge, regression baselines) and non-deterministic testing strategies

  • Strong automation skills using Playwright or a comparable framework

  • Proficiency in JavaScript/TypeScript and/or Java

  • Solid experience with API testing and data validation

  • Demonstrated track record of leading quality end-to-end for complex projects, from strategy through execution
  • Ability to define and drive QE technical direction within a project or product area
  • Experience working in Agile delivery environments and contributing to fast-moving release cycles

 

Nice to Have

  • Hands-on experience with prompting, embeddings, or vector databases

  • Cloud experience (AWS preferred)

  • Exposure to AI-assisted development or testing tools (e.g., Cursor, Copilot)

  • Experience with AI observability or monitoring tools for production LLM systems

 

This is a hybrid role based in Hoboken, NJ. The ideal candidate should be able to work from the Hoboken office at least 2–3 days per week.

Applications will be accepted through July 7. This window may be extended depending on business needs. 

Compensation at Pearson is influenced by a wide array of factors including but not limited to skill set, level of experience, and specific location. As required by the California, Colorado, Hawaii, Illinois, Maryland, Minnesota, New Jersey, New York State, New York City, Vermont, Washington State, and Washington DC laws, the pay range for this position is as follows:   

The full-time salary range for this position is between $130,000 - $150,000

This position is eligible to participate in an annual incentive program, and information on benefits offered is here.

Skills Required

  • 8+ years of experience in Quality Engineering or Quality Assurance
  • Proven hands-on experience testing AI/LLM-based systems, including agents, multi-agent workflows, and RAG pipelines
  • Experience with AI evaluation approaches (e.g., RAGAS, LLM-as-judge) and non-deterministic testing strategies
  • Strong automation skills using Playwright or a comparable framework
  • Proficiency in JavaScript, TypeScript and/or Java
  • Solid experience with API testing and data validation
  • Demonstrated track record of leading quality end-to-end for complex projects
  • Ability to define and drive QE technical direction within a project or product area
  • Experience working in Agile delivery environments and fast-moving release cycles
  • Ability to work hybrid from Hoboken, NJ office at least 2-3 days per week
  • Hands-on experience with prompting, embeddings, or vector databases
  • Cloud experience (AWS)
  • Exposure to AI-assisted development or testing tools (e.g., Cursor, Copilot)
  • Experience with AI observability or monitoring tools for production LLM systems
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: London
29,811 Employees
Year Founded: 1871

What We Do

We are the world’s learning company with more than 22,500 employees operating in 70 countries. We provide content, assessment and digital services to learners, educational institutions, employers, governments and other partners globally. We are committed to helping equip learners with the skills they need to enhance their employability prospects and to succeed in the changing world of work. We believe that wherever learning flourishes so do people.

Similar Jobs

Domino Data Lab Logo Domino Data Lab

Senior Quality Engineer

Artificial Intelligence • Machine Learning
Easy Apply
Remote or Hybrid
US
200 Employees
145K-175K Annually

ZT Systems Logo ZT Systems

Quality Assurance Engineer

Cloud • Hardware • Manufacturing
In-Office
Secaucus, NJ, USA
2500 Employees
93K-136K Annually

New York Life Insurance Company Logo New York Life Insurance Company

Senior Associate, Pension Risk Transfer & Retirement Income Solutions

Artificial Intelligence • Cloud • Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
Hybrid
Jersey City, NJ, USA
12000 Employees
115K-140K Annually

New York Life Insurance Company Logo New York Life Insurance Company

Site Reliability Engineer

Artificial Intelligence • Cloud • Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
Hybrid
Lebanon, NJ, USA
12000 Employees
100K-143K Annually

Similar Companies Hiring

ReUp Education Thumbnail
Social Impact • Edtech
Austin, TX
180 Employees
Learneo Thumbnail
Software • Machine Learning • Edtech • Artificial Intelligence
NL
397 Employees
CodePath.org Thumbnail
Edtech • Social Impact
San Francisco, CA
55 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account