Associate AI QA Engineer

Posted Yesterday
Easy Apply
Be an Early Applicant
Manchester, Greater Manchester, England
In-Office
Mid level
Information Technology
The Role
As an AI QA Engineer, you'll develop testing strategies and frameworks for GenAI features, create evaluation metrics, conduct tests, and collaborate with teams to ensure AI quality and reliability.
Summary Generated by Built In

At Anaplan, we are a team of innovators focused on optimizing business decision-making through our leading AI-infused scenario planning and analysis platform so our customers can outpace their competition and the market.

What unites Anaplanners across teams and geographies is our collective commitment to our customers’ success and to our Winning Culture.

Our customers rank among the who’s who in the Fortune 50. Coca-Cola, LinkedIn, Adobe, LVMH and Bayer are just a few of the 2,400+ global companies who rely on our best-in-class platform.

Our Winning Culture is the engine that drives our teams of innovators. We champion diversity of thought and ideas, we behave like leaders regardless of title, we are committed to achieving ambitious goals, and we love celebrating our wins – big and small.

Supported by operating principles of being strategy-led, values-based and disciplined in execution, you’ll be inspired, connected, developed and rewarded here. Everything that makes you unique is welcome; join us and let’s build what’s next - together!

We're pioneering a new role focused exclusively on quality assurance for GenAI systems. As our AI QA Engineer, you'll develop testing strategies, evaluation frameworks, and quality metrics specifically designed for LLM-powered applications. This role requires a unique blend of QA expertise, understanding of GenAI behaviour, and automation skills to ensure our AI features are reliable, accurate, and trustworthy.

Your Impact

  • Design and implement comprehensive testing strategies for GenAI features, including conversational AI, agentic systems, and LLM-powered workflows 
  • Develop automated test suites for prompt testing, including regression tests that detect unintended changes in model behaviour 
  • Create evaluation frameworks to measure GenAI quality across multiple dimensions (accuracy, relevance, safety, consistency, latency) 
  • Build and maintain test datasets and golden examples that represent diverse user scenarios and edge cases 
  • Implement monitoring and alerting systems to detect quality degradation in production GenAI features 
  • Perform adversarial testing to identify potential failures, hallucinations, biases, or security vulnerabilities in AI systems 
  • Collaborate with engineers to define acceptance criteria and quality gates for AI feature releases 
  • Develop tools and frameworks that make it easy for engineers to test their GenAI implementations 
  • Conduct user acceptance testing and gather feedback on AI feature performance from internal users 
  • Document testing procedures, known issues, and quality metrics in clear, accessible formats 
  • Partner with Product and Design teams to ensure AI features meet user experience standards 
  • Stay current with GenAI testing methodologies, tools, and industry best practices 

Your Qualifications 

  • 3+ years of QA or test engineering experience, preferably with AI/ML systems. 
  • Strong understanding of GenAI technologies including LLMs, prompt engineering, and AI application patterns 
  • Experience with test automation frameworks and scripting (Python, JavaScript, Selenium, Pytest) 
  • Knowledge of software testing methodologies (functional, integration, regression, performance, security testing) 
  • Ability to design test cases and evaluation criteria for non-deterministic systems 
  • Strong analytical and problem-solving skills with attention to detail 
  • Experience with API testing tools (Postman, REST Assured) and backend testing 
  • Familiarity with CI/CD pipelines and automated testing integration 
  • Excellent communication skills for documenting issues and collaboration 

Preferred Qualifications 

  • Experience testing conversational AI, chatbots, or agentic systems 
  • Knowledge of ML model evaluation metrics and techniques 
  • Familiarity with LLM evaluation frameworks (LangSmith, PromptFoo, Ragas) 
  • Experience with performance testing and load testing AI APIs 
  • Understanding of responsible AI principles, including fairness, transparency, and safety testing 
  • Background in enterprise software or SaaS QA 
  • Experience with test management tools (TestRail, Zephyr, Jira) 
  • Knowledge of security testing methodologies for AI systems 
  • Scripting experience with Python, including working with LLM APIs 

What Makes This Role Exciting 

  • Define QA practices for GenAI applications 
  • Work on cutting-edge AI technologies and help ensure they're reliable and trustworthy 
  • Shape quality standards that will impact millions of enterprise users 
  • Collaborate closely with engineers, data scientists, and product teams 
  • Grow expertise in a highly specialized and increasingly important domain 
  • Influence the entire AI product development lifecycle from design to release 
  • Join a team that values quality as a first-class concern, not an afterthought 

#LI-SP1

Our Commitment to Diversity, Equity, Inclusion and Belonging (DEIB)

We believe attracting and retaining the best talent and fostering an inclusive culture strengthens our business. DEIB improves our workforce, enhances trust with our partners and customers, and drives business success. Build your career in a place where diversity, equity, inclusion and belonging aren’t just words on paper – this is what drives our innovation, it’s how we connect, and it contributes to what makes us a market leader. We believe in a hiring and working environment where all people are respected and valued, regardless of gender identity or expression, sexual orientation, religion, ethnicity, age, neurodiversity, disability status, citizenship, or any other aspect which makes people unique. We hire you for who you are, and we want you to bring your authentic self to work every day! 

We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, perform essential job functions, and receive equitable benefits and all privileges of employment. Please contact us to request accommodation.  

Fraud Recruitment Disclaimer  

It has come to our attention that fraudulent and fictitious job opportunities are being circulated on the Internet. Prospective candidates are being contacted by certain individuals, mainly through telephone calls, emails and correspondence, claiming they are representatives of Anaplan. The main purpose of these correspondences and announcements is to obtain privileged information from individuals.  

Anaplan does not:  

  • Extend offers to candidates without an extensive interview process with a member of our recruitment team and a hiring manager via video or in person.   
  • Send job offers via email. All offers are first extended verbally by a member of our internal recruitment team whenever possible and then followed up via written communication.  

All emails from Anaplan would come from an @anaplan.com email address. Should you have any doubts about the authenticity of an email, letter or telephone communication purportedly from, for, or on behalf of Anaplan, please send an email to [email protected] before taking any further action in relation to the correspondence.   


Top Skills

JavaScript
Postman
Pytest
Python
Rest Assured
Selenium
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Francisco, CA
2,194 Employees
Year Founded: 2006

What We Do

Anaplan is building a future where connected leaders and teams are able to constantly adapt, transform and reinvent their businesses. We make it possible to share actionable insights, empower and unleash creativity, and drive innovation. With Anaplan, finance and operational leaders across the organization can model complex scenarios, forecast continuously with added intelligence, and make agile decisions with confidence.

Similar Jobs

Wise Logo Wise

Head of Finance Risk and Controls

Fintech • Mobile • Payments • Software • Financial Services
Hybrid
London, Greater London, England, GBR
6500 Employees
110K-150K Annually

Wise Logo Wise

Platform Engineer

Fintech • Mobile • Payments • Software • Financial Services
Hybrid
London, Greater London, England, GBR
6500 Employees
65K-85K Annually

Wise Logo Wise

Business Partnerships Lead

Fintech • Mobile • Payments • Software • Financial Services
Hybrid
London, Greater London, England, GBR
6500 Employees

Morningstar Logo Morningstar

Program Manager

Enterprise Web • Fintech • Financial Services
Hybrid
London, Greater London, England, GBR
12700 Employees

Similar Companies Hiring

Axle Health Thumbnail
Logistics • Information Technology • Healthtech • Artificial Intelligence
Santa Monica, CA
17 Employees
Scrunch AI Thumbnail
Software • SEO • Marketing Tech • Information Technology • Artificial Intelligence
Salt Lake City, Utah
Standard Template Labs Thumbnail
Software • Information Technology • Artificial Intelligence
New York, NY
10 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account