Anaplan

Associate AI QA Engineer

Posted Yesterday

Easy Apply

Be an Early Applicant

Manchester, Greater Manchester, England

In-Office

Mid level

Information Technology

The Role

As an AI QA Engineer, you'll develop testing strategies and frameworks for GenAI features, create evaluation metrics, conduct tests, and collaborate with teams to ensure AI quality and reliability.

Summary Generated by Built In

At Anaplan, we are a team of innovators focused on optimizing business decision-making through our leading AI-infused scenario planning and analysis platform so our customers can outpace their competition and the market.

What unites Anaplanners across teams and geographies is our collective commitment to our customers’ success and to our Winning Culture.

Our customers rank among the who’s who in the Fortune 50. Coca-Cola, LinkedIn, Adobe, LVMH and Bayer are just a few of the 2,400+ global companies who rely on our best-in-class platform.

Our Winning Culture is the engine that drives our teams of innovators. We champion diversity of thought and ideas, we behave like leaders regardless of title, we are committed to achieving ambitious goals, and we love celebrating our wins – big and small.

Supported by operating principles of being strategy-led, values-based and disciplined in execution, you’ll be inspired, connected, developed and rewarded here. Everything that makes you unique is welcome; join us and let’s build what’s next - together!

We're pioneering a new role focused exclusively on quality assurance for GenAI systems. As our AI QA Engineer, you'll develop testing strategies, evaluation frameworks, and quality metrics specifically designed for LLM-powered applications. This role requires a unique blend of QA expertise, understanding of GenAI behaviour, and automation skills to ensure our AI features are reliable, accurate, and trustworthy.

Your Impact

Design and implement comprehensive testing strategies for GenAI features, including conversational AI, agentic systems, and LLM-powered workflows
Develop automated test suites for prompt testing, including regression tests that detect unintended changes in model behaviour
Create evaluation frameworks to measure GenAI quality across multiple dimensions (accuracy, relevance, safety, consistency, latency)
Build and maintain test datasets and golden examples that represent diverse user scenarios and edge cases
Implement monitoring and alerting systems to detect quality degradation in production GenAI features
Perform adversarial testing to identify potential failures, hallucinations, biases, or security vulnerabilities in AI systems
Collaborate with engineers to define acceptance criteria and quality gates for AI feature releases
Develop tools and frameworks that make it easy for engineers to test their GenAI implementations
Conduct user acceptance testing and gather feedback on AI feature performance from internal users
Document testing procedures, known issues, and quality metrics in clear, accessible formats
Partner with Product and Design teams to ensure AI features meet user experience standards
Stay current with GenAI testing methodologies, tools, and industry best practices

Your Qualifications

3+ years of QA or test engineering experience, preferably with AI/ML systems.
Strong understanding of GenAI technologies including LLMs, prompt engineering, and AI application patterns
Experience with test automation frameworks and scripting (Python, JavaScript, Selenium, Pytest)
Knowledge of software testing methodologies (functional, integration, regression, performance, security testing)
Ability to design test cases and evaluation criteria for non-deterministic systems
Strong analytical and problem-solving skills with attention to detail
Experience with API testing tools (Postman, REST Assured) and backend testing
Familiarity with CI/CD pipelines and automated testing integration
Excellent communication skills for documenting issues and collaboration

Preferred Qualifications

Experience testing conversational AI, chatbots, or agentic systems
Knowledge of ML model evaluation metrics and techniques
Familiarity with LLM evaluation frameworks (LangSmith, PromptFoo, Ragas)
Experience with performance testing and load testing AI APIs
Understanding of responsible AI principles, including fairness, transparency, and safety testing
Background in enterprise software or SaaS QA
Experience with test management tools (TestRail, Zephyr, Jira)
Knowledge of security testing methodologies for AI systems
Scripting experience with Python, including working with LLM APIs

What Makes This Role Exciting

Define QA practices for GenAI applications
Work on cutting-edge AI technologies and help ensure they're reliable and trustworthy
Shape quality standards that will impact millions of enterprise users
Collaborate closely with engineers, data scientists, and product teams
Grow expertise in a highly specialized and increasingly important domain
Influence the entire AI product development lifecycle from design to release
Join a team that values quality as a first-class concern, not an afterthought

#LI-SP1

Our Commitment to Diversity, Equity, Inclusion and Belonging (DEIB)

We believe attracting and retaining the best talent and fostering an inclusive culture strengthens our business. DEIB improves our workforce, enhances trust with our partners and customers, and drives business success. Build your career in a place where diversity, equity, inclusion and belonging aren’t just words on paper – this is what drives our innovation, it’s how we connect, and it contributes to what makes us a market leader. We believe in a hiring and working environment where all people are respected and valued, regardless of gender identity or expression, sexual orientation, religion, ethnicity, age, neurodiversity, disability status, citizenship, or any other aspect which makes people unique. We hire you for who you are, and we want you to bring your authentic self to work every day! 

We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, perform essential job functions, and receive equitable benefits and all privileges of employment. Please contact us to request accommodation. 

Fraud Recruitment Disclaimer 

It has come to our attention that fraudulent and fictitious job opportunities are being circulated on the Internet. Prospective candidates are being contacted by certain individuals, mainly through telephone calls, emails and correspondence, claiming they are representatives of Anaplan. The main purpose of these correspondences and announcements is to obtain privileged information from individuals.  

Anaplan does not: 

Extend offers to candidates without an extensive interview process with a member of our recruitment team and a hiring manager via video or in person.  
Send job offers via email. All offers are first extended verbally by a member of our internal recruitment team whenever possible and then followed up via written communication.

All emails from Anaplan would come from an @anaplan.com email address. Should you have any doubts about the authenticity of an email, letter or telephone communication purportedly from, for, or on behalf of Anaplan, please send an email to [email protected] before taking any further action in relation to the correspondence.  

Top Skills

JavaScript

Postman

Pytest

Python

Rest Assured

Selenium

View all jobs at Anaplan

View Anaplan Profile

Report Job

Am I A Good Fit?

beta

Get Personalized Job Insights.

Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company

HQ: San Francisco, CA

2,194 Employees

Year Founded: 2006

What We Do

Anaplan is building a future where connected leaders and teams are able to constantly adapt, transform and reinvent their businesses. We make it possible to share actionable insights, empower and unleash creativity, and drive innovation. With Anaplan, finance and operational leaders across the organization can model complex scenarios, forecast continuously with added intelligence, and make agile decisions with confidence.