Lead Quality Engineer - AI

Reposted 10 Hours Ago
6 Locations
In-Office
90K-157K Annually
Mid level
Information Technology • Software
The Role
The Lead AI Quality Engineer ensures quality in AI systems through designing tests, validating functionalities, and collaborating with teams to monitor and measure system performance.
Summary Generated by Built In

**This is a Hybrid role requiring 2 days a week in a Wolters Kluwer office**

We are seeking a Lead AI Quality Engineer to ensure the quality, reliability, and trustworthiness of AI-powered product experiences in Wolters Kluwer Tax and Accounting. This role goes beyond validating that buttons click—you will design tests that confirm the system behaves correctly, measuring retrieval accuracy, citation correctness, and overall alignment of responses with user intent. You will be a key contributor in helping us deliver a system customers can trust.

Key Responsibilities:

· Design and implement evaluation harnesses to measure retrieval accuracy, citation correctness, response quality, and overall system behavior

· Develop automated tests for APIs, ingestion pipelines, and chat workflows

· Collaborate with developers and product managers to define quality metrics (accuracy, latency, cost, hallucination rate)

· Analyze logs, traces, and feedback signals to identify root causes of failures in AI-driven responses

· Create regression suites to ensure changes to prompts, chunking, or embeddings don’t break existing behavior

· Validate REST APIs and service integrations for resilience, correctness, and security

· Contribute to observability by instrumenting metrics and dashboards for system performance

· Participate in sprint planning and retrospectives, ensuring testability is built into features from day one

Key Requirements:

·Bachelors Degree in Computer Science or equivalent

.5+ years of experience in software testing, quality engineering, or equivalent engineering roles with a focus on validation and reliability.

· Experience with AI evaluation frameworks (e.g. LlamaIndex evals, OpenAI Evals, Ragas, TruLens, or custom harnesses)

· Strong skills in Python testing frameworks (Pytest, unittest, or equivalent)

· Experience testing web applications and APIs

· Familiarity with AI/ML or non-deterministic system testing

· Knowledge of CI/CD pipelines, Git, and automated regression testing

· Strong analytical skills: able to define metrics and success criteria where outputs aren’t deterministic

· Comfortable working in a fast-paced Agile environment with weekly sprints, pairing, and close collaboration with PM/UX/Dev

Desired Qualifications:

· Knowledge of retrieval-augmented generation (RAG) pipelines

· Experience with metrics/observability tooling (Grafana, Prometheus, Datadog)

· Familiarity with containerized environments (Docker, Kubernetes)

· Exposure to performance/load testing tools (Locust, k6, JMeter)

This role is critical in ensuring our AI solutions meet the high standards of accuracy and reliability expected in professional tax and accounting software.

Our Culture:
At Wolters Kluwer, our core values—Focus on Customer Success, Make it Better, Aim High and Deliver, and Win as a Team—guide everything we do. We are committed to driving success for our customers by delivering innovative solutions that exceed expectations. We continually strive to improve our processes and products, aiming for excellence in all our efforts. Collaboration and teamwork are central to our culture, enabling us to achieve great results together.

Our Interview Practices

To maintain a fair and genuine hiring process, we kindly ask that all candidates participate in interviews without the assistance of AI tools or external prompts. Our interview process is designed to assess your individual skills, experiences, and communication style. We value authenticity and want to ensure we’re getting to know you—not a digital assistant. To help maintain this integrity, we ask to remove virtual backgrounds and include in-person interviews in our hiring process. Please note that use of AI-generated responses or third-party support during interviews will be grounds for disqualification from the recruitment process.

Applicants may be required to appear onsite at a Wolters Kluwer office as part of the recruitment process.


Compensation:

$89,600.00 - $157,000.00 USD
This role is eligible for Bonus.

Compensation range listed is based on primary location of the position.  Actual base salary offer is influenced by a wide array of factors including but not limited to skills, experience and actual hiring location. Your recruiter can share more information about the specific offer for the job location during the hiring process. 

Additional Information:

Wolters Kluwer offers a wide variety of competitive benefits and programs to help meet your needs and balance your work and personal life, including but not limited to: Medical, Dental, & Vision Plans, 401(k), FSA/HSA, Commuter Benefits, Tuition Assistance Plan, Vacation and Sick Time, and Paid Parental Leave. Full details of our benefits are available upon request.

Top Skills

Ci/Cd
Datadog
Docker
Git
Grafana
Kubernetes
Prometheus
Python
Rest Api
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
Hagerstown, MD
18,996 Employees

What We Do

Wolters Kluwer (www.wolterskluwer.com) is a global leader in information services and solutions for professionals in the health, tax and accounting, risk and compliance, finance and legal sectors. We help our customers make critical decisions every day by providing expert solutions that combine deep domain knowledge with specialized technology and services.

Founded in 1836 and headquartered in Alphen aan den Rijn, the Netherlands, the company serves customers in over 180 countries, maintains operations in over 40 countries and employs 18,600 people worldwide.

Wolters Kluwer reported 2019 annual revenues of €4.6 billion. Listed on Euronext Amsterdam, Wolters Kluwer shares (WKL) are included in the AEX and Euronext 100 indices. Wolters Kluwer has a sponsored Level 1 American Depositary Receipt program. The ADRs are traded on the over-the-counter market in the U.S. (WTKWY).

Similar Jobs

Global Payments Inc. Logo Global Payments Inc.

Quality Assurance Engineer

eCommerce • Fintech • Payments
In-Office
4 Locations
24000 Employees

Ericsson Logo Ericsson

Electronic Engineer

Cloud • Information Technology • Internet of Things • Machine Learning • Software • Cybersecurity • Infrastructure as a Service (IaaS)
In-Office
Lewisville, TX, USA
89000 Employees

Cleo Logo Cleo

Project Manager

Cloud • eCommerce • Information Technology • Professional Services • Software
Remote or Hybrid
United States
500 Employees
120K-140K Annually

CrowdStrike Logo CrowdStrike

Enterprise Architect

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Remote or Hybrid
USA
10000 Employees
170K-260K Annually

Similar Companies Hiring

Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees
Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Fairly Even Thumbnail
Software • Sales • Robotics • Other • Hospitality • Hardware
New York, NY

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account