AI Evaluator - POLISH

Posted 7 Hours Ago
Be an Early Applicant
Hiring Remotely in Poland
Remote
Entry level
Artificial Intelligence • Information Technology • Professional Services • Consulting
The Role
Design and run short multi-turn Polish-language prompts to test AI personalization. Evaluate whether the model correctly uses personal signals, check grounding and integration quality, compare paired responses, write structured rationales, verify debug data sources, and maintain strict workflow hygiene. Remote, hourly contract for 30-40 hours/week.
Summary Generated by Built In

About Us

Gramian Consultancy is a boutique consultancy specializing in IT professional services and engineering talent solutions. With a strong background in software engineering and leadership, we help companies build high-performing teams by matching them with professionals who truly fit their needs.

This opening is on behalf of one of our clients, and we’ll work closely with you to make the process clear and straightforward.

Role Overview

We are looking for Polish-speaking AI Content Analysts to support evaluation of a new personalization capability within a leading AI assistant platform.

In this role, you will design realistic conversational prompts based on your own context and experiences, then rigorously evaluate how effectively the AI uses personal signals (such as prior conversations or activity context) to produce relevant, grounded, and helpful responses.

This position combines creative prompt design, analytical evaluation, and structured quality assessment, making it ideal for candidates with experience in AI evaluation, annotation, content review, or analytical research roles.

Responsibilities

  • Design and run short multi-turn conversations (typically 1–5 turns) intended to test AI personalization behavior
  • Create prompts grounded in realistic personal scenarios to evaluate contextual understanding
  • Review AI responses to determine whether personalization is correctly applied
  • Check grounding quality to ensure the model does not invent unsupported claims about the user
  • Evaluate integration quality — confirming personal signals are used naturally (not forced or robotic)
  • Compare two responses side-by-side and determine which is more helpful, natural, and relevant
  • Write clear, structured rationales explaining rankings and referencing specific conversation turns
  • Verify debug information showing whether correct data sources were used
  • Maintain strict workflow hygiene (including deleting evaluation conversations when required)

Notes:

  • The role is 100% remote, working hours within your local time zone (this is a MUST)
  • 30-40 hours/week commitment. Paid by hours logged and approved.
  • Contracting Model
  • Duration: 1 month (possible extension)

Requirements
  • Strong Polish proficiency (reading & writing required) — Polish is the primary evaluation language
  • BS/BA degree or equivalent experience in a relevant field (e.g., Policy, Law, Ethics, Linguistics, Journalism, Computer Science, or a related analytical field)
  • Strong analytical thinking and ability to assess nuanced AI outputs
  • Excellent written communication skills with ability to produce structured evaluation notes
  • High attention to detail when comparing similar responses
  • Ability to work independently in a fully remote environment
  • Reliable desktop/laptop and stable internet connection
  • Willingness to use your primary personal Google account and enable personal data sources for evaluation purposes

Skills Required

  • Strong Polish proficiency (reading & writing)
  • BS/BA degree or equivalent experience in a relevant field (Policy, Law, Ethics, Linguistics, Journalism, Computer Science, or related)
  • Experience in AI evaluation, annotation, content review, or analytical research roles
  • Strong analytical thinking and ability to assess nuanced AI outputs
  • Excellent written communication skills with ability to produce structured evaluation notes
  • High attention to detail when comparing similar responses
  • Ability to work independently in a fully remote environment
  • Reliable desktop or laptop and stable internet connection
  • Willingness to use your primary personal Google account and enable personal data sources for evaluation
  • Work 30-40 hours per week; 100% remote with working hours within your local time zone
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
6 Employees

What We Do

Gramian Consulting Group is a boutique consultancy specializing in IT professional services and engineering talent solutions. With a strong foundation in software engineering and leadership, the firm helps organizations build high-performing teams by matching them with qualified professionals. They specialize in talent augmentation and recruiting, specifically focusing on connecting engineering and data/AI talent with organizations to unlock real business value.

Similar Jobs

Zapier Logo Zapier

Staff Engineer

Artificial Intelligence • Productivity • Software • Automation
Remote
32 Locations
800 Employees
211K-316K Annually

SEON Logo SEON

Senior Site Reliability Engineer

Artificial Intelligence • Cybersecurity
In-Office or Remote
28 Locations
415 Employees

Dropbox Logo Dropbox

Software Engineer

Artificial Intelligence • Cloud • Consumer Web • Productivity • Software • App development • Data Privacy
Remote
Poland
2500 Employees
272K-368K Annually

ServiceNow Logo ServiceNow

Architect

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Remote or Hybrid
Warsaw, Warszawa, Mazowieckie, POL
29000 Employees

Similar Companies Hiring

Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
42 Employees
Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account