Machine Learning Engineer, Chakra

Reposted 21 Days Ago
Santa Clara, CA, USA
In-Office
120K-235K Annually
Mid level
HR Tech • Information Technology • Software
The Role
The role involves architecting and developing Chakra, an AI interviewer. Responsibilities include building consistent interview systems, developing evaluation pipelines, fine-tuning models, and ensuring product quality across all candidate experiences.
Summary Generated by Built In

HackerRank helps companies like NVIDIA, Amazon, and Microsoft hire and upskill the next generation of developers based on skills, not pedigree. Our platform is trusted by over 2,500 of the world’s most innovative companies to build strong engineering teams ready for what’s next.
Software has entered an era where humans and AI build side by side. As this shift accelerates, the definition of strong technical talent is changing. We give companies better ways to identify and invest in next-generation skills.
People at HackerRank care deeply about the impact of their work and sweat the small details so our customers can be wildly successful with products they genuinely love to use. We move with urgency and believe great outcomes come from high standards.

About the role

The developer's job is shifting from writing code to directing AI agents, and hiring needs to catch up. HackerRank has shaped how 3000+ companies identify engineering talent, with 30M+ developers assessed on our platform. Chakra is our bet on what the next generation of that looks like: an AI interviewer built for a world where the interview itself has to be as intelligent as the candidates it is evaluating.

Open Problem
An interview that thinks, listens and gets it right every time.

Running an interview is easy. Running a good one is hard.
Chakra is an AI interviewer. It holds a conversation with a candidate, asks follow-up questions, evaluates how they think, and produces a report a hiring manager can actually act on. It is to conduct interviews that are more consistent, more probing, and more fair than most human interviewers manage in practice.

Here is the problem. A great human interviewer can do this. They read the candidate. They push on the right things. They know when an answer is shallow and when it just sounds shallow. Getting a model to do that reliably is genuinely difficult. Not because the technology cannot hold a conversation. It can. The gap is in judgment. Knowing what to probe. Knowing what the answer actually reveals about the candidate. Knowing when to move on.

Now do that 200,000 times. With candidates who speak differently, think differently, and approach problems differently. Without the model drifting. Without it being gamed. Without every report reading like it was written by the same template.

That is where the field currently falls short. Closing that gap is the work.

What you will do

  • Architect and develop Chakra end to end: the agent design, conversation management, real-time response evaluation, scoring methodology, and report generation.
  • Build the systems that ensure interview consistency at scale. Not just model capability, but the infrastructure that makes the 200,000th interview as coherent as the first.
  • Design evaluation and benchmarking pipelines that measure interview quality, candidate experience consistency, and report defensibility.
  • Build fine-tuning and RLHF workflows to push model judgment past what off-the-shelf models deliver for this specific task.
  • Own the quality bar. Define what a good interview looks like, instrument how well the system meets that bar, and close the gap systematically.
  • Work across the full stack: data pipelines, model serving, latency constraints, and the product experience the candidate actually encounters.

Who you are

  • You have built and shipped agentic or conversational AI systems in production, not just prototypes.
  • You have a strong intuition for where LLM behavior breaks down under real-world conditions and how to address it systematically.
  • You think in systems. The conversation architecture, the evaluation model, the serving infrastructure, and the candidate experience are one problem to you.
  • You care about the quality bar at the level of a user who depends on the output, not just a researcher measuring aggregate metrics.
Even better if you have
  • Experience building multi-turn conversational agents or interview-style AI systems.
  • Worked with RLHF, Constitutional AI, or preference-based fine-tuning methods.
  • Background in dialogue systems, conversational evaluation, or rubric-based scoring.
  • Publications or contributions in agentic AI, LLM reliability, or evaluation of generative systems.
You will thrive here if
  • You are energized by the full scope of a hard product problem, from model architecture through the conversation a candidate actually has. 
  • You hold the product bar as high as the technical bar. You want to build something that works extraordinarily well for every single person who uses it.
Compensation

The annual US on-target earnings (OTE) range for this role is $120,000 – $235,000 which includes base salary and target bonus. This range may span multiple career levels at HackerRank and will be refined during the interview process based on factors such as the candidate’s experience, qualifications, and location. Compensation for this role includes base salary, target bonus, and equity.

Want to learn more about HackerRank? Check out HackerRank.com to explore our products, solutions and resources, and dive into our story and mission here.

HackerRank is a proud equal employment opportunity and affirmative action employer. We provide equal opportunity to everyone for employment based on individual performance and qualification. We never discriminate based on race, religion, national origin, gender identity or expression, sexual orientation, age, marital, veteran, or disability status. All your information will be kept confidential according to EEO guidelines. 

Linkedin | X | Blog | Instagram | Life@HackerRank

Notice to prospective HackerRank job applicants:

  • Our Recruiters use @hackerrank.com email addresses.
  • We never ask for payment or credit check information to apply, interview, or work here.

Skills Required

  • Built and shipped conversational AI systems in production
  • Strong intuition for LLM behavior in real-world conditions
  • Experience with multi-turn conversational agents
  • Background in dialogue systems or conversational evaluation

HackerRank Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about HackerRank and has not been reviewed or approved by HackerRank.

  • Fair & Transparent Compensation Pay is considered competitive for many U.S. roles, with engineering and sales ranges aligning to market snapshots for base and total compensation. Overall sentiment positions compensation as above average relative to other categories.
  • Healthcare Strength Core coverage includes medical, dental, vision, mental-health/EAP, and life and disability insurance. Healthcare elements are consistently highlighted as solid within the package.
  • Flexible Benefits Flexible PTO, remote-work support, and home-office stipends are offered alongside equity and an annual learning budget. Flexibility in where and how work happens is emphasized across materials.

HackerRank Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Mountain View, CA
1,053 Employees
Year Founded: 2012

What We Do

HackerRank is a technology hiring platform that is the standard for assessing developer skills for over 2600 companies around the world. HackerRank helps companies hire skilled developers and innovate faster by enabling tech recruiters and hiring managers to objectively evaluate talent at every stage of the recruiting process.

Similar Jobs

Taskrabbit Logo Taskrabbit

Senior Machine Learning Engineer

eCommerce • Information Technology • Sharing Economy • Software
Easy Apply
Hybrid
San Francisco, CA, USA
450 Employees
148K-200K Annually

Micron Technology Logo Micron Technology

Design and Verification Engineer, Pathfinding

Artificial Intelligence • Hardware • Information Technology • Machine Learning
In-Office
Folsom, CA, USA
45000 Employees
100K-213K Annually

Micron Technology Logo Micron Technology

Engineer - HIG - HBM Design

Artificial Intelligence • Hardware • Information Technology • Machine Learning
In-Office
Folsom, CA, USA
45000 Employees
132K-171K Annually

UL Solutions Logo UL Solutions

Field Evaluations Engineer - West US Region

Automotive • Professional Services • Software • Consulting • Energy • Chemical • Renewable Energy
Remote or Hybrid
Cañada De Los Coches, CA, USA
15000 Employees
97K-120K Annually

Similar Companies Hiring

Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account