JetBrains

Research Engineer (Agentic Models)

Reposted Yesterday

Be an Early Applicant

12 Locations

In-Office or Remote

Mid level

Software

The Role

Develop and maintain SFT and RL post-training pipelines for multi-step coding agents, train and adapt LLMs for agent workflows, build evaluation and simulation environments, design metrics and evaluation frameworks, analyze results to improve models and datasets, and collaborate with research, product, and infra teams to ship models into JetBrains IDEs.

Summary Generated by Built In

At JetBrains, code is our passion. Ever since we started, back in 2000, we’ve been striving to make the strongest, most effective developer tools on earth. Today, AI-powered assistance and agents are becoming a core part of how developers work in our IDEs.

We’re building multi-step coding agents that can understand large codebases, plan changes, call tools, and iterate with the user. As a Research Engineer in the Agentic Models team, you’ll be responsible for the models, training loops, and evaluation pipelines that power these agents.

You’ll work at the intersection of SFT and RL-style post-training, and product-driven evaluation, using our distributed GPU and MapReduce clusters to ship models into JetBrains products.

As part of our team, you will:

Design, implement, and maintain SFT and RL post-training pipelines for multi-step coding agents.
Train and adapt LLMs for agent workflows, including planning, tool use, and multi-step interactions inside JetBrains IDEs.
Build and develop evaluation and simulation environments where coding agents can act, be measured, and compared on realistic developer tasks.
Design evaluation frameworks and metrics for agent behavior, analyze traces and logs, and close the loop from evaluation back into training, data, and reward design.
Analyze training and evaluation results to propose and implement improvements to model architectures, training recipes, and datasets.
Work with large-scale infrastructure, including distributed training on GPU clusters and large MapReduce-style data processing for pre-training and fine-tuning datasets.
Collaborate closely with research, product, and infrastructure teams to turn high-level product visions into concrete models, experiments, and shipped features.

We’ll be happy to bring you on board if you have:

Extensive hands-on experience training LLMs (pre-training, fine-tuning, or post-training) in a research or production setting.
Deep expertise in modern deep learning frameworks such as PyTorch, and specialized LLM training stacks (e.g. Megatron, NeMo, verl, or similar).
Strong theoretical and practical understanding of LLM fundamentals: architectures, tokenization, data pipelines, batching, mixed precision, distributed training, and debugging unstable runs.
The ability to own projects end to end, starting from a high-level problem or product pain point and overseeing it through the design, experimentation, implementation, and iteration phases.
A product-aware mindset – you care about how developers actually use agents and can translate product needs and failure modes into modeling and evaluation work.
At least 3 years of Python experience writing clean, maintainable code in modern ML codebases.

Our ideal candidate would have experience with:

ML orchestrators and workflow tools such as Kubeflow, Dagster, Airflow, ZenML, and/or job schedulers like Kubernetes or SLURM.
Large-scale data and training pipelines, e.g. MapReduce-style clusters, multi-node GPU training, or workloads on the order of 1M+ CPU/GPU hours.
Designing and maintaining evaluation pipelines for LLMs or agents, including metrics, dashboards, experiment tracking, and automated regression checks.
AI agent development, such as tool-using agents, planners, or multi-step coding workflows, and familiarity with agentic frameworks or patterns.
Experiment tracking and observability using tools like Weights & Biases, MLflow, Langfuse, or similar.
Inference optimization and serving optimized models in production.

#LI-KP1

We are an equal opportunity employer
We know great ideas can come from anyone, anywhere. That’s why we do our best to create an open and inclusive workplace – one that welcomes everyone regardless of their background, identity, religion, age, accessibility needs, or orientation.

We process the data provided in your job application in accordance with the Recruitment Privacy Policy.

Skills Required

Extensive hands-on experience training LLMs (pre-training, fine-tuning, or post-training) in a research or production setting
Deep expertise in modern deep learning frameworks such as PyTorch and specialized LLM training stacks (e.g., Megatron, NeMo, verl)
Strong theoretical and practical understanding of LLM fundamentals: architectures, tokenization, data pipelines, batching, mixed precision, distributed training, and debugging unstable runs
Ability to own projects end to end from problem definition through design, experimentation, implementation, and iteration
Product-aware mindset; translate product needs and failure modes into modeling and evaluation work
At least 3 years of Python experience writing clean, maintainable code in modern ML codebases
Experience with ML orchestrators and workflow tools such as Kubeflow, Dagster, Airflow, ZenML, or job schedulers like Kubernetes or SLURM
Experience with large-scale data and training pipelines, MapReduce-style clusters, multi-node GPU training, or very large compute workloads
Designing and maintaining evaluation pipelines for LLMs or agents, including metrics, dashboards, experiment tracking, and automated regression checks
Experience in AI agent development (tool-using agents, planners, multi-step coding workflows) and familiarity with agentic frameworks or patterns
Experiment tracking and observability experience using tools like Weights & Biases, MLflow, Langfuse, or similar
Inference optimization and serving optimized models in production

JetBrains Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about JetBrains and has not been reviewed or approved by JetBrains.

Fair & Transparent Compensation — Fair & Transparent Compensation: Pay is considered market-competitive for many product and engineering roles in key locations, with published salary ranges on some postings aiding expectation-setting. Feedback suggests total compensation feels fair relative to local markets even if packages are positioned as “competitive but not top‑of‑big‑tech”.
Flexible Benefits — Flexible Benefits: Work setup includes hybrid/remote options and flexible hours across many locations. The ability to work abroad for part of the year adds practical flexibility to where work gets done.
Leave & Time Off Breadth — Leave & Time Off Breadth: Time off includes additional vacation days beyond local minimums in many countries. U.S. materials also highlight PTO, sick leave, and holidays, underscoring breadth beyond statutory baselines.

Learn more about JetBrains's Compensation & Benefits →

JetBrains Insights

What's It Like to Work at JetBrains? JetBrains Culture & Values JetBrains Career Growth & Development What's the Work-Life Balance Like at JetBrains? JetBrains Leadership & Management JetBrains Company Growth, Stability & Outlook

View all jobs at JetBrains

View JetBrains Profile

Report Job

Am I A Good Fit?

beta

Get Personalized Job Insights.

Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company

HQ: Prague

2,209 Employees

Year Founded: 2000

What We Do

JetBrains creates intelligent software development tools consistently used and trusted by 11.4 million professionals and 88 Fortune Global Top 100 companies. Our lineup of more than 30 products includes IDEs for most programming languages and technologies, such as IntelliJ IDEA, PyCharm, and others, as well as products for team collaboration, like YouTrack and TeamCity. JetBrains is also known for creating the Kotlin programming language, a cross-platform language used by more than 5 million developers worldwide yearly and recommended by Google as the preferred language for Android development. The company is headquartered in Prague, Czech Republic, and has offices around the world. JetBrains IDEs * IntelliJ IDEA (Java and Kotlin Developers) * PyCharm (Python developers) * PhpStorm (PHP developers) * GoLand (Go developers) * Rider (.NET developers) * CLion (C and C++ developers) * Rust Rover (Rust developers) * WebStorm (JavaScript & TypesScript developers) * RubyMine (Ruby and Rails developers) * DataGrip (Tool for multiple databases) * ReSharper (Extension for Visual Studio) * Fleet (Multilingual IDE and code editor) * Aqua (IDE for test automation engineers) .NET & Visual Studio: * Rider (IDE for .NET developers) * ReSharper (Extension for Visual Studio) * ReSharper C++ (Visual Studio Extension for C++ developers) * dotCover (.NET Unit Test Runner and Code Coverage Tool) * dotMemory (.NET Memory Profiler) * dotTrace (.NET Performance Profiler) * dotPeek (.NET decompiler and assembly browser) Team Tools: * TeamCity (Powerful CI out of the box) * YouTrack (Project management for all your teams) * Space (Intelligent code collaboration platform) * Datalore (Collaborative data science platform) * Qodana (Code quality platform for teams) Programming Languages: * Kotlin (Programming Language for the JVM and Android) * MPS (Create Your Own Domain-Specific Language) Education: * JetBrains Academy (Learn and Teach Computer Science) Profile by JetBrains s.r.o.