LLM Applied Data Scientist (RAG/ NLP)

Posted Yesterday
Be an Early Applicant
3 Locations
In-Office or Remote
Senior level
Blockchain • Fintech • Software • Cryptocurrency • Metaverse
The Role
Advance reasoning and planning of foundation models by building data acquisition, SFT, reward modelling, and RL workflows. Design scalable retrieval, indexing, vector search, reranking, and RAG pipelines; research multimodal/graph retrieval and advanced decoding (MCTS, A*); teach models to use external tools/APIs; build agents and evaluation methodologies; collaborate to productionise large-scale LLM/VLM/agent systems.
Summary Generated by Built In
Binance is a leading global blockchain ecosystem behind the world’s largest cryptocurrency exchange by trading volume and registered users. We are trusted by 300+ million people in 100+ countries for our industry-leading security, user fund transparency, trading engine speed, deep liquidity, and an unmatched portfolio of digital-asset products. Binance offerings range from trading and finance to education, research, payments, institutional services, Web3 features, and more. We leverage the power of digital assets and blockchain to build an inclusive financial ecosystem to advance the freedom of money and improve financial access for people around the world.


About the Role
We are seeking a highly skilled Research Scientist/Engineer to advance the reasoning and planning capabilities of large foundation models. In this role, you will enhance model performance across the entire development lifecycle—including data acquisition, supervised fine-tuning (SFT), reward modelling, and reinforcement learning—while driving innovations in reasoning and decision-making. You will synthesise large-scale, high-quality datasets through rewriting, augmentation, and generation techniques to strengthen foundation models during pretraining, SFT, and RL stages. A key part of the role involves solving complex tasks using System 2 thinking and applying advanced decoding strategies such as MCTS and A*. You will design and implement robust evaluation methodologies, teach models to interact with external tools, APIs, and code interpreters, and build agents and multi-agent systems capable of addressing sophisticated real-world problems.

Responsibilities

  • Design, develop, and optimize data processing and retrieval pipelines for enterprise-level generative tasks and mode training applications   (Customer Service, Token Report, Web3 Domain Models). This includes embedding, reranking, context engineering, and query rewriting models.
  • Research and evaluate advanced AI-native retrieval algorithms (e.g., low-latency, multimodal retrieval, hierarchical retrieval, GraphRAG) to strengthen large-scale LLM/VLM/Agentic AI capabilities in Binance products.
  • Collaborate with infrastructure and application teams to integrate RAG pipelines into production systems, ensuring scalability, reliability, and measurable business impact.
  • Develop and optimize retrieval and ranking pipelines (indexing, vector search, retrieval scoring, reranking) to improve user experience.
  • Participate in LLM training and RAG system, staying current with techniques such as pre-training, SFT, and reinforcement learning, and apply them to retrieval and generation tasks.
  • Apply NLP, CV, and multimodal methods to analyze user-generated content (classification, quality evaluation, trend detection, comment analysis).

Requirement

  • Master’s in Information Retrieval, NLP, Machine Learning, Computer Vision, Multimodal Learning, or related fields.
  • Proficient in PyTorch with strong coding skills in Python or C++.
  • Strong communication skills, intellectual curiosity, and passion for lifelong learning. Able to identify opportunities and drive cutting-edge retrieval & RAG technologies into real-world applications.
  • Solid theoretical foundation in information retrieval, NLP, and deep learning (experience with embeddings, reranking, query understanding preferred).
  • Hands-on experience with RAG, vector databases, multimodal/graph retrieval, or large-scale AI systems.
  • Strong engineering ability to translate research into scalable, production-level systems.
  • Self-driven, able to own projects end-to-end (design → implementation → deployment).
  • Publications in top-tier conferences/journals (NeurIPS, ICML, ACL, CVPR, SIGIR, KDD, WWW) are a plus; awards in ACM/ICPC or similar competitions preferred.

Why Binance
• Shape the future with the world’s leading blockchain ecosystem
• Collaborate with world-class talent in a user-centric global organization with a flat structure
• Tackle unique, fast-paced projects with autonomy in an innovative environment
• Thrive in a results-driven workplace with opportunities for career growth and continuous learning
• Competitive salary and company benefits
• Work-from-home arrangement (the arrangement may vary depending on the work nature of the business team)

Binance is committed to being an equal opportunity employer. We believe that having a diverse workforce is fundamental to our success.
By submitting a job application, you confirm that you have read and agree to our Candidate Privacy Notice.

Why Binance
• Shape the future with the world’s leading blockchain ecosystem
• Collaborate with world-class talent in a user-centric global organization with a flat structure
• Tackle unique, fast-paced projects with autonomy in an innovative environment
• Thrive in a results-driven workplace with opportunities for career growth and continuous learning
• Competitive salary and company benefits
• Work-from-home arrangement (the arrangement may vary depending on the work nature of the business team)

Binance is committed to being an equal opportunity employer. We believe that having a diverse workforce is fundamental to our success.
By submitting a job application, you confirm that you have read and agree to our Candidate Privacy Notice.

Skills Required

  • Master's in Information Retrieval, NLP, Machine Learning, Computer Vision, Multimodal Learning, or related field
  • Proficient in PyTorch
  • Strong coding skills in Python or C++
  • Solid theoretical foundation in information retrieval, NLP, and deep learning (embeddings, reranking, query understanding)
  • Hands-on experience with RAG, vector databases, multimodal/graph retrieval, or large-scale AI systems
  • Strong engineering ability to translate research into scalable, production-level systems (design -> implementation -> deployment)
  • Experience with retrieval/ranking pipelines (indexing, vector search, retrieval scoring, reranking) and building data/retrieval pipelines
  • Ability to design and implement evaluation methodologies and teach models to interact with external tools/APIs/code interpreters
  • Strong communication skills, intellectual curiosity, and self-driven ownership of end-to-end projects
  • Publications in top-tier conferences/journals (NeurIPS, ICML, ACL, CVPR, SIGIR, KDD, WWW)
  • Awards in ACM/ICPC or similar competitions

Binance Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about Binance and has not been reviewed or approved by Binance.

  • Career-Linked Recognition & Rewards Performance-linked bonuses can be sizable in favorable crypto cycles, lifting total compensation. Attractive packages in engineering and specialized roles indicate strong rewards for in-demand skills.
  • Flexible Benefits Remote-first flexibility and work-from-anywhere options add meaningful value to the overall rewards package. Flexible schedules and location independence are presented as core perks.
  • Retirement Support Binance.US includes a 401(k) as part of its benefits. This provides a conventional retirement pillar alongside cash and bonus components.

Binance Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
Melbourne, VIC
7,696 Employees
Year Founded: 2017

What We Do

Binance is the world’s leading blockchain and cryptocurrency infrastructure provider with a financial product suite that includes the largest digital asset exchange by volume. Trusted by millions worldwide, the Binance platform is dedicated to increasing the freedom of money for users, and features an unmatched portfolio of crypto products and offerings, including: trading and finance, education, data and research, social good, investment and incubation, decentralization and infrastructure solutions, and more. For more information, visit: https://www.binance.com

Similar Jobs

Remote
Hong Kong
25000 Employees

Collectors Logo Collectors

Business Development Manager

Consumer Web • eCommerce • Machine Learning • Software • Sports • Analytics
Remote
Hong Kong China
2246 Employees

TransUnion Logo TransUnion

Business Analyst

Big Data • Fintech • Information Technology • Business Intelligence • Financial Services • Cybersecurity • Big Data Analytics
Remote or Hybrid
Hong Kong
13000 Employees

Citadel Logo Citadel

Quantitative Researcher

Information Technology • Software • Financial Services • Big Data Analytics
In-Office or Remote
2 Locations
4000 Employees

Similar Companies Hiring

Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
42 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account