Responsibilities
- Own the full LLM pipeline from data preparation to production real case usage.
- Design, iterate and optimize prompts (zero-/few-shot, chain-of-thought, tool-calling, etc.) to maximize model utility and safety across products and languages.
- Build and maintain Retrieval-Augmented Generation (RAG) QA/search systems that connect to multi-source knowledge bases.
- Familiar with vLLM/SGLang inference architectures and have proven experience deploying and operating LLM services on multi‑GPU or cluster environments.
- Design, implement and operate multi‑agent LLM architectures (e.g. LangGraph, CrewAI, AutoGen) including task decomposition, agent orchestration, memory sharing and tool‑calling workflows.
- Develop evaluation pipelines (automatic metrics & human feedback) to measure prompt and model quality, bias, and hallucination rates.
- Collaborate with product and CS teams to integrate AI models into conversational Chatbot in different scenarios.
- Track cutting-edge research, author tech blogs, and keep improve current architecture.
Requirements
- Master’s Degree or higher in Computer Science, Data Science or related field..
- At least 2 years of deep-learning/NLP experience, including 1+ year practical LLM work (SFT, DPO, RAG, quantization, inference optimization, etc.).
- Demonstrated prompt engineering & tuning expertise (few-shot design, structured prompting, prefix-/p-tuning, reward re-ranking, safety filtering).
- Practical experience building and deploying multi‑agent LLM workflows, with understanding of agent‑orchestrator patterns, shared memory, long‑horizon planning and guard‑rail design.
- Proficient in both English and Chinese communication for efficient cross team collaboration
Skills Required
- Master's Degree or higher in Computer Science, Data Science or related field
- At least 2 years deep-learning/NLP experience including 1+ year practical LLM work (SFT, DPO, RAG, quantization, inference optimization)
- Demonstrated prompt engineering and tuning expertise (few-shot, structured prompting, prefix-/p-tuning, reward re-ranking, safety filtering)
- Practical experience building and deploying multi-agent LLM workflows and agent-orchestrator patterns
- Familiarity with vLLM/SGLang inference architectures and proven experience deploying/operating LLM services on multi-GPU or cluster environments
- Proficiency in both English and Chinese for cross-team communication
Binance Compensation & Benefits Highlights
The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about Binance and has not been reviewed or approved by Binance.
-
Career-Linked Recognition & Rewards — Performance-linked bonuses can be sizable in favorable crypto cycles, lifting total compensation. Attractive packages in engineering and specialized roles indicate strong rewards for in-demand skills.
-
Flexible Benefits — Remote-first flexibility and work-from-anywhere options add meaningful value to the overall rewards package. Flexible schedules and location independence are presented as core perks.
-
Retirement Support — Binance.US includes a 401(k) as part of its benefits. This provides a conventional retirement pillar alongside cash and bonus components.
Binance Insights
What We Do
Binance is the world’s leading blockchain and cryptocurrency infrastructure provider with a financial product suite that includes the largest digital asset exchange by volume. Trusted by millions worldwide, the Binance platform is dedicated to increasing the freedom of money for users, and features an unmatched portfolio of crypto products and offerings, including: trading and finance, education, data and research, social good, investment and incubation, decentralization and infrastructure solutions, and more. For more information, visit: https://www.binance.com







