Data Scientist - Reinforcement Learning

Posted Yesterday
Be an Early Applicant
Hiring Remotely in United States
Remote or Hybrid
Mid level
Information Technology • Database • Consulting
The Role
Design and implement reinforcement learning models and sequential behavioral models to optimize collections and treatment strategies. Build scalable ML pipelines (Databricks), run simulations and offline policy evaluation, and collaborate with stakeholders to deploy adaptive decisioning systems under uncertainty.
Summary Generated by Built In

Key Responsibilities
•    Design and develop Reinforcement Learning models to optimize collections strategies, customer treatment paths, and recovery outcomes. 
•    Build adaptive decisioning systems using techniques such as: 
o    Q-Learning 
o    Deep Q Networks (DQN) 
o    Policy Gradient Methods 
o    Contextual Bandits 
o    Markov Decision Processes (MDP) 
•    Develop sequential and behavioral models for customer engagement, repayment prediction, and collections prioritization. 
•    Apply stochastic modeling and probabilistic methods to optimize dynamic treatment strategies under uncertainty. 
•    Collaborate with business stakeholders to translate collections and risk management problems into scalable AI/ML solutions. 
•    Build and maintain machine learning pipelines in Databricks or similar distributed computing environments. 
•    Conduct experimentation, simulation, and offline policy evaluation to validate RL strategies before deployment. 
 

Responsibilities

Preferred / Good-to-Have Skill
•    Experience in collections, credit risk, customer analytics, or financial services domains. 
•    Familiarity with: 
o    Deep Learning frameworks (TensorFlow, PyTorch) 
o    MLOps and CI/CD workflows 
o    Real-time decision systems 
o    Cloud platforms such as AWS, Azure, or GCP

Qualifications

Must-Have Qualifications
•    Strong experience in Reinforcement Learning and sequential decision-making systems. 
•    Hands-on expertise with: 
o    Reinforcement Learning algorithms (Q-Learning, DQN, PPO, Bandits, etc.) 
o    Markov Decision Processes (MDP) 
 

Skills Required

  • Strong experience in Reinforcement Learning and sequential decision-making systems
  • Hands-on expertise with RL algorithms (Q-Learning, DQN, PPO, Bandits)
  • Experience with Markov Decision Processes (MDP)
  • Build and maintain machine learning pipelines in Databricks or similar distributed computing environments
  • Experience in collections, credit risk, customer analytics, or financial services domains
  • Familiarity with Deep Learning frameworks (TensorFlow, PyTorch)
  • Familiarity with MLOps and CI/CD workflows
  • Familiarity with real-time decision systems
  • Familiarity with cloud platforms such as AWS, Azure, or GCP
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
30,246 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account