EXL

Data Scientist - Reinforcement Learning

Posted Yesterday

Be an Early Applicant

Hiring Remotely in United States

Remote or Hybrid

Mid level

Information Technology • Database • Consulting

The Role

Design and implement reinforcement learning models and sequential behavioral models to optimize collections and treatment strategies. Build scalable ML pipelines (Databricks), run simulations and offline policy evaluation, and collaborate with stakeholders to deploy adaptive decisioning systems under uncertainty.

Summary Generated by Built In

Key Responsibilities
•    Design and develop Reinforcement Learning models to optimize collections strategies, customer treatment paths, and recovery outcomes.
•    Build adaptive decisioning systems using techniques such as:
o    Q-Learning
o    Deep Q Networks (DQN)
o    Policy Gradient Methods
o    Contextual Bandits
o    Markov Decision Processes (MDP)
•    Develop sequential and behavioral models for customer engagement, repayment prediction, and collections prioritization.
•    Apply stochastic modeling and probabilistic methods to optimize dynamic treatment strategies under uncertainty.
•    Collaborate with business stakeholders to translate collections and risk management problems into scalable AI/ML solutions.
•    Build and maintain machine learning pipelines in Databricks or similar distributed computing environments.
•    Conduct experimentation, simulation, and offline policy evaluation to validate RL strategies before deployment.

Responsibilities

Preferred / Good-to-Have Skill
•    Experience in collections, credit risk, customer analytics, or financial services domains.
•    Familiarity with:
o    Deep Learning frameworks (TensorFlow, PyTorch)
o    MLOps and CI/CD workflows
o    Real-time decision systems
o    Cloud platforms such as AWS, Azure, or GCP

Qualifications

Must-Have Qualifications
•    Strong experience in Reinforcement Learning and sequential decision-making systems.
•    Hands-on expertise with:
o    Reinforcement Learning algorithms (Q-Learning, DQN, PPO, Bandits, etc.)
o    Markov Decision Processes (MDP)

Skills Required

Strong experience in Reinforcement Learning and sequential decision-making systems
Hands-on expertise with RL algorithms (Q-Learning, DQN, PPO, Bandits)
Experience with Markov Decision Processes (MDP)
Build and maintain machine learning pipelines in Databricks or similar distributed computing environments
Experience in collections, credit risk, customer analytics, or financial services domains
Familiarity with Deep Learning frameworks (TensorFlow, PyTorch)
Familiarity with MLOps and CI/CD workflows
Familiarity with real-time decision systems
Familiarity with cloud platforms such as AWS, Azure, or GCP

View all jobs at EXL

View EXL Profile

Report Job

Am I A Good Fit?

beta

Get Personalized Job Insights.

Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.