Senior Data Scientist

Posted 2 Days Ago
Be an Early Applicant
5 Locations
In-Office
Senior level
AdTech • Marketing Tech
The Role
Design, develop, and deploy end-to-end ML pipelines and semantic AI solutions (RAG, embeddings, dense/sparse retrieval). Migrate cloud pipelines to Azure Databricks, optimise embedding storage and vector indexes, build forecasting and NLP models, and deliver production-grade solutions for classification, clustering, and recommendation use cases.
Summary Generated by Built In

Job Description:

Senior Data Scientist — ML & Semantic AI

Technologies: Azure · NLP · RAG · Semantic Matching · Python

Role Summary

We are looking for a Data Scientist with expertise in Python, Azure Cloud, and NLP to build and enhance machine learning models at scale. The role includes embedding optimisation, semantic matching, LDA and RAG architectures, dense and sparse retrieval pipelines, and migration of cloud-native data pipelines to Azure Databricks.

Core Requirements
  • Design and execute end-to-end machine learning pipelines including data extraction, preprocessing, feature engineering, model development, tuning, and deployment.
  • Develop machine learning pipelines using Azure Synapse, Databricks, and Snowflake.
  • Build and deploy classification, regression, and clustering models.
  • Develop and deploy proof-of-concept solutions for client use cases.
  • Implement semantic matching and similarity search using cosine similarity, dot-product scoring, and bi-encoder/cross-encoder architectures (e.g., SBERT, sentence-transformers).
  • Build embedding models by fine-tuning pre-trained models and optimising embedding storage in vector databases such as Chroma DB, FAISS, and Azure AI Search.
Model Development & Optimisation
  • Train and optimise models for new data providers with dynamic input handling.
  • Improve LDA model performance for large-scale topic modelling.
  • Implement hybrid semantic search by combining dense and sparse retrieval methods.
  • Optimise RAG architectures and retrieval QA systems for chatbot and recommendation performance.
  • Enable semantic query understanding using intent classification and query expansion techniques.
Forecasting & NLP
  • Develop forecasting models for marketing, demand prediction, and trend analysis.
  • Apply NLP-based forecasting techniques using sentiment and external data.
  • Use semantic similarity for audience intelligence, including zero-shot and few-shot classification techniques.
Data Pipeline & Cloud Migration
  • Migrate data pipelines from Azure Synapse to Azure Databricks and retrain models accordingly.
  • Optimise embedding storage and retrieval within Azure AI Search.
  • Perform vector index tuning including HNSW optimisation and ANN benchmarking for production systems.
Required Skills & Tools

Python, Azure Databricks, Azure ML, Azure Synapse, Azure Blob Storage, Scikit-learn, NumPy, Pandas, Hugging Face, sentence-transformers, FAISS, Chroma DB, Azure AI Search, LangChain, TensorFlow, PyTorch, Statsmodels, Azure OpenAI.

Location:

DGS India - Mumbai - Thane Ashar IT Park

Brand:

Merkle

Time Type:

Full time

Contract Type:

Permanent

Skills Required

  • Expertise in Python
  • Experience with Azure Cloud platform (Azure Databricks, Azure ML, Azure Synapse, Azure Blob Storage, Azure AI Search)
  • Experience building end-to-end ML pipelines including data extraction, preprocessing, feature engineering, model development, tuning, and deployment
  • Experience with Snowflake
  • Experience in NLP, semantic matching, and similarity search (cosine similarity, dot-product, bi-encoder/cross-encoder architectures)
  • Experience with embeddings: fine-tuning pre-trained models and optimising embedding storage (Chroma DB, FAISS, Azure AI Search)
  • Experience with RAG architectures, retrieval-augmented generation, and retrieval QA systems
  • Experience implementing hybrid dense and sparse retrieval pipelines and vector index tuning (HNSW, ANN benchmarking)
  • Experience with sentence-transformers / SBERT and Hugging Face ecosystem
  • Experience building and deploying classification, regression, clustering, and forecasting models (including NLP-based forecasting)
  • Proficiency with ML libraries: Scikit-learn, NumPy, Pandas, Statsmodels
  • Experience with deep learning frameworks: TensorFlow and/or PyTorch
  • Experience with LangChain and Azure OpenAI
  • Experience migrating cloud-native data pipelines (Azure Synapse to Azure Databricks) and retraining models

dentsu Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about dentsu and has not been reviewed or approved by dentsu.

  • Parental & Family Support Paid parental leave at full pay and caregiver supports (including backup care) are emphasized as standout elements. Feedback suggests family-oriented benefits are a strong part of the package.
  • Leave & Time Off Breadth Flexible or unlimited PTO, extensive paid holidays, and a year-end office closure are established components. Feedback suggests time-off policies are generous and add meaningful flexibility.
  • Retirement Support A large, established 401(k) plan with employer matching is clearly documented. Feedback suggests retirement benefits feel competitive and straightforward.

dentsu Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
15,492 Employees

What We Do

We are dentsu. We team together to help brands predict and plan for disruptive future opportunities and create new paths to growth in the sustainable economy. We know people better than anyone else and we use those insights to connect brand, content, commerce and experience, underpinned by modern creativity. We are the network designed for what’s next

Similar Jobs

NielsenIQ Logo NielsenIQ

Senior Data Scientist

Big Data • Information Technology
Hybrid
Pune, Mahārāshtra, IND
40000 Employees

Dentsu Creative Logo Dentsu Creative

Senior Data Scientist

AdTech • Marketing Tech • Software
In-Office
5 Locations
6507 Employees

Johnson & Johnson Logo Johnson & Johnson

Senior Data Scientist

Healthtech • Biotech • Pharmaceutical • Manufacturing
In-Office
2 Locations
143612 Employees

Weekday, Inc. Logo Weekday, Inc.

Senior Data Scientist

Artificial Intelligence • HR Tech • Professional Services • Software
In-Office
Pune, Mahārāshtra, IND
2M-5M Annually

Similar Companies Hiring

ClickMint Thumbnail
AdTech • eCommerce • Marketing Tech • Generative AI
Malibu, CA
9 Employees
PRIMA Thumbnail
Travel • Software • Marketing Tech • Hospitality • eCommerce
US
15 Employees
Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account