Researcher, Vision

Posted Yesterday
Be an Early Applicant
Bengaluru, Bengaluru Urban, Karnataka, IND
In-Office
Senior level
Artificial Intelligence • Software
The Role
Conduct research across vision-language model lifecycles: architecture, training (pretraining, SFT, RLHF, DPO), data strategies, evaluation and benchmarks for Indic multimodal tasks; diagnose failures, improve robustness and interpretability, prototype at scale, and engage with open-source and research community.
Summary Generated by Built In
About Sarvam

Sarvam is building the bedrock of Sovereign AI for India. The company is developing India's full-stack sovereign AI platform, building across research, models, infrastructure and applications with a singular focus on making AI genuinely work for India. Sarvam works with leading enterprises and public institutions and is backed by Lightspeed, Peak XV, and Khosla Ventures. Sarvam partners with India's leading brands, including Tata Capital, SBI Life, CRED, IDFC, and LIC.

About the Role

You will work across the full lifecycle of vision-language model (VLM) development — data, training, evaluation, and production. The team's scope will evolve as the field does; we want researchers who are comfortable with that and can lead.

What You'll Do
  • Research vision-language architectures — encoders, fusion mechanisms, pretraining objectives, and scaling behaviour

  • Design training methods (pretraining, SFT, RLHF, DPO) adapted for multilingual VLMs

  • Investigate data strategies — what mixtures, quality signals, and synthetic data approaches actually move the needle

  • Build evaluation frameworks and benchmarks, especially for Indic multimodal tasks

  • Study model failure modes, robustness, and interpretability

  • Work closely with engineers to ensure ideas are testable at scale — prototype fast, then validate properly

  • Engage with the broader research community through open-source contributions and collaborations

What We're Looking For
  • Deep understanding of vision-language models — training dynamics, architecture tradeoffs, and failure modes

  • Track record of good research — through publications, technical reports, or impactful shipped work

  • Rigorous experimental design — able to isolate variables and draw defensible conclusions

  • Strong PyTorch skills — runs experiments end to end

  • Intellectual range — willing to work across data, training, and evaluation problems

Bonus Points
  • PhD/Master's with relevant research experience in ML, Computer Vision, NLP, or related field

  • Research papers published at A/A* venues

  • Experience with multilingual or low-resource language modelling

  • Familiarity with document understanding, OCR, or structured visual prediction

  • Experience with large-scale data curation and its effect on model quality

Why Sarvam?

Sarvam is a fast-moving, high talent-density team building full-stack AI for India, working on problems that push the frontiers of AI with real population-scale impact.

  • Work alongside researchers, engineers, builders, and business leaders who move fast and hold each other to a very high bar

  • High ownership and high impact, from day one

  • Everything we do is AI-first, from the way we build and ship to the way we think about problems

  • You can work on problems that could change how an entire country learns, works, and communicates

If you want to work on problems at the frontier of AI in India, Sarvam is the place to be.

Skills Required

  • Deep understanding of vision-language models, training dynamics, architecture tradeoffs, and failure modes
  • Track record of research via publications, technical reports, or impactful shipped work
  • Rigorous experimental design skills to isolate variables and draw defensible conclusions
  • Strong PyTorch skills with end-to-end experiment execution
  • Ability to work across data, training, and evaluation problems for VLMs and productionize research
  • PhD or Master's with relevant ML/CV/NLP research experience
  • Publications at top-tier (A/A*) venues
  • Experience with multilingual or low-resource language modelling
  • Familiarity with document understanding, OCR, or structured visual prediction
  • Experience with large-scale data curation and its impact on model quality
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Bangalore, Karnataka
50 Employees
Year Founded: 2023

What We Do

We are an AI/ML research and development company on a mission to build reliable, performant, enterprise-grade AI systems at scale for India. We are committed to build the full-stack for generative AI for the rich & diverse landscape of India, mainly investing in: 1) Models: developing both efficient large scale Indic language models as well as bespoke enterprise models 2) Platform: building an enterprise-grade platform that empowers organisations to develop and ship creative and performant genAI applications at scale 3) Ecosystem: contributing to open-source models and datasets, as well as leading efforts for large scale data curation in public-good space

Similar Jobs

Optum Logo Optum

Software Engineering Lead

Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
In-Office
Bengaluru, Bengaluru Urban, Karnataka, IND
160000 Employees

Cleo Logo Cleo

EDI - Technical Solutions Manager

Cloud • eCommerce • Information Technology • Professional Services • Software
Hybrid
Bengaluru, Bengaluru Urban, Karnataka, IND
500 Employees

Ericsson Logo Ericsson

Senior Engineer

Cloud • Information Technology • Internet of Things • Machine Learning • Software • Cybersecurity • Infrastructure as a Service (IaaS)
In-Office
Bangalore, Bengaluru Urban, Karnataka, IND
88000 Employees

LogicMonitor Logo LogicMonitor

Software Engineer

Artificial Intelligence • Cloud • Information Technology • Machine Learning • Software
Easy Apply
Hybrid
2 Locations
1100 Employees
3-3 Annually

Similar Companies Hiring

Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
42 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account