Weather Data Scientist (Data Assimilation)

Reposted 11 Days Ago
Be an Early Applicant
New Delhi, Delhi, IND
In-Office
Senior level
Artificial Intelligence • Machine Learning • Energy • Utilities
The Role
The Weather Data Scientist will develop and implement data assimilation pipelines for weather forecasting, focusing on observational data quality and machine learning-driven forecasts.
Summary Generated by Built In
Weather Data Scientist: Data Assimilation


Working hours: The team is distributed across India and the US, so expect a few hours of evening overlap with US Pacific Time on most workdays.


Overview

About Pravāh

Pravāh is an AI lab building foundational intelligence for the electric grid. We apply modern machine learning to complex physical infrastructure problems spanning grid operations, weather, and geospatial systems.

Our work sits at the intersection of computer vision, physical systems, and large-scale ML, with deployments across utilities in the United States and India. We leverage multimodal data including satellite imagery, LiDAR, and street-level data to build high-fidelity representations of grid assets and their surroundings.

We are backed by Khosla Ventures, Pear VC, and Conviction - some of the most ambitious investors in Silicon Valley.

More about who we are, what we are building, and why we are excited: Website, Pravāh on Notion.


The role

We are hiring a Weather Data Scientist to advance the next generation of weather forecasting systems for India, with strong attention to observational data quality and geospatial consistency. You will work closely with machine learning and software engineers on three core threads:

1. Data assimilation: contribute hands-on to data assimilation for weather forecasting models.

2. ML-ready datasets: procure, process, and create ML-ready global and regional weather datasets at large scale (high volume, multi-source, long time horizons), with explicit focus on data-sparse regions.


What you'll work on

· Build and operate a cycling data assimilation pipeline for our operational forecasting models, and produce the high-resolution gridded products it enables downstream.

· Choose, deploy, and adapt a modern DA framework (e.g. JEDI/UFO, GSI, DART, PDAF) for our regional and global needs.

· Develop observation quality control, bias correction (VarBC), and thinning workflows that hold up at operational data volumes and degrade gracefully when feeds drop out.

· Contribute to AI-based data assimilation pipelines.

· Tailor weather prediction models to renewable-sector needs, particularly solar (GHI) and wind generation (100m winds).

· Assist in training AI-based weather prediction models.

· Work at the intersection of physics-based modeling and machine learning—hybrid physics–ML systems, learned parameterizations, and emulators.

Who you areRequired qualifications

· A master's or PhD in geophysical sciences, physics, applied mathematics, computer science, statistics, or a related field. A bachelor's degree with 3+ years of relevant research or operational experience is also acceptable.

· Demonstrated depth in data assimilation, evidenced by operational work, model contributions, research projects, publications, or technical reports.

· Hands-on experience across the DA toolkit: observation operators and error specification; variational (3D-/4D-Var) or ensemble (EnKF, LETKF, EDA) methods; cycling workflows and innovation statistics; and assimilation of satellite, radar, radiosonde, or station observations.

· Hands-on experience with at least one operational DA framework: JEDI/UFO, GSI, DART, PDAF, or an in-house equivalent, including building observation operators and forward models.

· Working knowledge of bias correction (VarBC), adaptive QC, and gross-error rejection.

· Experience contributing to or maintaining assimilation code, or holding responsibility in an operational or quasi-operational forecasting pipeline.

· Experience working with TB-scale, high-dimensional observational and modeling datasets (reanalysis, satellite, radar, weather-station, and sounding data) and the geospatial pipework (grids, reprojection, masks) around them.

· Hands-on experience with widely used reference datasets such as ERA5, MERRA-2, IMDAA, IMERG/GPM, and GOES/INSAT/Himawari.

· Practical experience on High Performance Computers (HPCs).

· Fluency in the modern geoscience Python stack—xarray, dask, zarr, netCDF.

· Experience building reproducible, production-grade pipelines.

· Excellent written and verbal communication, including the ability to explain technical work to both domain experts and cross-disciplinary collaborators.

Nice to have

· Prior work on projects specific to Indian geography.

· Familiarity with coupled earth-system models.

· Experience with any of: ensemble and probabilistic forecasting, regional downscaling, or subseasonal-to-seasonal (S2S) prediction.

· Experience working with operational forecasting agencies (IMD, NCMRWF, ECMWF, NOAA, etc.).

· Familiarity with AI-based weather prediction models and data assimilation techniques.

· Comfort using agentic AI tools to accelerate development.

· Publications in respected atmospheric, oceanic, or climate science venues.


What you'll gain

· Part of development of weather forecasting models deployed for real-time applications.

· Experience working on hard, open-ended problems at the intersection of AI and physical infrastructure.

· Exposure to how teams set priorities and push the frontier of AI weather prediction.

· Close collaboration with a deeply technical team.


Why this role

This role sits at the frontier of the AI weather revolution, applying modern machine learning to earth system modeling. The next decade of progress in weather and climate prediction will be built by scientists who understand the physics and the data and have learned to wield generative AI. You will work in data-sparse regions where data is heterogeneous, ground truth is incomplete, and progress requires both technical depth and first-principles thinking.

Skills Required

  • Master's or PhD in geophysical sciences, physics, applied mathematics, computer science, statistics, or related field
  • Demonstrated depth in data assimilation
  • Hands-on experience with operational DA frameworks
  • Fluency in the modern geoscience Python stack
  • Experience working with operational forecasting agencies
  • Publications in atmospheric, oceanic, or climate science venues
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
170 Employees
Year Founded: 1993

What We Do

Pravāh is a Stanford-founded, AI-powered grid intelligence company building foundational intelligence for the electric grid. They develop real-time forecasting and optimization systems to make electricity cleaner, more affordable, and reliable.

Similar Jobs

CrowdStrike Logo CrowdStrike

Regional Sales Manager

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Hybrid
Delhi, Connaught Place, New Delhi, Delhi, IND
10000 Employees

Capco Logo Capco

BA/PM_Loans

Fintech • Professional Services • Consulting • Energy • Financial Services • Cybersecurity • Generative AI
Remote or Hybrid
India
6000 Employees

Tufin Logo Tufin

Devops Engineer

Security • Cybersecurity
Remote or Hybrid
India
500 Employees

CrowdStrike Logo CrowdStrike

Sr. Google Workspace Administrator (Remote, IND)

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Remote or Hybrid
India
10000 Employees

Similar Companies Hiring

Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees
Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
42 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account