Cloud Quant

Sorry, this job was removed at 04:11 p.m. (CST) on Wednesday, May 21, 2025
Be an Early Applicant
New York, NY
In-Office
Machine Learning • Generative AI
The Role
About Us:

Modal is building the serverless compute platform to support the next generation of AI companies. In order to deliver the developer experience we wanted, we went deep and built our own infrastructure—including our own custom file system, container runtime, scheduler, container image builder, and much more.

We're a small team based out of New York, Stockholm and San Francisco. In just one year, we've reached 8-figure revenue, tripled our headcount, scaled to support thousands of GPUs, and raised over $32M in funding.

Working at Modal means joining one of the fastest-growing AI infrastructure organizations at an early stage, with many opportunities to grow within the company. Our team includes creators of popular open-source projects (e.g. Seaborn, Luigi), academic researchers, international olympiad medalists, and experienced engineering and product leaders with decades of experience.

The Role:

Modal is looking for something that lacks a conventional title but could roughly be described as a "Cloud Quant" or "Coding CFO".

The background is that Modal spends millions of dollars every month on cloud costs, and there's a tremendous amount of opportunities to optimize these costs:

  • Engage with new cloud providers and get quotes

  • Finding pockets of cheap capacity in existing cloud providers

  • Improving GPU utilization

  • Negotiating with existing vendors

  • Pricing optimizations, picking the right trade-off between growth and revenue

This is a role that mixes vendor negotiations, data science, software engineering, and trading. We want people who are deep systems thinkers and love optimizing things. The actual work spans a very wide range of activities – both deep coding, but also lots of vendor management.

Besides what's mentioned above, the job will also entail things like:

  • Reaching out to new potential vendors and get the best pricing

  • Pulling data from various cloud APIs and analyzing data

  • Downloading large datasets of dollar spend and analyze those

  • Working with software engineers at Modal to roll out optimizations to how we manage our large GPU fleet

  • Figure out new creative ways to improve our pricing, maybe by bundling similar GPU types, having "surge pricing", or discounts for nightly bath jobs.

  • Reporting directly to the CEO

Requirements:

We think the ideal candidate has a mix of skills. In particular, we expect you:

  • Have several years of Python and SQL experience

  • Have done a fair bit of data science, data visualization, and some statistics

  • Have experience working with cloud vendors (AWS etc)

  • Love thinking about how to optimize stuff, especially the bottom line

  • Don't mind getting on the phone with vendors and negotiate big contracts

  • Love thinking about businesses as big complex systems

  • Want to work out of our HQ in NYC (or for exceptional candidates, in SF)

  • Love telling stories about how they made their previous company lots of money

Similar Jobs

Zocdoc Logo Zocdoc

Senior Machine Learning Engineer

Healthtech • Information Technology • Software • Telehealth
Easy Apply
Remote or Hybrid
New York, NY, USA
900 Employees
209K-280K Annually
In-Office
4 Locations
750 Employees

Gusto Logo Gusto

GTM AI Agent Builder

Fintech • HR Tech
Easy Apply
Hybrid
4 Locations
4405 Employees
168K-230K Annually

Aceable Logo Aceable

Marketing Manager

eCommerce • Edtech • Healthtech • Insurance • Mobile • Real Estate • Software
Easy Apply
Remote or Hybrid
USA
183 Employees
80K-95K Annually
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: New York City, New York
50 Employees

What We Do

Deploy generative AI models, large-scale batch jobs, job queues, and more on Modal's platform. We help data science and machine learning teams accelerate development, reduce costs, and effortlessly scale workloads across thousands of CPUs and GPUs.

Our pay-per-use model ensures you're billed only for actual compute time, down to the CPU cycle. No more wasted resources or idle costs—just efficient, scalable computing power when you need it.

Similar Companies Hiring

Northslope Technologies Thumbnail
Software • Information Technology • Generative AI • Consulting • Artificial Intelligence • Analytics
Denver, CO
88 Employees
ClickMint Thumbnail
Marketing Tech • Generative AI • eCommerce • AdTech
Malibu, CA
9 Employees
Credal.ai Thumbnail
Software • Security • Productivity • Machine Learning • Artificial Intelligence
Brooklyn, NY

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account