Staff ML Data Engineer (Datagrid)

Posted 11 Hours Ago
7 Locations
In-Office or Remote
227K-313K Annually
Senior level
Cloud • Software
We are powering progress for our customers in the construction industry by connecting them on a global platform.
The Role
The Staff ML Data Engineer will design and build scalable data systems for machine learning, collaborating closely with researchers and engineers to enhance data pipelines and ensure data quality.
Summary Generated by Built In

We’re looking for a Staff ML Data Engineer to join Procore’s AI & Frontier Models organization. In this role, you’ll be responsible for designing and building the data systems that power frontier‑scale machine learning research and applied AI products, with a particular focus on spatial intelligence and multimodal data. The primary goal of this role is to ensure that researchers and engineers can reliably discover, curate, transform, and operate on large‑scale datasets that move from experimentation to production.

As a Staff ML Data Engineer, you’ll work closely with ML researchers, applied ML engineers, and system architects to turn ambiguous research needs into scalable, production‑ready data pipelines. You’ll remain deeply hands‑on while providing technical leadership in data architecture, quality, and operational excellence. This is an opportunity to shape how Procore builds, evaluates, and deploys frontier models by ensuring the underlying data systems are robust, observable, and designed for iteration.

This position reports into an Engineering Manager within Procore AI and will be based in our San Francisco office. We’re looking for someone to join us immediately.

What you’ll do
  • Act as the technical lead for data engineering efforts supporting frontier model research and applied ML systems.

  • Design, build, and maintain scalable batch and streaming pipelines for multimodal data (e.g., documents, images, spatial metadata).

  • Partner closely with researchers and architects to translate experimental workflows into reliable, repeatable data systems.

  • Lead the development of dataset curation, versioning, and lineage workflows that support rapid experimentation and reproducibility.

  • Establish and uphold standards for data quality, validation, observability, and cost efficiency across AI data pipelines.

  • Contribute to data architecture decisions spanning research environments and production systems.

  • Identify gaps or inefficiencies in existing data workflows and run proofs‑of‑concept to evaluate improvements.

  • Mentor other engineers through code reviews, design discussions, and hands‑on collaboration.

What we’re looking for
  • Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field, or equivalent practical experience.

  • 8+ years of experience designing and operating complex data systems in production or research‑adjacent environments.

  • Strong proficiency in SQL and Python; experience with data‑intensive or distributed systems.

  • Proven experience building scalable data pipelines that support machine learning training, evaluation, or inference workflows.

  • Solid understanding of data modeling, dataset lifecycle management, and data quality best practices.

  • Comfort operating in highly ambiguous problem spaces and collaborating closely with researchers and architects.

  • Demonstrated ability to lead through direct technical contribution, mentorship, and setting engineering standards.

  • Strong communication skills, with the ability to explain technical tradeoffs to both research and engineering audiences.

Nice to have experience with technologies such as:

  • ML & Research Data: Large‑scale dataset curation, annotation workflows, experiment tracking, reproducibility tooling

  • Data Platforms: Databricks, Spark, lakehouse architectures, cloud data warehouses

  • Streaming & Pipelines: Kafka, Pub/Sub, event‑driven data architectures

  • Orchestration & Observability: Airflow, Dagster, data quality and lineage tools

  • Cloud & Infrastructure: AWS or GCP, containerized data workloads, CI/CD, infrastructure‑as‑code

  • Performance & Cost: Optimizing data pipelines for GPU‑backed training and large‑scale inference workloads

Additional Information

Base Pay Range:

227,332.00 - 312,581.50 USD Annual

This role may also be eligible for Equity Compensation and/or Bonus Incentive Compensation. Procore is committed to offering competitive, fair, and commensurate compensation. Actual compensation will be based on a candidate’s job-related skills, experience, education or training, and location.

For Los Angeles County (unincorporated) Candidates:

Procore will consider for employment all qualified applicants, including those with arrest or conviction records, in accordance with the requirements of applicable federal, state, and local laws, including the City of Los Angeles’ Fair Chance Initiative for Hiring Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, and the California Fair Chance Act.

A criminal history may have a direct, adverse, and negative relationship on the following job duties, potentially resulting in the withdrawal of the conditional offer of employment: 1. appropriately managing, accessing, and handling confidential information including proprietary and trade secret information, as well as accessing Procore's information technology systems and platforms; 2. interacting with and occasionally having unsupervised contact with internal/external customers, stakeholders, and/or colleagues; and 3. exercising sound judgment.

Top Skills

Airflow
AWS
Dagster
Databricks
GCP
Kafka
Pub/Sub
Python
Spark
SQL
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Carpinteria, CA
4,500 Employees
Year Founded: 2002

What We Do

At Procore Technologies, we’re collectively building towards what’s next for our employees, industry, customers, and global communities. Our cloud-based construction management software streamlines the entire lifecycle of a construction project, connecting field and office teams, centralizing data to mitigate risks, providing real-time financials, and more to help clients efficiently build everything from skyscrapers to hospitals to airports. Procore was founded in 2002, and we’ve since grown into a global company of groundbreakers working throughout North America, EMEA, and APAC. Coming together from across diverse backgrounds to be our best, we embrace a culture of ownership and excellence that gives our teams the tools to grow and thrive as they shape their careers – and the Procore of tomorrow. To learn more about Procore and how you can build what comes next for your career, visit us at https://careers.procore.com/.

Why Work With Us

We make each other better at Procore. Here, your career is not pre-defined and it can take many paths. While you own your career, we provide you with the support and opportunities to help you succeed. You can help us transform an industry while you are transforming your career.

Gallery

Gallery

Similar Jobs

Wells Fargo Logo Wells Fargo

Branch Manager Katy District

Fintech • Financial Services
Remote or Hybrid
16 Locations
205000 Employees

Coinbase Logo Coinbase

Senior Software Engineer

Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
Easy Apply
Remote
Canada
4700 Employees
191K-191K Annually

Affirm Logo Affirm

Software Engineer

Big Data • Fintech • Mobile • Payments • Financial Services
Easy Apply
Remote
Canada
2200 Employees
125K-175K Annually

Affirm Logo Affirm

Customer Advocacy Specialist I

Big Data • Fintech • Mobile • Payments • Financial Services
Easy Apply
Remote
Canada
2200 Employees
55K-75K Annually

Similar Companies Hiring

Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Fairly Even Thumbnail
Software • Sales • Robotics • Other • Hospitality • Hardware
New York, NY
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account