Senior Software Engineer

Reposted 6 Days Ago
Be an Early Applicant
Hiring Remotely in Headquarters, AZ, USA
In-Office or Remote
180K-250K Annually
Senior level
Artificial Intelligence • Information Technology • Software
The Role
The Open Source Engineer will expand LanceDB's integration with big data systems, manage dataset operations, and enhance open-source community efforts.
Summary Generated by Built In
About LanceDB

LanceDB is the preeminent data platform for multimodal AI use cases. From hyper-scalable vector search to advanced retrieval for RAG, from streaming training data to interactive exploration of large-scale AI datasets, LanceDB is the best foundation for your AI application, and powers some of the most groundbreaking applications and challenging requirements today.

About the Role

We’re looking for a Senior Software Engineer to help expand the reach of Lance and LanceDB within the broader data infrastructure ecosystem. You’ll work at the intersection of high-performance computing, big data, and open-source systems. You will contribute scale and performance improvements, integrations with the wider data and AI ecosystem, simplifying distributed operations, and usability and maintainability enhancements.

You’ll be responsible for
  • Designing and maintaining efficient distributed Lance dataset operations

  • Building efficient indices to enable predicate pushdown and accelerate queries in Spark, Ray, or Trino

  • Working on table formats, data encodings, and various aspects of the Lance format in Rust

  • Driving open-source community efforts to integrate the Lance format with Spark, Hive Metastore, Presto, Trino, Ray, and other data infrastructure systems

  • Operating and improving internal data processing infrastructure

  • Promoting the Lance format in open-source communities and at Big Data conferences

Requirements
  • 10+ years of experience building high-performance databases, big data systems, or large-scale data services

  • Deep understanding of internals of open-source Big Data or AI training systems (e.g., Hadoop, Spark, Flink, Ray, Iceberg, Delta Lake, Hudi, ClickHouse, Trino, Presto, PyTorch, or JAX)

  • Strong experience with high-performance computing in C++, Java, and/or Scala

  • Experience with Rust (or willingness to learn it)

  • Proven ability to move fast, work independently, and collaborate with a high-caliber team

Nice to Have
  • Contributor, committer, or PMC member in Apache or other large open-source projects

  • Experience with Apache Arrow, DataFusion, Parquet, Iceberg, or Delta Lake

  • Track record of driving large features or integrations in distributed systems

  • Strong community presence and passion for open-source collaboration

What We Offer
  • A key role shaping an open-source project with real production usage

  • Remote-first team with flexible hours

  • Competitive compensation, equity, and benefits

  • Generous learning budget and support for open-source contributions

Why Join Us

You’ll join a world-class team of open-source builders, including co-authors of pandas, and contributors to HDFS, Arrow, Iceberg, and HBase. You’ll collaborate on systems that power next-generation AI workloads while shaping how LanceDB operates and scales production environments.

Skills Required

  • 5+ years of experience building high-performance databases, big data systems, or large-scale data services
  • Deep understanding of internals of open-source Big Data or AI training systems
  • Strong experience with high-performance computing in Java or Scala
  • Experience with Rust (or willingness to learn it)
  • Proven ability to move fast, work independently, and collaborate with a high-caliber team
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Francisco, California
29 Employees
Year Founded: 2022

What We Do

LanceDB is a developer-friendly, open source database for multimodal AI. From hyper scalable vector search to advanced retrieval for RAG, from streaming training data to interactive exploration of large scale AI datasets, LanceDB is the best foundation for your AI application.

Similar Jobs

Coinbase Logo Coinbase

Senior Software Engineer

Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
Easy Apply
Remote
USA
4700 Employees
186K-219K Annually

MongoDB Logo MongoDB

Senior Software Engineer

Big Data • Cloud • Software • Database
Easy Apply
Remote or Hybrid
2 Locations
5550 Employees
126K-248K Annually

Upstart Logo Upstart

Senior Software Engineer

Artificial Intelligence • Fintech • Machine Learning • Social Impact • Software
Easy Apply
Remote
United States
1500 Employees
167K-231K Annually

MongoDB Logo MongoDB

Senior Software Engineer

Big Data • Cloud • Software • Database
Easy Apply
Remote or Hybrid
United States
5550 Employees
147K-210K Annually

Similar Companies Hiring

Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account