Founding Data Engineer, AI Platform

Posted Yesterday
Hiring Remotely in San Mateo, CA, USA
In-Office or Remote
165K-350K Annually
Senior level
Artificial Intelligence • Legal Tech
The AI for in-house counsel
The Role
Founding data engineer responsible for consolidating multiple data sources into a BigQuery warehouse, building ETL/ELT pipelines, creating self-serve data tools (including natural-language/LLM agents), enabling analytics and personalization, and defining data engineering standards and infrastructure for a growing AI product.
Summary Generated by Built In

GC AI is the fastest-growing and most trusted legal AI platform for in-house legal teams. We're building the future of legal work, and we're doing it fast. You'll join at a pivotal moment—when decisions matter, impact is immediate, and the runway to shape your career is wide open. We’re a high-performing team where you'll have real ownership and influence from day one.

 

More than 1,500 companies use GC AI to drive their business forward, including 150+ public companies, 25+ unicorns, and brands such as News Corp, Miro, Bass Pro Shops, Snyk, Skims, Liquid Death, Vercel, Zscaler, and TIME.

 

We've 10x'd revenue in 12 months, raised a $60 million Series B ($555 million valuation), and are growing faster than ever. We are backed by incredible investors, including Scale Venture Partners, Northzone, Sound Ventures, and Guillermo Rauch, CEO of Vercel.

 

If you thrive when the stakes are high and the path isn't paved, you'll love it here. Our six guiding principles are: 1% better every day, customer obsession, ship today, find a way, care deeply, and own it completely. Come shape the future of legal work with us.

About the Role

You'll be GC AI's first dedicated data hire, which means you'll own the entire data stack from day one. Right now, our data lives in multiple systems: product usage, CRM, billing, customer data, user analytics. Your first job is to consolidate all of it into a single, well-modeled warehouse in BigQuery so the company can actually use it.

From there, you'll build the infrastructure that makes data accessible to everyone. Think less "build dashboards for the exec team" and more "build the internal data agent that lets anyone in the company ask a question and get an answer." If you've seen what Vercel did with d0 (their text-to-SQL agent on top of their warehouse), that's the direction. We want someone who reaches for code, GCP primitives, and simple maintainable systems before defaulting to expensive enterprise platforms.

Longer term, you'll build toward a data lake that supports personalization and fine-tuning for the GC AI platform. You'll work closely with Product and Engineering to make sure the data infrastructure serves both internal operations and the product itself.

This is a founding role. You'll set the patterns, choose the tools, and eventually build the team around you.

What You'll Do
  • Take ownership of the data warehouse in BigQuery: modeling, pipeline development, data quality, and performance.

  • Build pipelines that consolidate product usage data, CRM data, billing, customer contract data, and user analytics into a single source of truth.

  • Design and build internal data tools using applied AI, including natural-language query interfaces and automated reporting, so the rest of the company can self-serve without waiting on an analyst.

  • Set up the warehouse so business teams can run their own queries and pull their own numbers without filing a ticket.

  • Build toward a data lake architecture that supports personalization and model fine-tuning for the GC AI product.

  • Keep the stack lean. Use what's available in BigQuery and the broader GCP ecosystem and make smart decisions to reduce complexity and cost without introducing tool sprawl.

  • Define data engineering practices, tooling, and standards as the first hire on what will become a team.

What We Value
  • Builder instinct. Your default response to a data problem is to write code and build infrastructure, not to evaluate vendors and sign contracts.

  • Applied AI fluency. You're comfortable using LLMs and AI tooling to solve data problems: building agents that query warehouses, automating data quality checks, generating reports. You don't need to train models, but you should know how to put them to work.

  • Simplicity bias. You'd rather build a clean, maintainable pipeline with standard tools than an over-engineered stack that requires a team of five to operate.

  • Ownership. You treat the data warehouse like a product: it should be reliable, well-documented, and useful to the people who depend on it.

What We Require
  • 5+ years of experience in data engineering, with hands-on experience building and maintaining data warehouses and pipelines.

  • Strong SQL skills and deep experience with BigQuery or comparable analytical databases.

  • Proficiency in Python for pipeline development, scripting, and tooling.

  • Experience building ETL/ELT pipelines that consolidate data from multiple source systems (SaaS APIs, event streams, databases).

  • Experience working within GCP or a comparable cloud ecosystem.

  • Ability to design data models that are clean, performant, and usable by non-engineers.

Nice to Have
  • Experience building internal data tools or agents using LLMs (text-to-SQL, natural language interfaces, automated reporting). This is a strong differentiator.

  • Experience as the first or early data hire at a startup, where you owned the full stack.

  • Familiarity with legaltech, legal operations, or SaaS product analytics.

  • Experience setting up self-serve analytics layers (semantic layers, BI tool configuration, data documentation).

  • Experience with data infrastructure that supports ML workflows (feature stores, training data pipelines, data lakes).

  • Experience with infrastructure as code, especially Terraform, for managing GCP data infrastructure.

Location Policy

This is a remote role unless you fall within the following parameters. If you live within approximately 50 miles of our San Mateo, CA or Provo, UT office, the position follows a hybrid schedule with in-office days on Tuesdays, Wednesdays, and Thursdays.

 
Equal Opportunity Employment

GC AI is an equal opportunity employer that supports workplace diversity and does not discriminate on the basis of race, color, religion, gender identity/expression, national origin, age, military service eligibility, veteran status, sexual orientation, marital status, physical or mental disability, or any other protected class. GC AI is committed to working with and providing reasonable accommodation to applicants with physical and mental disabilities. #LI-GCAI

 
Fraud Notice to GC AI Applicants

To protect yourself against phishing and recruitment fraud, please note that GC AI only accepts job applications through our official careers page at https://gc.ai/careers and through sponsored jobs on LinkedIn. All legitimate communication from our team regarding job opportunities will come from a GC AI team member with a @gc.ai or @getgc.ai email address.

 

GC AI will never:

  • Refer you to external websites to apply

  • Conduct interviews over email, chat platforms, or messaging apps

  • Ask you to provide payment or purchase equipment

  • Request personal or financial information such as your mailing address, social security number, credit card numbers, or banking information during the application process

 

Examples of fraudulent email addresses:

 

If you are contacted by someone claiming to be from GC AI via an unofficial channel or from a suspicious email address, please do not share any information. Mark the communication as "phishing" or "spam" and do not respond.

Skills Required

  • 5+ years of experience in data engineering, with hands-on experience building and maintaining data warehouses and pipelines.
  • Strong SQL skills.
  • Deep experience with BigQuery or comparable analytical databases.
  • Proficiency in Python for pipeline development, scripting, and tooling.
  • Experience building ETL/ELT pipelines that consolidate data from multiple source systems (SaaS APIs, event streams, databases).
  • Experience working within GCP or a comparable cloud ecosystem.
  • Ability to design data models that are clean, performant, and usable by non-engineers.
  • Applied AI fluency (using LLMs and AI tooling to solve data problems).
  • Experience building internal data tools or agents using LLMs (text-to-SQL, natural language interfaces, automated reporting).
  • Experience as the first or early data hire at a startup.
  • Familiarity with legaltech, legal operations, or SaaS product analytics.
  • Experience setting up self-serve analytics layers (semantic layers, BI tool configuration, data documentation).
  • Experience with data infrastructure that supports ML workflows (feature stores, training data pipelines, data lakes).
  • Experience with infrastructure as code, especially Terraform, for managing GCP data infrastructure.

GC AI Compensation & Benefits Highlights

How does GC AI ensure its pay and bonus plans are competitive?

GC AI ensures its pay and bonus plans are competitive through a structured and transparent compensation approach. The company publishes pay transparency, uses defined pay bands with transparent earning potential, and sets defined commission tiers and policies for applicable roles. Its compensation package also includes company equity and a 401(k), giving employees a clear view of both immediate pay and long-term financial upside.

GC AI reinforces that compensation philosophy with practices that create consistency and clarity across the employee experience. The company promotes from within, supports learning and development, and maintains defined working hours, availability expectations, and goal-setting through an OKR operating model. For job seekers, that signals a workplace where compensation is tied to clearly established expectations, growth opportunities, and a rewards structure designed to stay competitive as the company scales.

GC AI Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Mateo, California
100 Employees
Year Founded: 2023

What We Do

GC AI is the leading legal AI platform built specifically for in-house teams to solve the high-precision workflows they face every day. With powerful features like Easy Prompt™, Exact Quote™ citations, and native Microsoft Word integration, GC AI enables legal professionals to be strategic business partners through faster and more accurate drafting, reviewing, researching, and redlining. Purpose-built for sensitive, high-stakes matters, the platform leverages five large language models while maintaining enterprise-grade security as a SOC 2 Type II certified provider that never uses confidential data for training. Founded by three-time General Counsel Cecilia Ziniti and AI engineer Bardia Pourvakil, GC AI is trusted by over 1,600 legal teams globally, including brands like News Corp, Skims, TIME Inc., Liquid Death, and Vercel. Discover the difference of becoming an AI-powered lawyer. Try it free or book a demo at gc.ai.

Why Work With Us

GC AI offers a rare opportunity to join a high-growth category leader at the intersection of generative AI and deep legal expertise. Our culture is defined by a "1% better every day" philosophy, favoring curious, customer-obsessed builders who thrive in a high-ownership environment where they can see the immediate impact of their work.

Gallery

Gallery
Gallery
Gallery

GC AI Offices

Hybrid Workspace

Employees engage in a combination of remote and on-site work.

Many roles are eligible for remote work in the U.S. and Canada. For folks who live within 50 miles of our San Mateo, CA, or Provo, UT offices, we value in-person collaboration and work in the office on Tuesday, Wednesday, and Thursday.

Typical time on-site: 3 days a week
Company Office Image
HQSan Mateo, CA
Company Office Image
Provo, UT
Learn more

Similar Jobs

GC AI Logo GC AI

Director Of Product Marketing

Artificial Intelligence • Legal Tech
In-Office or Remote
San Mateo, CA, USA
100 Employees
220K-260K Annually

GC AI Logo GC AI

Forward Deployed Engineer

Artificial Intelligence • Legal Tech
Remote
2 Locations
100 Employees
165K-350K Annually

GC AI Logo GC AI

Design Engineer

Artificial Intelligence • Legal Tech
Remote
United States
100 Employees
165K-350K Annually

GC AI Logo GC AI

Solutions Advisor, Legal Ops

Artificial Intelligence • Legal Tech
In-Office or Remote
San Mateo, CA, USA
100 Employees
150K-260K Annually

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account