Software Engineer, Data Foundations

Reposted 20 Days Ago
Easy Apply
7 Locations
In-Office or Remote
140K-265K Annually
Mid level
Artificial Intelligence • Software • Generative AI
The Role
As a Software Engineer on Glean's Data Foundations team, you'll build and scale data ingestion connectors, transforming unstructured content into structured data while ensuring system reliability and security throughout the data processing pipeline.
Summary Generated by Built In

About Glean:

Founded in 2019, Glean is an innovative AI-powered knowledge management platform designed to help organizations quickly find, organize, and share information across their teams. By integrating seamlessly with tools like Google Drive, Slack, and Microsoft Teams, Glean ensures employees can access the right knowledge at the right time, boosting productivity and collaboration. The company’s cutting-edge AI technology simplifies knowledge discovery, making it faster and more efficient for teams to leverage their collective intelligence.

Glean was born from Founder & CEO Arvind Jain’s deep understanding of the challenges employees face in finding and understanding information at work. Seeing firsthand how fragmented knowledge and sprawling SaaS tools made it difficult to stay productive, he set out to build a better way - an AI-powered enterprise search platform that helps people quickly and intuitively access the information they need. Since then, Glean has evolved into the leading Work AI platform, combining enterprise-grade search, an AI assistant, and powerful application- and agent-building capabilities to fundamentally redefine how employees work.

About the Role

We are looking for a Software Engineer to join Glean’s Data Foundations team — the group that owns the end-to-end data ingestion and management layer powering Glean’s Search, AI Assistant, and Agent products across thousands of enterprise apps and billions of documents.

Your work will directly determine the quality, freshness, and trustworthiness of the knowledge that every Glean user interacts with every day.

You will work on:

Ingestion & Connectivity

  • Build and scale connectors to a wide variety of SaaS and on-prem systems (Google Workspace, Microsoft 365, Slack, Salesforce, Jira, ServiceNow, GitHub, etc.).
  • Handle full syncs, low-latency incremental updates via webhooks/APIs, rate-limiting, and complex authentication flows.
  • Build advanced capabilities in datasources like actions, live-fetch, and query language support.

Data Processing & Modeling

  • Transform raw, unstructured enterprise content into rich, structured, permission-aware representations optimized for search and LLM reasoning.
  • Design document schemas and enrichment pipelines (entity extraction, access-graph propagation, redactions, etc.).
  • Expand the capabilities of AI products through deep integrations that allow us to automate tasks, perform complex queries grounded in enterprise data, and enhance our indexed corpus with live data.

Reliability & Distributed Systems

  • Own end-to-end correctness, freshness, and performance for petabyte-scale data flows.
  • Solve hard problems in ordering, idempotency, exactly-once processing, backpressure, and retries across distributed queues, workers, and storage.

Security & Permissions

  • Preserve fine-grained ACLs, deletions, and sensitivity constraints so AI answers are always grounded in what users are actually allowed to see.

Cross-Functional Impact

  • Partner closely with Search Serving, Product, Platforms, and Security teams to define how enterprise context is exposed to LLMs and agents.
  • Continuously improve observability, alerting, and automation to onboard larger customers and more data sources with confidence.
About you:
  • 3+ years building production backend or data infrastructure systems (Java, Go, C++, Python, etc.).
  • Hands-on experience with distributed systems, data pipelines, queues, and large-scale storage (SQL/NoSQL).
  • You think in SLOs, error budgets, failure modes, and correctness guarantees — not just features.
  • Comfortable with strict consistency and permission-modeling challenges.
  • Prior work on enterprise connectors, search/indexing, information retrieval, or security-sensitive systems is a strong plus.
  • Passionate about making AI trustworthy by building the rock-solid data foundation underneath it.
  • Power user of LLMs and AI tools in your own workflow.

Location:

  • This role is hybrid (4 days a week in one of our SF Bay Area offices)

Compensation & Benefits:

The standard base salary range for this position is $140,000 - $265,000 annually. Compensation offered will be determined by factors such as location, level, job-related knowledge, skills, and experience. Certain roles may be eligible for variable compensation, equity, and benefits.

We offer a comprehensive benefits package including competitive compensation, Medical, Vision, and Dental coverage, generous time-off policy, and the opportunity to contribute to your 401k plan to support your long-term goals. When you join, you'll receive a home office improvement stipend, as well as an annual education and wellness stipends to support your growth and wellbeing. We foster a vibrant company culture through regular events, and provide healthy lunches daily to keep you fueled and focused.

We are a diverse bunch of people and we want to continue to attract and retain a diverse range of people into our organization. We're committed to an inclusive and diverse company. We do not discriminate based on gender, ethnicity, sexual orientation, religion, civil or family status, age, disability, or race.

#LI-HYBRID

Top Skills

C++
Go
Java
NoSQL
Python
SQL
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Palo Alto, CA
224 Employees
Year Founded: 2019

What We Do

Glean searches across all your company’s apps to help you find exactly what you need and discover the things you should know.

🔍 AI-powered workplace search.
💡 Personalized results and knowledge discovery.
⚡ Easy to use, ready to go— right out of the box.

Similar Jobs

Samsara Logo Samsara

Product Manager

Artificial Intelligence • Cloud • Computer Vision • Hardware • Internet of Things • Software
Easy Apply
Remote or Hybrid
Toronto, ON, CAN
4000 Employees
122K-158K Annually

Optum Logo Optum

Software Engineer

Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
In-Office or Remote
Richmond, BC, CAN
160000 Employees
63K-132K Annually

Dropbox Logo Dropbox

Senior Engineering Manager

Artificial Intelligence • Cloud • Consumer Web • Productivity • Software • App development • Data Privacy
Remote
Canada
2500 Employees
205K-277K Annually

Webflow Logo Webflow

Senior Product Designer

Artificial Intelligence • Enterprise Web • Software • Design • Generative AI
Easy Apply
Remote
3 Locations
800 Employees
132K-188K Annually

Similar Companies Hiring

Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees
Fairly Even Thumbnail
Software • Sales • Robotics • Other • Hospitality • Hardware
New York, NY
Bellagent Thumbnail
Artificial Intelligence • Machine Learning • Business Intelligence • Generative AI
Chicago, IL
20 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account