Distributed Systems Engineer

Reposted 6 Days Ago
Be an Early Applicant
San Francisco, CA, USA
In-Office
Senior level
Generative AI
The Engine for Agentic Work: Build Reliable Agentic Applications and Document Processing Workflows with Ease.
The Role
Build core infrastructure for a durable application runtime: implement schedulers, runtimes, and data plane components in Rust; optimize cluster scheduling, resource utilization, and performance; extend SDKs; collaborate across product and engineering from research to production.
Summary Generated by Built In

Our core mission at Tensorlake is to unlock your data wherever it is. We believe that people should have access to the best tools to parse, extract, and manipulate data, run data applications, so they can spend more time putting knowledge into action.

We’re looking for engineers who want to build the operating system for AI Data Applications and Workflows.

About the role

We're looking for experienced distributed systems engineers to build the core infrastructure for our durable application runtime. This is a systems programming role—you'll be writing the schedulers, runtimes, and data plane components that other engineers build applications on top of. Some of the things you'll work on in this role

  • Build and evolve our durable application runtime to support advanced data processing and machine learning workflows
  • Design and implement core components of our cluster scheduler to improve resource utilization, reduce costs, and maximize performance
  • Write systems-level code in Rust for our data plane and execution engine
  • Design and build new capabilities for our SDKs
  • Work closely with the rest of the engineering team to take something from an idea to a polished product
About you
  • You have 5+ years of experience building distributed systems infrastructure—not configuring or operating it, but designing and implementing it from scratch
  • You've written production systems in Rust or other systems programming languages (C, C++, Go at the systems level)
  • You understand how cluster schedulers, databases, and runtimes work at the implementation level because you've built or contributed to them
  • You can autonomously lead, design, and build fault-tolerant systems
  • You enjoy diving deep into performance challenges at the systems level—memory allocation, concurrency primitives, network protocols
  • You want to be part of the entire product development process, from customer research to implementation

This role is not a fit if...

  • Your experience is primarily in DevOps, SRE, or platform operations (Terraform, Kubernetes administration, CI/CD pipelines)
  • You're looking for a role focused on automation, tooling, or infrastructure-as-code rather than building core systems
  • You haven't written substantial code in a systems programming language
Things you should know
  • We’re a startup, and we expect people to be able to wear multiple hats at any given time.
  • We’re distributed across the US and Europe, and everyone is self-sufficient to get work done even when nobody else is around.
  • We do not expect people to work all the time, but we expect everyone to follow up on their commitments.
  • We’re a small team with high ownership and we’re passionate about what we do.
  • Our tech stack is somehow diverse. You’d be mostly working with Rust, Python and FoundationDB on your day to day. But you’ll also need to understand TypeScript, Go, and Terraform enough to touch parts of our backend infrastructure.


Skills Required

  • 5+ years building distributed systems infrastructure (designing and implementing, not just operating)
  • Production experience writing systems-level code in Rust or C, C++, Go
  • Deep understanding of cluster schedulers, databases, and runtimes at implementation level
  • Ability to autonomously lead, design, and build fault-tolerant systems
  • Strong systems performance skills (memory allocation, concurrency, network protocols)
  • Day-to-day experience with Rust, Python, and FoundationDB
  • Familiarity with TypeScript, Go, and Terraform sufficient to touch backend infrastructure
  • Willingness to participate in full product development lifecycle from research to implementation
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
13 Employees
Year Founded: 2023

What We Do

Tensorlake is a platform for building Agentic automation and applications in Enterprises. We provide three foundational primitives to build reliable Agents: serverless compute with durable execution, code sandboxes and accurate document ingestion for applications. Agentic Runtime: Build agents with any framework, and deploy them on our serverless platform to expose them as an HTTP API. The runtime includes durable execution to replay requests and resume execution from where they crashed. The runtime offers code sandboxes to securely run LLM generated code in agents. Document AI: VLM-powered extraction that understands document semantics. Handles multi-page tables, handwriting, nested layouts, and strikethrough text. Returns layout-aware Markdown or validated JSON. Enterprise-Ready: HIPAA | SOC 2 Type II | Used by financial services, healthcare, insurance, logistics and legal tech where accuracy and reliability are non-negotiable. No Messy Infrastructure Tax: No Airflow. No Spark. No queue orchestration. No container management. Just Python that runs durably at global scale.

Similar Jobs

Capital One Logo Capital One

Lead Software Engineer

Fintech • Machine Learning • Payments • Software • Financial Services
Hybrid
5 Locations
55000 Employees
230K-286K Annually

Rubrik Logo Rubrik

Software Engineer

Artificial Intelligence • Big Data • Cloud • Information Technology • Software • Cybersecurity • Data Privacy
In-Office
Palo Alto, CA, USA
3000 Employees
158K-237K Annually

Rubrik Logo Rubrik

Senior Software Engineer

Artificial Intelligence • Big Data • Cloud • Information Technology • Software • Cybersecurity • Data Privacy
In-Office
Palo Alto, CA, USA
3000 Employees
189K-283K Annually

Capital One Logo Capital One

Lead Software Engineer

Fintech • Machine Learning • Payments • Software • Financial Services
Hybrid
5 Locations
55000 Employees
209K-286K Annually

Similar Companies Hiring

Northslope Thumbnail
Artificial Intelligence • Information Technology • Software • Analytics • Consulting • Generative AI
London, GB
100 Employees
ClickMint Thumbnail
AdTech • eCommerce • Marketing Tech • Generative AI
Malibu, CA
9 Employees
Bellagent Thumbnail
Artificial Intelligence • Machine Learning • Business Intelligence • Generative AI
Chicago, IL
20 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account