Staff Data/AI Engineer

Posted 2 Days Ago
3 Locations
Remote or Hybrid
Expert/Leader
Artificial Intelligence • Software • Biotech • Pharmaceutical
Sleuth offers done-for-you competitive intelligence for biopharma leaders
The Role
As a Staff Data/AI Engineer, you will design and develop data pipelines and machine learning infrastructure for biopharma, optimize data stores, and apply AI techniques to enhance knowledge retrieval.
Summary Generated by Built In

About us

Sleuth is building a modern, agentic, and intelligent decision-making platform for the biopharma and life sciences industry. We’re using AI to automate workflows and deliver crucial insights and bespoke reports that answer our user’s critical questions about their investments.

You should join us, because:

  • Traction: we’ve generated outsized demand for a startup of our size, and have already signed deals with some of the world’s leading biopharma companies.
  • Talent density: we’re growing thoughtfully and only work with incredibly smart, driven people.
  • Velocity: to meet the demand we’ve generated, we ship fast. You’ll learn a lot and constantly take on new challenges.
  • Frontier technology & product: we’re developing cutting-edge AI systems. You’ll truly be building the future of how biopharma and life sciences companies generate insights to power their business.
About the role

We are looking for an experienced Data, AI/ML Engineer and Architect to lead the design, development, and operation of our data platform and machine learning infrastructure. In this role, you’ll be working with data scientists and engineers to design, develop, and operate robust data pipelines, manage and optimize data stores (relational, vector, and graph databases), and develop the infrastructure and tooling that supports model training, fine-tuning, and deployment. This is a unique opportunity to go both broad and deep across data engineering and ML infrastructure to enable a modern GenAI system that serves the biopharma and life sciences industry.

What you'll do
  • Work with other members of the team to design, develop, and operate data and AI solutions as part of our products; specifically, how to scale the most comprehensive and accurate biopharma intelligence knowledge base.
  • Fine-tune and/or evaluate models for accuracy, safety, and relevance.
  • Apply techniques like few-shot learning, prompt chaining, and retrieval-augmented generation (RAG) to enhance knowledge retrieval.
  • Leverage data integration tools, or develop custom ones if needed, to connect to public and private data repositories.
  • Design, develop, and operate a scalable data platform that ingests, processes, and serves a large amount of structured and unstructured public and private data.
  • Develop and deploy a user feedback collection and data annotation mechanism.
  • Develop and deploy CI/CD for model and data pipelines, continuous testing, observability, and monitoring systems to guarantee the integrity and resilience of our data platform.
  • Establish data governance and related controls to ensure the confidentiality, integrity, and availability of our data.
  • Set data/ML engineering best practices.
What we're looking for
  • Experience: 10+ years of data and AI/ML engineering experience in the industry.
  • Infrastructure: experience with Cloud infrastructure and tools (AWS and/or GCP).
  • Stack: expert-level proficiency in Python and Spark.
  • Data stores: relational databases (specifically PostgreSQL), graph databases (e.g., Neo4J, AWS Neptune), and vector databases (e.g., Pinecone, Chroma).
  • Data definition: proficient in developing and applying data ontologies using industry standards and best practices (e.g., OWL, RDF).
  • AI/ML: understanding of modern AI applications and experience in fine-tuning and evaluating AI models and model-based solutions with techniques like RAG, prompt chaining, or agentic workflows using frameworks and protocols such as LangChain, LangGraph, LangSmith, AutoGen, MCP, etc.
  • DataOps: experience in data platform architecture, data pipelines, and commercial and open source ETL/ELT tools.
  • MLOps: experience in model deployment and serving models, CI/CD pipelines, containerization and orchestration (e.g., Airflow, Nextflow), AI/ML platforms (e.g., Vertex AI).
  • Compliance: familiar with SOC2 or regulated software development environments.
  • Education: BS, MS, or PhD in Computer Science, Engineering, Math, or related scientific field. Additional hands-on certificates are great to have.
  • Domain Knowledge: familiarity with biopharma, biotech, or life sciences environments is a plus.
What we offer
  • Competitive compensation, healthcare benefits with generous employer contribution, and flexible hybrid/remote work setup.
  • Hands-on experience at the frontier of AI, building agentic systems that don’t just support but transform how insights are generated and applied.
  • An opportunity to directly partner with leading biopharma companies and see your work shape how this industry makes billion-dollar decisions using our software.

Top Skills

Autogen
AWS
Aws Neptune
Chroma
GCP
Langchain
Langgraph
Langsmith
Mcp
Neo4J
Pinecone
Postgres
Python
Spark
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
11 Employees
Year Founded: 2023

What We Do

Sleuth is a startup that delivers done-for-you competitive intelligence insights and reports to biopharma companies and investors. We use AI agents and a proprietary database of over 100M data points and documents to power bespoke analyses for executives and a range of business-focused teams across the industry. Among our current customers are some of the largest pharma companies in the world, cutting-edge biotech companies and leading investors.

Why Work With Us

We move fast, which means we're always shipping new, game-changing features to our customers. We also prioritize effectiveness over everything else. That means freedom to work when and where you're most productive, while using in-person time for strategic planning and team building.

Similar Jobs

Rula Logo Rula

Data Engineer

Healthtech • Other • Social Impact • Software • Telehealth
Remote
United States
595 Employees
Remote
United States
3661 Employees
50K-130K Annually
Remote
USA
458 Employees
194K-253K Annually
In-Office or Remote
8 Locations
60 Employees

Similar Companies Hiring

Standard Template Labs Thumbnail
Software • Information Technology • Artificial Intelligence
New York, NY
10 Employees
PRIMA Thumbnail
Travel • Software • Marketing Tech • Hospitality • eCommerce
US
15 Employees
Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account