Bioinformatics Engineer, Pipelines

Reposted 3 Days Ago
Be an Early Applicant
San Francisco, CA, USA
In-Office
150K-200K Annually
Senior level
Information Technology • Machine Learning • Natural Language Processing • Software
The Role
Design and maintain bioinformatics pipelines to process various biological data types, ensuring reproducibility and integration with analysis systems.
Summary Generated by Built In

ABOUT MITHRL

We imagine a world where new medicines reach patients in months, not years, and where scientific breakthroughs happen at the speed of thought.

Mithrl is building the world’s first commercially available AI Co-Scientist. It is a discovery engine that transforms messy biological data into real insights in minutes. Scientists ask questions in natural language, and Mithrl responds with analysis, novel targets, hypotheses, and patent-ready reports.

Our traction speaks for itself:

  • 12X year-over-year revenue growth

  • Trusted by leading biotechs and big pharma across three continents

  • Driving real breakthroughs from target discovery to patient outcomes.

ABOUT THE ROLE

We are looking for a Lead Bioinformatics Pipeline Engineer to build and scale Mithrl’s multi modal scientific processing pipelines. You will own the workflows that transform raw biological data into clean, reproducible outputs that power Mithrl’s AI Co-Scientist. These workflows include microarray, imaging, spatial transcriptomics, genomics, epigenomics, flow cytometry, and more.

This role sits at the center of our technical stack. You will architect Nextflow and nf-core style pipelines, implement modality-specific validation and QC layers, and collaborate with the Tabular Data Team and Knowledge Curation Team to ensure downstream data harmonization, variable ID mapping, and schema alignment. Your work ensures that scientists can ask questions and receive accurate data-backed answers instantly.

If you enjoy building robust scientific workflows and want to work on high impact problems, you will thrive here.

WHAT YOU WILL DO

  • Design and maintain production grade bioinformatics pipelines for a wide range of data modalities, including microarray, cell painting, WGS and WES, spatial transcriptomics, flow cytometry, ATAC-seq, and methyl-seq

  • Build workflows using Nextflow, nf-core modules, or similar engines with a focus on reproducibility, validation, and scalability

  • Implement quality control, validation, and provenance tracking for all supported modalities

  • Collaborate with the Tabular Data Team to ensure pipeline outputs map cleanly into Mithrl’s internal schemas, including variable ID coercions, metadata normalization, and feature name harmonization

  • Work with the Knowledge Curation Team to align outputs with reference genomes, annotations, and biological ontologies

  • Produce structured output artifacts so users can download processed data and supporting metadata directly through the platform

WHAT YOU BRING

Required Qualifications

  • 6 to 8 years of experience in bioinformatics workflow engineering or computational biology

  • Strong experience with Nextflow, nf-core, WDL, CWL, Snakemake, or similar workflow systems

  • Proficiency in Python or R for data processing, QC, and pipeline logic

  • Hands-on experience building pipelines for multiple biological data types, including genomics, single cell, imaging, flow cytometry, spatial data, or epigenomics

  • Ability to design pipelines that are reproducible and containerized using Docker or Singularity

  • Strong understanding of secondary and tertiary data layers and how they integrate with downstream analysis systems

  • Experience integrating pipeline outputs with data stores, schemas, or ML-ready formats

Nice to Have

  • Experience executing pipelines in cloud environments such as AWS Batch, ECS, Tower, or Nextflow Cloud

  • Experience with imaging workflows such as CellProfiler, DeepCell, or Squidpy

  • Familiarity with genomic reference databases, annotation formats, and biological ontologies

  • Previous work in a tech bio startup, biotech R&D group, or scientific software company

WHAT YOU WILL LOVE AT MITHRL

  • You will build the core pipelines that transform raw biological data into insights used by the AI Co-Scientist

  • Team: Join a tight-knit, talent-dense team of engineers, scientists, and builders

  • Culture: We value consistency, clarity, and hard work. We solve hard problems through focused daily execution

  • Speed: We ship fast (2x/week) and improve continuously based on real user feedback

  • Location: Beautiful SF office with a high-energy, in-person culture

  • Benefits: Comprehensive PPO health coverage through Anthem (medical, dental, and vision) + 401(k) with top-tier plans

We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team.

Skills Required

  • 6 to 8 years of experience in bioinformatics workflow engineering or computational biology
  • Strong experience with Nextflow, nf-core, WDL, CWL, Snakemake, or similar workflow systems
  • Proficiency in Python or R for data processing, QC, and pipeline logic
  • Hands-on experience building pipelines for multiple biological data types
  • Ability to design pipelines that are reproducible and containerized using Docker or Singularity
  • Strong understanding of secondary and tertiary data layers
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Francisco, California
12 Employees
Year Founded: 2023

What We Do

Scientific labs waste weeks learning & coding pipelines that do not carry over to the next experiment. Using just natural language, Mithrl builds them custom workflows for NGS data on-demand, in minutes -- not weeks. This allows them to focus all their time on running higher quality experiments.

Similar Jobs

Zscaler Logo Zscaler

Operations Specialist

Cloud • Information Technology • Security • Software • Cybersecurity
Easy Apply
Remote or Hybrid
San Jose, CA, USA
8697 Employees
105K-150K Annually

Zscaler Logo Zscaler

Sales Engineer

Cloud • Information Technology • Security • Software • Cybersecurity
Easy Apply
Remote or Hybrid
USA
8697 Employees
171K-244K Annually

ZS Logo ZS

Strategic Alliances Senior Associate

Artificial Intelligence • Healthtech • Professional Services • Analytics • Consulting
Hybrid
3 Locations
15000 Employees
118K-131K Annually

Headway Logo Headway

Senior Product Designer

Consumer Web • Healthtech • Professional Services • Social Impact • Software
Easy Apply
In-Office or Remote
3 Locations
819 Employees
180K-225K Annually

Similar Companies Hiring

Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account