Snr Data Scientist, Computational Genomics/DNA Modelling

Posted 2 Days Ago
Be an Early Applicant
London, Greater London, England, GBR
In-Office
Senior level
Healthtech • Machine Learning • Biotech
The Role
Develop and evaluate machine learning models linking DNA sequence and genetic variation to molecular phenotypes and disease. Build sequence-to-function models (variant effect and regulatory activity prediction), integrate sequence representations with transcriptomic and epigenomic data, collaborate with experimental teams to validate hypotheses, communicate results, and contribute to publications and documentation.
Summary Generated by Built In
About Relation

Relation is a sector defining TechBio company developing transformational medicines, with technology at our core. Our ambition is to understand human biology in unprecedented ways, discovering therapies to treat some of life’s most devastating diseases. We leverage single-cell multi-omics from patient tissue, functional assays, and machine learning to drive disease understanding, from cause to cure.

We are scaling rapidly and building a team of exceptional individuals to push the boundaries of drug discovery. You will work in highly interdisciplinary teams where biology, computation, and engineering come together to solve complex problems that have not been solved before. Our state-of-the-art wet and dry labs in the heart of London are designed to accelerate this integration and translate insight into impact.

We are committed to building diverse and inclusive teams. Relation is an equal opportunities employer and does not discriminate on the basis of gender, sexual orientation, marital or civil partnership status, gender reassignment, race, colour, nationality, ethnic or national origin, religion or belief, disability, or age.

By joining Relation, you will help define how medicines are discovered and deliver meaningful impact for patients.

The opportunity

This is a unique opportunity for a senior data scientist to work on DNA sequence modelling, statistical machine learning, and functional genomics in support of target discovery and mechanism elucidation. You will develop and apply computational models that connect genetic variation and regulatory DNA sequence to downstream molecular phenotypes, cellular programmes, and disease biology. Embedded within the Rosalind team you will build and evaluate models that link sequence to function - including variant effect prediction,regulatory activity modelling and representation learning for genomic data.

Day to Day you will:
  • Develop and implement machine learning models for DNA sequence, regulatory elements, and genetic variation in disease-relevant contexts.

  • Build, evaluate and benchmark models for sequence-to-function tasks such as variant effect prediction, regulatory activity prediction, and the interpretation of non-coding disease signals.

  • Integrate sequence-derived representations with transcriptomic, epigenomic, and perturbational datasets to uncover disease mechanisms and support target prioritisation.

  • Partner closely with experimental and machine learning researchers to validate hypotheses, interpret results, and guide downstream studies.

  • Communicate findings clearly to internal stakeholders, including presenting methods, results, and recommendations.

  • Contribute to publications, scientific communications, and project documentation, supporting scientific excellence and external visibility.

Professionally, you will have
  • PhD in computational biology, machine learning, statistics, genomics, bioinformatics, or a related quantitative discipline.

  • Post-PhD experience, ideally including time in an industry, biotech, or pharmaceutical environment.

  • Experience with DNA language models, genomic foundation models, or transformer-based sequence models.

  • Understanding of statistical machine learning and probabilistic modelling, with experience selecting and evaluating appropriate modelling approaches for complex biological datasets.

  • Knowledge of statistical genetics methods (e.g fine-mapping, colocalisation,, or variant to gene approaches).

  • High proficiency in Python (preferred) and R, with experience working in high-performance computing environments.

  • Ability to operate independently, driving projects from concept through delivery.

  • Bonus experience:

    • Familiarity with single-cell transcriptomics or patient-derived datasets.

    • Experience working in interdisciplinary teams within biotech or pharma settings.

    • Experience integrating machine learning models with genomics, single-cell, or perturbation datasets.

    • Familiarity with 3D genome / chromatin interaction data where relevant (Hi-C, Capture-C, etc.).

Personally, you:
  • Are comfortable working in a matrixed environment, balancing multiple stakeholders and contributing effectively across teams.

  • Take ownership of your work, proactively seek opportunities to contribute, and enable others to do their best work.

  • Communicate openly and directly, give and receive feedback constructively, and handle challenging conversations with respect.

  • Actively seek out diverse perspectives, build strong working relationships, and contribute to shared goals across teams.

  • Embrace challenges with openness and resilience, set high standards for yourself, and strive to deliver meaningful outcomes.

Working Style & Culture at Relation

At Relation, we operate in a matrixed, interdisciplinary environment, where impact is driven through collaboration across scientific, technical, and operational domains. We collaborate, and you will partner with colleagues across multiple teams and projects, contributing your expertise while aligning to shared company priorities. We work together and win together!

The patient is waiting!

RECRUITMENT AGENCIES: Please note that Relation does not accept unsolicited resumes from agencies. Resumes should not be forwarded to our job aliases or employees. Relation will not be liable for any fees associated with unsolicited CVs.

Relation is a committed equal opportunities employer.

Skills Required

  • PhD in computational biology, machine learning, statistics, genomics, bioinformatics, or related quantitative discipline.
  • Post-PhD research experience
  • Industry, biotech, or pharmaceutical experience
  • Experience with DNA language models, genomic foundation models, or transformer-based sequence models
  • Understanding of statistical machine learning and probabilistic modelling
  • Knowledge of statistical genetics methods (e.g., fine-mapping, colocalisation, variant-to-gene)
  • High proficiency in Python
  • Proficiency in R
  • Experience working in high-performance computing environments
  • Ability to operate independently and drive projects from concept through delivery
  • Familiarity with single-cell transcriptomics or patient-derived datasets
  • Experience integrating machine learning models with genomics, single-cell, or perturbation datasets
  • Familiarity with 3D genome / chromatin interaction data (Hi-C, Capture-C)
  • Experience working in interdisciplinary teams within biotech or pharma settings
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: London
57 Employees
Year Founded: 2019

What We Do

We are an end-to-end biotech developing transformational medicines, with technology at our core. Our ambition is to understand human biology in an unprecedented way; discovering therapies that will treat some of life's most devastating diseases. We leverage single-cell and multi-omics measurements directly from patient tissue, functional assays and machine learning to drive disease understanding — from cause to cure. Medicines born from high-resolution biology, machine learning and clinical insights. Our Lab-in-the-Loop seamlessly integrates single-cell analysis, genomics and machine learning. We have developed powerful technologies which we apply to select druggable targets and develop transformational medicines.

Similar Jobs

Boeing Logo Boeing

Design Engineer

Aerospace • Information Technology • Software • Cybersecurity • Design • Defense • Manufacturing
Hybrid
Frimley, Surrey Heath, Surrey, England, GBR
170000 Employees

NBCUniversal Logo NBCUniversal

Space Planning Coordinator

AdTech • Cloud • Digital Media • Information Technology • News + Entertainment • App development
Hybrid
London, Greater London, England, GBR
68000 Employees

NBCUniversal Logo NBCUniversal

Site Reliability Engineer

AdTech • Cloud • Digital Media • Information Technology • News + Entertainment • App development
Remote or Hybrid
London, Greater London, England, GBR
68000 Employees

NBCUniversal Logo NBCUniversal

EHS Advisor (EMEA & APAC)

AdTech • Cloud • Digital Media • Information Technology • News + Entertainment • App development
Hybrid
London, Greater London, England, GBR
68000 Employees

Similar Companies Hiring

Credal.ai Thumbnail
Software • Security • Productivity • Machine Learning • Artificial Intelligence
Brooklyn, NY
Granted Thumbnail
Mobile • Insurance • Healthtech • Financial Services • Artificial Intelligence
New York, New York
23 Employees
OneImaging Thumbnail
Healthtech
Miami, FL
62 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account