Scientist - Computational Biophysics

Reposted Yesterday
Be an Early Applicant
Emeryville, CA, USA
In-Office
100K-180K Annually
Entry level
Artificial Intelligence • Machine Learning • Biotech • Generative AI
The Role
The Scientist will develop protein representations and bioinformatic pipelines, integrating ML models and collaborating with experimental partners to enhance biological insights.
Summary Generated by Built In
About Astera

Astera is a private foundation on a mission to steer science and technology toward an abundant future. We believe the coming years will bring an era of unprecedented scientific and technological advancement as exponential progress in AI converges with central advances in other fields to dramatically accelerate innovation. This inflection point provides an unparalleled opportunity to fundamentally rethink the institutions, systems, and tools that drive scientific progress.

Unlike traditional non-profit research organizations, projects supported by Astera operate like high-velocity startups, allowing us to focus on ambitious goals, match structure to problem, and attract strong technical talent and leadership. You can read more about our mission, vision, and programming here.

Position Summary

We are seeking a scientist to join the diffUSE Project, which focuses on developing next-generation protein representations that bridge dynamic structural biology to downstream functional applications. The diffUSE Project is an ambitious initiative designed to advance our understanding of protein dynamics by building the experimental methods, computational models, and global infrastructure needed to capture molecular motion at scale. Our goal is to establish dynamic structural biology as a foundational pillar of modern science, as transformative and indispensable as the Protein Data Bank has been for static structures.

In this role, you will build ensemble-aware protein representations to enable the integration with downstream inputs such as protein language model (PLM), LLMs, or other functional predictions. You will design and maintain large-scale bioinformatic pipelines, manage complex datasets, develop metrics for dynamics and/or fine-tune or architect ML models to capture sequence-structure-function relationships. A key part of the role involves synthesizing diverse data sources to improve biological relevance. You will work closely with experimental collaborators to ground computational insights in real biological systems.

Key Responsibilities
  • Build ensemble-aware protein representations that integrate PLM and LLM embeddings with experimentally derived structural heterogeneity for functional prediction

  • Design, develop, and maintain large-scale bioinformatic pipelines capable of processing and managing complex, high-dimensional datasets

  • Fine-tune or architect ML models to capture sequence-structure-function relationships, with a focus on dynamic and conformational features

  • Synthesize diverse data sources spanning evolutionary history, binding affinity, allostery, and functional annotations to improve model performance and biological relevance

  • Collaborate closely with experimental partners to ground computational representations in real biological measurements and ensure models are continuously refined against experimental ground truth

  • Contribute to the broader diffUSE infrastructure, helping establish community-wide standards and tools for dynamic structural biology

Required Skills and Qualifications
  • PhD in bioinformatics, computational biology, machine learning, or a related field.

  • Strong understanding of protein structure and function.

  • Demonstrated experience building large bioinformatic pipelines and managing high-dimensional datasets.

  • Proficiency in fine-tuning or modifying ML models (e.g., transformer-based architectures).

  • Familiarity with protein language models (ESM, AlphaFold, etc.) is a plus.

  • Collaborative, team-oriented mindset with the ability to drive research questions from conception to execution.

Compensation

The posted salary range is based on location in the Bay Area. The successful candidate will receive a competitive compensation package, commensurate with their experience and location.

  • Benefits summary

Skills Required

  • PhD in bioinformatics, computational biology, machine learning, or a related field
  • Strong understanding of protein structure and function
  • Experience building large bioinformatic pipelines and managing datasets
  • Proficiency in fine-tuning or modifying ML models
  • Familiarity with protein language models like ESM or AlphaFold
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
57 Employees
Year Founded: 2020

What We Do

Astera is a private foundation with a $2.5B endowment focused on steering science and technology toward an abundant future for all. They operate like a high-velocity startup, integrating neuroscience, AI, and bioengineering for AGI research.

Similar Jobs

Merge Labs Logo Merge Labs

Scientist

Artificial Intelligence • Biotech
In-Office or Remote
7 Locations
32 Employees
200K-270K Annually

Genentech Logo Genentech

Scientist

Healthtech • Biotech
In-Office
South San Francisco, CA, USA
20069 Employees
121K-224K Annually

Zscaler Logo Zscaler

Senior Value Advisor

Cloud • Information Technology • Security • Software • Cybersecurity
Easy Apply
Remote or Hybrid
USA
8697 Employees
165K-235K Annually
In-Office
City of Industry, CA, USA
2450 Employees
24-25 Hourly

Similar Companies Hiring

Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees
Bellagent Thumbnail
Artificial Intelligence • Machine Learning • Business Intelligence • Generative AI
Chicago, IL
20 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account