Black Canyon Consulting is seeking a Staff Scientist to work with a Principal Investigatory in the National Institutes of Health at the National Library of Medicine to support the development of high-fidelity artificial intelligence models designed to decode the functional landscape of the human and mouse genomes. This effort will leverage Telomere-to-Telomere (T2T) reference assemblies to advance understanding of gene regulation, particularly within complex and repetitive genomic regions.
This position requires a unique combination of computational genomics expertise, machine learning proficiency, and scalable software engineering capabilities to support large-scale data integration and model development.
Responsibilities- Lead the design, development, and implementation of AI-driven models for gene regulation analysis
- Architect and scale a TREDNet-based framework for cloud-native execution
- Optimize models for distributed, multi-GPU training environments
- Integrate and analyze large-scale genomic and epigenomic datasets, including:
- ENCODE / modENCODE
- NIH Roadmap Epigenomics
- UCSC Genome Database
- Apply AI methodologies to functionally annotate repetitive genomic regions, including centromeres and telomeres
- Develop and maintain scalable, containerized pipelines using Docker and/or Singularity
- Implement MLOps best practices, including experiment tracking, model versioning, and reproducibility
- Deploy and manage workflows in cloud environments (AWS, GCP, or Azure)
- Collaborate with interdisciplinary teams across computational and life sciences domains
- PhD in Computer Science, Computational Biology, Bioinformatics, or a related field
- Minimum of 5 years of experience developing and deploying machine learning or deep learning models
- Strong experience with cloud platforms (AWS, GCP, or Azure)
- Proficiency in deep learning frameworks (PyTorch preferred; TensorFlow or HuggingFace acceptable)
- Deep understanding of neural network architectures (CNNs, transformers, sequence models)
- Strong programming skills in Python and experience working in Linux-based environments
- Experience with MLOps practices, including experiment tracking and model versioning
- Experience building and deploying containerized workflows (Docker and/or Singularity)
- Experience with distributed training across GPUs or multi-node environments
- Strong knowledge of genomics, gene regulation, and epigenomics
- Experience working with large-scale biological datasets (e.g., ENCODE, Roadmap Epigenomics, UCSC Genome Browser)
- Familiarity with genomics data formats (FASTA, VCF, BAM/CRAM, BED)
- Experience with Telomere-to-Telomere (T2T) genome assemblies
- Experience analyzing repetitive genomic regions (e.g., centromeres, telomeres)
- Background in regulatory, functional, or comparative genomics (e.g., human vs. mouse)
- Experience with hyperparameter tuning and large-scale model optimization
- Familiarity with genomic foundation models or sequence-based deep learning approaches
- Experience running ML workloads on GPU-enabled cloud or HPC environments
- Familiarity with workflow orchestration tools (e.g., Nextflow, Snakemake, Airflow)
- Experience transitioning research models into production-grade pipelines
- Familiarity with CI/CD and infrastructure-as-code tools (e.g., Terraform)
- Experience working in interdisciplinary teams
- Develop a containerized (Docker/Singularity) TREDNet pipeline capable of scaling across multiple GPU nodes in a cloud environment
- Produce a comprehensive functional map of the T2T reference genome, identifying regulatory motifs in previously unresolved regions
- Develop comparative models between human and mouse cell lines to identify conserved regulatory mechanisms
We attract the best people in the business with our competitive benefits package, including medical, dental, and vision coverage; a 401(k) plan with employer contribution; paid holidays, vacation, and tuition reimbursement.
We offer a competitive salary commensurate with experience and location. The targeted range for this position is $110,000 - $140,000.
If you enjoy being part of a high-performing, professional, technology-focused organization, please apply today!
Top Skills
What We Do
Black Canyon Consulting, LLC (BCC) is a privately held, Woman-Owned Small Business, whose team of professionals bring over 125 years of experience providing innovative, tactical, and strategic Program and IT Management Support Services. At our core, BCC values Integrity, Impact and Innovation over all. We are an agile, adaptive and completely customer focused team who has cultivated and implemented a corporate philosophy of innovation and high value execution in our approach to support Federal R&D programs. Utilization of our proprietary technologies, workflows, and IT system tools has resulted in a vast network of resources, through which BCC leverages a distributed network and consulting workforce. These comprehensive, agile, and turn-key platforms along with our extensive development experience, subject matter expertise, and technical and systems-level knowledge ensure the delivery of enterprise and program services and solutions. Company Information: DUNS: 08-081-9702 CAGE: 7YLH4 SAM ID: WB3LFMA4KM45 GSA MAS: Contract #47QTCA18D00JA Primary NAICS: 54151 - Information Technology Professional Services 541511 - Custom Computer Programming Services 541512 - Computer Systems Design Services 541519 - Other Computer Related Services 1 541519 - Information Technology Value Added Resellers 2 541611 - Administrative Management and General Management Consulting Services 541618 - Other Management Consulting Services 541690 - Other Scientific and Technical Consulting Services 541990 - All Other Professional, Scientific, and Technical Services Certifications/Compliance: Woman-Owned Small Business (WOSB) ISO 9001:2015 ISO 20000-1:2018 ISO/IEC 27001:2013 FISMA Compliant SOC Readiness & Audit Independent Verification and Validation (IV & V) Payment Card Industry’s Data Security Standards (PCI-DSS) FIPS 199 FIPS 200 NIST Special Publications 800 Series DoD 8500 Series









.png)