Informatics Data Scientist Lead

Posted 9 Days Ago
Be an Early Applicant
Washington, DC
3-5 Years Experience
Healthtech
The Role
The Informatics Data Scientist Lead is responsible for developing and maintaining Python code for ETL processes and bioinformatics pipelines, managing automated testing, and collaborating with various teams to structure and prepare data for analysis and modeling in a high-performance computing environment.
Summary Generated by Built In

Informatics Data Scientist Lead
Prometheus Federal Services (PFS), a trusted partner to federal health and social services agencies, has an opening for an Informatics Data Scientist Lead. This position is responsible for developing and maintaining our Python codebase, focusing on Extract-Transform-Load (ETL) processes and bioinformatics pipelines. The role requires a blend of technical expertise in data science and bioinformatics, with a strong emphasis on Python programming, data processing, and high-performance computing.
Essential Duties and Responsibilities
The successful candidate may be responsible for, among other things:

  • Develop, maintain, and document Python code for ETL processes and bioinformatics pipelines
  • Ensure that code is well-documented, version-controlled, and adheres to industry standards such as PEP8
  • Implement automated testing frameworks (e.g., pytest) to ensure the reliability and performance of code
  • Create logging mechanisms to monitor processes and troubleshoot issues
  • Design and implement ETL processes to extract data from various sources, transform it as needed, and load it into relational databases
  • Enhance and maintain existing ETL processes, ensuring they are well-documented and tested
  • Align and harmonize data from multiple sources for integration into master datasets
  • Develop bioinformatics pipelines for tasks such as variant calling, gene expression analysis, and data annotation
  • Work within a Linux-based high-performance computing environment using command-line tools
  • Utilize tools like Python’s Snakemake to create and manage complex workflows
  • Perform testing and validation of bioinformatics pipelines, ensuring accuracy and efficiency
  • Collaborate with cross-functional teams, including data engineers, researchers, and project managers
  • Participate in regular meetings to discuss project progress, challenges, and goals
  • Provide support to research and data teams, helping to structure and prepare data for analysis and modeling

Minimum Qualifications

  • Bachelor’s Data Science, Computer Science, Bioinformatics, or a related field
  • Minimum of eight (8) years of experience
  • Minimum of five (5) years of federal consulting
  • Strong experience in Python programming, particularly in the context of ETL processes and bioinformatics
  • Familiarity with version control systems (e.g., Git) and workflow management tools like Snakemake
  • Experience working in Linux-based high-performance computing environments
  • Knowledge of relational databases and data integration techniques
  • Experience with automated testing and logging best practices
  • Strong analytical and problem-solving skills
  • Excellent communication and documentation skills
  • Ability to work both independently and as part of a team
  • Authorized to work in the U.S. indefinitely without sponsorship
  • Ability to obtain a public trust 

Preferred Qualifications

  • Experience in healthcare, life sciences, or related industries
  • Master’s degree in Data Science, Computer Science, Bioinformatics, or a related field
  • VHA Experience
  • Knowledge of bioinformatics tools and pipelines
  • Familiarity with AI/ML concepts and their application to data science

Top Skills

Python
The Company
HQ: Washington D.C. Metro Area, DC
95 Employees
On-site Workplace
Year Founded: 2016

What We Do


PFS is a public sector administrative and professional services firm serving federal health and social services agency clients.

With leadership that brings a combined 20 years of military health care experience, our vision is to positively impact populations in need through transformative work in health care improvement, planning and technical assistance, business transformation planning and support, strategic communications, and learning and performance.

Jobs at Similar Companies

Cencora Logo Cencora

Vendavo Developer

Healthtech • Logistics • Pharmaceutical
Pune, Maharashtra, IND
46000 Employees

Sage Logo Sage

Senior Fullstack Software Engineer, Care Platform

Hardware • Healthtech • Software • Analytics
New York, NY, USA
25 Employees

Zealthy Logo Zealthy

Medical Director (NY, NY)

Healthtech • Social Impact • Pharmaceutical • Telehealth
New York, NY, USA
13 Employees

Similar Companies Hiring

Sage Thumbnail
Software • Healthtech • Hardware • Analytics
New York, NY
25 Employees
Zealthy Thumbnail
Telehealth • Social Impact • Pharmaceutical • Healthtech
New York City, NY
13 Employees
Cencora Thumbnail
Pharmaceutical • Logistics • Healthtech
Conshohocken, PA
46000 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account