Data Scientist

Posted Yesterday
Long Island City, New York, NY, USA
In-Office
Mid level
Healthtech
The Role
Maintain and modernize data science infrastructure and integrations for the Enterprise Data Platform. Support cloud migrations to Azure, build automated ETL/data lake pipelines, design scalable hybrid architectures, and develop, test, deploy, and document statistical and machine learning models. Serve as liaison across IT, developers, and leadership to ensure secure access, platform standards, and adoption of best practices.
Summary Generated by Built In

Company Overview:

With an annual budget of $2.3 billion and more than 7,000 employees throughout the five boroughs, the New York City Department of Health and Mental Hygiene (NYC DOHMH) is one of the largest public health agencies in the world, serving 8 million New Yorkers from diverse ethnic and cultural backgrounds. We're tackling a broad range of public health issues with innovative policies and programs and getting exceptional results, but our work is never finished. The breadth of our innovative programs provides the widest range of choices for every member of our team. 

With grant funds from the Centers for Disease Control and Prevention (CDC), DOHMH is undertaking a new initiative that will meet critical infrastructure needs and make possible strategic investments that will have lasting effects on public health. Investments and improvements through this initiative will help modernize DOHMH’s foundational capabilities and data infrastructure, enabling it to partner in complex health and health care environments and, in turn, support better public health outcomes, including COVID-19. This initiative supports larger efforts to rebalance investments in public health and more equitably serve communities and populations.

The Center for Population Health Data Science (CPHDS)- launched in October of 2023- aims to catalyze critical data modernization work and enable the agency to make progress toward linking public health, healthcare, and social service for timely and effective public action. We are working towards making these data more accessible, timely, equitable, meaningfully usable, and protected and actively used to protect and promote the health and wellbeing of New Yorkers. We aim to strengthen agency wide data capabilities by empowering our workforce, enhancing intra- and inter-agency data sharing, and using modern technology to yield trusted and integrated data and insights. A real-time and comprehensive view of city needs is needed to enhance public health actions and improve health outcomes for the most vulnerable New Yorkers.

Job Description:

DOHMH has an opening for a Data Scientist. The Data Scientist reports to the Director, Applied Data Science and Solutions and maintains software and integrations that power the Enterprise Data Platform (EDP) including the Master Patient Index, and serves as a key liaison across agency IT, EDP developers, and leadership—ensuring secure access management, best‑practice adoption, and adherence to platform standards. 

Duties

  • Provide data engineering and infrastructure configuration support for complex Python applications
  • Aid migration of complex Python applications from on-premise environments to Azure including transformation of applications to cloud native architecture
  • Build and oversee automated data extraction, transfer, and load processes to support analytics databases
  • Build resources to ingest, extract, or analyze data housed in a data lake environment
  • Design, build, and document scalable hybrid technology architecture using both on-premise and cloud resources on Azure
  • Identify, explore, and help build emerging free, open source technologies
  • Design, implement, test, deploy, update, and document statistical, machine learning, and deep learning models
  • Propose and implement improvements to the DOHMH data science infrastructure and processes

Qualifications:

  • 3+ years of hands-on experience with Python version 3.x 
  • 3+ years of hands-on experience with SQL databases
  • 3+ years of hands-on experience with mathematical, statistical, machine learning, or artificial intelligence models in Python
  • 2+ years of hands-on experience performing data science tasks using cloud-based technologies
  • 2+ years of experience building Python applications that leverage cloud-based technologies such as docker or Azure Container Apps or Azure App Service
  • 3+ years of Data Lake analytics platforms such as Azure Synapse or Databricks
  • Strong organization and time management skills
  • Good written and verbal communication skills
  • Ability to work independently as well as part of a team
  • Undergraduate degree or certificate in Data Science, Mathematics, Applied Mathematics, Statistics, Applied Statistics, Computer Science, Computer Engineering, Electrical Engineering, Physics, or a similar field of study
  • The ideal candidate for this position must be a pro-active and self-motivated individual with the ability to work in teams and in a highly dynamic environment with multiple stakeholders and timelines.

Additional Desired Qualities 

  • 5+ years of experience performing data science tasks using cloud-based technologies
  • 5+ years of hands-on experience in Python
  • 3+ years of experience building applications in Python web frameworks such as FastAPI, Django, or Flask
  • 3+ years of experience using ETL platforms such as Azure Data Factory or Airflow
  • Graduate degree in Data Science, Mathematics, Applied Mathematics, Statistics, Applied Statistics,

Benefits: 

  • Hybrid Work Schedule. 
  • Generous Paid Time Off and Holidays. 
  • An attractive and comprehensive benefits package including Medical, Dental and Vision. 
  • Flexible Spending Accounts and Commuter Benefits. 
  • Company Paid Life Insurance and Disability Coverage. 
  • 403 (b) + employer matching and discretionary company contributions. 
  • College Savings Plan. 
  • Ongoing training and continuous opportunities for professional growth and development. 

Additional Information: 

  • This is a temporary grant-funded position ending in November 2027.
  • This individual must reside in the tri-state area (NY, NJ, CT) by their confirmed start date. 
  • Preference may be given to individuals residing in New York City (5 boroughs) or surrounding New York State counties. 
  • This individual will be expected to work non-business hours during emergencies.

At PHS, we place immense value on diversity within our teams, understanding that varied backgrounds and experiences significantly enhance our community and propel us toward our goals. If you find you don’t have experience in all the areas listed above, we still encourage you to apply and share your background and experiences in your application. We are eager to discover how your unique perspective can bring positive transformations to our team and help advance our mission of creating healthier, more equitable communities. 

We look forward to learning more about you!

PHS is proud to be an equal opportunity employer and encourages applications from women, people of color, persons with disabilities, LGBTQIA+ individuals, and veterans. 


Monday-Friday
35 Hours Per Week

Skills Required

  • 3+ years of hands-on experience with Python version 3.x
  • 3+ years of hands-on experience with SQL databases
  • 3+ years of hands-on experience with mathematical, statistical, machine learning, or artificial intelligence models in Python
  • 2+ years of hands-on experience performing data science tasks using cloud-based technologies
  • 2+ years of experience building Python applications that leverage cloud-based technologies such as Docker or Azure Container Apps or Azure App Service
  • 3+ years of experience with Data Lake analytics platforms such as Azure Synapse or Databricks
  • Strong organization and time management skills
  • Good written and verbal communication skills
  • Ability to work independently as well as part of a team
  • Undergraduate degree or certificate in Data Science, Mathematics, Applied Mathematics, Statistics, Applied Statistics, Computer Science, Computer Engineering, Electrical Engineering, Physics, or similar field
  • Must reside in the tri-state area (NY, NJ, CT) by confirmed start date
  • Willingness to work non-business hours during emergencies
  • 5+ years of experience performing data science tasks using cloud-based technologies
  • 5+ years of hands-on experience in Python
  • 3+ years of experience building applications in Python web frameworks such as FastAPI, Django, or Flask
  • 3+ years of experience using ETL platforms such as Azure Data Factory or Airflow
  • Graduate degree in Data Science, Mathematics, Applied Mathematics, Statistics, or Applied Statistics
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: New York, New York
563 Employees

What We Do

Public Health Solutions (PHS) is the largest public health nonprofit serving New York City. For over 60 years, PHS has improved health outcomes and helped families thrive by providing services directly to the city’s most vulnerable populations, publishing groundbreaking research that moves public health policy and practice forward, and supporting over 200 community-based organizations through our long-standing government partnerships. We are a leader in addressing crucial public health issues, including food and nutrition, health insurance access, maternal and child health, reproductive health, tobacco control, and HIV/AIDS prevention. PHS has a strong focus on health equity to ensure NYC families have the basics for a healthier life. For more information, visit healthsolutions.org.

Similar Jobs

Snap Inc. Logo Snap Inc.

Data Scientist

Artificial Intelligence • Cloud • Machine Learning • Mobile • Software • Virtual Reality • App development
Hybrid
6 Locations
5000 Employees
133K-235K Annually

Capital One Logo Capital One

Data Scientist

Fintech • Machine Learning • Payments • Software • Financial Services
Hybrid
2 Locations
55000 Employees
269K-335K Annually

Capital One Logo Capital One

Data Scientist

Fintech • Machine Learning • Payments • Software • Financial Services
Hybrid
3 Locations
55000 Employees
179K-246K Annually

Capital One Logo Capital One

Data Scientist

Fintech • Machine Learning • Payments • Software • Financial Services
Hybrid
2 Locations
55000 Employees
162K-201K Annually

Similar Companies Hiring

Camber Thumbnail
Fintech • Healthtech • Social Impact
New York, New York
90 Employees
Sailor Health Thumbnail
Healthtech • Social Impact • Telehealth
New York City, NY
20 Employees
Granted Thumbnail
Mobile • Insurance • Healthtech • Financial Services • Artificial Intelligence
New York, New York
23 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account