Data Engineer

Posted Yesterday
Hiring Remotely in USA
Remote
119K-185K Annually
Mid level
Healthtech
The Role
As a Data Engineer, you will build and maintain data pipelines, convert data into metrics, and operate the platform on AWS. Responsibilities include writing production code, processing data from APIs, and enhancing the orchestration layer using Dagster, while ensuring compliance in a HIPAA-regulated environment.
Summary Generated by Built In

We are looking to hire a Data Engineer to join our team as we build the data backbone for a mental and behavioral health practice. This role will build the platform that turns appointments, assessments, billing, and patient engagement data into the metrics our clinical and operations teams rely on. As a mid-level data engineer, you'll own meaningful pieces of our pipeline end-to-end: from pulling data out of third-party APIs, through medallion architecture transformations in dbt, to exposing curated metrics through our semantic layer.

This is a hands-on role on a small team. You'll write code that runs in production every day, ship improvements weekly, and have direct visibility into how the data is used. We work in a HIPAA-regulated environment, so thoughtfulness about data handling is part of the job.

Location

Hybrid role. Preference for candidates located in the San Francisco Bay Area, San Diego, or Salt Lake City. Remote is possible, with the expectation of regular in-person collaboration.


What You'll Work On

  • Maintaining and improving the orchestration layer: Dagster assets, jobs, schedules, sensors, and the dependency graph that ties extraction → loading → transformation together.

  • Adding new data sources to the pipeline; extracting from APIs (GraphQL, REST), Google Drive folders, and CSV/JSONL drops on S3, then landing them in our bronze schemas via Dagster assets.

  • Building silver and gold dbt models that transform raw source data into our unified entity model following the medallion architecture.

  • Extending our semantic layer so business metrics are available to downstream consumers (BI tool dashboards, AI agents, ad-hoc analysis) without re-deriving logic

  • Operating the platform on AWS: ECS Fargate services, RDS, S3, Secrets Manager, CloudFormation templates, and the CodePipeline-based CI/CD that deploys our data platform. All of our data platforms are deployed with IaC tools.

  • Writing tests (pytest for Python, dbt tests for models, data quality tests) and contributing to internal documentation as new patterns emerge.

What We're Looking For

Required
  • 4-7 years of professional experience building and operating data pipelines in production

  • From conversation to shipped data product: you're comfortable owning a request end-to-end: scoping it with a non-technical stakeholder, writing requirements clear enough that you (and others) can build against them, implementing the models or metrics, and verifying with the stakeholder that what shipped solves their problem.

  • Strong Python: comfortable writing modules, structuring code for reuse and testability, and debugging issues across an async or orchestrated pipeline.

  • Solid SQL skills, including window functions, CTEs (including recursive ones), and the ability to reason about query performance.

  • Hands-on experience with dbt: building models, writing tests, and understanding materializations.

  • Working knowledge of an orchestration framework: (Dagster, Airflow, Prefect, or similar), including the mental model of assets/tasks, dependencies, and scheduling.

  • Comfort with AWS fundamentals: S3, IAM, Secrets Manager, and either ECS or Lambda for compute.

  • Git-based workflows: code review, and writing PRs that are reviewable.

Nice to Have
  • Experience with Dagster specifically.

  • Experience with semantic layer tools (Cube.js, dbt Semantic Layer/MetricFlow, LookML)

  • Healthcare data experience (HIPAA, EHR systems, ICD-10/CPT codes)

  • CloudFormation, Terraform, or another IaC tool

  • Experience with GraphQL APIs as a consumer (pagination, introspection, dealing with rate limits and retries)

  • Familiarity with identity resolution patterns or slowly-changing dimension modeling

How We Work

  • Small, focused team; your work ships and gets used quickly

  • Pragmatic engineering: we favor readable code, clear naming conventions, and well-documented patterns over clever abstractions. Our internal "how to add X" guides are first-class artifacts.

  • Tests on everything; CI runs dbt parse, dg check defs, and pytest on every PR.

Company Mission & Vision

We are the brain health company of the future that integrates care delivery, technology innovation and research breakthroughs to better understand brain biology and diagnose, treat and ultimately cure brain disorders for all stages of life.

Who We Are

Salma Health is reimagining brain healthcare. We bring together advanced diagnostics, evidence-based treatments and continuous support under one connected system—so every person can receive the right care at the right moment.

Our multidisciplinary team of psychiatrists, neurologists, neuropsychiatrists, therapists and technologists work together to deliver personalized, compassionate care for people living with brain and mental health conditions. By combining cutting-edge science with human understanding, we’re creating a new model of care that replaces fragmentation with connection and uncertainty with clarity.

Headquartered in California, Salma Health is expanding access to innovative brain health care across the U.S., beginning with clinics in San Diego, Orange County, and clinics in the Bay Area and Los Angeles opening soon.

Compensation & Benefits

  • Base: The base salary range for this role is $119,000–$185,000, depending on geographic location, experience, and qualifications. Salma Health uses a tiered compensation structure based on candidate location. Specific range details are available during the interview process.

  • Incentives: Discretionary bonus based on company and individual performance

  • Benefits: Medical, dental, vision, PTO, and additional benefits

Work Authorization

Sponsorship for employment authorization may be considered on a case-by-case basis depending on the role and candidate qualifications.

Equal Opportunity & Accessibility Statement

We are committed to providing a workplace that is inclusive, respectful, and free from discrimination. We welcome applicants of all backgrounds and make employment decisions without regard to race, color, religion, sex (including pregnancy, childbirth, and related medical conditions), sexual orientation, gender identity or expression, national origin, ancestry, citizenship, age, physical or mental disability, medical condition, genetic information, marital status, military or veteran status, or any other characteristic protected by California or federal law.

In accordance with the California Fair Chance Act, we will consider qualified applicants with arrest and conviction records.

If you require a reasonable accommodation during the application or hiring process, please contact us directly - we’re happy to help.

Skills Required

  • 4-7 years of professional experience building and operating data pipelines in production
  • Strong Python skills
  • Solid SQL skills
  • Hands-on experience with dbt
  • Working knowledge of an orchestration framework
  • Comfort with AWS fundamentals
  • Git-based workflows
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Mateo, California
54 Employees

What We Do

Salma Health is a comprehensive brain health clinic combining advanced diagnostics, personalized treatment and continuous care under one roof. Our team specializes in mental health, cognitive health and neurological recovery, offering evidence-based therapies for conditions like depression, anxiety, PTSD and brain injury. From your first evaluation to ongoing care, we guide every step of your journey with clarity, compassion and science-driven support.

Similar Jobs

Optum Logo Optum

Data Engineer

Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
In-Office or Remote
San Antonio, TX, USA
160000 Employees
113K-193K Annually

CrowdStrike Logo CrowdStrike

Data Engineer

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Remote or Hybrid
7 Locations
10000 Employees
195K-320K Annually

Magnite Logo Magnite

Data Engineer

AdTech • Big Data • Digital Media • Software
Remote or Hybrid
3 Locations
950 Employees
120K-135K Annually

Jellyfish Logo Jellyfish

Data Engineer

Big Data • Cloud • Productivity • Software • Database • Analytics • Automation
Remote or Hybrid
United States
225 Employees
165K-205K Annually

Similar Companies Hiring

Camber Thumbnail
Fintech • Healthtech • Social Impact
New York, New York
90 Employees
Sailor Health Thumbnail
Healthtech • Social Impact • Telehealth
New York City, NY
20 Employees
Granted Thumbnail
Mobile • Insurance • Healthtech • Financial Services • Artificial Intelligence
New York, New York
23 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account