Salma Health

Data Engineer

Reposted 22 Days Ago

Hiring Remotely in USA

Remote

119K-185K Annually

Mid level

Healthtech

The Role

As a Data Engineer, you will build and maintain data pipelines, convert data into metrics, and operate the platform on AWS. Responsibilities include writing production code, processing data from APIs, and enhancing the orchestration layer using Dagster, while ensuring compliance in a HIPAA-regulated environment.

Summary Generated by Built In

We are looking to hire a Data Engineer to join our team as we build the data backbone for a mental and behavioral health practice. This role will build the platform that turns appointments, assessments, billing, and patient engagement data into the metrics our clinical and operations teams rely on. As a mid-level data engineer, you'll own meaningful pieces of our pipeline end-to-end: from pulling data out of third-party APIs, through medallion architecture transformations in dbt, to exposing curated metrics through our semantic layer.

This is a hands-on role on a small team. You'll write code that runs in production every day, ship improvements weekly, and have direct visibility into how the data is used. We work in a HIPAA-regulated environment, so thoughtfulness about data handling is part of the job.

Location

Hybrid role. Preference for candidates located in the San Francisco Bay Area, San Diego, or Salt Lake City. Remote is possible, with the expectation of regular in-person collaboration.
What You'll Work On

Maintaining and improving the orchestration layer: Dagster assets, jobs, schedules, sensors, and the dependency graph that ties extraction → loading → transformation together.
Adding new data sources to the pipeline; extracting from APIs (GraphQL, REST), Google Drive folders, and CSV/JSONL drops on S3, then landing them in our bronze schemas via Dagster assets.
Building silver and gold dbt models that transform raw source data into our unified entity model following the medallion architecture.
Extending our semantic layer so business metrics are available to downstream consumers (BI tool dashboards, AI agents, ad-hoc analysis) without re-deriving logic
Operating the platform on AWS: ECS Fargate services, RDS, S3, Secrets Manager, CloudFormation templates, and the CodePipeline-based CI/CD that deploys our data platform. All of our data platforms are deployed with IaC tools.
Writing tests (pytest for Python, dbt tests for models, data quality tests) and contributing to internal documentation as new patterns emerge.

What We're Looking For

Required

4-7 years of professional experience building and operating data pipelines in production
From conversation to shipped data product: you're comfortable owning a request end-to-end: scoping it with a non-technical stakeholder, writing requirements clear enough that you (and others) can build against them, implementing the models or metrics, and verifying with the stakeholder that what shipped solves their problem.
Strong Python: comfortable writing modules, structuring code for reuse and testability, and debugging issues across an async or orchestrated pipeline.
Solid SQL skills, including window functions, CTEs (including recursive ones), and the ability to reason about query performance.
Hands-on experience with dbt: building models, writing tests, and understanding materializations.
Working knowledge of an orchestration framework: (Dagster, Airflow, Prefect, or similar), including the mental model of assets/tasks, dependencies, and scheduling.
Comfort with AWS fundamentals: S3, IAM, Secrets Manager, and either ECS or Lambda for compute.
Git-based workflows: code review, and writing PRs that are reviewable.

Nice to Have

Experience with Dagster specifically.
Experience with semantic layer tools (Cube.js, dbt Semantic Layer/MetricFlow, LookML)
Healthcare data experience (HIPAA, EHR systems, ICD-10/CPT codes)
CloudFormation, Terraform, or another IaC tool
Experience with GraphQL APIs as a consumer (pagination, introspection, dealing with rate limits and retries)
Familiarity with identity resolution patterns or slowly-changing dimension modeling

How We Work

Small, focused team; your work ships and gets used quickly
Pragmatic engineering: we favor readable code, clear naming conventions, and well-documented patterns over clever abstractions. Our internal "how to add X" guides are first-class artifacts.
Tests on everything; CI runs dbt parse, dg check defs, and pytest on every PR.

Company Mission & Vision
We are the brain health company of the future, integrating care delivery, technology innovation, and research breakthroughs to better understand brain biology and diagnose, treat, and ultimately cure brain disorders across all stages of life.

Who We Are

Salma Health is reimagining brain healthcare by building a comprehensive, end-to-end brain health system that integrates advanced diagnostics, rapid-acting interventions, and continuous care coordination under one roof.

Our multidisciplinary teams of psychiatrists, neurologists, neuropsychiatrists, therapists, and technologists collaborate to deliver personalized, compassionate care. By leveraging real-world data and precision brain health research, we deliver measurable, evidence-based outcomes at scale.

Headquartered in California, Salma Health is expanding access to innovative brain health services across the U.S., beginning with clinics in San Diego, Orange County, the Bay Area, and Los Angeles.

Compensation & Benefits

Base: The base salary range for this role is $119,000–$185,000, depending on geographic location, experience, and qualifications. Salma Health uses a tiered compensation structure based on candidate location. Specific range details are available during the interview process.
Incentives: Discretionary bonus based on company and individual performance
Benefits: Medical, dental, vision, PTO, and additional benefits

Work Authorization

Sponsorship for employment authorization may be considered on a case-by-case basis depending on the role and candidate qualifications.

Equal Opportunity & Accessibility Statement

We are committed to providing a workplace that is inclusive, respectful, and free from discrimination. We welcome applicants of all backgrounds and make employment decisions without regard to race, color, religion, sex (including pregnancy, childbirth, and related medical conditions), sexual orientation, gender identity or expression, national origin, ancestry, citizenship, age, physical or mental disability, medical condition, genetic information, marital status, military or veteran status, or any other characteristic protected by California or federal law.

In accordance with the California Fair Chance Act, we will consider qualified applicants with arrest and conviction records.

If you require a reasonable accommodation during the application or hiring process, please contact us directly - we’re happy to help.

Skills Required

4-7 years of professional experience building and operating data pipelines in production
Strong Python skills
Solid SQL skills
Hands-on experience with dbt
Working knowledge of an orchestration framework
Comfort with AWS fundamentals
Git-based workflows

View all jobs at Salma Health

View Salma Health Profile

Report Job

Am I A Good Fit?

beta

Get Personalized Job Insights.

Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company

HQ: San Mateo, California

54 Employees

What We Do

Salma Health is a comprehensive brain health clinic combining advanced diagnostics, personalized treatment and continuous care under one roof. Our team specializes in mental health, cognitive health and neurological recovery, offering evidence-based therapies for conditions like depression, anxiety, PTSD and brain injury. From your first evaluation to ongoing care, we guide every step of your journey with clarity, compassion and science-driven support.