Top Data Engineer Jobs in San Diego, CA

24 Days AgoSaved
Remote
San Diego, CA
210K-240K Annually
Senior level
210K-240K Annually
Senior level
Fintech • Analytics • Financial Services
Lead architecture and scaling of the Databricks data warehouse and self-service data platform. Build reusable batch and streaming frameworks, enforce governance, SLOs, lineage, observability, CI/CD, and multi-tenancy. Mentor engineers, run design reviews, and ship documented patterns to enable analytics, data science, and app teams to move quickly without blocking on platform tickets.
Top Skills: AugmentAWSAzureAzure Event HubCi/CdClaudeDatabricksDbtGCPInfrastructure-As-CodePythonTerraform
24 Days AgoSaved
Remote
San Diego, CA
109K-125K Annually
Mid level
109K-125K Annually
Mid level
Cloud • Healthtech • Information Technology • Software • Consulting
Design, build, and maintain backend systems and data interactions for healthcare reporting. Develop APIs, data models, ETL pipelines, and interfaces; optimize performance, ensure data integrity and security, and support automated testing and CI/CD. Collaborate in Agile teams and reduce technical debt while enabling integrations with frontend and third-party systems.
Top Skills: SparkCi/Cd PipelinesDistributed ComputingETLOrchestrationPython
24 Days AgoSaved
Remote
San Diego, CA
170K-190K Annually
Senior level
170K-190K Annually
Senior level
Software
Build end-to-end data and analytics features: ingest and model data, build transformations and pipelines, create client-facing dashboards and interfaces, and develop LLM-powered workflows to accelerate analysis and app development. Partner with stakeholders, perform exploratory analyses, and ensure data quality, observability, and maintainability while influencing data engineering strategy.
Top Skills: Ai/Llm Development ToolsAutomation FrameworksCode AgentsLlmsPythonReactSQLVue
24 Days AgoSaved
Remote
San Diego, CA
Senior level
Senior level
Database
Lead design, build, and maintain scalable batch and streaming data pipelines and data warehouse models. Ensure data quality, observability, performance, and security. Own workstreams, support production incidents, and enable analysts and downstream applications.
Top Skills: AirflowDbtDockerGoogle AnalyticsJSONKafkaKubernetesPostgresPythonSalesforceSnowflakeSQL
24 Days AgoSaved
Remote
San Diego, CA
148K-203K Annually
Senior level
148K-203K Annually
Senior level
Artificial Intelligence • Information Technology • Machine Learning • Marketing Tech • Software • Biotech • Design
Administer, stabilize, and scale Oura's global Databricks environment. Manage workspace governance, cluster policies, access control, cost monitoring, job scheduling, and incident SLAs. Onboard teams, resolve performance and reliability issues, and partner with IT and Data Architecture. Over time contribute to pipeline development, Spark optimization, CI/CD, and infrastructure-as-code to support domain-oriented data mesh initiatives.
Top Skills: SparkAws GlueAws IamAws KinesisAws S3Ci/CdDatabricksInfrastructure As CodeJob OrchestrationUnity CatalogWorkflow Orchestration
New

Track Smarter, Apply Better.

Ditch the spreadsheets. Organize your job search with our freeApplication Tracker.

Use For Free
Application Tracker Preview
24 Days AgoSaved
Remote
San Diego, CA
248K-310K Annually
Senior level
248K-310K Annually
Senior level
Real Estate • Travel • PropTech
Lead the vision and execution for scalable data governance and quality across Airbnb. Architect future-proof data ecosystems, define best practices and tooling, drive cross-functional policy implementation (privacy, legal, infosec), mentor senior engineers, and influence executive strategy to embed stewardship into the data product lifecycle.
Top Skills: Amazon S3Apache IcebergSparkJavaRelational DatabasesScalaSQL
25 Days AgoSaved
Remote
San Diego, CA
Senior level
Senior level
Cloud • Information Technology • Software
Design, build, and operate large-scale AWS-based ETL pipelines using Glue/PySpark to ingest, transform, and curate multi-format data into Apache Iceberg tables. Ensure data quality, metadata management, semantic layer creation (Trino/Athena), CloudFormation deployments, documentation, and compliance within a FedRAMP federal agile environment.
Top Skills: Amazon AthenaAmazon EventbridgeAmazon Mwaa (Airflow)Amazon RedshiftAmazon S3Amazon SnsAmazon SqsApache HiveApache IcebergAPIsAvroAws GlueAws LambdaAws Service CatalogAws Step FunctionsCi/CdCloudFormationCsvDeequDirect ConnectEmrFedrampFismaFtpGitHttpsJIRAJSONKnowledge BasesMermaidNist 800-53NoSQLOracleOrcOwasp Asvs Level 2ParquetPep 8PostgresPysparkPythonSftpTrinoVector StoresWeb ScrapingXMLZero Trust Architecture
25 Days AgoSaved
Remote
San Diego, CA
Mid level
Mid level
Cloud • Information Technology • Software
Serve as a data quality and QA engineer on a federal data platform: build automated AWS Glue/Deequ data quality checks, validate schemas and ETL pipelines, create and run unit/integration tests with 90% coverage, support IV&V and UAT, perform static/security scans, produce test documentation, monitor production ETL, and collaborate with data stewards and IV&V teams.
Top Skills: AthenaAws GlueCi/CdDeequFismaGitGreat ExpectationsJIRANist 800-53Owasp Asvs Level 2PythonSQLTrinoXML
25 Days AgoSaved
In-Office or Remote
San Diego, CA
Expert/Leader
Expert/Leader
Fitness • Information Technology • Software • Sports • Wearables
Define and lead the technical architecture for a lakehouse on AWS, build production pipelines and data models for time-series and multi-source data, introduce data governance and catalog foundations, author platform playbooks (ADRs, runbooks, Terraform modules), deliver projects end-to-end, and mentor engineers while representing the platform to senior leadership and non-technical stakeholders.
Top Skills: AWSDatabricksDelta LakeEmrGlueHudiIamIcebergLambdaPythonS3SnowflakeSparkTerraformTrino
25 Days AgoSaved
In-Office or Remote
San Diego, CA
141K-173K Annually
Senior level
141K-173K Annually
Senior level
Fintech • Payments
Design and build scalable, semantically consistent 360 entity data models and transformation pipelines. Implement cleansing, enrichment, KPIs, scoring, and business rules in SQL and Python/Scala, ensure data quality, lineage, and performance at scale. Use AI coding assistants and SDD to accelerate development, create tests and documentation, and collaborate with domain experts, data scientists, and product teams to deliver trusted, reusable data assets.
Top Skills: Ai AgentsClaudeContext EngineeringCursorGithub CopilotLlmsMdmPrompt DesignPythonScalaSpec-Driven Development (Sdd)SQL
All Filters
JobType
New Jobs
Job Category
Experience
Industry
Company Name
Company Size

Sign up now Access later

Create Free Account