Role Summary
We are seeking a skilled Databricks Data Engineer to design, build, and optimize scalable data pipelines and data platforms using the Databricks Lakehouse architecture. The role involves working closely with data scientists, analysts, and business stakeholders to enable data-driven decision-making through robust, high-quality data solutions.
A Databricks Data Engineer primarily focuses on building and maintaining pipelines that transform raw data into usable insights using platforms like Apache Spark and Databricks.
Key Responsibilities
1. Data Pipeline Development
- Design, develop, and maintain scalable batch and streaming data pipelines on Databricks
- Implement end-to-end ingestion, transformation, and modeling (Bronze, Silver, Gold layers)
- Build ETL/ELT workflows for structured and unstructured data
2. Data Processing & Transformation
- Process large-scale data using Apache Spark (PySpark/Scala)
- Perform data cleansing, transformation, and enrichment
- Enable efficient data modeling for analytics and reporting
3. Performance Optimization
- Optimize Spark jobs for performance (partitioning, joins, caching)
- Troubleshoot and resolve pipeline performance bottlenecks
- Tune clusters, workloads, and resource utilization
4. Data Quality & Governance
- Ensure data quality, consistency, and reliability across pipelines
- Implement validation checks, monitoring, and alerting mechanisms
- Apply governance practices such as schema enforcement and versioning
5. Platform & Workflow Management
- Develop and manage Databricks notebooks, jobs, and workflows
- Orchestrate pipelines using tools like Databricks Workflows / Airflow
- Monitor pipeline execution and ensure SLAs are met
6. Collaboration & Stakeholder Engagement
- Work closely with data scientists, analysts, and architects to gather requirements
- Translate business requirements into scalable data solutions
- Support downstream reporting, analytics, and ML use cases
Required Skills & Competencies
Technical Skills
- Strong experience with Databricks & Apache Spark
- Proficiency in Python / PySpark / Scala / SQL
- Experience with ETL/ELT and Data Warehousing concepts
- Familiarity with Delta Lake, Lakehouse architecture
- Experience with Cloud platforms (Azure / AWS / GCP)
- Knowledge of big data technologies (Kafka, Hadoop, etc.)
Engineering Practices
- Experience with CI/CD, Git, and SDLC practices
- Exposure to pipeline orchestration tools (Airflow, ADF, etc.)
- Understanding of data governance and security
Soft Skills
- Strong problem-solving and analytical capabilities
- Effective communication and stakeholder management
- Ability to work in cross-functional teams
Qualifications
- Bachelor's / Master's degree in Computer Science, Engineering, or related field
- Typically 3+ years of experience in data engineering or big data environments
- Certifications in Databricks or cloud platforms are a plus
Nice-to-Have
- Experience with ML pipelines or feature engineering
- Exposure to real-time streaming frameworks
- Knowledge of Power BI / analytics tools (for enterprise setups)
Tools & Technologies
- Databricks (Lakehouse Platform)
- Apache Spark (PySpark / Scala)
- SQL
- Delta Lake
- Azure Data Factory / Airflow
- Cloud Storage (ADLS, S3, GCS)
About MetLife
Recognized on Fortune magazine's list of the "World's Most Admired Companies" and Fortune World's 25 Best Workplaces™, MetLife, through its subsidiaries and affiliates, is one of the world's leading financial services companies; providing insurance, annuities, employee benefits and asset management to individual and institutional customers. With operations in more than 40 markets, we hold leading positions in the United States, Latin America, Asia, Europe, and the Middle East.
Our purpose is simple - to help our colleagues, customers, communities, and the world at large create a more confident future. United by purpose and guided by our core values - Win Together, Do the Right Thing, Deliver Impact Over Activity, and Think Ahead - we're inspired to transform the next century in financial services. At MetLife, it's #AllTogetherPossible . Join us!
#BI-Hybrid
Skills Required
- 3+ years of experience in data engineering or big data environments
- Bachelor's / Master's degree in Computer Science, Engineering, or related field
- Strong experience with Databricks & Apache Spark
- Proficiency in Python / PySpark / Scala / SQL
- Experience with ETL/ELT and Data Warehousing concepts
- Familiarity with Delta Lake, Lakehouse architecture
- Experience with Cloud platforms (Azure / AWS / GCP)
- Knowledge of big data technologies (Kafka, Hadoop, etc.)
- Experience with CI/CD, Git, and SDLC practices
- Exposure to pipeline orchestration tools (Airflow, ADF, etc.)
- Certifications in Databricks or cloud platforms
What We Do
We're honored to be No. 10 on Great Place to Work's World's Best Workplaces and recognized in the Fortune 100 Best Companies to Work For® list in 2025. At MetLife, we're leading the global transformation of an industry we’ve defined for over 157 years. At MetLife, every innovation and line of code is a lifeline for our customers and their families—from victims of natural disasters to people living with disabilities and beyond. With operations in more than 40 markets and leading positions across the globe, MetLife fosters an inclusive culture where our people are energized and inspired to deliver for our customers and communities. Join our remarkable journey—one in which you help write the next century of innovation in financial services—because with MetLife, making the world a better place is All Together Possible.
Why Work With Us
At MetLife, you’ll be working for a company whose purpose is to help customers throughout their life’s journey, and often in their most critical time of need. You’ll be a part of developing leading-edge platforms that will have a lasting impact on the lives and well-being of tens of millions of customers.
Gallery
MetLife Teams
MetLife Offices
Hybrid Workspace
Employees engage in a combination of remote and on-site work.
MetLife's current workplace policies classify roles as Office, Hybrid or Virtual based on the nature of work, encouraging new ways of working together



.png)
















.png)























