Senior Data Engineer – Databricks Expert

Posted 22 Days Ago
Be an Early Applicant
Hyderabad, Telangana
Senior level
Artificial Intelligence • Information Technology
The Role
The Senior Data Engineer will act as a subject matter expert for Databricks, designing and managing ETL pipelines using tools like Python and Apache Spark. Responsibilities include optimizing data architectures, implementing CI/CD pipelines, and ensuring data governance while mentoring junior engineers.
Summary Generated by Built In

Company Description

We are an Artificial Intelligence (AI) focused product engineering company, providing our customers in healthcare, retail & e-commerce, manufacturing, and hospitality sectors with cutting-edge products & solutions, harnessing Big Data Analytics, Vision Analytics, and IoT.

Ever since our inception in March 2010, Tech Vedika has been

  • Great Place To Work Certified™(May 2022-May 2023) Organization 
  • Top 50 I Mid-Size India’s Best Workplaces for Women 2022 !
  • Top 10 Most Disruptive Face & Image Recognition Solution Providers’2020 – Analytics Insights
  • Top 10 Healthcare Analytics Solution Providers’ 2019- Healthcare Outlook Magazine
  • Top 20 most amazing AWS Service Providers – CIO Review India 2018

We strive for simple, elegant tech solutions to perform complex tasks. As a scalable technology partner, we enable organizations to improve operational efficiency and unleash new business potential


Job Description

We are seeking a Senior Data Engineer with in-depth knowledge of Databricks and Unity Catalog to serve as the subject matter expert for all things Databricks within our organization. This role requires deep expertise in Databricks, including CI/CD setup for data bricks, data lineage through Unity Catalog, and strong proficiency in ETLSQL, and modern data engineering practices. You will be the go-to person for designing, implementing, and optimizing data solutions with Databricks.


Key Responsibilities:

· Serve as the point of contact and subject matter expert for all Databricks-related activities, including architecture, development, and operational best practices.

· Design, develop, and manage ETL/ELT pipelines in Databricks using Python (PySpark), integrating various data sources to support business operations.

· Leverage Unity Catalog to ensure data lineagesecurity, and governance are properly managed across the Databricks environment.

· Implement and maintain CI/CD pipelines for Databricks, ensuring smooth deployments, version control, and automation using Git and other DevOps tools.

· Build scalable data architectures, including Data LakesLakehouses, and Data Warehouses, ensuring efficient data management and accessibility.

· Configure and optimize Databricks clustersjobs, and workflows for both batch and streaming data processing to handle large-scale datasets.

· Stay up-to-date with the latest Databricks features and advancements, continuously enhancing our data engineering practices.

· Collaborate with cross-functional teams to implement data governance and ensure compliance with security and industry regulations.

· Monitor and tune Databricks workloads to ensure high performance and scalability, adapting to business needs as required.

· Provide guidance and mentorship to junior engineers, ensuring adherence to best practices and fostering a collaborative environment.

Qualifications

Qualifications:

· 5+ years of experience in data engineering with significant expertise in Databricks and Apache Spark.

· Proficient in Unity Catalog for managing data lineage, security, and governance within the Databricks ecosystem.

· Experience building and optimizing ETL pipelines using tools like Azure Data FactoryInformatica, or similar.

· Strong understanding of CI/CD practices with experience in Git for version control and integration with Databricks.

· Expertise in SQL development and performance tuning for large-scale datasets.

· Knowledge of the Azure ecosystem, including data services like Azure Data Lake and Azure Storage.

· Ability to work with both batch and streaming data processing pipelines.

· Experience with data modeling and dimensional design (e.g., star schema).

· Good understanding of data governance, compliance, and security best practices.

· Excellent communication and problem-solving skills, with the ability to manage multiple priorities.

· Ability to stay current on Databricks innovations and proactively introduce new features and capabilities to the team.

Additional Information

At Tech Vedika, we are looking for talented individuals who want to work with driven people. Attain success while working on interesting projects with a culturally diverse group of individuals.
Perks & Benefits: 

  • Health Insurance
  • Meal Vouchers
  • Learning Aids
  • Client/Customer Interactions
  • Working with great minds 

If you want an exciting and dynamic career with unlimited growth potential, then Tech Vedika is the place for you!

Top Skills

Python
The Company
HQ: San Jose, CA
247 Employees
On-site Workplace
Year Founded: 2010

What We Do

We are an Artificial Intelligence (AI) and Amazon Web Services (AWS) focused Services Company providing Innovative Technology Solutions to our customers.

We have helped organizations build cutting-edge solutions and offer consulting services in a range of market verticals including Supply Chain & Logistics, Retail, Manufacturing and Healthcare harnessing AI/ML, Computer Vision, IoT, and Hybrid/Cloud/Edge infra.

TechVedika is a Certified AWS Services Partner with vast experience in managing cloud environments. Our cloud practice offers profound expertise in AWS and all leading cloud technologies.

We strive for simple, elegant tech solutions leveraging state-of-the-art and upcoming advances in technologies/platforms/architectures combined with a proven track record in delivery using Agile and DevOps.

Our global clients include early-stage start-ups to medium and large enterprises.
As a scalable technology partner, we enable organizations to improve operational efficiency and unleash new business potential.

Awards & Recognitions:
- Great Place to Work-Certified™ May 2023-May 2024
- Great Place to Work-Certified™ May 2022-May 2023
- Top 50 I Mid-Size India’s Best Workplaces for Women™ 2022
- Top 10 Most Disruptive Face & Image Recognition Solution Providers’2020 – Analytics Insights
- Top 10 Healthcare Analytics Solution Providers’ 2019- Healthcare Outlook Magazine
- Top 20 most amazing AWS Service Providers – CIO Review India 2018
- Top 100 Mobile App development vendors India – Silicon India 2013

Similar Jobs

Warner Bros. Discovery Logo Warner Bros. Discovery

Staff Data Engineer- C360, Hyderabad

Artificial Intelligence • Digital Media • Gaming • Machine Learning • News + Entertainment • Software
Hybrid
Hyderabad, Telangana, IND
40000 Employees

Warner Bros. Discovery Logo Warner Bros. Discovery

Senior Data Engineer (Growth Engineering)

Artificial Intelligence • Digital Media • Gaming • Machine Learning • News + Entertainment • Software
Hybrid
Hyderabad, Telangana, IND
40000 Employees

Crunchyroll Logo Crunchyroll

Staff Data Engineer

Digital Media • eCommerce • Gaming • Mobile • News + Entertainment
Hyderabad, Telangana, IND
1200 Employees

Warner Bros. Discovery Logo Warner Bros. Discovery

Senior Data Engineer - C360, Hyderabad

Artificial Intelligence • Digital Media • Gaming • Machine Learning • News + Entertainment • Software
Hybrid
Hyderabad, Telangana, IND
40000 Employees

Similar Companies Hiring

Jobba Trade Technologies, Inc. Thumbnail
Software • Professional Services • Productivity • Information Technology • Cloud
Chicago, IL
45 Employees
InCommodities Thumbnail
Renewable Energy • Machine Learning • Information Technology • Energy • Automation • Analytics
Austin, TX
234 Employees
RunPod Thumbnail
Software • Infrastructure as a Service (IaaS) • Cloud • Artificial Intelligence
Charlotte, North Carolina
53 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account