Lead AI Data Engineer

Reposted 10 Days Ago
Be an Early Applicant
Office, Machaze, Manica
Senior level
Information Technology • Software
The Role
The Senior Data Engineer will design and build data pipelines, manage workflows, ensure data reliability, and collaborate with the analytics team, using various tools and technologies across cloud platforms.
Summary Generated by Built In
Lead AI Data Engineer

Location: Bengaluru, Karnataka, India

About the Role:
We’re looking for an experienced AI Data Engineer (4-8 years) to join our data team. In this role, you’ll build and maintain our data infrastructure on AWS, enabling analytics and AI teams to extract actionable insights. You’ll design and manage end-to-end data pipelines, ensuring high-quality, reliable, and real-time data, while also contributing to ML/GenAI workflows and model deployment pipelines.

What You'll Do:

  • Design and build scalable data pipelines/transformations using Spark / PySpark / Scala.

  • Manage and optimize Airflow DAGs for complex data workflows.

  • Clean, transform, and prepare data for analytics, AI, and ML use cases.

  • Use Python for automation, data processing, and internal tooling.

  • Work with AWS services (S3, Redshift, EMR, Glue, Athena) to maintain robust data infrastructure.

  • Collaborate with Analytics and AI teams to design pipelines for ML/GenAI projects.

  • Contribute to Node.js (TypeScript) backend development for data services.

  • Automate deployments using CI/CD pipelines (GitHub Actions).

  • Monitor, troubleshoot, and ensure data quality, consistency, and reliability across systems.

  • Build and maintain data warehouses/lakes and handle real-time streaming data using Kafka or similar technologies.

What You'll Need:

  • Bachelor’s or Master’s in Computer Science, Engineering, or related field.

  • 4-8 years of hands-on experience in data engineering.

  • Strong expertise in Spark / Scala for large-scale data processing.

  • Proficient in Airflow for managing and optimizing complex DAGs.

  • Advanced Python skills for data manipulation, automation, and tool development.

  • Proven experience with AWS related cloud services (S3, Redshift, EMR, Glue, Athena, IAM, EC2).

  • Solid understanding of ETL/ELT, data preparation, and analytics workflows.

  • Familiar with Node.js and TypeScript for backend data services.

  • Experience with automated CI/CD (GitHub Actions).

  • Familiarity with CDC Tools like Debezium.

  • Strong SQL, knowledge of data warehousing and streaming (Kafka, Flink, Kinesis), and excellent communication skills.

Bonus Points:

  • Experience with data lake technologies (Delta Lake, Apache Iceberg).

  • Knowledge of ML/GenAI model deployment pipelines.

  • Familiarity with data governance, quality frameworks, and statistics.

  • Experience with infrastructure as code (Terraform).

  • Familiarity with containers (Docker, Kubernetes).

  • Experience with monitoring and logging tools (Prometheus, Grafana).

Top Skills

Airflow
Athena
AWS
Emr
Github Actions
Glue
Kafka
Node.js
Pyspark
Python
Redshift
S3
Scala
Spark
SQL
Typescript
Yaml
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Tokyo, Tokyo
178 Employees

What We Do

Josys is the SaaS Management Platform that simplifies how IT works. Our holistic approach equips IT with 360° control over their software and hardware portfolio by making it easier to visualize assets, analyze utilization trends, and automate provisioning processes that will make IT operations run more efficiently.

By leveraging APIs and integrating with hundreds of applications, Josys empowers IT with a single portal for assigning licenses and devices to employees, monitoring user access, and tracking adoption. IT teams can save time by eliminating dependencies on multiple spreadsheets and disparate tools, easily optimize IT costs, and securely govern access to company data with Josys.

Similar Jobs

Suite Studios Logo Suite Studios

Account Executive

Cloud • Digital Media • Professional Services • Database
In-Office or Remote
2 Locations
20 Employees
200K-240K Annually

Mondelēz International Logo Mondelēz International

Global Digital Smart Factory Analyst, Mondelez Digital Services

Big Data • Food • Hardware • Machine Learning • Retail • Automation • Manufacturing
Remote or Hybrid
12 Locations
90000 Employees

CWAN Logo CWAN

Business Development Representative

Fintech • Software • Financial Services
Remote or Hybrid
2 Locations
1100 Employees

Centari Logo Centari

Senior Software Engineer

Artificial Intelligence • Legal Tech • Professional Services • Software
Remote or Hybrid
2 Locations
8 Employees
150K-200K Annually

Similar Companies Hiring

Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees
Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Fairly Even Thumbnail
Software • Sales • Robotics • Other • Hospitality • Hardware
New York, NY

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account