Blend360 Jobs

Data Engineer

Blend360

Data Engineer

Reposted 7 Hours Ago

Hiring Remotely in Hyderabad, Telangana, IND

In-Office or Remote

Senior level

Database • Analytics

The Role

The Data Engineer will design and optimize data pipelines and workflows to support a Media Mix Optimization platform, ensuring data integrity and governance while collaborating with Data Science and BI teams.

Summary Generated by Built In

Company Description

Blend360 is a data and AI services company specializing in data engineering, data science, MLOps, and governance to build scalable analytics solutions. It partners with enterprise and Fortune 1000 clients across industries including financial services, healthcare, retail, technology, and hospitality to drive data-driven decision making. Headquartered in Columbia, Maryland, the company is recognized for rapid growth and global delivery of AI solutions through the integration of people, data, and technology.

We are seeking a hands-on Data Engineer with deep expertise in distributed systems, ETL/ELT development, and enterprise-grade database management. The engineer will design, implement, and optimize ingestion, transformation, and storage workflows to support the MMO platform. The role requires technical fluency across big data frameworks (HDFS, Hive, PySpark), orchestration platforms (NiFi), and relational systems (Postgres), combined with strong coding skills in Python and SQL for automation, custom transformations, and operational reliability.

Job Description

We are implementing a Media Mix Optimization (MMO) platform designed to analyze and optimize marketing investments across multiple channels. This initiative requires a robust on-premises data infrastructure to support distributed computing, large-scale data ingestion, and advanced analytics. The Data Engineer will be responsible for building and maintaining resilient pipelines and data systems that feed into MMO models, ensuring data quality, governance, and availability for Data Science and BI teams. The environment integrates HDFS for distributed storage, Apache NiFi for orchestration, Hive and PySpark for distributed processing, and Postgres for structured data management.

This role is central to enabling seamless integration of massive datasets from disparate sources (media, campaign, transaction, customer interaction, etc.), standardizing data, and providing reliable foundations for advanced econometric modeling and insights.

Responsibilities:

Data Pipeline Development & Orchestration
o Design, build, and optimize scalable data pipelines in Apache NiFi to

automate ingestion, cleansing, and enrichment from structured, semi-structured, and unstructured sources.

Ensure pipelines meet low-latency and high-throughput requirements for distributed processing.

Data Storage & Processing
o Architect and manage datasets on HDFS to support high-volume,

fault-tolerant storage.
o Develop distributed processing workflows in PySpark and Hive to

handle large-scale transformations, aggregations, and joins across

petabyte-level datasets.
o Implement partitioning, bucketing, and indexing strategies to

optimize query performance.

Database Engineering & Management
o Maintain and tune Postgres databases for high availability, integrity,

and performance.
o Write advanced SQL queries for ETL, analysis, and integration with

downstream BI/analytics systems.

Collaboration & Integration
o Partner with Data Scientists to deliver clean, reliable datasets for

model training and MMO analysis.
o Work with BI engineers to ensure data pipelines align with reporting

and visualization requirements.

Monitoring & Reliability Engineering
o Implement monitoring, logging, and alerting frameworks to track

data pipeline health.
o Troubleshoot and resolve issues in ingestion, transformations, and

distributed jobs.

Data Governance & Compliance
o Enforce standards for data quality, lineage, and security across

systems.
o Ensure compliance with internal governance and external

regulations.

Documentation & Knowledge Transfer
o Develop and maintain comprehensive technical documentation for

pipelines, data models, and workflows.
o Provide knowledge sharing and onboarding support for cross-

functional teams.

Qualifications

Bachelor’s degree in Computer Science, Information Technology, or related field (Master’s preferred).
Proven experience as a Data Engineer with expertise in HDFS, Apache NiFi, Hive, PySpark, Postgres, Python, and SQL.
Strong background in ETL/ELT design, distributed processing, and relational database management.
Experience with on-premises big data ecosystems supporting distributed computing.
Solid debugging, optimization, and performance tuning skills.
Ability to work in agile environments, collaborating with multi-disciplinary
teams.
Strong communication skills for cross-functional technical discussions.
Preferred Qualifications:

Familiarity with data governance frameworks, lineage tracking, and data cataloging tools.
Knowledge of security standards, encryption, and access control in on- premises environments.
Prior experience with Media Mix Modeling (MMM/MMO) or marketing analytics projects.
Exposure to workflow schedulers (Airflow, Oozie, or similar).
Proficiency in developing automation scripts and frameworks in Python for
CI/CD of data pipelines.

Skills Required

Bachelor's degree in Computer Science, Information Technology, or related field (Master's preferred)
Proven experience as a Data Engineer with expertise in HDFS, Apache NiFi, Hive, PySpark, Postgres, Python, and SQL
Strong background in ETL/ELT design, distributed processing, and relational database management
Experience with on-premises big data ecosystems supporting distributed computing
Solid debugging, optimization, and performance tuning skills
Ability to work in agile environments, collaborating with multi-disciplinary teams
Strong communication skills for cross-functional technical discussions

Blend360 Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about Blend360 and has not been reviewed or approved by Blend360.

Fair & Transparent Compensation — Pay is considered fair-to-good by many, and public salary postings for common data roles indicate competitive packages in numerous markets. Feedback suggests overall company sentiment aligns with acceptable compensation relative to peers in consulting and analytics.
Flexible Benefits — Flexible and remote/hybrid work arrangements are consistently highlighted in official materials and role descriptions. Feedback suggests flexibility is a meaningful part of the total rewards experience.
Retirement Support — A 401(k) with company match is part of the core package. Feedback suggests retirement offerings are standard and contribute to a complete benefits set.

Learn more about Blend360's Compensation & Benefits →

Blend360 Insights

What's It Like to Work at Blend360? Blend360 Culture & Values Blend360 Career Growth & Development What's the Work-Life Balance Like at Blend360? Blend360 Leadership & Management Blend360 Company Growth, Stability & Outlook

View all jobs at Blend360

View Blend360 Profile

Report Job

Am I A Good Fit?

beta

Get Personalized Job Insights.

Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company

HQ: Columbia, MD

390 Employees

Year Founded: 2016

What We Do

Our Vision is to build a company of world-class people that helps our clients optimize business performance through data, technology and analytics. Blend360 has two divisions: Data Science Solutions: We work at the intersection of data, technology and analytics. Talent Solutions: We live and breathe the digital and talent marketplace.