Data Engineer

Posted 5 Days Ago
Be an Early Applicant
Lahore, Punjab
In-Office
Mid level
Software
The Role
Design and build data pipelines, manage financial datasets, and ensure data quality for machine learning forecasting models.
Summary Generated by Built In
About CodeNinja

CodeNinja is a full-stack AI delivery company that helps enterprises, governments, and software acquirers build and operate intelligence-driven systems for mission-critical workflows. We specialize in deploying AI into real operations—combining strong engineering fundamentals with AI-native delivery to create measurable value, resilience, and long-term ownership for our clients. Our global footprint and delivery model are supported by AI Labs, AI Pods, and Global Capability Centers, enabling teams to co-engineer scalable platforms across regions and time zones.

Role Overview

We are seeking a skilled Data Engineer with 3+ years of experience to design and build robust, scalable data pipelines supporting financial machine learning forecasting models.

You will be responsible for ingesting, cleaning, validating, and structuring complex multi-source financial datasets to enable high-quality time-series model training and analytics.

This role requires strong technical expertise in ETL/ELT pipelines, financial data processing, and data quality assurance within secure corporate environments.

Key Responsibilities
  • Design, develop, and maintain automated ETL/ELT pipelines from CSV/Excel exports within secure enterprise environments.
  • Perform data cleaning, normalization, validation, and integrity checks on financial transaction datasets.
  • Execute entity mapping, currency standardization, and data synthesis across multiple legal entities and G/L accounts.
  • Build exploratory data analysis (EDA) pipelines, including statistical analysis of financial flow patterns and seasonality.
  • Develop feature stores with pre-computed lag and rolling statistics for ML forecasting consumption.
  • Ensure high data quality through validation frameworks and automated integrity checks.
  • Document data pipelines, quality reports, and technical handoff materials for ML Engineering teams.
  • Collaborate closely with ML engineers, domain subject matter experts, and stakeholders.

Requirements
  • 3+ years of hands-on experience building production-grade data engineering pipelines.
  • Strong Python expertise (Pandas, NumPy) and SQL proficiency.
  • Experience using Jupyter notebooks for EDA and reporting.
  • Proven experience handling complex financial datasets (transactions, general ledger, settlements, multi-currency data).
  • Strong knowledge of data validation frameworks and quality assurance methodologies.
  • Experience parsing and processing large CSV/Excel datasets at scale.
  • Familiarity with corporate network security protocols and access control environments.
  • Bachelor’s or Master’s degree in Computer Science, Data Engineering, or a related field.
Nice to Have
  • Financial domain experience (intercompany transactions, treasury operations).
  • Experience preparing time-series datasets for ML forecasting models.
  • Exposure to orchestration tools such as Airflow, Luigi, or Prefect.
  • Experience working with cloud data platforms (AWS S3, GCP BigQuery, Azure Data Lake).
Why Join CodeNinja?
  • Work on cutting-edge AI and financial forecasting solutions.
  • Collaborate with high-performing engineers and AI specialists.
  • Exposure to enterprise-grade secure environments and global clients.
  • Opportunity to contribute to impactful, data-driven transformation initiatives.
  • A culture that values ownership, growth, and continuous learning.
  • Competitive compensation and career progression opportunities.
Equal Opportunity & Disclaimer

CodeNinja is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. All employment decisions are made based on business needs, job requirements, and individual qualifications, without regard to race, color, religion, gender, national origin, disability status, or any other characteristic protected by applicable laws.

Only shortlisted candidates will be contacted. CodeNinja reserves the right to modify the job description based on business requirements.


Benefits
  • Provident Fund
  • Gym Membership
  • Leaves as per the company policy
  • Company-paid trips
  • Easy Loan Facility for Employees
  • Yearly increment
  • Maternity Benefits (Leaves & WFH)
  • Health Insurance (Maternity covered) – includes spouse and parents (till age 80)

Top Skills

Airflow
Aws S3
Azure Data Lake
Gcp Bigquery
Jupyter
Numpy
Pandas
Python
SQL
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
Noida, Uttar Pradesh
175 Employees
Year Founded: 2014

What We Do

A digital transformation company, driven to solve some of the most complex problems in the world today through technology. Our core service portfolio includes staff augmentation, software development and cloud services with over 300+ projects delivered to date. Established in 2014, with a continuously growing team of 250+ experts, operating across 3 regions with Americas, Middle East, and Southeast Asia, CodeNinja is one of the fastest growing technology companies recognized by Forbes Technology Council. With value addition being a core focus for the company, CodeNinja ranks as one of the top and best reviewed software development and services company. Now offering enterprise cloud solutions to some of the leading technology companies in the world, the company has secured funding of $1.6M and is poised to invest in redefining workplace dynamics through use of advanced AI. CodeNinja has earned the trust of more than 240 clients spread across 15 different countries and 9 different industries. The company is also one of the top-rated outsourcing services provider in Pakistan, forming dedicated technology teams for some of the world's leading organizations like Microsoft, Lifeforce (Tony Robbins) and OTL alongside unicorns such as 24Seven, ABHI, and Graana. In addition, to ensure quality and commitment to client data security, the company is now ISO 270001 certified. Services: · Offshore Engineering Teams · Custom Software Development · Dedicated Development Center · Application Modernizations · Cloud Services and Solutions · Modern Workplace Solutions · Digital Transformation Strategy · AI Consulting · AR/VR and Digital Twins · eCommerce Solutions Awards and Achievements: Top 1000 Companies Clutch Global 2023 Top Company in .Net Developers 2023 Clutch Global Fall 2023 Certifications & Partnerships: Adobe Commerce Cloud Solution Partner Microsoft Solution partner for business applications. Microsoft Solution Partner for Azure Data and AI.

Similar Jobs

Strategic Systems International Logo Strategic Systems International

Data Engineer

Professional Services • Software
In-Office
Lahore, Punjab, PAK
200 Employees

Strategic Systems International Logo Strategic Systems International

Data Engineer

Professional Services • Software
In-Office
Lahore, Punjab, PAK
200 Employees

Strategic Systems International Logo Strategic Systems International

Data Engineer

Professional Services • Software
In-Office
Lahore, Punjab, PAK
200 Employees

Northbay Logo Northbay

Senior Data Engineer

Software • Database • Analytics
In-Office or Remote
2 Locations
324 Employees
6-10 Annually

Similar Companies Hiring

Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees
Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Fairly Even Thumbnail
Software • Sales • Robotics • Other • Hospitality • Hardware
New York, NY

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account