Responsibilities
Direct Responsibilities
· Design & develop robust ingestion, transformation, and enrichment pipelines with Python, PySpark, and SQL.
- Write and optimize complex SQL queries, analytical UDFs, and window functions for data aggregation and reporting.
- Collaborate with CEFS data architects, data scientists, and business analysts to translate functional requirements into technical specifications.
- Unit‑test, integrate‑test, and review code.
- Maintain CI/CD pipelines (Git, Jenkins, Docker) for automated build, test, and deployment of jobs.
- Monitor production workloads and troubleshoot performance bottlenecks, memory issues, and job failures.
- Document data lineage, pipeline design, and operational run‑books in Confluence/SharePoint.
· Keep up to date with latest technologies and trends and provide input, expertise and recommendations.
Contributing Responsibilities
· Contribute towards innovation (e.g. AI/ML); suggest new technical practices for efficiency improvement.
- Participate in Agile ceremonies (sprint planning, daily standups, retrospectives) and help groom the backlogs.
- Mentor junior engineers and champion best practices in Python coding, Spark optimization, and data‑engineering patterns.
- Evaluate emerging technologies and deliver proof‑of‑concepts for CEFS.
Technical & Behavioral Competencies
· Resourceful to quickly understand complexities involved and provide the way forward.
· Good experience in technical analysis of n-tier applications with multiple integrations using object oriented, APIs & Microservices approaches.
· Strong knowledge about design patterns and development principles.
· Inclination and prior experience of working across SQL, Python and ETL.
· Strong Hands-on experience - Python (NumPy, pandas, Python Frameworks, Restful APIs, MS-SQL or Oracle.
· PySpark - DataFrames, Spark SQL, Structured Streaming, performance tuning (partitioning, caching, broadcast joins).
· Advanced SQL – complex queries, stored procedures, query optimization.
· Good Knowledge and experience to use Python packages such as Pandas, NumPy, etc. for cleaning up of Data, Data Wrangling, Analysis of Data and Visualization of Data.
· Good experience in development and maintenance of code/scripts in both functional and technical specifications of all applications component, bug fixing and production support.
· Good knowledge on Linux/Unix environment (basic commands, shell scripting, etc.), testing phases, documentation and new framework.
· Some experience of working with build tools like Maven & DevOps tools like Bitbucket, Jenkins.
· Knowledge of Agile, Scrum, DevOps.
· Development experience in Data Engineering environment.
· Ability & willingness to learn & work on diverse technologies (languages, frameworks, and tools).
· Self-motivated, good interpersonal skills and inclination to constantly upgrade new technologies and frameworks.
· Good communication and co-ordination skills.
Specific Qualifications:
· Good to have knowledge of front-end technologies preferably Flask.
Skills Referential (Required knowledge, skills and abilities)
Technical Skills:
- Python
- Pyspark
- SQL
- ETL
Behavioral Skills:
· Ability to synthetize / simplify
· Ability to collaborate / Teamwork
· Attention to detail / rigor
· Ability to deliver / Results driven
Education Level: Bachelor’s degree or equivalent
Location: Chennai
Skills Required
- Design and develop ingestion, transformation, and enrichment pipelines with Python, PySpark, and SQL.
- Write and optimize complex SQL queries, analytical UDFs, and window functions.
- Unit-test, integration-test, and perform code reviews.
- Maintain CI/CD pipelines (Git, Jenkins, Docker) for automated build, test, and deployment.
- Monitor production workloads and troubleshoot performance bottlenecks, memory issues, and job failures.
- Document data lineage, pipeline design, and operational run-books in Confluence/SharePoint.
- Strong hands-on experience with Python (NumPy, pandas, Python frameworks) and building RESTful APIs; experience with MS-SQL or Oracle.
- PySpark expertise: DataFrames, Spark SQL, Structured Streaming, and performance tuning (partitioning, caching, broadcast joins).
- Advanced SQL skills: complex queries, stored procedures, query optimization.
- Good knowledge of Linux/Unix environment and shell scripting.
- Experience with build and DevOps tools such as Maven, Bitbucket, Jenkins, Docker, and Git.
- Knowledge of Agile, Scrum, and DevOps practices.
- Development experience in a Data Engineering environment and production support.
- Bachelor's degree or equivalent.
- Knowledge of front-end technologies, preferably Flask.
What We Do
Infotel UK Consulting: Shaping technical innovation, digital transformation and data management We are at the forefront of technical innovation and digital transformation, providing data management systems, process improvement applications, software design, systems migration, and systems integration. We are also a software vendor, with our GDPR compliance software—Deepeo. We work with clients across numerous business sectors, including automotive, the financial sector, the public sector, and supply chain and logistics. Infotel UK is a subsidiary of The Infotel Conseil Group. Infotel UK was established in 2015 and is headquartered in Newcastle upon Tyne. With the growth of our UK business, we invested in a purpose-built innovation lab, allowing us to further support businesses with their IT requirements. Being part of a global group gives us access to 20 service centres throughout Europe and the USA, making us the perfect local partner to team up with.








