Lead Data Engineer (GenAI / LLM Applications)

Posted 21 Days Ago
Be an Early Applicant
Hiring Remotely in Bangalore, Bengaluru Urban, Karnataka, IND
In-Office or Remote
Senior level
Healthtech • Software
The Role
The role involves designing and maintaining scalable data architectures and pipelines, collaborating with teams to deliver data-driven solutions, and optimizing complex SQL across various databases.
Summary Generated by Built In

We are looking for a skilled and motivated Lead Engineer to join our Data Science and Delivery group at Clario, a part of Thermo Fisher Scientific. This role combines software development, data engineering, and analytical problem‑solving to design, build, and maintain scalable data platforms that support clinical trial operations and business intelligence. You will work across the full software development lifecycle (SDLC)—from requirements gathering through production support—collaborating closely with data scientists, analysts, product managers, and engineering teams to deliver high‑quality, data‑driven solutions.

What We Offer

  • Competitive compensation aligned with local market practices

  • Comprehensive health and wellness benefits

  • Paid time off and company holidays

  • Opportunities for professional development, learning, and career growth

  • The flexibility of working from Bangalore or remotely within India, while collaborating with global teams

What You’ll Be Doing

  • Design, develop, and maintain scalable software architectures and data pipelines that integrate with analytical and operational systems.

  • Write clean, reusable, and well‑tested Python code using frameworks such as Flask and related libraries.

  • Leverage AI‑assisted development tools, including GitHub Copilot and LangChain, to design, build, and integrate LLM‑powered solutions such as retrieval‑augmented generation (RAG) pipelines, intelligent agents, and automated workflows using AWS Bedrock or similar services.

  • Develop and optimize complex SQL across Oracle, MS SQL Server, PostgreSQL, and Snowflake, including procedures, functions, views, analytical functions, and dynamic SQL.

  • Design and implement ETL pipelines using Snowflake and related data processing technologies.

  • Implement scheduling and orchestration using Apache Airflow or similar workflow orchestration frameworks.

  • Establish and maintain data quality frameworks, versioning, and governance practices to ensure data reliability, integrity, and compliance.

  • Develop and maintain data architectures and models for both structured and unstructured data sources.

  • Troubleshoot production issues and drive continuous improvement in software quality, performance, and reliability.

  • Deploy, manage, and support solutions on AWS, including storage, compute, and pipeline services.

  • Create source‑to‑target mappings and support data and code migration initiatives.

  • Partner with stakeholders to gather requirements, translate business needs into technical solutions, and produce clear, well‑structured documentation.

  • Collaborate with product managers, analysts, and cross‑functional teams to deliver data‑driven insights and reporting using tools such as Plotly and Power BI.

What We Look For

  • Bachelor’s or higher degree in Computer Science, Information Technology, or a related technical field.

  • 5+ years of professional experience in software engineering, data engineering, or data‑focused development roles.

  • Strong proficiency in Python, including frameworks and libraries such as Django or Flask, pandas, NumPy, Plotly, and ag‑Grid.

  • Strong SQL expertise with Oracle, MS SQL Server, PostgreSQL, and/or Snowflake.

  • Proven experience writing complex SQL, including analytical and window functions, subqueries, all join types, DML/DDL/TCL statements, CASE expressions, and performance tuning.

  • Working knowledge of cloud platforms, with a preference for AWS (S3, EC2, Secrets Manager, Bedrock, Lambda).

  • Experience using AI‑assisted development tools and frameworks such as GitHub Copilot and LangChain for building LLM‑powered applications and workflows.

  • Experience with Git‑based version control systems and CI/CD pipelines.

  • Familiarity with data modeling concepts for both structured and unstructured data.

  • Strong analytical thinking, problem‑solving abilities, and communication skills.

  • Willingness to work across all phases of the SDLC, including requirements gathering, design, development, deployment, and production support.

  • Preferred experience includes exposure to the clinical trial lifecycle or clinical data management, data visualization tools (Plotly, Power BI), front‑end technologies (HTML5, CSS3, JavaScript), collaboration tools (Jira, Confluence, Microsoft Teams), and hands‑on data analysis or data cleansing using programming languages, SQL, and Excel.

At Clario, our purpose is to transform lives by unlocking better evidence. It’s a cause that unites and inspires us. It’s why we come to work—and how we empower our people to make a positive impact every day. Whether you’re starting your clinical data career or building long‑term expertise, your work helps bring life‑changing therapies to patients faster.

Skills Required

  • Bachelor's or higher degree in Computer Science, Information Technology, or related field
  • 5+ years of professional experience in software engineering or data engineering
  • Strong proficiency in Python, including frameworks and libraries like Flask and Django
  • Strong SQL expertise with Oracle, MS SQL Server, PostgreSQL, or Snowflake
  • Experience with Git-based version control systems and CI/CD pipelines

Clario Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about Clario and has not been reviewed or approved by Clario.

  • Leave & Time Off Breadth Time off is described as generous, including flexible PTO for some roles and up to 26–31 PTO days for eligible staff. The calendar includes 15 paid company holidays and a Winter Break.
  • Healthcare Strength Healthcare offerings include multiple medical plan options via Highmark with coordinated care navigation, plus dental and vision through established carriers. Employer HSA contributions and mental-health support broaden the package.
  • Inclusive Benefits Coverage Coverage highlights include domestic-partner eligibility, transgender-inclusive care, and HIV prevention/treatment. These inclusive elements are presented alongside broader inclusion commitments.

Clario Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Philadelphia, PA
6,733 Employees
Year Founded: 1972

What We Do

-- Clario has been named a Top Workplace by Energage for the 2022 Top Workplaces USA national awards. -- Clario generates the richest clinical evidence by fusing our deep scientific expertise and global scale into the broadest endpoint technology platform. By doing this, we empower our partners to transform lives. With almost 50 years of experience, 19,000 clinical trials, and 870 regulatory approvals, Clario has mastered the ability to generate rich evidence across a Trial Anywhere™ portfolio: decentralized (DCT), hybrid and site-based clinical trials. With 30 facilities in nine countries across North America, Europe, and Asia Pacific, Clario delivers the power of certainty. Partners ————— Clario brings the best of ERT and Bioclinica together to work alongside our partners to solve some of their biggest questions on topics such as: - eCOA vs. paper - Decentralized Clinical Trial (DCT) - Rescue a clinical trial - Broad endpoint technology: cardiac safety, imaging, respiratory And many more. People ———— We are so honoured to be named a 2022 Top Workplace by Energage. One of our leading values at Clario is People First Always. We help individuals build meaningful careers at Clario as they serve to help transform patients lives. Join us on this journey and check out our careers page: https://clario.com/careers/

Similar Jobs

Airwallex Logo Airwallex

Associate Account Executive

Artificial Intelligence • Fintech • Payments • Business Intelligence • Financial Services • Generative AI
In-Office or Remote
Bangalore, Bengaluru Urban, Karnataka, IND
2000 Employees

Airwallex Logo Airwallex

Associate Account Executive

Artificial Intelligence • Fintech • Payments • Business Intelligence • Financial Services • Generative AI
In-Office or Remote
Bangalore, Bengaluru Urban, Karnataka, IND
2000 Employees

Airwallex Logo Airwallex

Sales Representative

Artificial Intelligence • Fintech • Payments • Business Intelligence • Financial Services • Generative AI
In-Office or Remote
Bangalore, Bengaluru Urban, Karnataka, IND
2000 Employees

Airwallex Logo Airwallex

Associate Account Executive

Artificial Intelligence • Fintech • Payments • Business Intelligence • Financial Services • Generative AI
In-Office or Remote
Bangalore, Bengaluru Urban, Karnataka, IND
2000 Employees

Similar Companies Hiring

Fairly Even Thumbnail
Hardware • Other • Robotics • Sales • Software • Hospitality
New York, NY
30 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account