Senior Data / RAG Engineer

Posted 6 Days Ago
Easy Apply
Be an Early Applicant
Chennai, Tamil Nadu
In-Office
8-10 Annually
Senior level
Software
The Role
The Senior Data Engineer will design and manage RAG Vector Databases, modernize data ingestion pipelines, ensure data synchronization, and collaborate across teams for AI insights.
Summary Generated by Built In

Banyan Software provides the best permanent home for successful enterprise software companies, their employees, and customers. We are on a mission to acquire, build and grow great enterprise software businesses all over the world that have dominant positions in niche vertical markets. In recent years, Banyan was named the #1 fastest-growing private software company in the US on the Inc. 5000 and amongst the top 10 fastest-growing companies by the Deloitte Technology Fast 500. Founded in 2016 with a permanent capital base setup to preserve the legacy of founders, Banyan focuses on a buy and hold for life strategy for growing software companies that serve specialized vertical markets.

Role Overview:
We’re looking for a Senior Data Engineer with deep expertise in RAG (Retrieval-Augmented Generation) and Vector Database design to build and manage the knowledge backbone for AI compliance and insights. This role focuses on modernizing archival data ingestion and enabling real-time contextual retrieval for AI-driven systems.

Key Responsibilities:

  • Design and implement RAG Vector Databases (e.g., OpenSearch, Pinecone) using archival data from S3 / Glacier and overall data management via MS SQL Server.
  • Modernize existing data ingestion pipelines, replacing legacy OCR-based processes with scalable ETL/ELT frameworks.
  • Ensure data synchronization and consistency between RDS (MS SQL Server) and Vector DB for real-time AI context.
  • Collaborate with AI, backend, and infrastructure teams to optimize retrieval performance and model access.
  • Drive data integrity, schema evolution, and compliance readiness across systems.

Required Skills & Experience:

  • Proven expertise in data engineering pipelines (Kafka / MSK, ETL / ELT).
  • Hands-on experience with Vector Databases and RAG implementations (OpenSearch, Pinecone, FAISS, Chroma).
  • Strong proficiency in SQL, data modeling, and Python / C# / Go.
  • Experience with AWS data ecosystem (S3, RDS, Glue, Lambda and related technologies).
  • 8–10 years of experience in data engineering or AI data platforms.

Diversity, Equity, Inclusion & Equal Employment Opportunity at Banyan: Banyan affirms that inequality is detrimental to our Global Teams, associates, our Operating Companies, and the communities we serve. As a collective, our goal is to impact lasting change through our actions. Together, we unite for equality and equity. Banyan is committed to equal employment opportunities regardless of any protected characteristic, including race, color, genetic information, creed, national origin, religion, sex, affectional or sexual orientation, gender identity or expression, lawful alien status, ancestry, age, marital status, or protected veteran status and will not discriminate against anyone on the basis of a disability. We support an inclusive workplace where associates excel based on personal merit, qualifications, experience, ability, and job performance.


Beware of Recruitment Scams

We have been made aware of individuals fraudulently posing as members of our Talent Acquisition team and extending fake job offers. These scams may involve requests for personal information or payment for equipment. 

Protect yourself by following these steps:

  • Verify that all communications from our recruiting team come from an @banyansoftware.com email address.
  • Remember, employers will never request payment or banking information during the hiring process.
  • If you receive a suspicious message, do not respond — instead, forward it to [email protected] and/or report it to the platform where you received it.

Your safety and security are important to us. Thank you for staying vigilant.

Top Skills

AWS
C#
Elt
ETL
Glue
Go
Kafka
Lambda
Ms Sql Server
Opensearch
Pinecone
Python
Rag
Rds
S3
SQL
Vector Database Design
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Atlanta, GA
118 Employees
Year Founded: 2016

What We Do

Banyan Software provides the best permanent home for successful enterprise software businesses, their employees, and customers to preserve the legacy of founders, while helping grow the business into the future.

We are on a mission to acquire, build and grow great software businesses that have dominant positions in niche markets all over the world. Today Banyan has over 750 employees throughout the US, Canada, UK, Europe, Australia and New Zealand. Founded in 2016 with permanent capital to preserve the legacy of founders, Banyan focuses on a buy, hold and grow for life strategy. For more information on Banyan Software, Inc. visit: http://www.banyansoftware.com

What We Look For:
- Great enterprise software businesses that have dominant positions in niche markets
- We work with owners who are thinking about an exit today or further down the road
- We are flexible and can be creative when we find a business that is a good fit
- The businesses in the Banyan family all share a similar profile:
- Annual revenues in excess of $2M-$30M
- A high percentage of recurring revenue
- Positive operating margins and cash flow
- High customer retention and satisfaction
- Happy and committed employees

Similar Jobs

TransUnion Logo TransUnion

Manager of Real Time Adherence

Big Data • Fintech • Information Technology • Business Intelligence • Financial Services • Cybersecurity • Big Data Analytics
Hybrid
Chennai, Tamil Nadu, IND
13000 Employees

TransUnion Logo TransUnion

Lead Data Engineer

Big Data • Fintech • Information Technology • Business Intelligence • Financial Services • Cybersecurity • Big Data Analytics
Hybrid
Chennai, Tamil Nadu, IND
13000 Employees

FourKites Logo FourKites

Artificial Intelligence Engineer

Artificial Intelligence • Big Data • Logistics • Machine Learning • Software • Transportation
Easy Apply
Remote or Hybrid
2 Locations
475 Employees

CrowdStrike Logo CrowdStrike

Manager, Threat Research (Remote, IND)

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Remote or Hybrid
19 Locations
10000 Employees
12-12 Annually

Similar Companies Hiring

Standard Template Labs Thumbnail
Software • Information Technology • Artificial Intelligence
New York, NY
10 Employees
PRIMA Thumbnail
Travel • Software • Marketing Tech • Hospitality • eCommerce
US
15 Employees
Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account