Senior Software Engineer - Retrieval-Augmented Generation (RAG)

Reposted 5 Hours Ago
4 Locations
In-Office or Remote
95K-180K Annually
Senior level
Artificial Intelligence • Healthtech • Information Technology • Other • Analytics
The Role
The role involves architecting and implementing production-scale retrieval-augmented generation systems, focusing on document retrieval and contextual response generation in healthcare. Responsibilities include developing APIs, optimizing workflows, ensuring security and compliance, and collaborating with data engineers on retrieval quality.
Summary Generated by Built In

Job title: Senior Software Engineer II – Retrieval-Augmented Generation (RAG) System

About the role, we are seeking an experienced engineer to work with a team to build and support a healthcare centered production-scale RAG system that combines document retrieval with response generation to deliver accurate, context-aware answers. This engineer we be expected to design, implement, and operate end-to-end RAG pipelines— LLM interaction, API creation, and high-performance, secure delivery of knowledge-grounded capabilities. You will collaborate with data engineers, platform teams, and product partners to ship reliable, scalable, and observable systems.

About the team;  This collaborative team is entrusted with building the Next Generation Health Solutions through the utilization of cutting-edge technology.

Role and responsibilities

  • Architecting, implementing, testing, and operating end-to-end RAG workflows:

  • Ingesting and normalizing documents from diverse sources

  • Generating and managing embeddings; index and query vector databases
    Retrieve relevant passages, apply reranking or fusion strategies, and feed prompts to LLMs

  • Building scalable, low-latency services and APIs (Python preferred; other languages acceptable) and ensure production-grade reliability (monitoring, tracing, alerting)

  • Integrating with vector databases and embedding pipelines  and optimize for latency, throughput, and cost

  • Designing and implementing ML Ops workflows: model/version management, experiments, feature stores, CI/CD for ML-enabled services, rollback plans

  • Developing robust data pipelines and governance around ingestion, provenance, quality checks, and access controls

  • Collaborating with data engineers to improve retrieval quality (embedding strategies, reranking, cross-encoder models, prompt engineering) and implement evaluation metrics (precision/recall, MRR, QA accuracy, user-centric metrics)

  • Implementing monitoring and observability for RAG components (latency, success rate, cache hit rate, retrieval quality, data drift)

  • Ensuring security, privacy, and compliance (authentication, authorization, data masking, PII handling, audit logging)

Required qualifications

  • 5+ years of professional software engineering experience designing and delivering production systems

  • Strong programming skills (Python required; NodeJs a plus)

  • Deep understanding of retrieval-augmented or application-scale NLP systems and practical experience building RAG-like pipelines

  • Hands-on experience with ML workflow tooling and MLOps concepts (model serving, versioning, experiments, feature stores, reproducibility)

  • Proficiency with cloud infrastructure and modern software practices (AWS/GCP/Azure; Docker; Kubernetes; CI/CD)

  • Strong problem-solving skills, excellent communication, and ability to work with cross-functional teams

  • Familiarity with data governance, privacy, and security best practices

Preferred qualifications

  • Experience with agentic workflow tools (LangGraph) and familiarity with prompt engineering for LLMs

  • Exposure to working with and evaluating different  LLMs

  • Knowledge of evaluation methodologies for retrieval and QA systems and the ability to set up A/B tests and dashboards

  • Experience with data processing frameworks (SQL, Pandas, Spark) and working with large-scale data pipelines

  • Background in performance optimization for low-latency AI services (MLflow)

  • Experience with monitoring and logging via New Relic, K9s, Portkey, etc

  • Experience with minimizing token usage and cost optimization

  • Comfortable with design and implementation of security controls for data-intensive AI systems

Elsevier is a renowned global information analytics company that primarily focuses on providing scientific, technical, and medical (STM) research content, tools, and services. It is one of the largest publishers of academic journals and scholarly literature in the world.

Elsevier operates in various domains, including science, technology, medicine, social sciences, and more. They publish a vast number of peer-reviewed journals covering a wide range of disciplines. These journals act as platforms for researchers and academics to share their findings and contribute to the advancement of knowledge in their respective fields.

U.S. National Base Pay Range: $95,300 - $158,800. Geographic differentials may apply in some locations to better reflect local market rates. If performed in New Jersey, the base pay range is $112,574 - $179,826. This job is eligible for an annual incentive bonus.

We know your well-being and happiness are key to a long and successful career. We are delighted to offer country specific benefits. Click here to access benefits specific to your location.

We are committed to providing a fair and accessible hiring process. If you have a disability or other need that requires accommodation or adjustment, please let us know by completing our Applicant Request Support Form or please contact 1-855-833-5120.

Criminals may pose as recruiters asking for money or personal information. We never request money or banking details from job applicants. Learn more about spotting and avoiding scams here.

Please read our Candidate Privacy Policy.

We are an equal opportunity employer: qualified applicants are considered for and treated during employment without regard to race, color, creed, religion, sex, national origin, citizenship status, disability status, protected veteran status, age, marital status, sexual orientation, gender identity, genetic information, or any other characteristic protected by law.

USA Job Seekers:

EEO Know Your Rights.

Top Skills

AWS
Azure
Ci/Cd
Docker
GCP
Kubernetes
Mlflow
Node.js
Pandas
Python
Spark
SQL
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
0 Employees
Year Founded: 1880

What We Do

Elsevier is a world-leading provider of information solutions that enhance the performance of science, health, and technology professionals, empowering them to make better decisions, and deliver better care.

Because informed decisions lead to better outcomes, Elsevier is a leader in information and analytics for customers across the global research and health ecosystems.

Elsevier helps researchers and healthcare professionals advance science and improve health outcomes for the benefit of society.

We do this by facilitating insights and critical decision-making for customers across the global research and health ecosystems.

Similar Jobs

Elsevier Logo Elsevier

Senior Software Engineer

Artificial Intelligence • Healthtech • Information Technology • Other • Analytics
In-Office or Remote
4 Locations
87K-163K Annually

Quantum Metric, Inc. Logo Quantum Metric, Inc.

Account Executive

eCommerce • Enterprise Web • Information Technology • Software • Database • Analytics • Business Intelligence
Remote
United States
426 Employees
140K-160K Annually

Dandy Logo Dandy

Senior Software Engineer

Computer Vision • Healthtech • Information Technology • Logistics • Machine Learning • Software • Manufacturing
Remote
USA
1800 Employees
201K-237K Annually

Dandy Logo Dandy

Software Engineering Manager

Computer Vision • Healthtech • Information Technology • Logistics • Machine Learning • Software • Manufacturing
Remote
USA
1800 Employees
201K-237K Annually

Similar Companies Hiring

Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees
Fairly Even Thumbnail
Software • Sales • Robotics • Other • Hospitality • Hardware
New York, NY
Bellagent Thumbnail
Artificial Intelligence • Machine Learning • Business Intelligence • Generative AI
Chicago, IL
20 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account