Principal Engineer - RAG Database & Embeddings Architect

Posted Yesterday
Be an Early Applicant
Charlotte, NC, USA
In-Office
Expert/Leader
Big Data • Fintech • Mobile • Payments • Financial Services • Data Privacy
The Role
Lead design and governance of enterprise Retrieval-Augmented Generation data stores and retrieval architecture. Define embedding strategies, vectorization workflows, indexing, metadata models, hybrid retrieval, relevance tuning, and observability. Evaluate vector DB platforms, set standards and guardrails, partner with pipeline and context engineers, mentor teams, and balance innovation with security, scalability, and operational reliability.
Summary Generated by Built In

Job Description:

At Bank of America, we are guided by a common purpose to help make financial lives better through the power of every connection. We do this by driving Responsible Growth and delivering for our clients, teammates, communities and shareholders every day.
Being a Great Place to Work and providing a culture of caring is core to how we drive Responsible Growth. We are intentional about fostering an inclusive workplace where every teammate has the opportunity to succeed, build a career and contribute to our shared success. This includes attracting and developing exceptional talent, recognizing and rewarding performance, and supporting our teammates’ physical, emotional, and financial wellness through affordable, competitive and flexible benefits.
We value the unique perspectives individuals bring from all backgrounds and career paths - whether shaped by military service, community college education, or a wide range of work and life experiences. These journeys foster resilience, leadership and innovation, strengthening our workforce and positively impact the communities we serve.
Bank of America is committed to an in-office culture that supports collaboration, engagement, and career development. Our approach includes clear in-office expectations, while providing an appropriate level of flexibility based on role-specific responsibilities and business needs.
At Bank of America, you can build a successful career with opportunities to learn, grow, and make an impact. Join us!

Job Description:
This job is responsible for defining and leading the engineering approach for solutions at the program or portfolio level, to deliver significant business outcomes. Key responsibilities include continuously improving the design, quality, and reuse of the solution and delivering technology enablers that improve development efficiencies for the solution. Job expectations include familiarity with at least one area of engineering, acting as a “go to” reference across the organization, and applying knowledge to improve technical competencies through recruitment and development activities.

Developer Experience (DevEx) provides enterprise technical standards and common technical services, platforms, and tools that are leveraged by delivery teams across all lines of business. Within the SDLC Software Delivery Lifecycle program, this role leads portfolio product delivery strategy and execution for enterprise software delivery capabilities, ensuring the right investments, operating model, governance, and prioritization are in place to improve how internal technical users build, test, and deliver software at scale.

The RAG Database & Embeddings Architect is responsible for designing, building, and governing the vector database and retrieval architecture that powers enterprise Retrieval-Augmented Generation systems. This role focuses on embeddings, vectorization strategies, semantic search, indexing, metadata modeling, hybrid retrieval, relevance tuning, and performance optimization.

This engineer will define how enterprise knowledge is represented, stored, retrieved, ranked, and refreshed for use by LLM-powered applications.

Responsibilities:

  • Develops the engineering approach for the entire program/portfolio solution and works with Architecture, to develop/analyze/deliver the implementation of technical enablers
  • Leads the planning, definition, and design of the complex features which span multiple teams and explore solution alternatives
  • Creates ideas on designing complex technology and solution development approaches
  • Leads the technical oversight for teams in solution development including design reviews and code within own domain
  • Defines the technology tool stack for the solution within ranged of internally approved and supported technologies
  • Explores state-of-the-art technologies to improve development efficiencies, quality of test/QA coverage, and release management
  • Leads and is responsible for the end-to-end test strategy/creation/adherence, and the integration between teams for a program/portfolio solution
  • Improve the experience for our developers, making it easier to deliver industry-leading solutions, while managing work efficiently and with the right controls
  • Advance our technology platforms through innovation
  • Reduce risk and improve quality across our technology portfolio by aligning to a single enterprise architecture strategy and delivering governance that enables consistency, integration and automation
  • Design and own the architecture for enterprise RAG data stores, including vector databases, document stores, metadata stores, and hybrid search layers.
  • Define embedding strategies across structured, semi-structured, and unstructured content.
  • Evaluate and select embedding models based on accuracy, latency, cost, domain fit, multilingual needs, and operational constraints.
  • Design vectorization workflows including chunking, embedding generation, indexing, versioning, and re-embedding lifecycle management.
  • Implement semantic, keyword, metadata-filtered, and hybrid retrieval patterns.
  • Optimize retrieval quality using similarity metrics, reranking, query expansion, metadata boosting, and relevance feedback.
  • Establish standards for vector schema design, namespace strategy, document lineage, source attribution, and access-control-aware retrieval.
  • Partner with data pipeline engineers to ensure ingestion processes produce high-quality, retrievable content.
  • Partner with context engineers to tune retrieval outputs for downstream LLM consumption.
  • Define observability for retrieval quality, including recall, precision, latency, cost, freshness, and hallucination risk indicators.
  • Lead technical evaluation of vector database platforms and retrieval frameworks.
  • Provide engineering leadership, design reviews, mentoring, and architectural guidance across AI platform teams.
  • Serve as a senior technical authority for enterprise AI platform engineering.
  • Own architecture decisions that impact multiple teams, systems, or domains.
  • Create reusable patterns, reference architectures, standards, and engineering guardrails.
  • Mentor senior engineers and influence technical direction without requiring direct reporting authority.
  • Balance innovation with operational reliability, security, compliance, scalability, and cost management.
  • Communicate complex AI and data engineering concepts clearly to engineering, product, risk, security, and executive stakeholders.

Required Qualification:

  • 10+ years of software engineering, data engineering, platform engineering, or AI engineering experience.
  • 5+ years designing large-scale enterprise systems.
  • 2+ years working with LLM, RAG, vector search, semantic search, or AI platform capabilities.
  • Experience operating systems in regulated, security-conscious, or enterprise-scale environments.
  • Extensive experience designing production-grade search, indexing, or database systems.
  • Strong understanding of vector databases, embeddings, similarity search, approximate nearest neighbor algorithms, and retrieval optimization.
  • Experience with RAG architectures and enterprise-scale knowledge retrieval.
  • Hands-on experience with platforms such as Azure AI Search, Cosmos DB vector search, Pinecone, Weaviate, Milvus, OpenSearch, Elasticsearch, PostgreSQL/pgvector, or equivalent.
  • Experience with embedding models from providers such as OpenAI, Azure OpenAI, Cohere, Hugging Face, or open-source model ecosystems.
  • Strong background in distributed systems, database design, API design, and performance tuning.
  • Experience designing metadata models, access control filtering, document provenance, and auditability.
  • Ability to define engineering patterns and standards used across multiple teams.
  • Proven ability to lead architecture across multiple engineering teams.
  • Strong written and verbal communication skills.
  • Bachelor’s degree in Computer Science, Engineering, Information Systems, Applied Mathematics, or a related technical field

Desired Qualifications:

  • Experience with hybrid retrieval, reranking models, knowledge graphs, or entity-aware retrieval.
  • Experience supporting regulated or enterprise environments with security, compliance, lineage, and privacy requirements.
  • Experience with LLM evaluation, retrieval evaluation, and automated relevance testing.
  • Familiarity with model drift, embedding drift, and re-indexing strategies.
  • High-quality retrieval architecture that improves LLM answer accuracy and reduces hallucinations.
  • Scalable vector database strategy supporting multiple enterprise domains.
  • Clear standards for embeddings, metadata, indexing, and retrieval evaluation.
  • Measurable improvements in retrieval precision, recall, latency, and cost efficiency.
  • Enterprise architecture
  • Distributed systems design
  • AI platform engineering
  • Data governance and security
  • Cloud-native engineering
  • Observability and operational excellence
  • Technical strategy and roadmap development
  • Cross-functional influence
  • Vendor and platform evaluation
  • Production support and continuous improvement

Skills:

  • Automation
  • Influence
  • Result Orientation
  • Stakeholder Management
  • Technical Strategy Development
  • Application Development
  • Architecture
  • Business Acumen
  • Risk Management
  • Solution Design
  • Agile Practices
  • Analytical Thinking
  • Collaboration
  • Data Management
  • Solution Delivery Process

Shift:

1st shift (United States of America)

Hours Per Week: 

40

Skills Required

  • 10+ years of software engineering, data engineering, platform engineering, or AI engineering experience
  • 5+ years designing large-scale enterprise systems
  • 2+ years working with LLM, RAG, vector search, semantic search, or AI platform capabilities
  • Experience operating systems in regulated, security-conscious, or enterprise-scale environments
  • Extensive experience designing production-grade search, indexing, or database systems
  • Strong understanding of vector databases, embeddings, similarity search, approximate nearest neighbor algorithms, and retrieval optimization
  • Experience with RAG architectures and enterprise-scale knowledge retrieval
  • Hands-on experience with platforms such as Azure AI Search, Cosmos DB vector search, Pinecone, Weaviate, Milvus, OpenSearch, Elasticsearch, PostgreSQL/pgvector or equivalent
  • Experience with embedding models from providers such as OpenAI, Azure OpenAI, Cohere, Hugging Face, or open-source model ecosystems
  • Strong background in distributed systems, database design, API design, and performance tuning
  • Experience designing metadata models, access control filtering, document provenance, and auditability
  • Ability to define engineering patterns and standards used across multiple teams
  • Proven ability to lead architecture across multiple engineering teams
  • Strong written and verbal communication skills
  • Bachelor's degree in Computer Science, Engineering, Information Systems, Applied Mathematics, or a related technical field
  • Experience with hybrid retrieval, reranking models, knowledge graphs, or entity-aware retrieval
  • Experience supporting regulated or enterprise environments with security, compliance, lineage, and privacy requirements
  • Experience with LLM evaluation, retrieval evaluation, and automated relevance testing
  • Familiarity with model drift, embedding drift, and re-indexing strategies
  • Experience in enterprise architecture, AI platform engineering, data governance, cloud-native engineering, and observability

Bank of America Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about Bank of America and has not been reviewed or approved by Bank of America.

  • Fair & Transparent Compensation The $25/hour U.S. minimum wage, reaffirmed in recent company materials, sets a clear compensation floor that lifts entry-level and operations pay. Public salary information and disclosures provide visible benchmarks for pay across roles.
  • Parental & Family Support Parental leave extends up to 26 weeks with 16 weeks fully paid for eligible teammates, alongside backup child and adult care and a dedicated Life Event Services team. Family-building assistance offers up to a $20,000 lifetime reimbursement and bereavement leave provides 20 paid days for loss of a spouse, partner, or child.
  • Retirement Support Retirement programs include a 401(k) match up to 5% of eligible pay plus an additional 2–3% annual company contribution based on service. These employer contributions add meaningful long-term value beyond base pay.

Bank of America Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Charlotte, NC
208,000 Employees
Year Founded: 1784

What We Do

We make financial lives better for our clients and our communities through the power of every connection. Our employees are at the heart of this purpose, and are key to driving responsible growth. Every day, across the globe, our employees bring a commitment to our purpose and to driving responsible growth by living our values: deliver together, act responsibly, realize the power of our people and trust the team. A key aspect of driving responsible growth is doing so in a sustainable manner, a critical pillar of which is being a great place to work for our teammates.

Gallery

Gallery

Similar Jobs

Liberty Mutual Insurance Logo Liberty Mutual Insurance

Principal SAP PaPM Configuration Engineer

Artificial Intelligence • Fintech • Insurance • Marketing Tech • Software • Analytics
Remote or Hybrid
United States
40000 Employees
120K-225K Annually

Tapestry - Coach and Kate Spade Logo Tapestry - Coach and Kate Spade

Sales Associate II

eCommerce • Fashion • Retail • Sales • Wearables • Design
Hybrid
Crabtree, NC, USA
16000 Employees
15-20 Hourly

Pfizer Logo Pfizer

Director of Operational Excellence (AI Transformation)

Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Biotech • Pharmaceutical
In-Office or Remote
30 Locations
121990 Employees
177K-294K Annually

Pfizer Logo Pfizer

Manager, Regulatory Quality, Vaccines

Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Biotech • Pharmaceutical
In-Office or Remote
2 Locations
121990 Employees
89K-148K Annually

Similar Companies Hiring

Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
42 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account