Senior Data Architect - AI-Powered Data Platforms

Reposted 8 Days Ago
Be an Early Applicant
Bengaluru, Bengaluru Urban, Karnataka
In-Office
Senior level
Energy • Manufacturing • Solar • Renewable Energy
GE Vernova is accelerating the path to more reliable, affordable, and sustainable energy.
The Role
As a Data Architect, you'll transform enterprise data platforms, focusing on AI integration with structured and unstructured data, while overseeing RAG architecture and optimization of data processing systems.
Summary Generated by Built In
Job Description SummaryDESCRIPTION
We are seeking an experienced Data Architect who specializes in modernizing enterprise data platforms for the AI era. This role requires someone who deeply understands both traditional data architectures and the emerging requirements of AI systems, with expertise in bridging existing data lakes to support modern AI capabilities like RAG (Retrieval-Augmented Generation), vector search, and multi-modal AI applications. You'll be the architect who transforms our wealth of structured and unstructured data assets into AI-ready infrastructure.
The ideal candidate will have 10+ years of experience with enterprise data platforms and proven expertise in handling both structured and unstructured data at scale. You understand the complexities of existing data lake architectures and can architect the evolution path to support AI workloads without disrupting current operations.
As a GE Vernova accelerator, GE Vernova Advanced Research is driving strategy and leading research & development efforts to execute on the business's mission to help power the energy transition. We forge the collaborations and help invent the technologies required to electrify and decarbonize for a zero-carbon future.
Representing virtually every major scientific and engineering discipline, our researchers are collaborating with GE Vernova's businesses, the U.S. government, and more than 420 entities at the forefront of technology to execute on 150+ energy-focused projects. Collectively, these research programs and initiatives aim to solve near term technical challenges, deliver next generation product advances, and drive long term breakthrough innovation to enable more affordable, reliable, sustainable, and secure energy.

Job Description

Key job responsibilities

Unstructured Data & AI Enablement

  • Design scalable architectures for processing and indexing unstructured data (PDFs, documents, emails, logs, images) for AI consumption 
  • Architect document processing pipelines that leverage multi-modal LLMs (GPT-4V, Claude, Gemini) for direct document understanding without traditional OCR preprocessing 
  • Implement intelligent document extraction using LLMs' native vision and context capabilities to handle complex layouts, tables, and mixed media 
  • Design metadata extraction and enrichment pipelines that enhance discoverability of unstructured assets 
  • Build architectures for multi-modal AI applications that combine structured and unstructured data sources

RAG & Knowledge Platform Architecture

  • Design end-to-end RAG architectures that leverage existing data lakes and enterprise knowledge bases 
  • Architect hybrid search systems combining traditional keyword search with semantic/vector search capabilities 
  • Implement chunking strategies and embedding pipelines for diverse document types and data sources
  • Build architectures for continuous learning where RAG systems are updated with new data in near real-time
  • Design security and access control models that work across legacy systems and modern AI platforms 
  • Create data governance frameworks that ensure compliance while enabling AI innovation

Platform Optimization & Scale:

  • Optimize storage strategies for cost-effective management of structured and unstructured data 
  • Design tiered storage architectures that balance performance needs with storage costs 
  • Implement caching layers for frequently accessed embeddings and AI model inputs

QUALIFICATIONS

  • Bachelor's degree in Computer Science, Information Systems, or related field
  • 10+ years of experience as a Data Architect, Data Platform Engineer, or similar role with enterprise data systems 
  • 5+ years of experience working with both structured (SQL databases, data warehouses) and unstructured data (documents, logs, multimedia)
  • Understanding of modern document processing using multi-modal LLMs and traditional extraction methods 
  • Proficiency in Python and SQL, with experience in data processing libraries
  • Must be willing to work out of an office located in Bangalore JFWTC Campus
  • You must submit your application for employment on the careers page at www.careers.gevernova.com to be considered.

PREFERRED QUALIFICATIONS

  • 12+ years of experience modernizing legacy data architectures for cloud and AI workloads
  • Deep expertise in unstructured data processing using both multi-modal LLMs and traditional methods
  • Experience with multi-modal LLMs for document understanding and their cost/performance trade-offs
  • Background in information retrieval, search engineering, or content management systems 
  • Experience with multi-modal AI architectures combining text, image, and structured data
  • Master's degree in Computer Science, Information Systems, or related field

Technical Stack

Document Processing: Multi-modal LLMs (GPT-4V, Claude Vision, Gemini), LlamaParse, Unstructured.io, Azure Document Intelligence, AWS Textract (for legacy/high-volume), direct PDF-to-context pipelines

Vector/Search: Pinecone, Weaviate, pgvector

Lake Technologies: AWS S3, Azure ADLS

Languages: Python, SQL, Scala, Java 

APIs: OpenAI, Anthropic, Google Vertex AI, AWS Bedrock, Azure OpenAI

Additional Information

Relocation Assistance Provided: Yes

Top Skills

Anthropic
Aws Bedrock
Aws S3
Aws Textract
Azure Adls
Azure Document Intelligence
Azure Openai
Google Vertex Ai
Java
Multi-Modal Llms
Openai
Pgvector
Pinecone
Python
Scala
SQL
Weaviate
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: , Cambridge, MA
75,000 Employees
Year Founded: 2024

What We Do

GE Vernova is a planned purpose-built company on a mission to electrify the planet while simultaneously working to decarbonize it.

If we want our energy future to be different…we must be different.

Our mission is embedded in our name. We retain our treasured legacy, “GE,” in our name as an enduring and hard-earned badge of quality and ingenuity. “Ver” / “verde” signal Earth’s verdant and lush ecosystems. “Nova,” from the Latin “novus,” nods to a new, innovative era of lower carbon energy that GE Vernova will help deliver.

GE Vernova brings together GE’s portfolio of energy businesses including Power, Wind, Electrification and Digital businesses. With focus, GE Vernova is accelerating the path to more reliable, affordable, and sustainable energy, while helping our customers power economies and deliver the electricity that is vital to health, safety, security, and improved quality of life.

Together, we have The Energy to Change the World.

Why Work With Us

Join our team, to evolve and grow, surrounded by some of the brightest minds in the industry who help you get better every day. You’ll get the chance to rewrite the rules, work on cutting-edge technology, and be part of a global team for positive change.

Gallery

Gallery

Similar Jobs

Nexthink Logo Nexthink

Consultant

Artificial Intelligence • Big Data • Cloud • Information Technology • Machine Learning • Software
Hybrid
Bengaluru, Karnataka, IND
1200 Employees

CSC Logo CSC

Accountant

Fintech • Legal Tech • Software • Financial Services • Cybersecurity • Data Privacy
In-Office
Bangalore, Bengaluru Urban, Karnataka, IND
8500 Employees

Zeta Global Logo Zeta Global

Senior Software Engineer

AdTech • Artificial Intelligence • Marketing Tech • Software • Analytics
Easy Apply
Hybrid
Bengaluru, Bengaluru Urban, Karnataka, IND
2429 Employees

Zeta Global Logo Zeta Global

Operations Specialist

AdTech • Artificial Intelligence • Marketing Tech • Software • Analytics
Easy Apply
Hybrid
Bengaluru, Bengaluru Urban, Karnataka, IND
2429 Employees

Similar Companies Hiring

Energy CX Thumbnail
Utilities • Professional Services • Greentech • Financial Services • Energy • Consulting • Business Intelligence
Chicago, IL
108 Employees
True Anomaly Thumbnail
Software • Manufacturing • Machine Learning • Hardware • Defense • Artificial Intelligence • Aerospace
Centennial, CO
220 Employees
Turion Space Thumbnail
Software • Manufacturing • Information Technology • Hardware • Defense • Artificial Intelligence • Aerospace
Irvine, CA
150 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account