Staff Software Development Engineer (Data Engineer)

Posted 2 Days Ago
Be an Early Applicant
Bengaluru, Bengaluru Urban, Karnataka, IND
In-Office
Senior level
Consumer Web • Digital Media • News + Entertainment
A legacy of entertainment, now united as one. Welcome to JioStar - where stories and experiences are infinite!
The Role
Lead design and development of scalable hybrid retrieval architectures, build data pipelines, innovate retrieval techniques, and collaborate on database technologies for AI integration.
Summary Generated by Built In
About the Role: As a Staff Software Development Engineer in the video engineering team, you will be leading the charge in how we organize and retrieve the world's information. You will be the architect of the "brain" of our platform, solving the hardest problems in semantic search and graph-based reasoning at a scale few companies ever reach. This role offers a unique opportunity to define the data infrastructure strategy that powers our GenAI future. 
 
About the Team: 

You are a retrieval-oriented engineer with deep expertise in high-dimensional data, relational structures, and large-scale knowledge representation. You thrive on the challenge of bridging the gap between raw data and semantic understanding, building the backbone for next-generation AI and discovery systems. You are passionate about data topology, latent space optimization, and the performance tuning of complex query engines. You constantly strive to reduce "time-to-insight" and maximize the precision of information retrieval at scale.

The pace of our growth is incredible—if you want to tackle the foundational challenges of RAG (Retrieval-Augmented Generation), knowledge graphs, and semantic search at a global scale, join us!


Key Responsibilities:

  • Lead the design and development of hybrid retrieval architectures combining vector similarity search with structured graph traversals.

  • Architect scalable data pipelines for the ingestion, embedding, and indexing of massive, multi-modal datasets.

  • Innovate and prototype advanced retrieval techniques, including multi-stage re-ranking, graph-tooling for LLMs, and dynamic metadata filtering.

  • Design and implement schemas for complex knowledge graphs, ensuring high-performance relationship mapping and ontological integrity.

  • Build automated data validation and drift detection systems to monitor the quality of embeddings and the health of the vector space.

  • Drive technical implementation of "Memory" systems for AI agents, focusing on long-term persistence, observability, and sub-second latency.

  • Champion data organization standards, ensuring that disparate data sources are unified into a coherent, searchable knowledge base.

  • Collaborate with AI Research and Product teams to evaluate emerging database technologies (e.g., HNSW optimizations, GraphRAG) and integrate them into production.

Skills and Attributes for Success:

  • 7+ years of experience in data engineering or backend systems with a focus on high-performance data retrieval and storage.

  • BE/B.Tech in Computer Science, Mathematics, or equivalent. MS or PhD in a related field is a plus.

  • Expert proficiency in Python, Java, or Go, with a strong grasp of distributed system design patterns.

  • Deep understanding of Vector Databases, including indexing strategies (HNSW, IVFFlat, PQ) and distance metrics (Cosine, Euclidean, Dot Product). Experience with Pinecone, Milvus, Weaviate, or Qdrant.

  • Strong background in Graph Databases (Neo4j, AWS Neptune, or ArangoDB) and query languages like Cypher or Gremlin.

  • Experience with Data Modeling and organization, specifically in building semantic layers, ontologies, and taxonomies.

  • Hands-on experience with LLM orchestration frameworks (LangChain, LlamaIndex) and embedding models (OpenAI, HuggingFace, Cohere).

  • Proficiency in large-scale data processing using Spark, Flink, or Kafka for real-time indexing and ETL.

  • Understanding of Information Retrieval (IR) fundamentals, including BM25, TF-IDF, and reciprocal rank fusion.

  • Experience with cloud-native infrastructure (AWS/GCP/Azure) and container orchestration (Kubernetes).

Preferred Education and Experience:

  • Bachelors/master's in computer science or a related field with 7-9 years of professional experience

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Thane, Maharashtra
809 Employees
Year Founded: 2010

What We Do

A legacy of entertainment, now united as one. Welcome to JioStar - where stories and experiences are infinite!

Similar Jobs

JioStar Logo JioStar

Development Engineer

Consumer Web • Digital Media • News + Entertainment
In-Office
Bengaluru, Bengaluru Urban, Karnataka, IND
809 Employees

TransUnion Logo TransUnion

Sr Analyst, Data Operations

Big Data • Fintech • Information Technology • Business Intelligence • Financial Services • Cybersecurity • Big Data Analytics
Hybrid
2 Locations
13000 Employees

ServiceNow Logo ServiceNow

Sales Manager

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Hybrid
Bangalore, Bengaluru Urban, Karnataka, IND
28000 Employees

ServiceNow Logo ServiceNow

Development Manager

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Remote or Hybrid
Bangalore, Bengaluru Urban, Karnataka, IND
28000 Employees

Similar Companies Hiring

bet365 Thumbnail
Digital Media • Gaming • Software • Esports • Automation
Denver, Colorado
9000 Employees
Hedra Thumbnail
Software • News + Entertainment • Marketing Tech • Generative AI • Enterprise Web • Digital Media • Consumer Web
San Francisco, CA
14 Employees
Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account