Multimodal embeddings for Telecom Images

Sorry, this job was removed at 02:12 p.m. (CST) on Monday, Jan 19, 2026
Be an Early Applicant
Massy, Palaiseau, Essonne, Île-de-France, FRA
In-Office
Cloud • Information Technology • Internet of Things • Machine Learning • Software • Cybersecurity • Infrastructure as a Service (IaaS)
We are shaping the future of digital connectivity the world relies on.
The Role
Multimodal Embeddings and Tokenization for Telecom-Oriented Images
Ericsson is building a new R&D team in Massy, France, covering many cutting-edge technologies such as AI/ML, cloud, and 5G advanced/6G technologies.
Within this new lab, the Standards & Technology unit is part of Development Unit Networks' global Standards & Technology organization.
At Development Unit Networks, Standards & Technology secures technology leadership in Radio Access Networks (RAN) by actively driving New Concepts, Standardization, Software and Hardware Research, Architecture, and Testbeds.
Technical focus includes 5G evolution, IoT, Digital Twins, Automation / Machine Learning, and Security.
As a part of this young and talented research team, we are looking for motivated interns to contribute to our research activities on Generative AI and Multimodal Intelligence for Telecom Systems.
Context
Modern telecommunication research and engineering rely on a variety of technical images and diagrams, including network architectures, protocol sequence charts, RF signal plots, coverage maps, and KPI dashboards.
These visual representations are essential for understanding system design and performance, yet they remain poorly captured by existing vision-language models (VLMs), which are trained mostly on natural images.
Generic models such as CLIP, Qwen2-VL, or LLaVA are powerful but struggle to interpret telecom-specific visual symbols, legends, and numerical content (e.g., "256-QAM EVM plots" or "handover flow diagrams").
To enable intelligent retrieval, reasoning, and automation in the telecom domain, there is a need for domain-adapted multimodal embeddings and tokenizers that can understand these technical visual cues.
This internship explores the training and evaluation of a multimodal embedding or tokenizer model specialized for telecom-oriented images, aiming to bridge the gap between visual and textual representations in this technical domain.
Research Questions
• How can we design visual tokenizers or embeddings that effectively represent structured, symbolic telecom images (diagrams, plots, dashboards)?
• What pretraining or fine-tuning strategies (contrastive, masked, or alignment-based) best adapt general-purpose VLMs to telecom data?
• How can these embeddings be integrated with textual knowledge bases or LLMs to enable multimodal reasoning (e.g., "Explain Figure 5: RRC connection procedure")?
• How can we evaluate visual grounding and retrieval quality for telecom-specific multimodal datasets?
Objectives
In this internship we are looking for talented students to help us design, train, and evaluate a multimodal embedding model for telecom images.
• Collect and preprocess a dataset of telecom-oriented images (network diagrams, RF plots, dashboards) with accompanying captions or textual context.
• Explore domain-adaptive fine-tuning of existing models (CLIP, SigLIP-2, Qwen2-VL, or LLaVA) using telecom data.
• Train and evaluate a visual tokenizer or encoder capable of generating robust embeddings for structured technical images.
• Design evaluation benchmarks for telecom image understanding (e.g., retrieval, captioning, or visual grounding tasks).
• Integrate the resulting embeddings into a GraphRAG or Hi-RAG pipeline for cross-modal retrieval (image clause entity).
• Analyze performance against generic VLMs and report improvements in domain alignment and factual grounding.
To be successful in the role you need to have:
• Basic understanding of telecommunication systems and network architectures (4G/5G or similar).
• Interest in computer vision, multimodal learning, and representation learning.
• Familiarity with Transformer-based architectures (Vision Transformers, CLIP, or similar).
• Experience with Python and deep learning frameworks (PyTorch, TensorFlow, or similar).
• Experience with cloud-based AI platforms, preferably AWS and Amazon Bedrock (or similar cloud LLM/VLM services).
• Experience with version control and collaboration platforms (Git/GitLab, or similar).
• Curiosity about Generative AI, Large Multimodal Models, and their applications in telecom.
• Qualities of fast learning, critical thinking, autonomy, and teamwork.
• Willingness to work in an inclusive, research-oriented, and multicultural environment.
• Fluent English language skills in both writing and conversation.
• French language skills are a plus.

What the Team is Saying

Vishal
Mayank
Granville
Sneha
Olga
Emily
Nicole
Sola
Aditi
Christoph

Similar Jobs

Ericsson Logo Ericsson

Head of BOS Integrated Services Hub 1

Cloud • Information Technology • Internet of Things • Machine Learning • Software • Cybersecurity • Infrastructure as a Service (IaaS)
In-Office or Remote
90 Locations
88000 Employees

Ericsson Logo Ericsson

Architect

Cloud • Information Technology • Internet of Things • Machine Learning • Software • Cybersecurity • Infrastructure as a Service (IaaS)
In-Office
4 Locations
88000 Employees

Ericsson Logo Ericsson

Program Director

Cloud • Information Technology • Internet of Things • Machine Learning • Software • Cybersecurity • Infrastructure as a Service (IaaS)
In-Office
Massy, Palaiseau, Essonne, Île-de-France, FRA
88000 Employees
10M-10M Annually

Ericsson Logo Ericsson

Head of BOS Bouygues (CU West Europe)

Cloud • Information Technology • Internet of Things • Machine Learning • Software • Cybersecurity • Infrastructure as a Service (IaaS)
In-Office
6 Locations
88000 Employees
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Stockholm
88,000 Employees
Year Founded: 1876

What We Do

Ericsson builds the digital connectivity the world relies on. Our technology underpins the mobile networks, platforms, and systems that billions of people, businesses, and societies depend on every day. We are a global leader in communications technology, delivering mobile network infrastructure, cloud software, and wireless connectivity solutions for service providers and enterprises worldwide. Our networks support connectivity across 180+ countries, helping power everyday communication as well as critical digital services at global scale. Connectivity has evolved far beyond consumer mobile use. Today, nearly 80% of the world’s population accesses the internet via mobile networks, and Ericsson is helping shape what comes next. We are advancing 5G and 5G Advanced, developing network APIs that open connectivity to the global developer ecosystem, and applying automation and AI to make networks more intelligent, efficient, and resilient. Ericsson was the first company to launch live 5G networks on five continents, and our 5G platform is now commercially live in 150+ networks across 60+ countries. We also support more than 36,000 enterprise customers, enabling secure, high-performance connectivity for industries such as manufacturing, aviation, logistics, utilities, and public safety, where reliability and performance are mission critical. Innovation is central to how we work. Ericsson has approximately 28,000 employees in research and development, backed by one of the strongest intellectual property portfolios in the industry with 60,000+ granted patents. Our engineers, researchers, and technologists work across 100+ global R&D sites, helping define how networks evolve and how digital infrastructure is built for the long term. As the world moves toward a mobile-first, AI-powered, and cloud-driven future, connectivity becomes the foundation for digital transformation across every industry. Ericsson is building that foundation, shaping the future of digital connectivity through technology that operates at global scale and supports real-world impact, today and for what comes next.

Why Work With Us

Ericsson is a place for people who want to work on technology that powers everyday life. You’ll contribute to large-scale systems used every day, tackle complex challenges in live environments, and keep developing your skills and career in your own vision.

Gallery

Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery

Ericsson Offices

Hybrid Workspace

Employees engage in a combination of remote and on-site work.

Ericsson adopts a hybrid work model globally because we know balance matters. Sometimes things are better in real life. Other times we can be more productive at home. Our hybrid approach gives you the best of both worlds.

Typical time on-site: Flexible
Company Office Image
HQEricsson HQ - Kista, Sweden
Yokohama Office
Mexico City - Mexico Head Office
Indaiatuba-SP 5G Smart Factory
Athlone Software Campus
Austin ASIC Design Center
Beijing - China Headquarters
Bellevue Office
Bengaluru Hub
Boise - Ericsson Enterprise Wireless Solutions (formerly Cradlepoint) Headquarters
Bucharest Main Site
Budapest - Ericsson House
Company Office Image
Chennai Hub
Gurugram Hub - Ericsson India Global Services (Delhi NCR)
Company Office Image
Gurgaon Head Office - Ericsson India (Delhi NCR)
Irvine Office
Ottawa R&D Site
Kolkata Hub
Krakow R&D Center
Company Office Image
Lewisville 5G Smart Factory
Łódź R&D Site
Montreal AI & R&D Hub
Company Office Image
Morristown Office
Nanjing R&D & Manufacturing Hub
Noida Hub (Delhi NCR)
Overland Park Office
Plano Office - US Headquarters
Pune Hub
Company Office Image
Santa Clara D-15 Innovation Center
São Paulo - Brazil Main Office
Tokyo Head Office
Learn more

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account