Ericsson is building a new R&D team in Massy, France, covering many cutting-edge technologies such as AI/ML, cloud, and 5G advanced/6G technologies.
Within this new lab, the Standards & Technology unit is part of Development Unit Networks' global Standards & Technology organization.
At Development Unit Networks, Standards & Technology secures technology leadership in Radio Access Networks (RAN) by actively driving New Concepts, Standardization, Software and Hardware Research, Architecture, and Testbeds.
Technical focus includes 5G evolution, IoT, Digital Twins, Automation / Machine Learning, and Security.
As a part of this young and talented research team, we are looking for motivated interns to contribute to our research activities on Generative AI and Multimodal Intelligence for Telecom Systems.
Context
Modern telecommunication research and engineering rely on a variety of technical images and diagrams, including network architectures, protocol sequence charts, RF signal plots, coverage maps, and KPI dashboards.
These visual representations are essential for understanding system design and performance, yet they remain poorly captured by existing vision-language models (VLMs), which are trained mostly on natural images.
Generic models such as CLIP, Qwen2-VL, or LLaVA are powerful but struggle to interpret telecom-specific visual symbols, legends, and numerical content (e.g., "256-QAM EVM plots" or "handover flow diagrams").
To enable intelligent retrieval, reasoning, and automation in the telecom domain, there is a need for domain-adapted multimodal embeddings and tokenizers that can understand these technical visual cues.
This internship explores the training and evaluation of a multimodal embedding or tokenizer model specialized for telecom-oriented images, aiming to bridge the gap between visual and textual representations in this technical domain.
Research Questions
• How can we design visual tokenizers or embeddings that effectively represent structured, symbolic telecom images (diagrams, plots, dashboards)?
• What pretraining or fine-tuning strategies (contrastive, masked, or alignment-based) best adapt general-purpose VLMs to telecom data?
• How can these embeddings be integrated with textual knowledge bases or LLMs to enable multimodal reasoning (e.g., "Explain Figure 5: RRC connection procedure")?
• How can we evaluate visual grounding and retrieval quality for telecom-specific multimodal datasets?
Objectives
In this internship we are looking for talented students to help us design, train, and evaluate a multimodal embedding model for telecom images.
• Collect and preprocess a dataset of telecom-oriented images (network diagrams, RF plots, dashboards) with accompanying captions or textual context.
• Explore domain-adaptive fine-tuning of existing models (CLIP, SigLIP-2, Qwen2-VL, or LLaVA) using telecom data.
• Train and evaluate a visual tokenizer or encoder capable of generating robust embeddings for structured technical images.
• Design evaluation benchmarks for telecom image understanding (e.g., retrieval, captioning, or visual grounding tasks).
• Integrate the resulting embeddings into a GraphRAG or Hi-RAG pipeline for cross-modal retrieval (image clause entity).
• Analyze performance against generic VLMs and report improvements in domain alignment and factual grounding.
To be successful in the role you need to have:
• Basic understanding of telecommunication systems and network architectures (4G/5G or similar).
• Interest in computer vision, multimodal learning, and representation learning.
• Familiarity with Transformer-based architectures (Vision Transformers, CLIP, or similar).
• Experience with Python and deep learning frameworks (PyTorch, TensorFlow, or similar).
• Experience with cloud-based AI platforms, preferably AWS and Amazon Bedrock (or similar cloud LLM/VLM services).
• Experience with version control and collaboration platforms (Git/GitLab, or similar).
• Curiosity about Generative AI, Large Multimodal Models, and their applications in telecom.
• Qualities of fast learning, critical thinking, autonomy, and teamwork.
• Willingness to work in an inclusive, research-oriented, and multicultural environment.
• Fluent English language skills in both writing and conversation.
• French language skills are a plus.
Similar Jobs
What We Do
Ericsson builds the digital connectivity the world relies on. Our technology underpins the mobile networks, platforms, and systems that billions of people, businesses, and societies depend on every day. We are a global leader in communications technology, delivering mobile network infrastructure, cloud software, and wireless connectivity solutions for service providers and enterprises worldwide. Our networks support connectivity across 180+ countries, helping power everyday communication as well as critical digital services at global scale. Connectivity has evolved far beyond consumer mobile use. Today, nearly 80% of the world’s population accesses the internet via mobile networks, and Ericsson is helping shape what comes next. We are advancing 5G and 5G Advanced, developing network APIs that open connectivity to the global developer ecosystem, and applying automation and AI to make networks more intelligent, efficient, and resilient. Ericsson was the first company to launch live 5G networks on five continents, and our 5G platform is now commercially live in 150+ networks across 60+ countries. We also support more than 36,000 enterprise customers, enabling secure, high-performance connectivity for industries such as manufacturing, aviation, logistics, utilities, and public safety, where reliability and performance are mission critical. Innovation is central to how we work. Ericsson has approximately 28,000 employees in research and development, backed by one of the strongest intellectual property portfolios in the industry with 60,000+ granted patents. Our engineers, researchers, and technologists work across 100+ global R&D sites, helping define how networks evolve and how digital infrastructure is built for the long term. As the world moves toward a mobile-first, AI-powered, and cloud-driven future, connectivity becomes the foundation for digital transformation across every industry. Ericsson is building that foundation, shaping the future of digital connectivity through technology that operates at global scale and supports real-world impact, today and for what comes next.
Why Work With Us
Ericsson is a place for people who want to work on technology that powers everyday life. You’ll contribute to large-scale systems used every day, tackle complex challenges in live environments, and keep developing your skills and career in your own vision.
Gallery
Ericsson Offices
Hybrid Workspace
Employees engage in a combination of remote and on-site work.
Ericsson adopts a hybrid work model globally because we know balance matters. Sometimes things are better in real life. Other times we can be more productive at home. Our hybrid approach gives you the best of both worlds.






.png)









.png)








