Computer Vision Research Internship: Image to Sequence Modeling (e.g. Transformers)

Reposted 9 Days Ago
Be an Early Applicant
Tampere
Hybrid
Internship
Computer Vision • Machine Learning • Mobile • Productivity • Software
Scandit is the leader in smart data capture giving superpowers to workers, customers and businesses.
The Role
The internship focuses on advancing machine learning methods for computer vision, involving designing ML architectures, conducting experiments, and implementing improvements on image-to-sequence modeling techniques.
Summary Generated by Built In

Duration: Minimum 6 months; ideally 9–12 months, depending on the candidate’s experience

Scandit gives people superpowers. Whether enabling delivery drivers to make quicker deliveries, matching a patient with their medication, or allowing retailers to make store operations more efficient, our technology automates workflows and provides actionable insights to help businesses in a variety of industries. Join us, as we continue to expand, grow and innovate, and help take Scandit to the next level.

About the Internship

We are offering a research-focused internship aimed at advancing machine learning methods for complex visual understanding tasks. The project centers on deep learning architectures for image-to-sequence modelling, such as Transformers, attention mechanisms, and modern sequence and representation-learning frameworks, to address challenging and highly structured computer vision problems. This project contributes to long-term research efforts aimed at achieving even higher performance, robustness, and generalization in large-scale visual applications.

What you will do

You will work closely with experienced ML researchers and engineers on cutting-edge research at the intersection of computer vision and sequence modeling. Your work will include:

  • Designing and experimenting with new ML architectures for structured visual data.
  • Evaluating alternative modeling paradigms (e.g., encoder–decoder, hybrid Transformer models, sequence-based representations).
  • Investigating techniques for improving robustness, generalization, and multi-view reasoning.
  • Running systematic experiments, ablations, and error analyses to validate research hypotheses.

This project provides opportunities for novel model design, extensive experimentation, and scholarly research. You will contribute to long-term innovation in our technology, with potential real-world impact for millions of users. An ideal position for experienced master’s students, PhD collaborations, or candidates preparing for a research career in industry or academia.

Who you are

MSc or PhD student in Computer Science, Machine Learning, Artificial Intelligence, or a related field with a strong research focus. Candidates should have a solid foundation in machine learning theory, neural networks, and computer vision.

Essential Skills:

  • Proficiency in Python and deep learning frameworks such as PyTorch.
  • Practical experience designing, training, and evaluating neural networks, including CNNs and Transformer-based architectures.
  • Strong analytical and problem-solving abilities, with the capability to interpret experimental results and iterate effectively.
  • Familiarity with research best practices, including reproducibility, controlled experiments, and ablation studies.

Desirable Skills:

  • Prior research experience in computer vision, pattern recognition, sequence modeling, or image-to-sequence architectures.
  • Experience training large-scale models or working with foundation-style architectures.
  • Contributions to publications, preprints, or open-source machine learning projects.

Strong communication skills and the ability to work independently in a research-oriented environment.

What We Offer
  • We are certified as a “Great Place to Work” in 10 countries!
  • A highly skilled team and a fun environment where you can put your enthusiasm for computer vision challenges and cutting-edge technologies to use
  • Hackathons, summer parties, company outings and other regular events
  • Office in the city center of Tampere
Who We Are

Could your code give superpowers? Whether enabling delivery drivers to make quicker deliveries, matching a patient with their medication or allowing retailers to make store operations more efficient, our technology automates workflows and provides actionable insights to help businesses in a variety of industries. This means we have no shortage of technical challenges for engineers like you. Join us, as we continue to expand, grow and innovate, and help take Scandit to the next level.

“Everybody is welcome here” - Is a celebrated component of our DNA.

At Scandit we strive to create an inclusive environment that empowers our employees. We believe that our products and services benefit from our diverse backgrounds and experiences and are proud to be a safe space for all.

All qualified applications will receive consideration for employment without regard to race, colour, nationality, religion, sexual orientation, gender, gender identity, age, physical [dis]ability or length of time spent unemployed.

Top Skills

Deep Learning
Python
PyTorch
Transformers
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Zurich
400 Employees
Year Founded: 2009

What We Do

Scandit supports its customers by providing actionable insights and automating end-to-end processes.

Our Smart Data Capture platform enables smart devices, such as smartphones, drones, digital eyewear and robots to interact with physical items by capturing data from barcodes, text, IDs and objects with unmatched speed, accuracy and intelligence. Scandit accurately scans up to 3x faster than dedicated scanners in challenging light or at angles, on damaged labels, across multiple codes, on any smart device.

We enable innovation that delivers significant cost savings, increases employee retention and customer loyalty. Scandit partners with customers at every step with trials, solution design, integration and customer success support included.

Specialties:
Image Recognition, Optical Character Recognition (OCR), Computer Vision, Transportation and Logistics, Healthcare, Barcode Scanning Software, Augmented Reality, Retail Software, Enterprise Software, Last Mile Solutions, BYOD Solutions, Digital Transformation and Machine Learning.

Visit scandit.com to learn why market leaders across retail, transport and logistics, healthcare and manufacturing like Instacart, Levi’s Strauss, Sephora, NHS and FedEx trust us.

We are hiring, take a look at our career opportunities - https://www.scandit.com/careers/

Why Work With Us

At Scandit, we pride ourselves on building amazing products that revolutionize our customers’ business processes. We are a highly collaborative company with talented, driven, and passionate people all across the globe. We strive to foster a sense of individuality, innovation and fun, and view that as the core of our dynamic culture.

Gallery

Gallery

Similar Companies Hiring

Standard Template Labs Thumbnail
Software • Information Technology • Artificial Intelligence
New York, NY
15 Employees
PRIMA Thumbnail
Travel • Software • Marketing Tech • Hospitality • eCommerce
US
15 Employees
Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account