Data Quality Intern (d/f/m)

Posted 3 Days Ago
Be an Early Applicant
27 Locations
Remote
Internship
Artificial Intelligence • Machine Learning • Productivity • Software • Database
The electronics supply chain platform
The Role
Investigate and quantify data-quality issues for parts and components, translate them into customer-impact metrics, run read-only queries on ClickHouse/Postgres, build dashboards and ad-hoc visualizations, run small AI-assisted scripts against external sources, and implement safe, reversible data fixes and documented findings during a 3–6 month internship.
Summary Generated by Built In
👑 Your opportunity

Our software platform is built to solve engineering and procurement issues in the trillion-dollar electronics industry. In other words: Luminovo is redefining the software stack used to bring any type of electronics to live.

  • To understand what we do, check out our website, plus two of our blog posts (here and here).

🚀 Your role
  • Our data quality mission is product discovery applied to our part and component data. You take a fuzzy quality problem, figure out what it actually means for customers, measure it honestly, and hand a well-scoped, evidence-backed finding to the team that delivers the larger fix.

  • The hard part isn't running a query (our AI tooling helps with that). It's reframing "x% of parts have no pin count" into "y% of a customer's costings can't complete because of it," then giving other teams a result they can act on without them having to re-check it.

  • You'll be a junior version of this discovery loop: sharp, honest, and data-fluent. You'll follow threads the team doesn't have time to chase, turn them into decision-ready findings, and grow into more autonomy across your internship. You work within a clear direction, and you can take a fix all the way into production when it's a data-level change you can script, like manufacturer merges or backfills. You won't need to be a Rust engineer or own large refactors. AI tooling does the heavy lifting on unfamiliar code and scripting. Your judgment and rigor are what matter most.

  • This role is an internship with a duration of three to six months.

     
🎯 Your performance objectives
  • Turn ambiguous data-quality questions into customer-relevant findings by reframing part-level observations into business/customer impact (e.g. tenant-aware "what actually blocks costing"), defining a sensible metric or proxy, and producing a measured, caveated answer to the question set by the product manager.

  • Independently size problems and test hypotheses against our data by writing read-only queries over the data warehouse (ClickHouse) and production Postgres, and producing numbers you can defend (knowing when a result is double-counted, misleading, or too good to be true)

  • Make the effect of fixes and experiments visible by extending our dashboards and building ad-hoc visualizations that show trends, baselines, and whether an intervention actually moved coverage/correctness.

  • Run small experiments to gather evidence by writing scripts (with AI assistance) against external sources such as SiliconExpert and DigiKey, e.g., to check whether a missing-data gap is fetchable, calibrate a finding, or do spot checks on interesting cases.

  • Verify assumptions in the product itself by navigating the epibator (Rust/TS) codebase with AI tooling to confirm how data is actually resolved/used, and occasionally adding light instrumentation we find we need, without owning large refactors.

  • Apply the fixes you've scoped, safely by writing AI-assisted scripts that correct production customer data at scale: e.g. automating the research to decide whether two manufacturers are the same record and then executing thousands of merges. Make every change safe by construction: dry-run and validate against samples first, work in reversible/checkpointed batches, and put guardrails in place so we never introduce regressions or corrupt manufacturing/costing data.

  • Leave behind durable, trustworthy knowledge by following the mission's loop (brief, investigate, report, distill), citing evidence, dating facts, and writing findings other teams and stakeholders can act on without re-deriving them.

  • Be your own harshest critic by reconciling and sanity-checking your own results, clearly separating "what's proven" from "what's still a hypothesis," and flagging loudly when a finding overturns a prior assumption (incl. your own).

💪 What you bring

Discovery instinct and intellectual honesty are what set this role apart, sitting on top of an analytics baseline. Deep technical and domain depth is coachable and AI-assisted. Calibration and judgment are not.

  • You can pull a trustworthy number out of messy data with SQL

  • You own the question, not just the query, and reframe it toward what actually matters

  • You sanity-check your own work and say "not proven yet" when that's the truth

  • You communicate findings clearly enough for a non-analyst to act on

  • You use AI with real verification, and write small Python scripts to fix data safely

  • You bring high agency, learn fast, and don't drop threads

  • Bonus: you read code with AI help, and you're curious about the electronics domain

💻 Our tech stack

We always try to use the best tool for the job. Don't worry, we don't need you to be familiar with all of these:

  • ClickHouse as our data warehouse and PostgreSQL for our production data, the two main places you'll query.

  • Python for scripting, data transformation, and safe bulk fixes.

  • Rust and TypeScript in our main product codebase, which you'll read (with AI help) to verify behavior.

  • External data sources such as SiliconExpert and DigiKey, reached through their APIs.

  • AI tooling (such as Claude Code) for code navigation, querying, and scripting.

 
🤓 Whom you’ll be working with
  • You'll report to and take direction from Mike, who owns the data quality mission and sets the questions you'll investigate.

  • You'll be working with other Luminerds like Shamir, Igor, and Tiko.

⚡️ Skills, knowledge, and expertise

Just so you know – we purposefully did not create a standard list of minimum qualifications for this role. We care much more about your motivation and ability to help us accelerate technical progress than we care just about your CV. If you're interested to find out more about why we banned requirements from our performance profiles – see here.

⚡️ About Luminovo

Luminovo is the first electronics supply chain platform that unites all data, processes, and stakeholders in the trillion-dollar electronics industry.

We fundamentally believe that technology is the best tool that humankind has developed to tackle the biggest challenges we face as a society today. And we want to do our share in accelerating technological progress.

We bring innovations to life faster and cheaper by creating a more connected and resilient electronics supply chain.

To date, over 300 active customers worldwide are working with us towards this shared goal. Supported by leading investors such as Cherry Ventures, General Catalyst/LaFamiglia, Chalfen Ventures, and others, we have raised more than €20 million.

Our core principles: putting people first & building great things

As a remote-first company headquartered in Munich, our brilliant team is spread across cities including Munich, New York, Austin, Calgary, Berlin, London, Valencia, Nairobi, and Verona, among others. 🌍

Our team includes a mix of product enthusiasts, people advocates, business masterminds, and engineering experts. Among us are graduates from institutions like Stanford University and the University of Cambridge, as well as talented individuals with unconventional CVs. What matters most is their drive to build great things.

We provide an authentic environment based on psychological safety to empower Luminerds from various backgrounds to succeed, grow, and focus on impact. Putting people first means that we deeply care about who our employees are, what makes them unique, and what they excel at.

See what employees and candidates have to say about Luminovo on kununu and Glassdoor.

 

Skills Required

  • Proficient with SQL to extract trustworthy numbers from messy data
  • Experience writing read-only queries against ClickHouse and PostgreSQL
  • Ability to write small Python scripts for data transformation and safe bulk fixes
  • Ability to reframe ambiguous data questions into customer-relevant metrics and findings
  • Ability to sanity-check results, clearly separate proven facts from hypotheses
  • Communicate findings clearly for non-analyst stakeholders
  • Use AI tooling productively while verifying outputs
  • Familiarity reading code (Rust/TypeScript) with AI assistance (bonus)
  • High agency, fast learner, follows investigative threads to completion
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Munich
51 Employees
Year Founded: 2017

What We Do

Luminovo is the electronics supply chain platform that helps you quote and procure faster and better by bringing all your data, processes, and players together in one place. Our products for OEM (Design-to-source) and for EMS (Configure-price-quote) are modular and easily tailored to your needs thanks to an array of add-ons specifically built for the electronics industry challenges. We are a mix of deep learning natives, business talents and product enthusiasts. Our current team not only includes top graduates from institutions such as Stanford University, Cambridge, ETH Zurich, TUM, and CDTM, but also motivated individuals with rather unconventional CVs who are burning for what they do. Luminovo was born in Munich but has evolved into a remote-first company operating globally and with our team spread from Amsterdam to Valencia, Tirana, and Nairobi. 🌍 If you're someone working in the electronics industry or looking to grow a software startup – join Luminovo as we pioneer a modern setup of the all-encompassing electronics industry. 🚀

Similar Jobs

Mondelēz International Logo Mondelēz International

o9 Change Readiness Lead

Big Data • Food • Hardware • Machine Learning • Retail • Automation • Manufacturing
Remote or Hybrid
11 Locations
90000 Employees

Zapier Logo Zapier

Manager or Sr. Manager, Sales Assist

Artificial Intelligence • Productivity • Software • Automation
Remote
30 Locations
800 Employees
Remote
26 Locations
393 Employees
179K-179K Annually

Mondelēz International Logo Mondelēz International

Manager, Procurement Data Science and Analytics (F/M/X)

Big Data • Food • Hardware • Machine Learning • Retail • Automation • Manufacturing
Remote or Hybrid
3 Locations
90000 Employees

Similar Companies Hiring

Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
42 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account