Software Engineer, AI Data Infrastructure

Reposted 20 Days Ago
Be an Early Applicant
Munich, Bayern
In-Office
112K-153K Annually
Mid level
Hardware • Software
The Role
The Data Engineer will design and maintain data ingestion pipelines, develop transformation processes, build secure APIs, and contribute to MLOps tooling for machine learning models and data governance.
Summary Generated by Built In

About the Company:

World is building a real human network designed to accelerate people in the age of AI. As bots and autonomous agents reshape the internet, people, institutions, and applications need a trusted way to confirm who is a real human while preserving privacy. Our products make this possible: the Orb verifies real people, World ID proves it privately, and World App enables and distributes the new applications made possible by this technology. Together, they form a new layer for AI internet.

We’re one of the fastest-growing networks in tech. More than 17 million people across 160 countries have verified with World ID, and we complete over 350,000 verifications each week. World App is already among the most used wallets globally. Developers are integrating World ID to build safer online experiences and create spaces where real people can participate, earn, and be recognized in ways AI simply can’t replicate.

World was founded in 2019 and launched globally in 2023. We are more than 400 people across hardware, software, AI, cryptography, mobile engineering, and global operations. Our teams come from OpenAI, Tesla, SpaceX, Apple, Google, Stripe, Meta, Coinbase, Palantir and MIT Media Lab. We’re backed by leading investors, including a16z, Khosla Ventures, Bain Capital Crypto, Blockchain Capital, Variant, Tiger Global, and Coinbase Ventures, as well as prominent operators and founders across fintech and AI.

World has been featured on the cover of TIME Magazine, highlighted in Fast Company’s Next 5 in Fintech, and explored in a Bloomberg deep dive. The New York Times, Bankless and TechCrunch have all recognized our progress in identity, cryptography, AI, and global-scale hardware deployment. Our leadership is also named to the Time AI 100.

This opportunity would be with Tools for Humanity

About the AI & Biometrics Team:

The AI & Biometrics team is building a biometric recognition system that can work reliably with more than a billion users and enables them to claim their free share of WLD. We use cutting-edge machine learning models deployed on custom hardware to enable high-quality image acquisition, identification, and fraud prevention, all while requiring minimal user interaction.

We are building a biometric recognition and fraud detection engine that works on the 1bn people scale. Therefore, its performance needs to out-perform all the current recognition technologies. We leverage our powerful custom-made iris recognition and presentation attack detection device, the Orb, combined with the latest research from the field of AI and Deep Learning.

About the Opportunity:

You will join a high-impact team that maintains and evolves the data platform powering our AI pipelines. This is an all-rounder role that combines backend development, data engineering, infrastructure, and lightweight frontend work.

Your work will span the ingestion layer, transformation workflows, and the warehouse itself: designing resilient pipelines, building secure APIs, and creating services that make our datasets reliable, discoverable, and ready for large-scale training.

You will be a key contributor to the infrastructure that feeds and monitors our machine learning models in production: ensuring that data flows seamlessly, services run reliably, and governance standards are never compromised. Every solution you build will follow the highest security standards and rigorous data governance principles, ensuring sensitive biometric data is handled with absolute care.

This role is onsite 5 days/week and sits in our Munich or San Francisco office

Key Responsibilities:

  • Design and maintain ingestion pipelines that move data from edge devices and internal services into the data platform with traceability, versioning, and high reliability

  • Develop and refine transformation processes to deliver clean, well‑structured tables ready for analytics, model training, and evaluation workflows - production-grade datasets ready for AI training and evaluation workflows, with strong schema contracts and lineage guarantees

  • Build internal APIs and backend services that provide secure, performant access to large datasets while upholding strict governance and privacy controls

  • Instrument systems with metrics, automated checks, and recovery mechanisms that detect issues early and enable self‑healing responses

  • Contribute to MLOps tooling for dataset monitoring and model training pipelines, ensuring smooth iteration cycles for research teams

  • Raise engineering standards by improving CI/CD pipelines, integration tests, and dependency management

  • Build lightweight dashboards (Streamlit/Next.js) to make datasets and metrics accessible internally

  • Design optimized and scalable, fault-tolerant real-time or near real time data pipelines using distributed processing tools.

  • Own the lifecycle of critical data assets — including lineage tracking, access control, and schema enforcement

  • You will work with both structured and semi-structured data, combining SQL-based platforms like Snowflake with NoSQL sources like MongoDB. You’ll build resilient pipelines that handle versioning, schema evolution, and are GDPR compliant.

About You:

  • 4 - 6 years proficiency in both Python and Go, with experience building production services

  • Comfortable with containerization and orchestration tools like Docker and Kubernetes

  • Experienced with AWS services (S3, KMS, IAM) and Terraform for infrastructure as code

  • Skilled in designing and operating data ingestion and transformation workflows, with exposure to Snowflake or other SQL‑based analytics platforms

  • Familiar with CI/CD pipelines and version control practices, ideally using GitHub Actions or similar tools

  • Committed to building systems that are secure, observable, and follow strong data governance principles

  • Able to contribute lightweight internal dashboards using frameworks like Streamlit or Next.js

  • Obsessed with reliability, observability, and data governance — you care deeply about logs, metrics, and traceability

  • Strong fundamentals in data modeling, schema design, and backward-compatible schema evolution

  • Comfortable working with NoSQL systems like MongoDB, especially for building ingestion frameworks, managing schema evolution, or integrating Change Streams into ETL pipelines

Nice to have:

  • Experience with event‑driven data pipelines using SQS, SNS, Lambda, or Step Functions

  • Knowledge of data partitioning strategies, schema evolution, and large‑scale dataset optimization for analytics and ML

  • Familiarity with metadata management, dataset versioning, and lineage tracking in production environments

  • Exposure to monitoring and alerting stacks such as Datadog or Prometheus

  • Proficiency in Rust or an interest in learning it

What we offer:
  • The reasonably estimated salary for this role at Tools for Humanity in our Munich office ranges from €125,000 to €153,000. plus a competitive long-term incentive package. Actual compensation is based on factors such as the candidate's skills, qualifications, and experience. In addition, Tools for Humanity offers a wide range of best-in-class, comprehensive, and inclusive employee benefits for this role, including healthcare, dental, vision, 401(k) plan and match, life insurance, flexible time off, commuter benefits, and much more.

    By submitting your application, you consent to the processing and internal sharing of your CV within the company, in compliance with the GDPR.

    If you don't think you meet all of the criteria but are still interested in the job, please apply. Nobody checks every box, and we're looking for someone excited to join the team.

Top Skills

AWS
Docker
Go
Kubernetes
Python
Snowflake
Terraform
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Francisco, CA
66 Employees
Year Founded: 2019

What We Do

Headquartered in Germany, Tools for Humanity is a global software and hardware development company building the tools to bring blockchain and digital identity technologies and products to billions of people, including by contributing to the development and growth of the Worldcoin ecosystem and other projects.

Similar Jobs

Worldcoin Logo Worldcoin

Software Engineer

Artificial Intelligence • Blockchain • Cryptocurrency
In-Office
Munich, Bayern, DEU
288 Employees
112K-153K Annually

Boeing Logo Boeing

Working student Engineer (Multi-Skill) (m/f/d)

Aerospace • Information Technology • Cybersecurity • Defense • Manufacturing
In-Office
Munich, Bayern, DEU
141000 Employees

Boeing Logo Boeing

Aerospace Structural/Design Engineering Intern (m/f/d)

Aerospace • Information Technology • Cybersecurity • Defense • Manufacturing
In-Office
Munich, Bayern, DEU
141000 Employees

Magna International Logo Magna International

Praktikant (m/w/x) im Bereich HR

Automotive • Hardware • Robotics • Software • Transportation • Manufacturing
Hybrid
Sailauf, Bayern, DEU
171000 Employees

Similar Companies Hiring

Standard Template Labs Thumbnail
Software • Information Technology • Artificial Intelligence
New York, NY
10 Employees
PRIMA Thumbnail
Travel • Software • Marketing Tech • Hospitality • eCommerce
US
15 Employees
Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account