Company Overview:
We are building Protege to solve the biggest unmet need in AI — getting access to the right training data. The process today is time intensive, incredibly expensive, and often ends in failure. The Protege platform facilitates the secure, efficient, and privacy-centric exchange of AI training data.
Solving AI’s data problem is a generational opportunity. We’re backed by world-class investors and already powering partnerships with some of the most ambitious teams in AI. The company that succeeds will be one of the largest in AI — and in tech.
We’re a lean, fast-moving, high-trust team of builders who are obsessed with velocity and impact. Our culture is built for people who thrive on ambiguity, own outcomes, and want to shape the future of data and AI.
Role Overview:
As a Software Engineer, you’ll play a pivotal role in shaping our product and infrastructure. Your work will span from building customer-facing features to optimizing backend systems to working with our data warehouses. You’ll be expected to own projects and features and be able to engineer them from the ground up.
We're looking for software engineers that range from the mid to staff level!
Key Responsibilities:
Design, build, and ship end-to-end features across our stack, balancing speed with technical excellence
Identify and solve complex problems with creative, pragmatic solutions, knowing when to build quick fixes versus long-term, scalable systems
Take ownership of projects from idea to production, including testing, deployment, and iteration based on user feedback
Be a part of shaping our engineering culture and processes, introducing and iterating best practices for code quality, collaboration, and scalability
About You:
You have 2+ years of software engineering experience
You are biased towards action
You are comfortable with ambiguity and with a fast-paced environment that can require rapid prototyping, experimenting, and iterating
Strong communication skills, with the ability to explain complex technical concepts to diverse audiences
Strong experience in at least some of Python, Typescript, AWS, SQL, data orchestrators
Excited to work in a company that deals with moving, processing, and transforming large volumes of data
Bonus if you have these attributes:
Experience with Snowflake
Experience with other cloud providers like GCP and Azure
Experience with Spark
Prior startup experience
Top Skills
What We Do
The biggest unmet need in AI today is getting access to the right training data. Data holders often don’t know where to start and are rightly concerned about governance, intellectual property, and security implications. AI companies can spend years finding and negotiating access to the data they need.
Protege is solving these problems by providing an easy-to-use platform to connect data holders with vetted data users.








