Kiddom’s Content & AI Systems team is building the data layer that powers the next generation of AI-assisted curriculum authoring and content delivery. This role sits at the intersection of data engineering and content systems — owning the pipelines, schemas, and validation frameworks that turn raw curriculum content into structured, AI-ready data products.
This is not a traditional data engineering role. Curriculum content is messy, inconsistent, and deeply domain-specific. You will work closely with Instructional Designers, AI engineers, and the Content Agents team to define data requirements, design schemas, and build the infrastructure that makes AI-powered authoring workflows possible.
You will...
Design and own the schema and data models representing Kiddom’s curriculum content (lessons, activities, standards alignments) for downstream use
Build ingestion pipelines that process content from varied, inconsistent source formats — XML, JSON, PDF-derived, and API-delivered
Develop Python-based parsers, transformers, and validation scripts that enforce schema conformance and content quality at scale
Collaborate directly with Instructional Designers and product teams to translate content authoring workflows into data engineering requirements
Build and maintain embedding and vector database pipelines that feed Kiddom’s AI-powered content features as they scale
Work in Git-based workflows — treating data artifacts with the same rigor as software: versioned, reviewed, and documented
What we're looking for...
4+ years of data engineering experience with strong Python skills — you’ve written parsers, validators, and transformation scripts for real-world messy data
Schema design instincts — you think carefully about how data should be structured for downstream use, not just how to move it
Data quality mindset — you build validation and completeness checks in from the start, not as an afterthought
Cross-functional collaborator — comfortable working with non-engineers to define requirements and translate domain knowledge into data structures
Provisioning and monitoring of infrastructure for data systems, familiarity with IaC tools such as Terraform and Terragrunt
The data system operates, ECS, EKS clusters, provision lambdas and S3 buckets
Bonus:
Background in education, curriculum design, or ed-tech — understanding how instructional content is authored and structured is a genuine differentiator
Experience with vector databases (Pinecone, Weaviate, pgvector) or embedding pipeline tooling
Familiarity with agentic AI patterns or Model Context Protocol (MCP)
Skills Required
- 3+ years of experience as a data engineer
- 8+ years of software engineering experience
- Strong understanding of data engineering principles including infrastructure deployment, governance and security
- Experience with MySQL, Snowflake, Cassandra and familiarity with Graph databases
- Proficiency in SQL, Python, Golang
- Proficient with AWS offerings such as AWS Glue, EKS, ECS and Lambda
What We Do
Kiddom unleashes the full potential of high-quality instructional materials paired with effective tech enhancements and hand-in-hand support to help teachers do what teachers do best, teach with a human touch. For over 10 years, we’ve been combining valuable open educational resources with simple, flexible technology. Our feature-rich learning platform gives teachers time back, so they can do what they do best, teach. Accelerate planning, instructional delivery, assessment, and reporting while using proven-curriculum teaches already know and love.
Why Work With Us
We innovate at the speed of technology. We are introducing new AI features to streamline routine teacher tasks and more equitably serve student needs. We’re dedicated to developing AI enhancements that drastically improve the teacher’s ability to unpack, understand, and implement HQIM with fidelity.
Gallery








