Must-Haves
- Proven experience deploying Production RAG pipelines against real-world, messy datasets.
- Deep expertise in Agentic system design (tool-use, multi-agent orchestration).
- Strong Python engineering skills—writing clean, scalable, and maintainable code
- Experience operating within AWS/GovCloud environments.
Nice-to-Haves
- Experience fine-tuning NLP or object detection models.
- Familiarity with LLM evaluation frameworks (hallucination detection, drift monitoring).
- Knowledge of government security standards and working in different classification environments and on-prem
- Security Clearance: Existing Secret/TS clearance or eligibility is a significant plus.
Your Technical Toolkit
- Languages: Python (expert-level), SQL
- LLM & Agentic Frameworks: LangChain, LangGraph, CrewAI, or similar orchestration frameworks
- RAG Stack: Retrieval with vector databases (Pinecone, Weaviate, Chroma, pgvector), graph databases (Neo4J), Elasticsearch, BM25, and Sentence-Transformers; NLP enrichment with spaCy, GLiNER, and Transformers; optimization using embedding models, reranking pipelines, and DSPy
- Evaluation & Observability: RAGAS, DeepEval, Arize Phoenix, and synthetic annotations
- Cloud & Infrastructure: AWS, SageMaker, Bedrock, S3, Lambda, Docker, and FastAPI
- Data Processing: Complex pipelines for unstructured and multimodal data, including PDFs, scanned documents, images, and audio.
Skills Required
- Proven experience deploying Production RAG pipelines against real-world, messy datasets
- Deep expertise in Agentic system design
- Strong Python engineering skills
- Experience operating within AWS/GovCloud environments
unstructured.io Compensation & Benefits Highlights
The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about unstructured.io and has not been reviewed or approved by unstructured.io.
-
Healthcare Strength — Health, dental, and vision coverage start on day one, alongside life and disability insurance noted in current postings. These consistently advertised coverages indicate strong core healthcare support.
-
Leave & Time Off Breadth — Unlimited PTO is explicitly offered and paired with flexible parental leave in public job descriptions. Together, these policies signal broad time-away provisions.
-
Fair & Transparent Compensation — Compensation ranges are frequently included in job postings, and listings reference competitive salary and equity. This pay communication provides candidates clearer visibility into expected ranges.
unstructured.io Insights
What We Do
At Unstructured, we're on a mission to give organizations access to all their data. We know the world runs on documents—from research reports and memos, to quarterly filings and plans of action. And yet, 80% of this information is trapped in inaccessible formats leading to inefficient decision-making and repetitive work. Until now. Unstructured captures this unstructured data wherever it lives and transforms it into AI-friendly JSON files for companies who are eager to fold AI into their business.
.png)






