We're building the infrastructure layer for agentic web interaction at scale. Our API is designed from the ground up to power Retrieval-Augmented Generation (RAG) and real-time reasoning in AI systems. By connecting LLMs to high-quality, trustworthy web content, we help developers build agents that are not only intelligent - but also informed.
We work with some of the most innovative teams in AI - from small startups shaping the ecosystem to the largest enterprises deploying AI at scale. Whether it’s powering sales assistants, research copilots, or internal knowledge tools, we’re the missing link between LLMs and the real world.
The Role: Software EngineerWe’re looking for a Software Engineer to join our core engineering team and help build the infrastructure that powers real-time AI agents. You’ll work across the stack, ship fast, and take ownership of critical systems as we scale.
This is a great role for a generalist who loves building from scratch, thrives in low-process environments, and wants to work on technically ambitious problems.
What You'll DoBe the expert in building fast, reliable, and scalable systems for real-time LLM workflows
Design and implement backend infrastructure and API endpoints
Collaborate closely with product to iterate on features quickly and thoughtfully
Improve performance, monitoring, and reliability across the stack
Own core systems and contribute to key architectural decisions
Help shape a strong engineering culture focused on velocity and quality
2+ years of professional software engineering experience
Strong backend development skills (Python, Go , C++)
Proven experience designing and operating large-scale, distributed systems, with a solid understanding of API design, reliability, and performance at scale
Hands-on expertise with AWS infrastructure and cloud-native services, bringing practical knowledge of deploying and managing services in real-world environments
Comfortable in a fast-paced startup environment with lots of ownership
You have hands-on experience designing and operating high-throughput, low-latency infrastructure, including systems that handle massive concurrency, heavy query loads.
Curiosity about LLMs, retrieval and the future of AI systems, with a drive to stay at the forefront of new technology
Experience with performance optimization, load testing, and debugging production issues in large-scale systems.
Strong attention to system correctness, performance, and reliability, and a drive to continuously refine and improve production systems to perfection.
Familiarity with DevOps practices, including CI/CD pipelines, infrastructure as code, Kubernetes orchestration, and modern monitoring tools.
Top Skills
What We Do
Search. Extract. Crawl. The web access stack built for builders, by builders.
Tavily powers the next generation of agents with a suite of tools for real-time Search, structured data Extraction, and fully-rendered Crawling — everything agents need to access and reason over the live web.
Purpose-built for RAG, autonomy, and production-grade agent systems.






