We’re building the search engine for AI agents. Our API is designed from the ground up to power Retrieval-Augmented Generation (RAG) and real-time reasoning in AI systems. By connecting LLMs to high-quality, trustworthy web content, we help developers build agents that are not only intelligent — but also informed.
We work with some of the most innovative teams in AI — from small startups shaping the ecosystem to the largest enterprises deploying AI at scale. Whether it’s powering sales assistants, research copilots, or internal knowledge tools, we’re the missing link between LLMs and the real world.
About the RoleThis is a full-time, on-site role for a Forward Deployed Engineer based in our New York office.
We’re looking for a technically exceptional engineer who’s excited to work directly with our customers and partners to design, prototype, and deploy integrations of Tavily’s API into real-world agentic systems. As a Forward Deployed Engineer, you’ll be embedded with strategic partners and high-value customers—playing a critical role in both pre-sales and post-deployment success. You’ll help architect performant RAG pipelines, production-grade applications, and industry-specific GenAI use cases built on Tavily.
You’ll operate across the customer lifecycle—from early discovery and solution design to scaling usage and identifying new opportunities. You’ll bridge product, engineering, and customer success—translating feedback into scalable solutions and informing what we build next.
ResponsibilitiesWork directly with customers to understand their use cases and build custom integrations using Tavily’s API.
Participate in pre-sales efforts, including sales calls, technical discovery, and proof-of-concept scoping.
Design and implement prototypes, proofs of concept, and production-ready systems alongside client teams.
Provide hands-on guidance for RAG system design, LLM integration, and performance optimization.
Monitor API usage patterns and recommend improvements to increase reliability, efficiency, and coverage.
Collaborate with internal teams to influence roadmap priorities based on customer needs and recurring technical challenges.
Build reusable technical content including documentation, reference architectures, internal tools, and templates to scale future deployments.
Represent Tavily in customer conversations, architecture reviews, post-deployment tracking, and partner-facing technical engagements.
Contribute to strategic partner efforts: supporting integration efforts, evangelizing Tavily in developer communities, and building joint solutions.
3+ years of software engineering experience, ideally in a customer-facing, consulting, or partner-focused role (e.g., Forward Deployed Engineer, Solutions Architect, Sales Engineer).
Strong backend skills and experience working with APIs, Python, and LLM toolchains (e.g., LangChain, LlamaIndex, vector DBs).
Deep understanding of Retrieval-Augmented Generation (RAG), prompt engineering, and agent architectures.
Strong interpersonal skills and comfort working directly with technical stakeholders at enterprises and partners.
Fast learner, highly autonomous, and excited to work in a fast-paced startup environment.
Based in New York City or willing to relocate.
Full-time employees at Tavily enjoy:
🤝 A young, open, and inclusive culture where everyone has real impact from day one
🧠 The chance to build alongside a fast-moving team at the forefront of agentic AI
🍽 Daily team lunches + fully stocked snacks to keep you energized
🦷 Full medical, dental, and vision insurance to keep you feeling your best
🌱 A deep-work culture that values curiosity, creativity, and continuous learning
🏙 Hybrid-friendly setup with offices in New York and Tel Aviv
🛫 Generous time off to rest, recharge, and explore
Top Skills
What We Do
Search. Extract. Crawl. The web access stack built for builders, by builders.
Tavily powers the next generation of agents with a suite of tools for real-time Search, structured data Extraction, and fully-rendered Crawling — everything agents need to access and reason over the live web.
Purpose-built for RAG, autonomy, and production-grade agent systems.