Software Engineer – AI Agents

Posted Yesterday
Be an Early Applicant
San Francisco, CA, USA
Hybrid
Mid level
Artificial Intelligence • Cloud • Generative AI • Infrastructure as a Service (IaaS)
The Role
Design, build, and maintain agent APIs and production agent applications for document understanding, advanced RAG, and customer support automation. Integrate open-source models, collaborate with backend and infra for deployment and monitoring, and ensure APIs are robust, scalable, and developer-friendly.
Summary Generated by Built In
About the Job

We’re seeking an Agent Engineer to design and build agentic features in our platform, including document understanding, advanced RAG, and customer support automation. In this role, you will develop not only the agent components themselves, but also the Friendli Agent API, which serves as the core developer interface for building and extending agent applications. You will also build agent applications as production-ready examples of how agents can solve real-world problems.

These applications will be primarily written in Python and will serve as reference implementations for our customers and community. We are looking for a hands-on engineer who is passionate about building agent systems and making AI easy for developers to adopt. The ideal candidate is comfortable creating agent applications that showcase what is possible, is curious about and experienced with open-source models, and enjoys turning them into reliable, high-impact features.

Key Responsibilities
  • Design, build, and maintain agent APIs and applications that deliver document understanding and other high-value features

  • Evaluate and integrate open-source models to power production-ready agent features where possible

  • Develop reference agent applications to showcase workflows and accelerate customer adoption

  • Collaborate with backend and infrastructure teams to integrate agents with deployment, orchestration, and monitoring systems

  • Ensure APIs are robust, developer-friendly, and enterprise-ready through strong design principles and documentation

  • Continuously improve the reliability, scalability, and performance of agent features in production

Qualifications
  • 3+ years of experience in software engineering, preferably in backend, ML systems, or API development

  • Bachelor’s or Master's degree in Computer Science, Computer Engineering, or equivalent

  • Strong programming skills in Python; experience with various Python frameworks

  • Solid understanding of LLM workflows, agent patterns, or tool invocation systems

  • Experience designing and delivering production APIs

  • Familiarity with open-source LLMs and multimodal models (HuggingFace, LangChain, LlamaIndex, etc.)

  • Strong foundations in cloud-native development

Preferred Experience
  • Experience with document understanding pipelines (e.g., OCR, RAG, summarization, structured extraction)

  • Familiarity with Kubernetes or container orchestration in production

  • Built or contributed to agent frameworks, SDKs, or CLIs

  • Have worked in a startup or fast-paced environments with ownership and ambiguity

  • Passion for developer experience and enabling AI adoption

Benefits
  • Flexible working hours

  • Daily lunch and dinner provided; unlimited snacks and beverages

  • Supportive and highly collaborative work environment

  • Health check-up support and top-tier equipment/hardware support

  • A front-row seat to the generative AI infrastructure revolution

  • Competitive compensation, startup equity, health insurance, and other benefits.

About FriendliAI

FriendliAI is building the world’s best AI inference platform that makes large language and multi-modal models fast, efficient, and deployable at scale. We power high-throughput, low-latency AI workloads for organizations worldwide and integrate directly with Hugging Face, giving developers instant access to over 500,000 open-source models.

We are a small, fast-moving team doing work that matters at one of the most exciting moments in the history of technology. With our world-class inference engine, we are building a platform that the AI industry can actually rely on.

Skills Required

  • 3+ years of experience in software engineering, preferably backend, ML systems, or API development
  • Bachelor's or Master's degree in Computer Science, Computer Engineering, or equivalent
  • Strong programming skills in Python; experience with various Python frameworks
  • Solid understanding of LLM workflows, agent patterns, or tool invocation systems
  • Experience designing and delivering production APIs
  • Familiarity with open-source LLMs and multimodal models (HuggingFace, LangChain, LlamaIndex, etc.)
  • Strong foundations in cloud-native development
  • Experience with document understanding pipelines (OCR, RAG, summarization, structured extraction)
  • Familiarity with Kubernetes or container orchestration in production
  • Built or contributed to agent frameworks, SDKs, or CLIs
  • Experience working in startup or fast-paced environments with ownership and ambiguity
  • Passion for developer experience and enabling AI adoption
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
34 Employees
Year Founded: 2021

What We Do

FriendliAI is The Frontier AI Inference Cloud: an AI infrastructure platform that deploys, scales, and monitors large language and multimodal models. Its inference engine maximizes GPU utilization to deliver faster performance and steep cost savings for open-weight and custom models, while offering enterprise-grade reliability, SLAs, and compliance to help teams run generative AI and agent workloads at production scale.

Similar Jobs

Traba Logo Traba

Senior Software Engineer

Information Technology • Logistics • Software • 3PL: Third Party Logistics • Industrial • Manufacturing
In-Office
San Francisco, CA, USA
100 Employees
200K-240K Annually

Traba Logo Traba

Staff Software Engineer

Information Technology • Logistics • Software • 3PL: Third Party Logistics • Industrial • Manufacturing
In-Office
San Francisco, CA, USA
100 Employees
240K-300K Annually

Pylon (usepylon.com) Logo Pylon (usepylon.com)

Software Engineer

Artificial Intelligence • Software
In-Office
San Francisco, CA, USA
43 Employees
180K-300K Annually

LiveFlow Logo LiveFlow

Software Engineer

Fintech • Software • Financial Services
Hybrid
San Francisco, CA, USA
52 Employees

Similar Companies Hiring

Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
42 Employees
LTX Thumbnail
Conversational AI • Generative AI
Jerusalem, Israel
360 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account