Software Engineer - Full Stack

Posted Yesterday
Be an Early Applicant
San Francisco, CA, USA
Hybrid
Senior level
Artificial Intelligence • Cloud • Generative AI • Infrastructure as a Service (IaaS)
The Role
Design, build, and maintain a scalable web platform and APIs for deploying and monitoring multimodal AI models and agent workflows. Collaborate with product, infrastructure, and design teams to optimize performance, ensure reliability, drive CI/CD and testing, and contribute to long-term architecture decisions for a cloud-native, multi-tenant SaaS system.
Summary Generated by Built In
About the Job

We’re seeking a Full-Stack Engineer to design, build, and scale our web platform, which serves as the core interface for deploying multimodal models, observing workloads, and building agent workflows. In this role, you’ll work closely with product, infrastructure, and design teams to create high-performance, developer-friendly, and enterprise-ready tools.

We are looking for a hands-on engineer who is eager to work at the intersection of infrastructure, developer experience, and AI applications. The ideal candidate is a talented full-stack developer, strong collaborator, and someone who enjoys working across the stack, cares deeply about developer workflows, and is excited to help define the future of AI adoption.

Key Responsibilities
  • Design, build, and maintain web applications and tools for AI model deployment, monitoring, and performance optimization

  • Develop clean, scalable, and robust APIs powering AI agents, workflows, and user-facing systems

  • Collaborate with infrastructure engineers to integrate backend systems with deployment and orchestration pipelines

  • Optimize the performance and usability of web interfaces

  • Drive code quality through automated testing, CI/CD, and code reviews

  • Contribute to architecture and design decisions that shape our platform’s long-term direction

  • Identify and resolve technical debt and improve system reliability in production systems

Qualifications
  • 5+ years of industry experience in full-stack or backend engineering

  • Bachelor’s or Master's degree in Computer Science, Computer Engineering, or equivalent

  • Fluent in TypeScript and Python, Expert with React/Next.js

  • Strong backend experience with FastAPI or similar Python frameworks

  • Proven expertise in delivering production-scale full-stack applications

  • Proficiency in designing data models, writing SQL, and working with PostgreSQL

  • Deep understanding of modern web frameworks and component-driven architecture

  • Strong API design experience across gRPC/REST/GraphQL in production systems

  • Solid foundation in cloud-native development

  • Familiarity with OpenTelemetry tracing, metrics, and structured logging

  • Knowledge of web security, authentication, RBAC, and multi-tenant SaaS systems

Preferred Experience
  • Familiarity with LLM-based workflows, tool invocation, or agentic systems

  • Familiarity with Kubernetes for container orchestration, including deploying, scaling, and managing containerized applications in production environments

  • Have worked in a startup or fast-paced environments with ownership and ambiguity

  • Built developer-facing SDKs/CLIs

  • Passion for developer experience and enabling AI adoption

Benefits
  • Flexible working hours

  • Daily lunch and dinner provided; unlimited snacks and beverages

  • Supportive and highly collaborative work environment

  • Health check-up support and top-tier equipment/hardware support

  • A front-row seat to the generative AI infrastructure revolution

  • Competitive compensation, startup equity, health insurance, and other benefits.

About FriendliAI

FriendliAI is building the world’s best AI inference platform that makes large language and multi-modal models fast, efficient, and deployable at scale. We power high-throughput, low-latency AI workloads for organizations worldwide and integrate directly with Hugging Face, giving developers instant access to over 500,000 open-source models.

We are a small, fast-moving team doing work that matters at one of the most exciting moments in the history of technology. With our world-class inference engine, we are building a platform that the AI industry can actually rely on.

Skills Required

  • 5+ years of industry experience in full-stack or backend engineering
  • Bachelor's or Master's degree in Computer Science, Computer Engineering, or equivalent
  • Fluent in TypeScript and Python
  • Expert with React and Next.js
  • Strong backend experience with FastAPI or similar Python frameworks
  • Proven expertise delivering production-scale full-stack applications
  • Proficiency in designing data models, writing SQL, and working with PostgreSQL
  • Strong API design experience across gRPC, REST, and GraphQL in production systems
  • Solid foundation in cloud-native development
  • Familiarity with OpenTelemetry tracing, metrics, and structured logging
  • Knowledge of web security, authentication, RBAC, and multi-tenant SaaS systems
  • Familiarity with LLM-based workflows, tool invocation, or agentic systems
  • Familiarity with Kubernetes for deploying and managing containerized applications
  • Experience working in startups or fast-paced environments with ownership
  • Experience building developer-facing SDKs or CLIs
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
34 Employees
Year Founded: 2021

What We Do

FriendliAI is The Frontier AI Inference Cloud: an AI infrastructure platform that deploys, scales, and monitors large language and multimodal models. Its inference engine maximizes GPU utilization to deliver faster performance and steep cost savings for open-weight and custom models, while offering enterprise-grade reliability, SLAs, and compliance to help teams run generative AI and agent workloads at production scale.

Similar Jobs

Adyen Logo Adyen

Software Engineer

Fintech • Payments • Financial Services
Easy Apply
Hybrid
San Francisco, CA, USA
4771 Employees
198K-293K Annually

Gusto Logo Gusto

Staff Software Engineer

Fintech • HR Tech
Easy Apply
Hybrid
3 Locations
4405 Employees
163K-247K Annually

Eve Logo Eve

Software Engineer

Legal Tech • Software • Generative AI
Easy Apply
Hybrid
San Mateo, CA, USA
180 Employees
175K-240K Annually

Capital One Logo Capital One

Lead Software Engineer

Fintech • Machine Learning • Payments • Software • Financial Services
Hybrid
3 Locations
55000 Employees
230K-286K Annually

Similar Companies Hiring

Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
42 Employees
LTX Thumbnail
Conversational AI • Generative AI
Jerusalem, Israel
360 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account