Senior Software Engineer - API Gateway

Posted 5 Days Ago
Be an Early Applicant
Hiring Remotely in Canada
Remote
Senior level
Artificial Intelligence • Information Technology • Software
The Role
Develop and enhance the API gateway for an AI inference platform, focusing on feature implementation, bug fixes, infrastructure management, and reliability improvements.
Summary Generated by Built In

About the Role

Featherless.ai is building the world’s most reliable and comprehensive open-model inference platform — the infrastructure powering the next generation of AI creators, startups, and enterprises. Our serverless approach to inference unlocks the best GPU utilization in AI infrastructure.

We’re hiring Senior Software Engineers to support and evolve the API gateway to our inference cloud, which is responsible for

  • authentication and inference to all models

  • subscription management and subscription entitlement (e.g. context-length, concurrency limits)

  • and providing the necessary API surface for applications and builders

API Gateway is constantly evolving in response to the unending stream of new models, modalities, clients and inference load.

What you'll do

The API gateway is managed by the Platform Team, who aim to make Featherless the best place to find and use models. As a member of the platform team, you will

  • undertake feature development and bug fixes to keep up with clients, resolve user issues, and onboard new models

  • improve the reliability of the existing API (increasing instrumentation and monitoring, right-sizing infrastructure)

  • respond to availability incidents

  • triage and resolve issues of inference quality and reliability

  • manage the infrastructure on which our gateway runs

What you'll bring

  • first-hand experience of the user’s we’re building for (familiarity with popular open LLMs, common clients, and experience building with LLM)

  • experience with the technologies and paradigms of the web (REST, websockets, DNS, networking, opentelemetry)

  • experience with significant components of our stack (k8s, node, mikro-orm, fastify, redis, mongodb, python, elastic cloud, cloudflare, sentry, otel)

  • ability to debug complex issues across a wide stack and build instrumentation as necessary

  • desire to work collaboratively as part of a skilled team

  • Alignment with team and company values, including

    • bias to action

    • responsiveness to users (bug-fixes over features)

    • instinct to iterate

    • subscribing to that done means proven by usage data

Other

This team operates on Eastern Time. We are remote, but with a preference to hire in Toronto, Canada.

Top Skills

Cloudflare
Dns
Elastic Cloud
Fastify
K8S
Mikro-Orm
MongoDB
Networking
Node.js
Opentelemetry
Otel
Python
Redis
Rest
Sentry
Websockets
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Francisco, California
20 Employees
Year Founded: 2023

What We Do

We enable serverless inference via our GPU orchestration and model load-balancing system. We unlock fine-tuning by enabling organizations to size their server fleet to throughput needs, not number of models in the catalogue.

See it in action on our public cloud, which offers inference for 10k+ open weight models.

Similar Jobs

Dropbox Logo Dropbox

Senior Engineering Manager

Artificial Intelligence • Cloud • Consumer Web • Productivity • Software • App development • Data Privacy
Remote
Canada
2500 Employees
205K-277K Annually

Superhuman Logo Superhuman

Alliances Partner Manager

Artificial Intelligence • Information Technology • Machine Learning • Natural Language Processing • Productivity • Software • Generative AI
Easy Apply
Remote or Hybrid
2 Locations
1500 Employees
204K-270K Annually

Affirm Logo Affirm

Senior Director, Product Marketing

Big Data • Fintech • Mobile • Payments • Financial Services
Easy Apply
Remote
Canada
2200 Employees
191K-271K Annually

CrowdStrike Logo CrowdStrike

Consultant

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Remote or Hybrid
6 Locations
10000 Employees
135K-200K Annually

Similar Companies Hiring

Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees
Fairly Even Thumbnail
Software • Sales • Robotics • Other • Hospitality • Hardware
New York, NY

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account