Best Tech Jobs & Startup Jobs 2026

FriendliAI

Software Engineer – GPU Kernel

7 Days AgoSaved

Hybrid

San Francisco, CA, USA

Mid level

Artificial Intelligence • Cloud • Generative AI • Infrastructure as a Service (IaaS)

Design and optimize high-performance GPU kernels (GEMM, attention, routing) for AI inference across NVIDIA and AMD GPUs. Implement CUDA/C++ and low-level assembly code, build reduced-precision/quantized (FP8/FP4) kernels, benchmark cross-vendor performance, contribute to internal GPU libraries, accelerate multi-modal pipelines, and integrate next-generation GPU features into production.

Top Skills: AmdC++CudaCutlassFp4Fp8GemmGpu AssemblyHipNvidiaRocmTriton

FriendliAI

QA Engineer

7 Days AgoSaved

Hybrid

San Francisco, CA, USA

Mid level

Artificial Intelligence • Cloud • Generative AI • Infrastructure as a Service (IaaS)

Own quality for FriendliAI's full SaaS stack, including backend microservices, frontend, model deployments, and inference. Build pytest automated suites, Locust performance tests, Playwright end-to-end tests, and design strategies for validating LLM inference and model deployment workflows.

Top Skills: Hugging FaceLlm ServingLocustMicroservicesMulti-CloudPlaywrightPytestPython

FriendliAI

Software Engineer – AI Inference Engine

7 Days AgoSaved

Hybrid

San Francisco, CA, USA

Senior level

Artificial Intelligence • Cloud • Generative AI • Infrastructure as a Service (IaaS)

Design, implement, and optimize GPU kernels, kernel compiler, memory planner, and runtime for low-latency generative AI inference. Analyze performance bottlenecks across hardware and software, collaborate with infrastructure teams, and maintain production profiling, benchmarking, and validation tooling while supporting new model architectures and multi-GPU strategies.

Top Skills: BenchmarkingC++Compiler InfrastructureDiffusion ModelsDistributed InferenceGpu KernelsKernel CompilerMulti-GpuProfilingPythonRuntime SystemsTransformer Models

FriendliAI

Software Engineer – GPU Kernel

7 Days AgoSaved

In-Office

Seoul, KOR

Mid level

Artificial Intelligence • Cloud • Generative AI • Infrastructure as a Service (IaaS)

Design, implement, and optimize high-performance GPU kernels (GEMM, attention, routing), develop CUDA/ROCm C++ code including low-level assembly, implement reduced-precision/quantized kernels (FP8/FP4), benchmark and ensure parity across NVIDIA and AMD, contribute to GPU libraries, accelerate multi-modal pipelines, and integrate next-generation GPU features into production inference engine.

Top Skills: AmdC++CudaCutlassFp4Fp8GemmGpu AssemblyHipNvidiaRocmTriton

FriendliAI

Software Engineer – Senior Backend

7 Days AgoSaved

In-Office

Seoul, KOR

Senior level

Artificial Intelligence • Cloud • Generative AI • Infrastructure as a Service (IaaS)

Own and evolve core backend microservices for an AI inference platform, building production-grade APIs and multi-tenant SaaS capabilities (authentication, RBAC, billing). Design data models and pipelines across PostgreSQL and ClickHouse, collaborate on multi-cloud orchestration, ensure reliability and performance, and drive engineering quality through testing and CI/CD.

Top Skills: CliClickhouseFastapiGraphQLGrpcKubernetesLlm ServingMulti-CloudNext.JsOlapOltpOpentelemetryPostgresPythonReactRestSdk

Top Tech Jobs & Startup Jobs

Software Engineer – GPU Kernel

QA Engineer

Software Engineer – AI Inference Engine

Software Engineer – GPU Kernel

Software Engineer – Senior Backend

Cut your apply time in half.

Senior Product Manager

Solutions Architect - AI Inference Specialist

Software Engineer – Python Developer Tools

Software Engineer - Senior Backend

Software Engineer – AI Agents

Top Companies Hiring

FriendliAI

Popular Job Searches

US Jobs

International Jobs

Total selected ()