Staff AI Engineer - LLM expert

Reposted 8 Days Ago
Easy Apply
Be an Early Applicant
7 Locations
In-Office
200K-240K Annually
Senior level
Artificial Intelligence • Fintech
The Role
Lead the development and optimization of LLM-powered multi-agent systems for financial institutions, focusing on safe data interactions and improved AI models, while driving innovation in reinforcement learning and inference methods.
Summary Generated by Built In

Banking is being reimagined—and customers expect every interaction to be easy, personal, and instant

We are building a universal banking assistant that millions of U.S. consumers can use to transact across all financial institutions and, over time, autonomously drive their financial goals. Powered by our proprietary BankGPT platform, this assistant is positioned to displace age-old legacy systems within financial institutions and own the end-to-end CX stack, unlocking a $200B opportunity and potentially replacing multiple publicly traded companies

Ultimately, our mission is to drive financial well-being for millions of consumers.

With over two-thirds of Americans living paycheck to paycheck, 50% holding less than $500 in savings, and only 17% financially literate, we aim to put financial well-being on autopilot to help solve this problem.


About the Role

We’re hiring a Staff Engineer – Core AI to design, experiment, and scale the next generation of LLM-powered multi-agent systems that enable intelligent, secure, and compliant automation for financial institutions. This role goes beyond integrating third-party APIs — it’s about building differentiated intelligence: training, tuning, and evolving models that reason, plan, and act autonomously in high-stakes environments. You’ll work at the intersection of LLM research, applied reinforcement learning, and AI systems engineering, driving innovation in model fine-tuning, prompt optimization, encryption for inference, and speech-to-speech AI.

Your mission: create the AI runtime layer that powers adaptive, explainable, and policy-aligned agents — at scale.

What You’ll Own 

As the lead for LLM engineering, you’ll define how models learn, optimize, and safely interact with sensitive financial data. You’ll be responsible for:

  • Model Evolution: Building fine-tuning pipelines, exploring open-weight models, and benchmarking their performance against proprietary LLMs.
  • Inference Optimization: Driving high-throughput, low-latency inference strategies across GPUs, TPUs, and distributed inference clusters.
  • Safety & Guardrails: Designing data-safe pipelines with encryption for model I/O, and implementing automated PII detection and masking at both prompt and response layers.
  • RL-Based Learning: Applying Reinforcement Learning (RLHF/RLAIF), reward modeling, and policy optimization to continuously improve model performance.
  • Speech-to-Speech and Multimodal AI: Exploring speech model architectures (ASR/TTS) and building adaptive pipelines for natural, real-time conversational intelligence.
  • POCs & Experimentation: Rapidly prototyping emerging models, toolchains, and optimization methods to maintain a competitive edge.
  • Framework Leadership: Collaborating with research and backend teams to evolve our custom AI orchestration layer — combining multiple specialized models, memory systems, and evaluation tools.

What You’ll Do

  • Lead Fine-Tuning and Experimentation: Create fine-tuning workflows using LoRA, PEFT, and instruction-tuning pipelines; manage large-scale training datasets.
  • Drive Auto-Prompt Optimization: Build self-evolving prompt evaluation loops using reinforcement learning, reward modeling, and continuous evaluation frameworks.
  • Accelerate Inference Throughput: Optimize model inference through quantization, batching, caching, and high-performance serving strategies.
  • Implement Encrypted Inference: Develop novel encryption and key management techniques for model-level data protection during inferencing.
  • Design Guardrail Systems: Implement policy layers that enforce safety, prevent hallucinations, and ensure compliance (SOC2, GDPR).
  • Integrate Speech Models: Develop and optimize speech-to-speech pipelines, managing end-to-end latency, transcription accuracy, and model adaptation.
  • Run Advanced Evals: Establish evaluation harnesses that measure factual accuracy, latency, cost-efficiency, and safety compliance in production environments.
  • Research and Publish: Explore the latest advancements in open-source LLMs and reinforcement learning for agents, driving our internal AI innovation roadmap.

What We’re Looking For

Required Qualifications

  • Strong LLM Expertise: 2+ years of experience working directly with transformer architectures and LLM fine-tuning (e.g., Llama, Mistral, GPT, Mixtral, Gemma, Falcon, Claude)
  • Applied Reinforcement Learning: Hands-on experience with RLHF/RLAIF, reward modeling, and multi-objective optimization for generative models
  • Prompt Optimization & Evaluation: Deep knowledge of auto-prompting, chain-of-thought evaluation, and self-improving agent loops.
  • Inference Engineering: Experience improving throughput, quantization, and token efficiency on GPUs or specialized inference hardware.
  • Data Security in AI: Knowledge of PII masking, data encryption, and secure model pipelines in production settings.
  • Modern AI Tooling: Experience with frameworks such as PyTorch, Transformers, Deep Speed, Hugging Face, LangChain, or vLLM.
Preferred Qualifications
  • Experience with speech-to-speech or multimodal models (ASR, TTS, embeddings)
  • Understanding of AI evaluation frameworks (e.g., Evals, Llama Index Benchmarks, or custom metrics)
  • Familiarity with financial data compliance and AI observability tools
  • Exposure to low-level inference optimization (CUDA kernels, model parallelism).
  • Contributions to open-source LLM or RL research projects

What Makes This Role Special?

  • You’ll shape the core AI that powers agentic intelligence for financial systems serving millions of users.
  • You’ll own a research-meets-engineering mandate — from exploring new models to bringing them to life in production.
  • You’ll define how autonomous AI systems learn, adapt, and remain safe in a regulated environment.
  • You’ll work with a team combining AI research, applied data science, and product engineering, moving fast with purpose and rigor.

Compensation

  •  Compensation is expected to be between $200,000 - $240,000. Exact compensation may vary based on skills and location.

What We Offer

  • 💡 100% paid health, dental & vision care
  • 💰 401(k) match & financial wellness perks
  • 🌴 Discretionary PTO + paid parental leave
  • 🏡 Remote-first flexibility
  • 🧠 Mental health, wellness & family benefits
  • 🚀 A mission-driven team shaping the future of banking

At interface.ai, we are committed to providing an inclusive and welcoming environment for all employees and applicants. We celebrate diversity and believe it is critical to our success as a company. We do not  discriminate on the basis of race, color, religion, national origin, age, sex, gender identity, gender expression, sexual orientation, marital status, veteran status, disability status, or any other legally protected status. All employment decisions at Interface.ai are based on business needs, job requirements, and individual qualifications. We strive to create a culture that values and respects each person's unique perspective and contributions. We encourage all qualified individuals to apply for employment opportunities with Interface.ai and are committed to ensuring that our hiring process is inclusive and accessible.

Top Skills

Deep Speed
Hugging Face
Langchain
Llm Fine-Tuning
PyTorch
Rlaif
Rlhf
Transformer Architectures
Transformers
Vllm
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
Covina, California
135 Employees
Year Founded: 2019

What We Do

interface.ai is an AI leader specializing in Intelligent Virtual Assistants for the financial sector. By integrating its deep banking domain understanding with AI, interface.ai is transforming interactions between banks, credit unions, their customers, and employees, leading the charge into the era of Interactive Intelligence for Banking.

Unveiling interface.ai's New Product Suite Powered By Generative AI:

Discover Sphere for Customers - an industry-first, ChatGPT-like universal channel revolutionizing banking through intelligent guidance, innovative plugins, and personalized AI assistance.

Discover Sphere for Employees - an industry-first, ChatGPT-like universal channel that replaces 14-15 applications traditionally juggled by frontline staff, thereby enhancing frontline operations' efficiency by 10x.

Learn more: https://interface.ai/solutions/sphere-generative-ai-assistant-chatgpt-for-creditunions-and-banks

Similar Jobs

ZS Logo ZS

Lead Engineer - Data

Artificial Intelligence • Healthtech • Professional Services • Analytics • Consulting
Hybrid
Toronto, ON, CAN
13000 Employees
140K-155K Annually

ZS Logo ZS

Senior Associate Engineer - Data

Artificial Intelligence • Healthtech • Professional Services • Analytics • Consulting
Hybrid
Toronto, ON, CAN
13000 Employees
110K-125K Annually

Kraft Heinz Logo Kraft Heinz

Superviseur(e) Qualité Quart de Nuit

Big Data • Cloud • Food • Machine Learning • Software • Database • Analytics
Hybrid
Mount Royal, QC, CAN
38000 Employees
75K-94K Annually

Samsara Logo Samsara

Machine Learning Engineer

Artificial Intelligence • Cloud • Computer Vision • Hardware • Internet of Things • Software
Easy Apply
Remote or Hybrid
Canada
4000 Employees
162K-223K Annually

Similar Companies Hiring

Rain Thumbnail
Web3 • Payments • Infrastructure as a Service (IaaS) • Fintech • Financial Services • Cryptocurrency • Blockchain
New York, NY
40 Employees
Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees
Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account