Senior Staff AI Engineer

Reposted 15 Days Ago
7 Locations
In-Office or Remote
207K-290K Annually
Senior level
Artificial Intelligence • Information Technology
JazzX AI leverages advanced AI to transform enterprise operations, boosting both productivity and employee satisfaction.
The Role
The Senior Staff AI Engineer will lead the design and optimization of reinforcement learning systems for enterprise applications, focusing on scalable architectures and production deployment. Responsibilities include mentoring, collaboration, and ensuring compliance with AI ethics and safety standards.
Summary Generated by Built In
About JazzX AI: 
 
Vision: Enterprises operating on institutional intelligence—governed, self-improving, and scalable beyond individual expertise.
JazzX AI is defining the future of enterprise work—by building AI-native digital workers that actually get the job done.
 
We believe enterprises don't scale expertise—they lose it. Knowledge stays trapped in individuals, judgment gets applied inconsistently, and the best talent spends time on work that should run itself. We're changing that. 
JazzX AI transforms messy enterprise reality into institutional intelligence: governed digital workers that capture expert judgment, make every decision explainable, and continuously improve through real-world execution. The result is faster decisions, higher-quality outcomes, and reliable execution at scale—in domains where getting it wrong isn't an option.
 
We're starting with lending and due-diligence workflows—complex, regulated, high-stakes. From here, we're building the backbone for enterprise intelligence across industries.
 
This is early-stage, hard, and consequential work. If you want to be part of bringing AI systems to market that actually run in production, handle real complexity, and deliver real outcomes—not demos, not chatbots—JazzX AI is the place.

Headquartered in Los Altos, CA. Backed by SAIGroup.
 
About SAIGroup :
 
SAIGroup a private investment firm that has committed $1B to build and scale next-generation, AI-powered enterprise software companies. SAIGroup’s portfolio serves 2,000+ global enterprise customers, generates nearly $800M in annual revenue, and employs 4,000+ people worldwide — providing JazzX AI with long-term capital, deep operating expertise, and access to real-world enterprise scale from day one.

Learn more about JazzX AI:
Website: https://jazzx.ai
LinkedIn: https://www.linkedin.com/company/jazzx-ai

Learn more about SAIGroup:
Website: https://saigroup.ai

 

About the Role

We are seeking an experienced AI Engineer with deep expertise in Reinforcement Learning (RL) to join our team as a Senior Staff Architect. In this role, you will be responsible for shaping the vision, architecture, and technical execution of RL-driven AI reasoning models and systems that power next-generation enterprise AGI platform.

You will lead the design, development, and optimization of cutting-edge RL solutions, from experimentation and simulation through production deployment. This includes building scalable training architectures, architecting multi-agent and hierarchical RL frameworks, and ensuring that the RL systems are resilient, efficient, explainable and safe.

As a senior technical leader, you will partner with cross-functional teams—including product, core platform engineering, and research—to define architectural best practices, establish governance standards, and enable seamless integration of RL into our broader AGI platform. You will also drive innovation by exploring novel RL techniques, mentoring engineers and researchers, and ensuring the RL infrastructure can scale to support high-throughput training and real-world scenarios and enterprise use cases end to end.

Ultimately, your work will be critical in bridging research and production, ensuring that the latest RL advancements translate into reliable, impactful, and enterprise-ready AI solutions.

Key Responsibilities

  • Architecture & Design: Define and drive the end-to-end architecture for reinforcement learning–based systems, including training pipelines, simulation environments, reward shaping, and model serving.
  • Research & Development: Apply cutting-edge RL techniques (policy optimization, model-based RL, hierarchical RL, multi-agent RL, etc) to solve complex enterprise problems.
  • Scalability & Infrastructure: Design distributed training systems, leverage cloud-native infrastructure, and optimize for performance, reproducibility, and cost-efficiency.
  • Leadership & Mentorship: Provide technical leadership to AI engineers and researchers; mentor junior team members; review designs and code with a focus on scalability, robustness, and clarity.
  • Collaboration: Partner with product, data, and platform teams to align RL solutions with strategic business goals and integrate them into production systems.
  • Evaluation & Monitoring: Define frameworks for benchmarking, continuous evaluation, and feedback-driven improvements in deployed RL models.
  • Compliance & Safety: Ensure RL systems align with ethical AI practices, safety constraints, and regulatory standards for enterprises.

Required Qualifications

  • 10+ years of experience in AI/ML engineering, including at least 5 years specializing in reinforcement learning research and production systems.
  • Demonstrated success in designing and deploying large-scale RL architectures in enterprise environments.
  • Deep expertise in reinforcement learning algorithms, including on-policy (PPO, A3C) and off-policy (SAC, DDPG) methods, along with hands-on work in simulation frameworks (e.g., OpenAI Gym, Isaac Gym, PettingZoo, MuJoCo).
  • Practical experience with multi-agent reinforcement learning (MARL), including coordination strategies for complex environments.
  • Strong proficiency in Reinforcement Learning with Verifiable Rewards (RLVR) and GRPO-like policy optimization approaches, applying reinforcement learning principles both rigorously and pragmatically.
  • Experience with test-time compute optimization techniques, including inference-time search, chain-of-thought reasoning, and adaptive computation strategies for improving model performance during deployment.
  • Proven ability in large language model (LLM) training and fine-tuning, across both supervised and reinforcement learning–driven techniques.
  • Advanced software engineering skills in Python, C++, or Java, with deep expertise in ML frameworks such as TensorFlow, PyTorch, JAX, or Ray RLlib.
  • Hands-on experience with distributed training infrastructure (Kubernetes, GPU/TPU clusters, and cloud ML platforms).
  • Excellent communication, collaboration, and leadership skills, with experience working across multidisciplinary teams.

Preferred Qualifications

  • PhD in Computer Science, Machine Learning, Robotics, or related field.
  • Experience leading enterprise AI adoption and guiding organizational strategy for RL powered systems.
  • Contributions to open-source RL frameworks or publications in top-tier conferences (NeurIPS, AISTATS, ICML, ICLR, AAAI).
  • Background in safety, alignment, or explainability of RL agents.

We offer a competitive salary ranging from USD 207,000 to USD 290,000, along with equity options and an attractive benefits package, including health, dental, and vision insurance, flexible working arrangements, and more.

We are an equal opportunity employer and celebrate diversity at our company. We do not discriminate based on race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

 

Why Join JazzX AI:

What draws people to JazzX AI is the opportunity to build something truly foundational.

We are at the beginning of a new era — one where enterprise systems will no longer be static tools, but dynamic collaborators that learn, reason, and evolve alongside the people who use them. At JazzX AI, you’ll help build a platform that redefines how work happens in the AGI era.

JazzX AI brings together vision, deep technology, and purpose. This is not about experimenting with AI features; it’s about turning the promise of AGI into practical, auditable, and human-aligned systems that enterprises can trust and rely on at scale. The work here directly impacts productivity, efficiency, and decision-making in complex, real-world environments.

If you’re motivated by building systems that matter — systems that combine intelligence with responsibility — JazzX AI offers the chance to do that work at a deeper level, with real ownership and lasting impact.

Skills Required

  • 10+ years of experience in AI/ML engineering
  • 5 years specializing in reinforcement learning
  • Experience with large-scale RL architectures
  • Deep expertise in reinforcement learning algorithms
  • Practical experience with multi-agent reinforcement learning
  • Strong proficiency in Reinforcement Learning with Verifiable Rewards
  • Experience with test-time compute optimization techniques
  • Proven ability in LLM training and fine-tuning
  • Advanced software engineering skills in Python, C++, or Java
  • Hands-on experience with distributed training infrastructure
  • Excellent communication and leadership skills
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Los Altos, CA
33 Employees
Year Founded: 2024

What We Do

We are technologists, designers, builders, and innovators who are revolutionizing traditional business processes and practices with advanced AI technology. Our vision is a future where novel digital solutions support employees in their complex, knowledge-driven tasks while automating the routine ones. This not only improves work productivity and enjoyment, but also enables companies to become pacesetters within their industries. JazzX AI is owned by SAIGroup, one of the largest, fastest-growing investment firms in enterprise AI. SAIGroup is a private investment firm investing in business with the potential to become leaders in enterprise AI solutions. SAIGroup’s investments enable our portfolio companies to accelerate their innovation and growth. Principal owner and investor Dr. Romesh Wadhwani has made a commitment to invest up to $1 billion in SAIGroup. Visit saigroup.ai to learn more.

Similar Jobs

Arc (joinarc.com) Logo Arc (joinarc.com)

Staff Engineer

Fintech • Payments • Software • Financial Services
Remote or Hybrid
8 Locations
242 Employees
180K-300K Annually

Webflow Logo Webflow

Staff Engineer

Artificial Intelligence • Enterprise Web • Software • Design • Generative AI
Easy Apply
Remote
3 Locations
800 Employees
194K-285K Annually

Forward Financing Logo Forward Financing

Senior Data Scientist

Fintech • Financial Services
Remote
Alberta, AB, CAN
529 Employees
153K-195K Annually

Cencora Logo Cencora

Seasonal Associate/ Student/Coop

Healthtech • Logistics • Pharmaceutical
Remote
Yukon, YT, CAN
51000 Employees
40K-59K Annually

Similar Companies Hiring

Bellagent Thumbnail
Artificial Intelligence • Machine Learning • Business Intelligence • Generative AI
Chicago, IL
20 Employees
Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York City, NY
100 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account