JazzX AI Jobs

Senior Staff AI Engineer

JazzX AI

Senior Staff AI Engineer

Reposted 16 Days Ago

7 Locations

In-Office or Remote

207K-290K Annually

Senior level

Artificial Intelligence • Information Technology

JazzX AI leverages advanced AI to transform enterprise operations, boosting both productivity and employee satisfaction.

The Role

The Senior Staff AI Engineer will lead the design and optimization of reinforcement learning systems for enterprise applications, focusing on scalable architectures and production deployment. Responsibilities include mentoring, collaboration, and ensuring compliance with AI ethics and safety standards.

Summary Generated by Built In

About JazzX AI:

Vision: Enterprises operating on institutional intelligence—governed, self-improving, and scalable beyond individual expertise.

JazzX AI is defining the future of enterprise work—by building AI-native digital workers that actually get the job done.

We believe enterprises don't scale expertise—they lose it. Knowledge stays trapped in individuals, judgment gets applied inconsistently, and the best talent spends time on work that should run itself. We're changing that.

JazzX AI transforms messy enterprise reality into institutional intelligence: governed digital workers that capture expert judgment, make every decision explainable, and continuously improve through real-world execution. The result is faster decisions, higher-quality outcomes, and reliable execution at scale—in domains where getting it wrong isn't an option.

We're starting with lending and due-diligence workflows—complex, regulated, high-stakes. From here, we're building the backbone for enterprise intelligence across industries.

This is early-stage, hard, and consequential work. If you want to be part of bringing AI systems to market that actually run in production, handle real complexity, and deliver real outcomes—not demos, not chatbots—JazzX AI is the place.

About SAIGroup :

SAIGroup a private investment firm that has committed $1B to build and scale next-generation, AI-powered enterprise software companies. SAIGroup’s portfolio serves 2,000+ global enterprise customers, generates nearly $800M in annual revenue, and employs 4,000+ people worldwide — providing JazzX AI with long-term capital, deep operating expertise, and access to real-world enterprise scale from day one.

About the Role

We are seeking an experienced AI Engineer with deep expertise in Reinforcement Learning (RL) to join our team as a Senior Staff Architect. In this role, you will be responsible for shaping the vision, architecture, and technical execution of RL-driven AI reasoning models and systems that power next-generation enterprise AGI platform.

You will lead the design, development, and optimization of cutting-edge RL solutions, from experimentation and simulation through production deployment. This includes building scalable training architectures, architecting multi-agent and hierarchical RL frameworks, and ensuring that the RL systems are resilient, efficient, explainable and safe.

As a senior technical leader, you will partner with cross-functional teams—including product, core platform engineering, and research—to define architectural best practices, establish governance standards, and enable seamless integration of RL into our broader AGI platform. You will also drive innovation by exploring novel RL techniques, mentoring engineers and researchers, and ensuring the RL infrastructure can scale to support high-throughput training and real-world scenarios and enterprise use cases end to end.

Ultimately, your work will be critical in bridging research and production, ensuring that the latest RL advancements translate into reliable, impactful, and enterprise-ready AI solutions.

Key Responsibilities

Architecture & Design: Define and drive the end-to-end architecture for reinforcement learning–based systems, including training pipelines, simulation environments, reward shaping, and model serving.
Research & Development: Apply cutting-edge RL techniques (policy optimization, model-based RL, hierarchical RL, multi-agent RL, etc) to solve complex enterprise problems.
Scalability & Infrastructure: Design distributed training systems, leverage cloud-native infrastructure, and optimize for performance, reproducibility, and cost-efficiency.
Leadership & Mentorship: Provide technical leadership to AI engineers and researchers; mentor junior team members; review designs and code with a focus on scalability, robustness, and clarity.
Collaboration: Partner with product, data, and platform teams to align RL solutions with strategic business goals and integrate them into production systems.
Evaluation & Monitoring: Define frameworks for benchmarking, continuous evaluation, and feedback-driven improvements in deployed RL models.
Compliance & Safety: Ensure RL systems align with ethical AI practices, safety constraints, and regulatory standards for enterprises.

Required Qualifications

10+ years of experience in AI/ML engineering, including at least 5 years specializing in reinforcement learning research and production systems.
Demonstrated success in designing and deploying large-scale RL architectures in enterprise environments.
Deep expertise in reinforcement learning algorithms, including on-policy (PPO, A3C) and off-policy (SAC, DDPG) methods, along with hands-on work in simulation frameworks (e.g., OpenAI Gym, Isaac Gym, PettingZoo, MuJoCo).
Practical experience with multi-agent reinforcement learning (MARL), including coordination strategies for complex environments.
Strong proficiency in Reinforcement Learning with Verifiable Rewards (RLVR) and GRPO-like policy optimization approaches, applying reinforcement learning principles both rigorously and pragmatically.
Experience with test-time compute optimization techniques, including inference-time search, chain-of-thought reasoning, and adaptive computation strategies for improving model performance during deployment.
Proven ability in large language model (LLM) training and fine-tuning, across both supervised and reinforcement learning–driven techniques.
Advanced software engineering skills in Python, C++, or Java, with deep expertise in ML frameworks such as TensorFlow, PyTorch, JAX, or Ray RLlib.
Hands-on experience with distributed training infrastructure (Kubernetes, GPU/TPU clusters, and cloud ML platforms).
Excellent communication, collaboration, and leadership skills, with experience working across multidisciplinary teams.

Preferred Qualifications

PhD in Computer Science, Machine Learning, Robotics, or related field.
Experience leading enterprise AI adoption and guiding organizational strategy for RL powered systems.
Contributions to open-source RL frameworks or publications in top-tier conferences (NeurIPS, AISTATS, ICML, ICLR, AAAI).
Background in safety, alignment, or explainability of RL agents.

We offer a competitive salary ranging from USD 207,000 to USD 290,000, along with equity options and an attractive benefits package, including health, dental, and vision insurance, flexible working arrangements, and more.

We are an equal opportunity employer and celebrate diversity at our company. We do not discriminate based on race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

Why Join JazzX AI:

What draws people to JazzX AI is the opportunity to build something truly foundational.

We are at the beginning of a new era — one where enterprise systems will no longer be static tools, but dynamic collaborators that learn, reason, and evolve alongside the people who use them. At JazzX AI, you’ll help build a platform that redefines how work happens in the AGI era.

JazzX AI brings together vision, deep technology, and purpose. This is not about experimenting with AI features; it’s about turning the promise of AGI into practical, auditable, and human-aligned systems that enterprises can trust and rely on at scale. The work here directly impacts productivity, efficiency, and decision-making in complex, real-world environments.

If you’re motivated by building systems that matter — systems that combine intelligence with responsibility — JazzX AI offers the chance to do that work at a deeper level, with real ownership and lasting impact.

Headquartered in Los Altos, CA. Backed by SAIGroup.

Learn more about JazzX AI:
Website: https://jazzx.ai
LinkedIn: https://www.linkedin.com/company/jazzx-ai

Learn more about SAIGroup:
Website: https://saigroup.ai

Skills Required

10+ years of experience in AI/ML engineering
5 years specializing in reinforcement learning
Experience with large-scale RL architectures
Deep expertise in reinforcement learning algorithms
Practical experience with multi-agent reinforcement learning
Strong proficiency in Reinforcement Learning with Verifiable Rewards
Experience with test-time compute optimization techniques
Proven ability in LLM training and fine-tuning
Advanced software engineering skills in Python, C++, or Java
Hands-on experience with distributed training infrastructure
Excellent communication and leadership skills

View all jobs at JazzX AI

View JazzX AI Profile

Report Job

Am I A Good Fit?

beta

Get Personalized Job Insights.

Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company

HQ: Los Altos, CA

33 Employees

Year Founded: 2024

What We Do

We are technologists, designers, builders, and innovators who are revolutionizing traditional business processes and practices with advanced AI technology. Our vision is a future where novel digital solutions support employees in their complex, knowledge-driven tasks while automating the routine ones. This not only improves work productivity and enjoyment, but also enables companies to become pacesetters within their industries. JazzX AI is owned by SAIGroup, one of the largest, fastest-growing investment firms in enterprise AI. SAIGroup is a private investment firm investing in business with the potential to become leaders in enterprise AI solutions. SAIGroup’s investments enable our portfolio companies to accelerate their innovation and growth. Principal owner and investor Dr. Romesh Wadhwani has made a commitment to invest up to $1 billion in SAIGroup. Visit saigroup.ai to learn more.