Senior Machine Learning Engineer

Posted 2 Days Ago
Chicago, IL
Hybrid
140K-180K Annually
Senior level
Artificial Intelligence • Healthtech • Natural Language Processing • Software • Conversational AI
We are transforming the way businesses interact with their customers.
The Role
The Senior Machine Learning Engineer will develop and optimize voice-centric AI systems, manage ML lifecycle, and ensure high-quality conversational experiences in production environments.
Summary Generated by Built In

Job Summary

We’re looking for a hands-on, entrepreneurial Senior Machine Learning Engineer who has already taken voice-centric AI systems (TTS, STT, LLM-driven dialog) from prototype to planet-scale production. You will own the full lifecycle of our ML stack—research, data pipelines, training, evaluation, deployment, and relentless optimisation—so that millions of patients can have natural, sub-second conversations with our Agentic Voice platform. You’ll collaborate tightly with product, infra, and compliance teams, set a high technical bar for ML excellence

What sets this role apart: You'll specialize in creating highly optimized, domain-specific conversational AI models by fine-tuning and compressing existing LLMs and specialized conversational architectures for specific use cases. We need someone who can rapidly research, prototype, and deploy smaller, faster, cheaper models that outperform general-purpose solutions in conversational settings - achieving 10x speed improvements and 90% cost reductions while building efficient pipelines for intent classification, dialogue management, and text-based optimization systems that improve conversational quality of our dialogue systems.


Key Responsibilities

Advanced Model Optimization & Fine Tuning

Apply LoRA, QLoRA, DPO, RLHF and parameter-efficient methods to create smaller, faster models optimized for conversational contexts

Implement quantization, pruning, knowledge distillation to significantly reduce model size while preserving quality

Work with modern conversational architectures: DeBERTa, SetFit, sentence transformers, lightweight decoder models for domain-specific use cases

Rapidly evaluate and adapt latest research for conversational applications

End-to-End ML Engineering

Design, build, and maintain high-performing STT, TTS, and LLM pipelines that operate at < 800 ms end-to-end latency and thousands of concurrent calls.

Train and fine-tune smaller, task specific LLMs optimized for accuracy, latency and cost efficiency in real time applications.

Inference at Scale

Optimise GPU- and CPU-based serving on EKS / Kubernetes using techniques such as dynamic batching, quantisation, speculative decoding, and streaming gRPC / WebSockets.

Dialogue Management

Implement and extend LangGraph / LangChain flows and Model Context Protocol (MCP) schemas to orchestrate complex multi-turn healthcare conversations safely and compliantly.

Data & Evaluation

Build robust data pipelines (Kafka → Snowflake / S3) for conversation logs; design offline and online evaluation frameworks for ASR/WER, TTS MOS, and task-completion metrics.

Technical Leadership

Establish ML best practices—versioning, monitoring, A/B gating, CICD for models—and mentor engineers on ML ops, audio processing, and prompt engineering.

Cross-Functional Collaboration

Work daily with product managers, designers, compliance leads, and customer teams to translate business goals into scalable voice experiences.

Innovation & Research

Stay on the cutting edge of open-source speech and LLM research; run rapid POCs (e.g., Whisper-v3, Bark), explore efficient fine tuning techniques(e.g. LORA, DPO), and continuously improve model performance in production environments.

Reliability & Compliance

Ensure HIPAA-grade security, auditable PHI handling, guardrails, and fallback strategies to keep conversations safe and reliable 24 × 7.


Qualifications

Education

  • B.S. or M.S. in Computer Science, Machine Learning, or related field.

Experience

  • 7+ years building production ML systems, 2+ specifically in speech / conversational AI.
  • Proven track record shipping voice AI or large-scale LLM products to tens-of-millions of users or thousands of concurrent sessions.


Technical Expertise

  • Advanced Fine-tuning & Model Compression:

Proven experience with parameter-efficient fine-tuning techniques (LoRA, QLoRA, adapters) for conversational applications

Knowledge of few-shot learning frameworks for conversational tasks with limited data

Experience with model compression techniques (quantization GPTQ/AWQ, pruning, knowledge distillation) for real-time inference

  • Speech: Deep understanding of ASR (e.g., Whisper, NeMo, Kaldi) and TTS (e.g., Tacotron, FastSpeech, VITS) model internals and evaluation.
  • LLMs & Dialogue: Experience with GPT-class models, fine-tuning, RAG, LangGraph, LangChain, MCP, prompt-engineering and safety guardrails.
  • Languages: Expert in Python; proficiency in TypeScript / Node and/or Java is a plus.
  • MLOps & Infra: Kubernetes (AWS EKS), Helm, Terraform, MLflow / SageMaker, model-aware CI/CD, feature stores, GPU scheduling, autoscaling.
  • Data: Kafka, Redis, Postgres, Snowflake; designing real-time and batch pipelines for audio and text.
  • Protocols: gRPC, WebSockets, HTTP/2 streaming, RTP/WebRTC.
  • Security & Compliance: Experience securing PHI/PII, HIPAA/HITRUST controls, and SOC2 processes.

Soft Skills:

  • Product Mindset: Proven ability to make strategic product decisions with a focus on user needs and business impact.
  • Entrepreneurial: Experience taking ideas all the way from ideation to execution. Instead of waiting for tasks, you’re proactively identifying areas of opportunity and building them out.
  • Leadership Skills: Demonstrated experience in leading technical teams and mentoring engineers.
  • Problem-Solving: Excellent analytical and problem-solving abilities.
  • Communication: Strong verbal and written communication skills; ability to articulate complex technical concepts to non-technical stakeholders.


How we work:

  • Small, cross‑functional pods with a tech lead (Data Science/Engineering owns sprint tickets; product owns the what/why and outcomes).
  • Bias to prototype → validate → build; instrument everything; learn fast.
  • High autonomy, high bar, candid feedback, low politics.
  • Hybrid: 3 days/week in our West Loop office; occasional travel for customer meetings and team onsites.


Compensation & benefits:

Competitive base + equity; comprehensive health benefits; flexible PTO.


XSELL is committed to a culture of teamwork; where everyone works together to plan, do, learn, and continuously improve. We accomplish that by staying true to our core values.

  • Best Chapter: Every XSELLer is plugged in and focused on writing their “best chapter yet”, both personally and professionally. We believe in working hard to achieve success, but that success only comes if we are doing it together. We do this with a high level of humility, integrity, and compassion towards our coworkers. We celebrate and recognize each other and have a lot of fun along the way.
  • Know Us By Our Results: We do what we say and say what we do. Our coworkers and clients will “know us by our results” – we welcome that and embrace transparency and measurement.
  • Do It The XSELL Way: Together we are building an inclusive culture full of top-performing, talented people that are striving towards common goals with resilience. People will admire not only the work we do but also that we “do it the XSELL way” – as one team. We do this through strong communication, collaboration, and accountability to each other.
  • Open for Business: We are always “open for business” – fiercely committed to improving ourselves, our team, and our company. We stay curious and approach every situation as an opportunity to learn and grow.
  • Conversations Happen In the Room: Feedback is imperative to our collective success. We approach the “conversation in the room” with respect, empathy, and candor. Our dialogue with each other is always open and honest.


XSELL Technologies is an Equal Employment Opportunity Employer and all employment decisions shall be made without regard to age, race, creed, color, religion, sex, national origin, ancestry, disability status, veteran status, sexual orientation, gender identity or expression, genetic information, marital status, citizenship status or any other basis as protected by federal, state, or local law.

We are committed to the full inclusion of all qualified individuals. As part of this commitment, we will ensure that persons with disabilities are provided reasonable accommodations. If reasonable accommodation is needed to participate in the job application or interview process, perform essential job functions, and/or receive other benefits and privileges of employment, please contact us.

Top Skills

Aws Eks
Grpc
Helm
Http/2
Java
Kafka
Kubernetes
Mlflow
Node.js
Postgres
Python
Redis
Sagemaker
Snowflake
Terraform
Typescript
Websockets
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Chicago, IL
60 Employees
Year Founded: 2014

What We Do

At XSELL we leverage augmented intelligence to deliver digital personalization at scale. Our unique business model brings together proprietary cloud-based natural language processing with deep human expertise to empower the best real-time responses to exponentially increase sales and conversion rates alongside an unmatched customer service experience.

Why Work With Us

At XSELL, you will be poised to write the best chapter of your career. The time is now!
We have spent the last four years envisioning and creating a platform that today is transformative, but tomorrow will be mainstream. The next chapter of our company's growth is right around the corner.

Gallery

Gallery

Similar Jobs

In-Office or Remote
Chicago, IL, USA
2676 Employees
In-Office
4 Locations
20002 Employees
117K-176K Annually
In-Office
4 Locations
1001 Employees
124K-198K Annually

NVIDIA Logo NVIDIA

Senior AI and ML HPC Cluster Engineer

Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
In-Office or Remote
6 Locations
21960 Employees
136K-265K Annually

Similar Companies Hiring

Standard Template Labs Thumbnail
Software • Information Technology • Artificial Intelligence
New York, NY
10 Employees
PRIMA Thumbnail
Travel • Software • Marketing Tech • Hospitality • eCommerce
US
15 Employees
Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account