Machine Learning Operations Engineer

Posted 3 Days Ago
Somerville, MA, USA
Hybrid
150K-200K Annually
Mid level
Artificial Intelligence • Cloud • Gaming • Machine Learning • Software • Virtual Reality • Cybersecurity
Reimagining voice online—so every conversation feels safe, respectful, and empowering for everyone.
The Role
The Machine Learning Operations Engineer will ensure reliability and performance of ML model inference systems, overseeing deployments, monitoring, and incident response, while collaborating on infrastructure and optimization.
Summary Generated by Built In

Modulate is the leader in conversational voice intelligence. We enable enterprises to deeply understand how people communicate and take timely action based on those insights. Our products help detect harm, prevent fraud, and build safer, more trusted online and real-world voice environments. We are building a Conversation Intelligence Platform — APIs, workflows, and applications that bring voice understanding to customers at enterprise scale.

We’re looking for a Machine Learning Operations Engineer to own and scale the production inference systems behind Modulate’s machine learning models. This role will focus on ensuring high availability, reliability, and efficiency of deployed models across our APIs and enterprise products as we rapidly grow in customer usage and model demand.

Your Impact

  • Own the reliability and performance of ML model inference systems in production

  • Ensure high availability of deployed models across APIs and enterprise products

  • Build systems to handle scaling, load variability, and production traffic growth

  • Reduce operational burden through better tooling, automation, and processes

  • Help define how Modulate runs ML systems at scale with reliability and efficiency

What You Will Do

  • Deploy, monitor, and maintain production machine learning inference systems

  • Oversee fleets of inference machines and ensure system health and performance

  • Design monitoring, alerting, and incident response systems for ML workloads

  • Participate in on-call rotations and lead incident response and debugging

  • Build systems and processes for scaling inference infrastructure under variable load

  • Improve reliability and observability of production ML services

  • Collaborate on infrastructure-as-code for production deployments

  • Support or contribute to GPU-based training and inference infrastructure

  • Work closely with ML and engineering teams to ensure smooth model deployments

  • (Optional growth area) Optimize model inference performance and latency

What We Are Looking For

  • Experience deploying and maintaining production software systems

  • Experience building monitoring and alerting systems for production environments

  • Experience with on-call rotations and incident response

  • Strong experience with AWS, Python, and Linux

  • Exposure to PyTorch or similar ML frameworks

  • Experience working with GPU-based applications and basic GPU tooling (drivers, runtime, monitoring)

  • Strong debugging and systems thinking skills

  • Ability to operate calmly in production incident environments

Nice to Have

  • Experience with ML model serving systems or dedicated model servers

  • Experience monitoring GPU performance for inference workloads

  • Experience optimizing machine learning model inference

  • Familiarity with audio or multimedia data (codecs, streaming, real-time systems)

  • Experience with infrastructure-as-code (e.g., Terraform, CloudFormation)

Benefits

  • Competitive salary + equity

  • Full health, dental, and vision coverage

  • Flexible PTO with strong culture of taking it

  • Weekly team lunches with dietary accommodations

  • Hybrid work with core in-office days and flexible remote options

  • Leadership and technical learning sessions

  • Career development and continued learning support

  • Up to 8 weeks work-from-anywhere policy

  • A deeply inclusive, human-centered culture

Pay Transparency
Modulate believes in transparency as a cornerstone of equity and trust. Compensation for this role is based on seniority, skills, and experience.

  • Salary: $150-$200K

  • Equity: Offered

  • Other perks: HSA, FSA, 15 holidays, professional growth resources

About Modulate
Modulate is on a mission to make voice a force for good online. Our tools help communities thrive by proactively detecting toxic behavior, protecting user identity, and empowering safety teams. We’re trusted by leaders in gaming and beyond—and we’re growing fast. We believe that great cultures don’t just happen. That’s why we’ve built a foundation of intentional systems: from bias-reducing hiring practices to transparent pay to tools that help teams collaborate across communication styles. At Modulate, we treat people like people—and we’re building technology that does the same.

Ready to join us? Apply here or reach out directly—we’re excited to meet you.

A quick note as you apply

  • Please apply through the website rather than emailing [email protected]

  • For application questions (“Your fit for the role,” “Your values/goals,” “Why Modulate?”), focus on relevant experience and motivations

  • Avoid including protected demographic information

  • Keep responses authentic and in your own voice

Skills Required

  • Experience deploying and maintaining production software systems
  • Experience building monitoring and alerting systems for production environments
  • Experience with on-call rotations and incident response
  • Strong experience with AWS, Python, and Linux
  • Exposure to PyTorch or similar ML frameworks
  • Experience working with GPU-based applications and basic GPU tooling
  • Strong debugging and systems thinking skills
  • Ability to operate calmly in production incident environments
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Somerville, MA
41 Employees
Year Founded: 2017

What We Do

Modulate designs and builds prosocial voice intelligence, empowering platforms to transform voice interactions into safe, trustworthy experiences. Our flagship product, ToxMod, is a proactive voice moderation system that goes beyond keyword scanning. Utilizing layered machine learning models—designed to detect emotional tone, speech patterns, conversational context, and intent—ToxMod identifies toxic behaviors as they occur. This nuanced approach allows moderators and trust-and-safety teams to intervene before harm escalates, not after, significantly reducing negative interactions in real time. As of October 2024, ToxMod has moderated over 160 million hours of voice data, resulting in more than 80 million moderator interventions that make online environments safer and more welcoming for users. While gaming was our launchpad, ToxMod now extends its protection into gig economy and customer service platforms. With Confidential Connect, platforms can safeguard privacy by anonymizing phone interactions and surfacing actionable insights when harm or fraud is detected in voice communication. Our modular model architecture—with specialized detectors for emotion, deception, harassment, phishing, and more—is both highly accurate and explainable. This allows for rapid adaptation: when collaborating with a gig platform in early 2024, we stood up a fraud-detection module within a month using the same system. Beyond voice safety, Modulate empowers platforms with compliance support and community health strategies. Our tools drive insights for transparency reporting, code-of-conduct alignment, and evolving regulatory needs—like those shaped by AI policy, trust and safety frameworks, and user privacy laws. In summary: Modulate transforms voice—from chat, phone, or AI agent—to an asset for trust, safety, and user well-being. We're building tools that don’t just respond to harm—they anticipate it. Whether it’s protecting gamers, customer-care agents, gig workers, or social communities, Modulate empowers platforms to safeguard human connection in the voice-first era.

Why Work With Us

Modulate stands out by combining real-time voice‑native AI with a mission-driven focus on safety, inclusion, and explainability. Working here means building cutting-edge, trust-first technology that protects real communities—while growing in a transparent, people-centered culture that values impact just as much as innovation.

Gallery

Gallery

Similar Jobs

Kensho Technologies Logo Kensho Technologies

Machine Learning Operations Engineer II

Artificial Intelligence • Fintech • Machine Learning • Natural Language Processing • Software • Generative AI
Hybrid
Cambridge, MA, USA
175 Employees
130K-175K Annually

Foundation EGI Logo Foundation EGI

ML Ops Engineer (Boston, MA)

Artificial Intelligence • Design • Generative AI • Manufacturing
In-Office or Remote
Boston, MA, USA
27 Employees
In-Office
Somerville, MA, USA
296 Employees
121K-157K Annually
In-Office
2 Locations
26747 Employees
130K-175K Annually

Similar Companies Hiring

Bellagent Thumbnail
Artificial Intelligence • Machine Learning • Business Intelligence • Generative AI
Chicago, IL
20 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York City, NY
100 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account