Senior ML Ops Developer

Posted 18 Hours Ago
Be an Early Applicant
Québec, QC
Senior level
Software
The Role
As a Senior Machine Learning Developer on the ML Platform team, you will create and maintain tools that facilitate the development, deployment, and testing of ML models. You will collaborate with Applied Scientists, improve model performance, and contribute to MLOps best practices for large-scale ML solutions.
Summary Generated by Built In

*Please note that we have 3 Senior Machine Learning Developer roles open hybrid, in Montreal and Quebec city locations, as well as remote across the Quebec and Ontario provinces.

Play a key role in shaping LLMOps best practices 

As a Senior Machine Learning Developer on the ML Platform team, you will play a key role in supporting the teams of applied scientists responsible for creating and training Large Language Models (LLMs) and other ML models at scale. 

Your primary responsibility will be to create and maintain a suite of tools and workflows that enable the efficient development of robust, scalable, and maintainable ML models. You will also work closely with Applied Scientists to accelerate the iteration and experimentation process, ensuring faster and more effective model development.

Here is what makes this opportunity exciting:

The ML unit at Coveo focuses on finding ways to apply the latest advances in Recommender Systems, Ranking Optimization, LLMs and NLP to build innovative solutions in e-commerce, self-service and other business verticals.

We solve real problems with real data, for hundreds of large enterprise clients all around the world, on a modern platform that serves over 100M requests and automatically trains thousands of ML models on a daily basis.

Here is a glimpse at your responsibilities:

  • Provide end-to-end ML tooling from data exploration to production deployment tooling.
  • Facilitate development, deployment, automated testing, monitoring and debugging of ML models
  • Analyze and improve the performance of our models and ML Platform to help meet critical SLOs for training models at scale and low-latency inference.
  • Facilitate the adoption and usage of ML platform and observability resources and provide guidelines to improve operational efficiency and service reliability.
  • Engage with your community of peers to challenge the status quo, improve our shared ways of working, and influence overall architecture decisions.
  • Learn and evolve our modern tech stack which includes Python, AWS, Kubernetes, Pytorch, Terraform, Snowflake, Honeycomb and others

Here is what will qualify you for the role: 

  • You have 5+ years of Machine Learning industry experience.
  • You operationalized, instrumented and supported ML models in production at a non-trivial scale and you're ready to take on new challenges and deploy LLMs (if you haven't already!).
  • You are fluent in good data and software engineering practices, and you are able to develop the tools and culture which enable ML teams to deliver reliable production code in an efficient manner.
  • You enjoy collaborating with scientists on a daily basis to understand their pain points and figure out how to improve their tools and increase their efficiency. You also have experience working in cross-functional teams.

Here is what will make you stand out:

  • You master best practices in MLOps, ML engineering, and large-scale deployment of ML models.
  • You have experience maintaining and evangelizing internal resources and libraries.
  • You have acquired considerable MLOps experience hosting models at scale, by previously building tooling to facilitate data exploration and experimentation as well as automating and orchestrating complex and efficient training pipelines
  • You are recognized for your communication skills and presenting complex technical subjects to audiences with different levels of technical proficiency. 

Do you think you can bring this role to life? 

You don’t need to check every single box; passion goes a long way and we appreciate that skillsets are transferable.
Send us your CV, we want to get to know you! Join the Coveolife!
We encourage all qualified candidates to apply regardless of, for example, age, gender, disability, gaps in CV, national or ethnic background. We know that applying for a new role is a lot of work and we really appreciate your time.

Top Skills

Python
The Company
HQ: Québec
763 Employees
On-site Workplace

What We Do

Coveo powers the digital experiences of the world’s most innovative brands serving millions of people and billions of interactions across every digital experience. After a decade of enriching our market-leading platform with forward-thinking global enterprises, we know what it takes to gain a trusted AI-experience advantage.

We strongly believe that the future is business-to-person, that experience is today’s competitive front line, a make or break for every business.

For enterprises to achieve this AI-experience advantage at scale, it is imperative to have an Enterprise Spinal and composable ability to deliver AI semantic search and generative experiences at each customer and employee interaction.

Similar Jobs

Motorola Solutions Logo Motorola Solutions

Entry Level DevOps Developer

Artificial Intelligence • Hardware • Information Technology • Security • Software • Cybersecurity • Big Data Analytics
Hybrid
Gatineau, QC, CAN
21000 Employees

Bounteous Logo Bounteous

Graduate Software Developer

Agency • Digital Media • eCommerce • Professional Services • Software • Analytics • Consulting
Hybrid
Montréal, QC, CAN
5000 Employees

Arrow Electronics, Inc. Logo Arrow Electronics, Inc.

Field Applications Engineer

Cloud • Enterprise Web • Hardware • Information Technology • Internet of Things • Robotics • Semiconductor
Remote
Québec, QC, CAN
22000 Employees

ServiceNow Logo ServiceNow

Senior Backend Python Developer

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Hybrid
Montréal, QC, CAN
26000 Employees

Similar Companies Hiring

Jobba Trade Technologies, Inc. Thumbnail
Software • Professional Services • Productivity • Information Technology • Cloud
Chicago, IL
45 Employees
RunPod Thumbnail
Software • Infrastructure as a Service (IaaS) • Cloud • Artificial Intelligence
Charlotte, North Carolina
53 Employees
Hedra Thumbnail
Software • News + Entertainment • Marketing Tech • Generative AI • Enterprise Web • Digital Media • Consumer Web
San Francisco, CA
14 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account