ML Engineer II (Inference Platform)

Posted 6 Days Ago
Be an Early Applicant
Vancouver, BC, CAN
In-Office
Mid level
Music • Software
The Role
The role involves building machine learning inference systems, optimizing cloud infrastructure, and managing data pipelines, focusing on audio content detection and model deployment.
Summary Generated by Built In
About Beatdapp

Beatdapp is a company delivering the most advanced streaming integrity technology in the world. One of our ventures is building machine learning inference systems for audio at scale. The work spans music, podcasts, and speech, with a particular focus on AI-generated sound. As generative audio gets cheaper and faster, the platforms we serve need accurate signal about what's real in their content, and they depend on us to provide it.

About the role

This role sits at the intersection of ML engineering, platform / infrastructure work, and inference systems. You will bridge the gap between raw audio and the clean signals our detection models depend on. You will partner with data scientists on bringing those models into production, carrying the production lens (latency, cost, customer-facing edges) into design conversations early so trade-offs are made together rather than discovered late.

In practice, the work cuts across the GPU-bound inference containers, the multi-cloud infrastructure that runs them, the API layer in front, the data and observability around them, and the CI that ships it all. The architectural challenge running through all of it is containing drift and scaling with minimal code.

Roadmaps here are weeks, not quarters, and your scope grows and shifts as the team and systems do. So we are hiring for engineering judgment first: a strong feel for clean, scalable design, an eye for code hygiene, and the courage to advocate for a better approach rather than default to consensus.

What you'll do
  • Container Engineering and Orchestration: Build, tune, and ship our inference containers. Building and maintaining Dockerfile and dependencies, image size and cold-starts, GPU access patterns, the multi-cloud orchestration shape that runs it (ECS, Cloud Run, GKE, EKS), test coverage for the container surface, and the storage abstraction it depends on.
  • In-Container Performance and Resource Optimization: Squeeze more out of each GPU instance: concurrency tuning, VRAM accounting, request timeouts and queueing, rate limiting, multi-GPU distribution on instances that have more than one, and the right-sizing decisions that follow.
  • Scale and Stress Testing: Build and run scale and stress scenarios across mock deployments that mirror real customer environments. Characterize the latency-vs-throughput curves, find the breaking points, and turn the results into autoscaling and instance-sizing decisions.
  • Cloud Infrastructure: Operate the Terraform stack across multiple clouds (GCP, AWS). Networking, identity, GPU nodes, autoscaling, per-tenant account configurations.
  • API Layer: Build and extend the customer-facing API layer that fronts the inference service: client authentication, rate limiting, per-client data isolation, and request metering.
  • Maintain Data Pipelines: Maintain and extend the data orchestration pipelines that feed model evaluation, customer reporting, and operational dashboards.
  • Observability: Build and tune the metrics, dashboards, logging, and alarms across three layers: the inference service, the running instances, and the deployed models themselves.
What we're looking for
  • Related STEM degree (BSc, MSc, or higher) and 3+ years of work experience in platform / infra / backend / ML / applied-ML / data engineering.
  • Strong engineering skills: The ability to write clean, scalable, production-grade code in Python or more performance-oriented language(s) (Go, Rust, C++).
  • Architectural fluency across data stores, distributed systems, caching, and data transfer protocols.
  • Data engineering skills: Comfort building data processing pipelines and using SQL (Airflow, BigQuery, Postgres).
  • Deep cloud infrastructure and networking experience across one or more platforms (GCP, AWS).
  • ML platform tooling: comfort with MLflow or similar tooling and model lifecycle processes (model versioning, artifact storage, promotion workflows).
  • Terraform: write and modify modules, understand state and backends, IaC over console.
  • CI/CD discipline: cloud OIDC, image signing, pinned versions, an instinct for cheap and reproducible CI.
  • Observability instincts: comfortable instrumenting across hardware, application, and model layers (latency, throughput, score distributions, drift). You know which metric to look at first when latency spikes.
  • Inference performance tuning: comfort with the levers of a high-throughput GPU service (micro-batching, concurrency, request queueing, in-container resource management).
  • Strong written communication: runbooks, design docs, PR descriptions, postmortems, and ticket hygiene (Jira).
Nice-to-haves

Not required, but a strong plus if you bring hands-on work experience with at least one of the following:

  • Audio or media systems
  • Signal processing
  • Speech detection (synthetic / artificial)
  • Computer vision
  • GPU work beyond running inference (CUDA, kernels, drivers, cluster operations)
  • Streaming systems (Kafka, Pub/Sub, Kinesis, or similar)

Skills Required

  • 3+ years of work experience in platform, infra, backend, ML, applied-ML, or data engineering
  • Ability to write clean, scalable, production-grade code in Python or performance-oriented languages
  • Experience with data processing pipelines and SQL
  • Deep cloud infrastructure and networking experience across GCP or AWS
  • Experience with ML platform tooling like MLflow
  • Experience with Terraform and Infrastructure as Code
  • Experience in CI/CD processes
  • Strong written communication skills
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Vancouver, British Columbia
43 Employees
Year Founded: 2018

What We Do

We help music labels and artists track their songs to collect royalties. Beatdapp is a tracking system that authenticates, verifies, and validates media streamed in real time. It reduces the tracking discrepancies between Digital Service Providers (DSPs) and rights holders, increasing royalty revenue for rights holders while limiting legal exposure, expensive audit costs, and lawsuits for DSPs. Think of it like a PWC for music play count! Accelerator Alumni: -Creative Destruction Labs, Prime Stream (CDL-West) -500 Startups (San Francisco) -Project Music Portfolio (Nashville, TN) Awards: -#2 Seed Stage Company in Canada by Crunchbase (2019) -Winner of Inventures "Exploring Blockchain Breakthroughs" Competition (2019) -NACO Top 5 - Most Promising Startup of the Year (2019) -New Ventures BC "People's Choice" Winner (2019 - by over 4,000 votes) -New Ventures BC Top 10 (2019) -Selected Top 25 Canadian Companies for "48 Hours in The Valley" by C100 (2020 Cohort) -Selected for "Emerging Rockets List" (2020)

Similar Jobs

Rapid7 Logo Rapid7

Account Executive

Artificial Intelligence • Cloud • Information Technology • Sales • Security • Software • Cybersecurity
Remote or Hybrid
Canada
2400 Employees

Remitly Logo Remitly

Development Engineer

eCommerce • Fintech • Payments • Software • Financial Services
In-Office
Vancouver, BC, CAN
2800 Employees
124K-155K Annually

Remitly Logo Remitly

Senior Full-stack Engineer

eCommerce • Fintech • Payments • Software • Financial Services
In-Office
Burnaby, BC, CAN
2800 Employees
152K-190K Annually

Remitly Logo Remitly

Deputy Chief Compliance Officer, Canada

eCommerce • Fintech • Payments • Software • Financial Services
In-Office
Burnaby, BC, CAN
2800 Employees
112K-140K Annually

Similar Companies Hiring

Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
31 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account