About the Job: We are
seeking a seasoned Machine Learning Engineer – Computer Vision to
design, optimise, and deploy deep learning models for large-scale, real-time
edge inference. In this role, you will work on the end-to-end lifecycle of
computer vision models—from training and evaluation to optimisation, automated
governance, and edge deployment—while advancing MLOps capabilities on Google
Cloud. You will work at the intersection of deep learning, cloud
infrastructure, and edge AI, building reliable, high-performance solutions that
scale across devices and continuously improve through automation and data
driven evaluation.
Office Location: Toronto
Employment Type: Permanent
Role Type: New position –
current requirement
Work Arrangement: Hybrid (2
days in office per week)
Position Responsibilities:
- Computer Vision Development: Design, train,
evaluate, and fine-tune state-of-the-art deep learning models for image
classification and object detection tasks.
- Pipeline Enhancement: Maintain, optimize and add
advanced MLOps capabilities to existing Vertex AI Kubeflow Pipelines
(KFP).
- Model Optimization & Conversion: Manage the
complex conversion of models from frameworks like TensorFlow into highly
optimized TensorFlow Lite (TFLite) artifacts for edge inference (e.g.,
handling Int8 full integer quantization and hardware-specific acceleration).
- Edge Artifact Management: Architect the deployment
flow to save optimized edge models to Google Cloud Storage (GCS) and
manage model versioning for seamless edge-device retrieval, bypassing
traditional Vertex AI Endpoints.
- Automation & Reliability: Implement automated
evaluation gates to ensure newly trained models outperform existing
production models before edge deployment.
Requirements
Required Qualifications:
- Experience: 3- 6 years in Machine Learning
Engineering, preferably Computer Vision.
- Deep Learning Foundation: Strong mathematical and
architectural understanding of deep learning concepts, specifically
Convolutional Neural Networks (CNNs) and standard object detection
architectures.
- Framework Mastery: Deep, hands-on expertise with
TensorFlow 2.x and/or PyTorch.
- Edge ML: Proven experience optimizing deep learning
models for edge devices using TFLite (e.g., post-training quantization,
pruning, handling custom ops).
- GCP MLOps: Strong proficiency in Google Cloud
Platform, specifically building and running custom components in Vertex AI
Pipelines (KFP).
- Programming: Advanced programming skills in Python,
with experience containerizing ML workloads using Docker.
- Cloud Infrastructure: Solid understanding of Google
Cloud Storage (GCS) for managing massive datasets and handling model
artifact hand-offs.
- Critical thinking, Effective communication skills –
verbal and written, Problem solving, and Dealing with complexity
Preferred Qualifications:
- YOLO Expertise: Hands-on experience with the
Ultralytics YOLOv8 ecosystem, specifically bridging PyTorch YOLO weights
to TensorFlow/TFLite edge deployments.
- Data Orchestration: Experience using Google Cloud
Composer (Apache Airflow) to schedule and trigger complex ML training
pipelines based on data arrival or model drift.
- Scalable Data Processing: Familiarity with Google
Cloud Dataflow (Apache Beam) for large-scale, parallelized image
preprocessing, augmentation, and dataset formatting (e.g., generating
TFRecords).
- CI/CD for ML: Experience with continuous
integration and continuous deployment practices specifically tailored for
machine learning models.
- Generative AI: Knowledge or experience in
Generative AI architectures, with experience building Retrieval-Augmented
Generation (RAG) pipelines and developing multi-agent systems.
Benefits
Salary Range: CAD $100,000
- $110,000/ year
The final compensation offered
will depend on local market conditions and geographic location, as well as
job-related factors such as the candidate’s knowledge, skills, qualifications,
relevant experience, and education/training. Compensation may also include
additional components such as benefits, and/or other incentives, where
applicable. In accordance with new employment standards requirements, we retain
copies of this job posting and applicant information for three (3) years after
the posting is removed. We do not use AI technology; all applications are also
reviewed by our recruitment team.
Infoya is an equal opportunity
employer committed to diversity and inclusion. We welcome applications from all
qualified individuals, regardless of race, color, religion, sex, sexual
orientation, gender identity, national origin, age, disability, protected veteran
status, aboriginal status, or any other legally protected factors.
Skills Required
- 3-6 years in Machine Learning Engineering, preferably Computer Vision
- Strong understanding of deep learning, CNNs, and object detection architectures
- Hands-on expertise with TensorFlow 2.x and/or PyTorch
- Experience optimizing models for edge using TensorFlow Lite (post-training quantization, pruning, custom ops)
- Experience building and running custom components in Vertex AI Pipelines (KFP) on GCP
- Advanced programming skills in Python and containerizing ML workloads with Docker
- Solid understanding of Google Cloud Storage (GCS) for dataset and model artifact management
- Critical thinking, effective verbal and written communication, and problem-solving skills
What We Do
Infoya is a global IT solutions and consulting firm specializing in business transformation, digital innovation, and advanced engineering services, including AI and cloud enablement.

.png)







