Senior Applied AI Engineer (Computer Vision)

Posted 14 Hours Ago
Be an Early Applicant
Bengaluru North, Yelahanka, Bengaluru Urban, Karnataka, IND
In-Office
Senior level
Artificial Intelligence • Software • Consulting • Generative AI
The Role
Design and deploy production-grade computer vision systems. Focus on building visual intelligence systems using deep learning and classical techniques. Responsibilities include developing models for object detection, scene understanding, multimodal systems, and optimizing for performance and scalability.
Summary Generated by Built In

AuxoAI is hiring a Senior Applied AI Engineer to design and deploy production-grade computer vision systems that operate reliably in real-world environments.

This role focuses on building end-to-end visual intelligence systems, combining deep learning, classical computer vision techniques, and multimodal models. It is not limited to model training and requires strong ownership of system design, deployment, and real-world performance.

You will work on systems that perform perception, understanding, and reasoning over visual data, and integrate these capabilities into larger AI platforms and agent-based workflows.

You will also work on problems where existing approaches may not be sufficient, and will be expected to combine deep learning, geometric methods, and multimodal reasoning to build robust, production-grade systems.

Location – Mumbai / Bangalore / Hyderabad / Gurgaon (Hybrid – 3 days per week in office)

Responsibilitiesp:
  • Design and deploy computer vision systems for tasks such as:
    • Object detection, segmentation, and tracking
    • Scene understanding and structured perception
    • Video understanding and temporal reasoning
  • Build and optimize models using architectures such as:
    • CNNs (ResNet, EfficientNet)
    • Vision Transformers (ViT, Swin, DeiT)
    • Detection/segmentation models (YOLO, DETR, Mask R-CNN)
  • Develop multimodal systems combining vision and language:
    • CLIP-style models
    • Vision-language models (VLMs)
    • Visual grounding and captioning systems
  • Implement algorithms for:
    • Multi-object tracking (SORT, DeepSORT, ByteTrack)
    • Feature matching and representation learning
    • Temporal modeling (RNNs, Transformers for video)
  • Apply geometric and classical computer vision methods where relevant:
    • Camera calibration
    • Epipolar geometry
    • Pose estimation
    • 3D reconstruction or depth estimation
  • Optimize systems for:
    • Low-latency, real-time inference
    • Throughput and scalability
    • Edge and distributed deployment
  • Design and build data pipelines for:
    • Annotation workflows
    • Dataset curation
    • Synthetic data generation
  • Integrate vision systems into:
    • Multimodal AI pipelines
    • Agent-based systems
    • Decision-making workflows


Requirements
  • 5+ years of experience building computer vision systems in production environments
  • Strong experience with deep learning frameworks (PyTorch / TensorFlow)
  • Hands-on experience with:
    • Detection, segmentation, or tracking systems
    • Model training, fine-tuning, and evaluation
  • Strong understanding of:
    • Representation learning
    • Loss functions (contrastive loss, focal loss, etc.)
    • Evaluation metrics (mAP, IoU, precision/recall)
  • Experience building and deploying end-to-end vision systems, not just training models

Candidates whose primary experience is limited to academic projects or model experimentation without real-world deployment may not be a fit for this role.

Nice to Have:
  • Experience with multimodal systems (vision + language)
  • Familiarity with models such as:
    • CLIP, BLIP, Flamingo, or similar
  • Experience with 3D vision:
    • NeRFs
    • SLAM
    • Point clouds
  • Experience with video understanding:
    • Action recognition
    • Event detection
  • Experience building data engines:
    • Active learning
    • Hard negative mining
  • Experience working with large-scale datasets and distributed training pipelines


Skills Required

  • 5+ years of experience building computer vision systems in production environments
  • Strong experience with deep learning frameworks (PyTorch / TensorFlow)
  • Hands-on experience with detection, segmentation, or tracking systems
  • Strong understanding of representation learning and loss functions
  • Experience building and deploying end-to-end vision systems
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
0 Employees
Year Founded: 2022

What We Do

AuxoAI partners with enterprise leaders to build AI systems, enabling the creation of AI-first enterprises by moving from AI strategy to production-grade deployed systems in weeks.

Similar Jobs

Ericsson Logo Ericsson

Technical Authority Expert

Cloud • Information Technology • Internet of Things • Machine Learning • Software • Cybersecurity • Infrastructure as a Service (IaaS)
Hybrid
2 Locations
88000 Employees

Ericsson Logo Ericsson

Devops Engineer

Cloud • Information Technology • Internet of Things • Machine Learning • Software • Cybersecurity • Infrastructure as a Service (IaaS)
Hybrid
Bangalore, Bengaluru Urban, Karnataka, IND
88000 Employees
2-2 Annually

Ericsson Logo Ericsson

Devops Engineer

Cloud • Information Technology • Internet of Things • Machine Learning • Software • Cybersecurity • Infrastructure as a Service (IaaS)
Hybrid
Bangalore, Bengaluru Urban, Karnataka, IND
88000 Employees
In-Office
Bengaluru, Bengaluru Urban, Karnataka, IND
2449 Employees

Similar Companies Hiring

Bellagent Thumbnail
Artificial Intelligence • Machine Learning • Business Intelligence • Generative AI
Chicago, IL
20 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Software
US
100 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account