AI Engineer

Posted 4 Days Ago
Be an Early Applicant
2 Locations
In-Office or Remote
100K-130K Annually
Mid level
Artificial Intelligence • Computer Vision • Software • PropTech
The Role
Fine-tune, deploy, and maintain vision-language and large language models for production. Build end-to-end multi-modal training, evaluation, and inference pipelines in Python, implement prompt engineering and RAG, quantize transformers for efficiency, and create feedback loops to continuously improve model performance in real-world environments.
Summary Generated by Built In
We're seeking an AI Engineer with deep experience in transformers, generative models, and vision-language models (VLMs) to push City Detect's products beyond traditional object detection. You'll fine-tune, deploy, and maintain multi-modal models that combine visual and language understanding to deliver intelligent, scalable solutions across heterogeneous real-world environments.

What You'll Do
  • Fine-tune and deploy vision-language models (VLMs) and large language models for production use cases
  • Design and maintain end-to-end pipelines for multi-modal model training, evaluation, and inference in Python
  • Develop prompt engineering strategies, RAG architectures, and other techniques to maximize model performance
  • Evaluate model outputs systematically and build feedback loops for continuous improvement
  • Quantize large transformer models to improve model efficiency
  • Stay current with rapid advances in transformer architectures, fine-tuning methods, and multi-modal research

Requirements
  • 3+ years of professional experience working with transformer-based architectures
  • 2+ years of hands-on experience fine-tuning and deploying multi-modal models (VLMs)
  • 2+ years of proven computer vision experience, with a strong preference for object detection
  • Strong experience with LLMs — fine-tuning, inference optimization, and production deployment
  • Proficiency in Python for model development, training, and deployment (2+ years)
  • Experience with deep learning frameworks such as PyTorch or TensorFlow
  • Solid understanding of attention mechanisms, tokenization, transfer learning, and generative model fundamentals
  • Proven experience taking models from experimentation through production-ready deployment

Nice to Have
  • SQL proficiency for querying detection results, labeling metrics, or model performance data
  • Experience with roadside or infrastructure object detection (signs, signals, debris, pavement markings)
  • Background in GovTech, public sector, or smart city projects
  • Experience in automated driving, ADAS, or autonomous vehicle perception systems
  • Familiarity with model-assisted labeling, active learning, or human-in-the-loop workflows
  • Experience with edge deployment or model optimization (TensorRT, ONNX, quantization)

Compensation
The base pay range for this role is $100,000 – $130,000 per year.

Skills Required

  • 3+ years of professional experience working with transformer-based architectures
  • 2+ years of hands-on experience fine-tuning and deploying multi-modal models (VLMs)
  • 2+ years of proven computer vision experience, preferably object detection
  • Strong experience with LLMs including fine-tuning, inference optimization, and production deployment
  • Proficiency in Python for model development, training, and deployment (2+ years)
  • Experience with deep learning frameworks such as PyTorch or TensorFlow
  • Solid understanding of attention mechanisms, tokenization, transfer learning, and generative model fundamentals
  • Proven experience taking models from experimentation through production-ready deployment
  • SQL proficiency for querying detection results and model performance data
  • Experience with roadside or infrastructure object detection (signs, signals, debris, pavement markings)
  • Background in GovTech, public sector, or smart city projects
  • Experience in automated driving, ADAS, or autonomous vehicle perception systems
  • Familiarity with model-assisted labeling, active learning, or human-in-the-loop workflows
  • Experience with edge deployment or model optimization (TensorRT, ONNX, quantization)
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
26 Employees

What We Do

City Detect is an AI-powered platform that helps local governments and non-profits identify and combat urban blight and property degradation. Utilizing vehicle-mounted cameras and computer vision technology, the company automatically assesses the exterior condition of houses across a city. This provides municipalities with actionable data and reports to prioritize urban redevelopment and create cleaner, safer communities.

Similar Jobs

Hyphen Connect Limited Logo Hyphen Connect Limited

Artificial Intelligence Engineer

Agency • Artificial Intelligence • Blockchain • Web3
Remote
China
7 Employees

Hyphen Connect Limited Logo Hyphen Connect Limited

Artificial Intelligence Engineer

Agency • Artificial Intelligence • Blockchain • Web3
Remote
China
7 Employees
In-Office or Remote
2 Locations
3007 Employees

Airbnb Logo Airbnb

Artificial Intelligence Engineer

Real Estate • Travel • PropTech
Remote
China
14622 Employees

Similar Companies Hiring

Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
42 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account