Crop.photo by Evolphin

AI Engineer - Computer Vision (Crop.Photo)

Sorry, this job was removed at 10:12 a.m. (CST) on Monday, Jan 12, 2026

Be an Early Applicant

Hiring Remotely in India

Remote

Artificial Intelligence • eCommerce • Software

The Role

We’re Crop.photo — a high-velocity AI startup powering creative automation for global brands like Lacoste, UrbanOutfitters, and AP News. We help brands produce high-quality visuals — images, ads, banners — at scale, and we’re building the core visual intelligence engine that makes that possible.

Our engineers don’t just write code. They frame product logic, shape UX behavior, and ship features. No PMs handing down tickets. No design handoffs. If you think like an owner and love combining deep ML logic with hard product edges — this role is for you. You’ll be working on systems focused on the transformation and generation of millions of visual assets for small-to-large enterprises at scale.

What You’ll Do

Build and own AI-backed features end to end, from ideation to production — including layout logic, smart cropping, visual enhancement, out-painting and GenAI workflows for background fills
Design scalable APIs that wrap vision models like BiRefNet, YOLOv8, Grounding DINO, SAM, CLIP, ControlNet, etc., into batch and real-time pipelines.
Write production-grade Python code to manipulate and transform image data using NumPy, OpenCV (cv2), PIL, and PyTorch.
Handle pixel-level transformations — from custom masks and color space conversions to geometric warps and contour ops — with speed and precision.
Integrate your models into our production web app (AWS based Python/Java backend) and optimize them for latency, memory, and throughput
Frame problems when specs are vague — you’ll help define what “good” looks like, and then build it
Collaborate with product, UX, and other engineers without relying on formal handoffs — you own your domain

What You’ll Need

4–6 years of hands-on experience with vision and image generation models such as YOLO, Grounding DINO, SAM, CLIP, Stable Diffusion, VITON, or TryOnGAN — including experience with inpainting and outpainting workflows using Stable Diffusion pipelines (e.g., Diffusers, InvokeAI, or custom-built solutions)
Strong hands-on knowledge of NumPy, OpenCV, PIL, PyTorch, and image visualization/debugging techniques.
2–3 years of experience working with popular LLM APIs such as OpenAI, Anthropic, Gemini and how to compose multi-modal pipelines
Solid grasp of production model integration — model loading, GPU/CPU optimization, async inference, caching, and batch processing.
Experience solving real-world visual problems like object detection, segmentation, composition, or enhancement.
Ability to debug and diagnose visual output errors — e.g., weird segmentation artifacts, off-center crops, broken masks.
Deep understanding of image processing in Python: array slicing, color formats, augmentation, geometric transforms, contour detection, etc.
Experience building and deploying FastAPI services and containerizing them with Docker for AWS-based infra (ECS, EC2/GPU, Lambda).
Solid grasp of production model integration — model loading, GPU/CPU optimization, async inference, caching, and batch processing.
A customer-centric approach — you think about how your work affects end users and product experience, not just model performance
A quest for high-quality deliverables — you write clean, tested code and debug edge cases until they’re truly fixed
The ability to frame problems from scratch and work without strict handoffs — you build from a goal, not a ticket

Who You Are

You’ve built systems — not just prototypes
You care about both ML results and the system’s behavior in production
You’re comfortable taking a rough business goal and shaping the technical path to get there
You’re energized by product-focused AI work — things that users feel and rely on
You’ve worked in or want to work in a startup-grade environment: messy, fast, and impactful

What You Get

Full autonomy over your problem space
A builder-first, no-handoff culture
Remote-first flexibility (India preferred)
Base + Variable + meaningful equity
A product shipping to some of the world’s most recognizable brands

View all jobs at Crop.photo by Evolphin

View Crop.photo by Evolphin Profile

Report Job

Similar Jobs

Motive

Executive Assistant

Artificial Intelligence • Fintech • Hardware • Information Technology • Sales • Software • Transportation

Easy Apply

Remote

India

4000 Employees

Coinbase

Staff Accountant

Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3

Easy Apply

Remote

India

4700 Employees

3M-3M Annually

GitLab

Sales Manager

Cloud • Security • Software • Cybersecurity • Automation

Easy Apply

Remote

India

2500 Employees

QuillBot

Senior Internal Communications Manager

Artificial Intelligence • Edtech • Mobile • Natural Language Processing • Productivity • Software

Easy Apply

Remote

India

232 Employees

Get Personalized Job Insights.

Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company

HQ: San Ramon, California

14 Employees

What We Do

Crop.photo is an AI-powered service for bulk image editing & retouching, offering powerful tools for automating image cropping, resizing, background removal, and listing image analysis. The service is powered by advanced AI algorithms that streamline the image processing workflow for businesses of all sizes. For more information, visit https://crop.photo/about-us Crop.photo is the brainchild of Evolphin Software, Inc, a leading Silicon Valley based provider of digital & media asset management solutions. Evolphin has been serving creative operations teams across various industries for over a decade, helping them streamline their digital workflows, optimize media asset management, and automate their digital workflows. Evolphin's expertise in digital asset management, combined with cutting-edge AI technology, has resulted in the development of Crop.photo, a cloud-based service that simplifies image retouching for businesses of all sizes.