Multimodal AI Systems Architect (AI Engineering)

Posted 8 Days Ago
Be an Early Applicant
8 Locations
In-Office or Remote
Senior level
Agency • Artificial Intelligence • Blockchain • Web3
The Role
Design and integrate vision and audio models into agent reasoning loops, optimize streaming latency for voice-to-voice interactions, and architect multimodal RAG systems to retrieve insights from videos and PDFs.
Summary Generated by Built In

We are seeking a talented Multimodal AI Systems Architect to develop and optimize AI systems that seamlessly integrate vision and audio models. This role focuses on enhancing our voice-to-voice interactions and multimodal retrieval capabilities, ensuring our systems are efficient and innovative.


Responsibilities:

  • Integrate vision encoders and audio-native models into core agent reasoning loops.
  • Optimize streaming latency for voice-to-voice AI interactions.
  • Architect multimodal RAG systems capable of retrieving insights from videos and PDFs.

Qualifications:

  • Experience with Whisper, CLIP, and multimodal LLM integration.
  • Knowledge of streaming architectures and WebRTC.
  • Expertise in cross-modal alignment.

Skills Required

  • Experience with Whisper
  • Experience with CLIP
  • Multimodal LLM integration experience
  • Knowledge of streaming architectures
  • Experience with WebRTC
  • Expertise in cross-modal alignment
  • Experience integrating vision encoders and audio-native models into agent systems
  • Experience optimizing streaming latency for voice-to-voice interactions
  • Experience architecting multimodal RAG systems for video and PDF retrieval
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
7 Employees
Year Founded: 2024

What We Do

Hyphen Connect is a Web3 and AI talent agency and crypto-integrated software solutions provider that connects blockchain, DeFi, NFT, and AI companies with specialized technical and go-to-market talent globally and remotely. They deliver headhunting, data-driven research, and recruitment services across infrastructure, exchanges, gaming, and DeFi projects, plus industry analysis and hiring insights to help clients build engineering and product teams.

Similar Jobs

PwC Logo PwC

Oracle PMO - Senior Associate

Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Remote or Hybrid
66 Locations
370000 Employees
77K-202K Annually

PwC Logo PwC

Oracle PMO - Manager

Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Remote or Hybrid
67 Locations
370000 Employees
99K-232K Annually

PwC Logo PwC

Oracle PMO - Senior Manager

Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Remote or Hybrid
67 Locations
370000 Employees
124K-280K Annually

NBCUniversal Logo NBCUniversal

Quality Assurance Lead

AdTech • Cloud • Digital Media • Information Technology • News + Entertainment • App development
Remote or Hybrid
Montréal, QC, CAN
68000 Employees

Similar Companies Hiring

Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees
Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
42 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account