Back to Search






Mid-Level
Multimodal AI Systems Architect (AI Engineering)
Confirmed live in the last 24 hours
Hyphen Connect
San Francisco Bay Area, USA
On-site
Posted April 24, 2026
Job Description
We are seeking a talented Multimodal AI Systems Architect to develop and optimize AI systems that seamlessly integrate vision and audio models. This role focuses on enhancing our voice-to-voice interactions and multimodal retrieval capabilities, ensuring our systems are efficient and innovative.
Responsibilities:
- Integrate vision encoders and audio-native models into core agent reasoning loops.
- Optimize streaming latency for voice-to-voice AI interactions.
- Architect multimodal RAG systems capable of retrieving insights from videos and PDFs.
Qualifications:
- Experience with Whisper, CLIP, and multimodal LLM integration.
- Knowledge of streaming architectures and WebRTC.
- Expertise in cross-modal alignment.
ai
Similar Jobs
Dexcom
SW Development Engineer 2
Mid-LevelBengaluru, India
S&P Global
Software Developer
Mid-LevelGurugram, Haryana
Citigroup
Java Backend Application Developer
Mid-Level2 Locations
Citigroup
Senior Java Backend Application Developer
Senior2 Locations
Citigroup
Quantitative Developer, VP
Lead / ManagerLondon United Kingd...
Citigroup
Lead Java Developer (VP)
Lead / ManagerLondon United Kingd...