Skip to yearly menu bar Skip to main content


Workshop

The Second Workshop on Multimodal Representation and Retrieval

Xinliang Zhu, Arnab Dhua, Shengsheng Qian, Xin (Eric) Wang, Rene Vidal, Douglas Gray

Mon 20 Oct, 11:30 a.m. PDT

Multimodal representation learning is central to modern AI, enabling applications across retrieval, generation, RAG, reasoning, agentic AI, and embodied intelligence. With the growing ubiquity of multimodal data—from e-commerce listings to social media and video content—new challenges arise in multimodal retrieval, where both queries and indexed content span multiple modalities. This task requires deeper semantic understanding and reasoning, especially at scale, where data complexity and noise become significant hurdles. The half-day event will feature keynote talks, oral and poster presentations.

Live content is unavailable. Log in and register to view live content