Workshop
The Second Workshop on Multimodal Representation and Retrieval
Xinliang Zhu, Arnab Dhua, Shengsheng Qian, Xin (Eric) Wang, Rene Vidal, Douglas Gray
Mon 20 Oct, 11:30 a.m. PDT
Multimodal representation learning is central to modern AI, enabling applications across retrieval, generation, RAG, reasoning, agentic AI, and embodied intelligence. With the growing ubiquity of multimodal data—from e-commerce listings to social media and video content—new challenges arise in multimodal retrieval, where both queries and indexed content span multiple modalities. This task requires deeper semantic understanding and reasoning, especially at scale, where data complexity and noise become significant hurdles. The half-day event will feature keynote talks, oral and poster presentations.
Live content is unavailable. Log in and register to view live content