Skip to yearly menu bar Skip to main content


Workshop

Binocular Egocentric-360 Multi-modal Scene Understanding in the Wild

Jianbo Jiao, Shangzhe Wu, Dylan Campbell, Yunchao Wei, Lu Qi, Yasmine Mellah, Aleš Leonardis, Chenyuan Qu, Han Hu, Qiming Huang, Hao Chen

306 B

Sun 19 Oct noon PDT — 3:30 p.m. PDT

This workshop mainly looks at multi-modal scene understanding and perception in a human-like manner. Specifically, we will focus on binocular/stereo egocentric and 360° panoramic perspectives, which measure both first-person views and third-person panoptic views, mimicking a human in the scene, by combining with multi‑modal cues such as spatial audio, textual descriptions, and geo‑metadata. This workshop will cover but not be limited to the following topics: Embodied 360° scene understanding & egocentric visual reasoning; Multi-modal scene understanding; Stereo Vision; Open‑world learning & domain adaptation.

Live content is unavailable. Log in and register to view live content