Workshop
Binocular Egocentric-360 Multi-modal Scene Understanding in the Wild
Jianbo Jiao, Shangzhe Wu, Dylan Campbell, Yunchao Wei, Lu Qi, Yasmine Mellah, Aleš Leonardis, Chenyuan Qu, Han Hu, Qiming Huang, Hao Chen
306 B
Sun 19 Oct noon PDT — 3:30 p.m. PDT
This workshop mainly looks at multi-modal scene understanding and perception in a human-like manner. Specifically, we will focus on binocular/stereo egocentric and 360° panoramic perspectives, which measure both first-person views and third-person panoptic views, mimicking a human in the scene, by combining with multi‑modal cues such as spatial audio, textual descriptions, and geo‑metadata. This workshop will cover but not be limited to the following topics: Embodied 360° scene understanding & egocentric visual reasoning; Multi-modal scene understanding; Stereo Vision; Open‑world learning & domain adaptation.
Live content is unavailable. Log in and register to view live content