Workshop
Story-Level Movie Understanding and Audio Description
Junyu Xie, Ridouane Ghermi, Tengda Han, Max Bain, Arsha Nagrani, Gül Varol, Weidi Xie, Vicky Kalogeiton, Ivan Laptev, Andrew Zisserman
Sun 19 Oct, 11:50 a.m. PDT
The SLoMO workshop brings together researchers focused on the understanding of long-form, edited videos—such as movies and TV episodes. We spotlight two central research directions: (i) Audio Description (AD) Generation: This track explores the generation of concise and coherent descriptions that complement the original audio for blind and visually impaired (BVI) audiences. We have invited four leading experts in movie understanding and AD generation to share their insights and recent advancements in the field. (ii) Movie Question Answering: This track evaluates models’ capabilities in narrative comprehension, emphasizing story-level understanding. As part of this effort, we host the Short-Films 20K (SF20K) Competition, which aims to drive progress in story-level video understanding using the newly introduced SF20K dataset.
Live content is unavailable. Log in and register to view live content