Workshop
The Third Perception Test Challenge
Joe Heyward, Nikhil Parthasarathy, Joao Carreira, Dima Damen, Andrew Zisserman, Viorica Patraucean, Eunice Yiu, Shiry Ginosar, Saman Motamed, Priyank Jaini
Ballroom B
Sun 19 Oct noon PDT — 8 p.m. PDT
The 3rd Perception Test challenge comprehensively evaluates the perception capabilities of large multimodal models using the Perception Test benchmark. This year, novel tracks unify diverse tasks under common interfaces: joint object/point tracking, joint action/sound localisation, and unified multiple-choice videoQA (integrating non-semantic tasks via inpainted queries). A new VLM interpretability track is included to investigate model strengths and failures. Guest tracks cover image understanding (KiVA) and video generation (Physics-IQ). Our workshop provides a venue to evaluate all foundation vision models—discriminative, generative, image- or video-based. Prizes up to 50k EUR are available.
Live content is unavailable. Log in and register to view live content