Skip to yearly menu bar Skip to main content


Workshop

The Third Perception Test Challenge

Joe Heyward, Nikhil Parthasarathy, Joao Carreira, Dima Damen, Andrew Zisserman, Viorica Patraucean, Eunice Yiu, Shiry Ginosar, Saman Motamed, Priyank Jaini

Sun 19 Oct, noon PDT

The 3rd Perception Test challenge comprehensively evaluates the perception capabilities of large multimodal models using the Perception Test benchmark. This year, novel tracks unify diverse tasks under common interfaces: joint object/point tracking, joint action/sound localisation, and unified multiple-choice videoQA (integrating non-semantic tasks via inpainted queries). A new VLM interpretability track is included to investigate model strengths and failures. Guest tracks cover image understanding (KiVA) and video generation (Physics-IQ). Our workshop provides a venue to evaluate all foundation vision models—discriminative, generative, image- or video-based. Prizes up to 50k EUR are available.

Live content is unavailable. Log in and register to view live content