Skip to yearly menu bar Skip to main content


Poster Thu, Oct 23, 2025 • 2:15 PM – 4:15 PM PDT Exhibit Hall I #119

V2PE: Improving Multimodal Long-Context Capability of Vision-Language Models with Variable Visual Position Encoding

Junqi Ge ⋅ Ziyi Chen ⋅ Jintao Lin ⋅ Jinguo Zhu ⋅ Xihui Liu ⋅ Jifeng Dai ⋅ Xizhou Zhu

Abstract

Chat is not available.