Skip to yearly menu bar Skip to main content


Poster Thu, Oct 23, 2025 • 2:15 PM – 4:15 PM PDT Exhibit Hall I #119

V2PE: Improving Multimodal Long-Context Capability of Vision-Language Models with Variable Visual Position Encoding

Junqi Ge · Ziyi Chen · Jintao Lin · Jinguo Zhu · Xihui Liu · Jifeng Dai · Xizhou Zhu

Abstract

Chat is not available.