Skip to yearly menu bar Skip to main content


Poster Exhibit Hall I #119

V2PE: Improving Multimodal Long-Context Capability of Vision-Language Models with Variable Visual Position Encoding

Junqi Ge ⋅ Ziyi Chen ⋅ Jintao Lin ⋅ Jinguo Zhu ⋅ Xihui Liu ⋅ Jifeng Dai ⋅ Xizhou Zhu
2025 Poster

Abstract

Chat is not available.