Skip to yearly menu bar Skip to main content


ICCV 2025 Accepted Papers

Real3D: Towards Scaling Large Reconstruction Models with Real Images Poster Session 2 & Exhibit Hall with Coffee Break
Hanwen Jiang ⋅ Qixing Huang ⋅ Georgios Pavlakos
Exhibit Hall I #76
Enhancing Few-Shot Vision-Language Classification with Large Multimodal Model Features Poster Session 1 & Exhibit Hall
Chancharik Mitra ⋅ Brandon Huang ⋅ Tianning Chai ⋅ Zhiqiu Lin ⋅ Assaf Arbelle ⋅ Rogerio Feris ⋅ Leonid Karlinsky ⋅ Trevor Darrell ⋅ Deva Ramanan ⋅ Roei Herzig
Exhibit Hall I #254
ContraGS: Codebook-Condensed and Trainable Gaussian Splatting for Fast, Memory-Efficient Reconstruction Poster Session 6 & Exhibit Hall with Coffee Break
Sankeerth Durvasula ⋅ Sharanshangar Muhunthan ⋅ Zain Moustafa ⋅ Richard Chen ⋅ Ruofan Liang ⋅ Yushi Guan ⋅ Nilesh Ahuja ⋅ Nilesh Jain ⋅ Selvakumar Panneer ⋅ Nandita Vijaykumar
Exhibit Hall I #406
ARIG: Autoregressive Interactive Head Generation for Real-time Conversations Poster Session 3 & Exhibit Hall
Ying Guo ⋅ Xi Liu ⋅ Cheng Zhen ⋅ Pengfei Yan ⋅ Xiaoming Wei
Exhibit Hall I #278
VALLR: Visual ASR Language Model for Lip Reading Poster Session 1 & Exhibit Hall
Marshall Thomas ⋅ Edward Fish ⋅ Richard Bowden
Exhibit Hall I #262
FREE-Merging: Fourier Transform for Efficient Model Merging Poster Session 1 & Exhibit Hall
Shenghe Zheng ⋅ Hongzhi Wang
Exhibit Hall I #359
Chimera: Improving Generalist Model with Domain-Specific Experts Poster Session 1 & Exhibit Hall
Tianshuo Peng ⋅ Mingsheng Li ⋅ Jiakang Yuan ⋅ Hongbin Zhou ⋅ Renqiu Xia ⋅ Renrui Zhang ⋅ LEI BAI ⋅ Song Mao ⋅ Bin Wang ⋅ Aojun Zhou ⋅ Botian Shi ⋅ Tao Chen ⋅ Bo Zhang ⋅ Xiangyu Yue
Exhibit Hall I #278
Why LVLMs Are More Prone to Hallucinations in Longer Responses: The Role of Context Poster Session 1 & Exhibit Hall
Ge Zheng ⋅ Jiaye Qian ⋅ Jiajin Tang ⋅ Sibei Yang
Exhibit Hall I #384
Any-SSR: How Recursive Least Squares Works in Continual Learning of Large Language Model Poster Session 1 & Exhibit Hall
Kai Tong ⋅ Kang Pan ⋅ Xiao Zhang ⋅ Erli Meng ⋅ Run He ⋅ Yawen Cui ⋅ Nuoyan Guo ⋅ Huiping Zhuang
Exhibit Hall I #281
Audio-visual Controlled Video Diffusion with Masked Selective State Spaces Modeling for Natural Talking Head Generation Poster Session 3 & Exhibit Hall
Fating Hong ⋅ Zunnan Xu ⋅ Zixiang Zhou ⋅ Jun Zhou ⋅ Xiu Li ⋅ Qin Lin ⋅ Qinglin Lu ⋅ Dan Xu
Exhibit Hall I #240
ImHead: A Large-scale Implicit Morphable Model for Localized Head Modeling Poster Session 3 & Exhibit Hall
Rolandos Alexandros Potamias ⋅ Stathis Galanakis ⋅ Jiankang Deng ⋅ Athanasios Papaioannou ⋅ Stefanos Zafeiriou
Exhibit Hall I #18
PathFinder: A Multi-Modal Multi-Agent System for Medical Diagnostic Decision-Making Applied to Histopathology Poster Session 5 & Exhibit Hall
Fatemeh Ghezloo ⋅ Saygin Seyfioglu ⋅ Rustin Soraki ⋅ Wisdom Ikezogwo ⋅ Beibin Li ⋅ Tejoram Vivekanandan ⋅ Joann Elmore ⋅ Ranjay Krishna ⋅ Linda Shapiro
Exhibit Hall I #342
SAS: Segment Any 3D Scene with Integrated 2D Priors Poster Session 2 & Exhibit Hall with Coffee Break
Zhuoyuan Li ⋅ Jiahao Lu ⋅ Jiacheng Deng ⋅ Hanzhi Chang ⋅ Lifan Wu ⋅ Yanzhe Liang ⋅ Tianzhu Zhang
Exhibit Hall I #310
GloPER: Unsupervised Animal Pattern Extraction from Local Reconstruction Poster Session 2 & Exhibit Hall with Coffee Break
Bowen Chen ⋅ Yun Sing Koh ⋅ Gillian Dobbie
Exhibit Hall I #140
StrandHead: Text to Hair-Disentangled 3D Head Avatars Using Human-Centric Priors Poster Session 3 & Exhibit Hall
Xiaokun Sun ⋅ Zeyu Cai ⋅ Ying Tai ⋅ Jian Yang ⋅ Zhenyu Zhang
Exhibit Hall I #320
Creation-MMBench: Assessing Context-Aware Creative Intelligence in MLLMs Poster Session 1 & Exhibit Hall
Xinyu Fang ⋅ Zhijian Chen ⋅ Kai Lan ⋅ Lixin Ma ⋅ Shengyuan Ding ⋅ Yingji Liang ⋅ Xiangyu Zhao ⋅ Farong Wen ⋅ Zicheng Zhang ⋅ Guofeng Zhang ⋅ Haodong Duan ⋅ Kai Chen ⋅ Dahua Lin
Exhibit Hall I #32
Can Knowledge be Transferred from Unimodal to Multimodal? Investigating the Transitivity of Multimodal Knowledge Editing Poster Session 1 & Exhibit Hall
Lingyong Fang ⋅ Xinzhong Wang ⋅ Depeng depeng wang ⋅ Zongru Wu ⋅ Ya Guo ⋅ Huijia Zhu ⋅ Zhuosheng Zhang ⋅ Gongshen Liu
Exhibit Hall I #226
Token Activation Map to Visually Explain Multimodal LLMs Poster Session 1 & Exhibit Hall
Yi Li ⋅ Hualiang Wang ⋅ Xinpeng Ding ⋅ Haonan Wang ⋅ Xiaomeng Li
Exhibit Hall I #380
Probabilistic Prototype Calibration of Vision-language Models for Generalized Few-shot Semantic Segmentation Poster Session 5 & Exhibit Hall
Jie Liu ⋅ Jiayi Shen ⋅ Pan Zhou ⋅ Jan-Jakob Sonke ⋅ Stratis Gavves
Exhibit Hall I #126
Learning Interpretable Queries for Explainable Image Classification with Information Pursuit Poster Session 1 & Exhibit Hall
Stefan Kolek ⋅ Aditya Chattopadhyay ⋅ Kwan Ho Ryan Chan ⋅ Hector Andrade Loarca ⋅ Gitta Kutyniok ⋅ Rene Vidal
Exhibit Hall I #367
Long-Tailed Classification with Multi-Granularity Semantics Poster Session 1 & Exhibit Hall
Yuting Liu ⋅ Liu Yang ⋅ Yu Wang
Exhibit Hall I #401
VideoLLaMB: Long Streaming Video Understanding with Recurrent Memory Bridges Poster Session 5 & Exhibit Hall
Yuxuan Wang ⋅ Yiqi Song ⋅ Cihang Xie ⋅ Yang Liu ⋅ Zilong Zheng
Exhibit Hall I #409
Auto-Regressive Transformation for Image Alignment Poster Session 3 & Exhibit Hall
Kanggeon Lee ⋅ Soochahn Lee ⋅ Kyoung Mu Lee
Exhibit Hall I #336
LMM-Det: Make Large Multimodal Models Excel in Object Detection Poster Session 1 & Exhibit Hall
Jincheng Li ⋅ Chunyu Xie ⋅ Ji Ao ⋅ Dawei Leng ⋅ Yuhui Yin
Exhibit Hall I #19
Attention to the Burtiness in Visual Prompt Tuning! Poster Session 1 & Exhibit Hall
Yuzhu Wang ⋅ Manni Duan ⋅ Shu Kong
Exhibit Hall I #398
Diffusion-Based Imaginative Coordination for Bimanual Manipulation Poster Session 3 & Exhibit Hall
Huilin Xu ⋅ Jian Ding ⋅ Jiakun Xu ⋅ Ruixiang Wang ⋅ Jun Chen ⋅ Jinjie Mai ⋅ Yanwei Fu ⋅ Bernard Ghanem ⋅ Feng Xu ⋅ Mohamed Elhoseiny
Exhibit Hall I #137
Learning Neural Scene Representation from iToF Imaging Poster Session 6 & Exhibit Hall with Coffee Break
Wenjie Chang ⋅ Hanzhi Chang ⋅ Yueyi Zhang ⋅ Wenfei Yang ⋅ Tianzhu Zhang
Exhibit Hall I #310
ChartCap: Mitigating Hallucination of Dense Chart Captioning Poster Session 3 & Exhibit Hall
Junyoung Lim ⋅ Jaewoo Ahn ⋅ Gunhee Kim
Exhibit Hall I #298
MultiVerse: A Multi-Turn Conversation Benchmark for Evaluating Large Vision and Language Models Poster Session 1 & Exhibit Hall
Young-Jun Lee ⋅ Byung-Kwan Lee ⋅ Jianshu Zhang ⋅ Yechan Hwang ⋅ Byungsoo Ko ⋅ Han-Gyu Kim ⋅ Dongyu Yao ⋅ Xuankun Rong ⋅ Eojin Joo ⋅ Seung-Ho Han ⋅ Bowon Ko ⋅ Ho-Jin Choi
Exhibit Hall I #57
Causal Disentanglement and Cross-Modal Alignment for Enhanced Few-Shot Learning Poster Session 1 & Exhibit Hall
Tianjiao Jiang ⋅ Zhen Zhang ⋅ Yuhang Liu ⋅ Javen Qinfeng Shi
Exhibit Hall I #74
Weakly-Supervised Learning of Dense Functional Correspondences Poster Session 2 & Exhibit Hall with Coffee Break
Stefan Stojanov ⋅ Linan Zhao ⋅ Yunzhi Zhang ⋅ Daniel Yamins ⋅ Jiajun Wu
Exhibit Hall I #184
PERSONA: Personalized Whole-Body 3D Avatar with Pose-Driven Deformations from a Single Image Poster Session 3 & Exhibit Hall
Geonhee Sim ⋅ Gyeongsik Moon
Exhibit Hall I #251
Unsupervised Visual Chain-of-Thought Reasoning via Preference Optimization Poster Session 1 & Exhibit Hall
Kesen Zhao ⋅ Beier Zhu ⋅ Qianru Sun ⋅ Hanwang Zhang
Exhibit Hall I #209
Staining and Locking Computer Vision Models Without Retraining Poster Session 1 & Exhibit Hall
Oliver Sutton ⋅ Qinghua Zhou ⋅ George Leete ⋅ Alexander Gorban ⋅ Ivan Tyukin
Exhibit Hall I #213
Test-Time Prompt Tuning for Zero-Shot Depth Completion Poster Session 2 & Exhibit Hall with Coffee Break
Chanhwi Jeong ⋅ Inhwan Bae ⋅ Jin-Hwi Park ⋅ Hae-Gon Jeon
Exhibit Hall I #415
TITAN: Query-Token based Domain Adaptive Adversarial Learning Poster Session 1 & Exhibit Hall
Tajamul Ashraf ⋅ Janibul Bashir
Exhibit Hall I #14
Dual Reciprocal Learning of Language-based Human Motion Understanding and Generation Poster Session 2 & Exhibit Hall with Coffee Break
CHEN LIANG ⋅ Zhicheng Shi ⋅ Wenguan Wang ⋅ Yi Yang
Exhibit Hall I #115
StolenLoRA: Exploring LoRA Extraction Attacks via Synthetic Data Poster Session 1 & Exhibit Hall
Yixu Wang ⋅ Yan Teng ⋅ Yingchun Wang ⋅ Xingjun Ma
Exhibit Hall I #15
MMReason: An Open-Ended Multi-Modal Multi-Step Reasoning Benchmark for MLLMs Toward AGI Poster Session 1 & Exhibit Hall
Huanjin Yao ⋅ Jiaxing Huang ⋅ Yawen Qiu ⋅ Michael K. Chen ⋅ Wenzheng Liu ⋅ Wei Zhang ⋅ wenjie zeng ⋅ Xikun ZHANG ⋅ Jingyi Zhang ⋅ YuXin Song ⋅ Wenhao Wu ⋅ Dacheng Tao
Exhibit Hall I #16
PlaceIt3D: Language-Guided Object Placement in Real 3D Scenes Poster Session 2 & Exhibit Hall with Coffee Break
Ahmed Abdelreheem ⋅ Filippo Aleotti ⋅ Jamie Watson ⋅ Zawar Qureshi ⋅ Abdelrahman Eldesokey ⋅ Peter Wonka ⋅ Gabriel Brostow ⋅ Sara Vicente ⋅ Guillermo Garcia-Hernando
Exhibit Hall I #154
Know "No" Better: A Data-Driven Approach for Enhancing Negation Awareness in CLIP Poster Session 1 & Exhibit Hall
Junsung Park ⋅ Jungbeom Lee ⋅ Jongyoon Song ⋅ Sangwon Yu ⋅ Dahuin Jung ⋅ Sungroh Yoon
Exhibit Hall I #260
Motion Synthesis with Sparse and Flexible Keyjoint Control Poster Session 3 & Exhibit Hall
Inwoo Hwang ⋅ Jinseok Bae ⋅ Donggeun Lim ⋅ Young Min Kim
Exhibit Hall I #303
StruMamba3D: Exploring Structural Mamba for Self-supervised Point Cloud Representation Learning Poster Session 6 & Exhibit Hall with Coffee Break
Chuxin Wang ⋅ Yixin Zha ⋅ Wenfei Yang ⋅ Tianzhu Zhang
Exhibit Hall I #370
TRACE: Learning 3D Gaussian Physical Dynamics from Multi-view Videos Poster Session 2 & Exhibit Hall with Coffee Break
Jinxi Li ⋅ Ziyang Song ⋅ Bo Yang
Exhibit Hall I #357
D-Attn: Decomposed Attention for Large Vision-and-Language Model Poster Session 5 & Exhibit Hall
Chia-Wen Kuo ⋅ Sijie Zhu ⋅ Fan Chen ⋅ Xiaohui Shen ⋅ Longyin Wen
Exhibit Hall I #388
InfoBridge: Balanced Multimodal Integration through Conditional Dependency Modeling Poster Session 1 & Exhibit Hall
Chenxin Li ⋅ Yifan Liu ⋅ Panwang Pan ⋅ Hengyu Liu ⋅ Xinyu Liu ⋅ Wuyang Li ⋅ Cheng Wang ⋅ Weihao Yu ⋅ Yiyang LIN ⋅ Yixuan Yuan
Exhibit Hall I #27
Closed-Loop Transfer for Weakly-supervised Affordance Grounding Poster Session 2 & Exhibit Hall with Coffee Break
Jiajin Tang ⋅ Zhengxuan Wei ⋅ Ge Zheng ⋅ Sibei Yang
Exhibit Hall I #423
Exploring View Consistency for Scene-Adaptive Low-Light Light Field Image Enhancement Poster Session 2 & Exhibit Hall with Coffee Break
Shuo Zhang ⋅ Chen Gao ⋅ Youfang Lin
Exhibit Hall I #217
Neuromanifold-Regularized KANs for Shape-fair Feature Representations Poster Session 3 & Exhibit Hall
Mazlum Arslan ⋅ Weihong Guo ⋅ Shuo Li
Exhibit Hall I #262
Training-Free Class Purification for Open-Vocabulary Semantic Segmentation Poster Session 5 & Exhibit Hall
Qi Chen ⋅ Lingxiao Yang ⋅ Yun Chen ⋅ Nailong Zhao ⋅ Jianhuang Lai ⋅ Jie Shao ⋅ Xiaohua Xie
Exhibit Hall I #314
Perspective-Aware Reasoning in Vision-Language Models via Mental Imagery Simulation Poster Session 2 & Exhibit Hall with Coffee Break
Yuseung Lee ⋅ Jihyeon Je ⋅ Chanho Park ⋅ Mikaela Uy ⋅ Leonidas Guibas ⋅ Minhyuk Sung
Exhibit Hall I #397
Erasing More Than Intended? How Concept Erasure Degrades the Generation of Non-Target Concepts Poster Session 4 & Exhibit Hall with Coffee Break
Ibtihel Amara ⋅ Ahmed Imtiaz Humayun ⋅ Ivana Kajic ⋅ Zarana Parekh ⋅ Natalie Harris ⋅ Sarah Young ⋅ Chirag Nagpal ⋅ Najoung Kim ⋅ Junfeng He ⋅ Cristina Vasconcelos ⋅ Deepak Ramachandran ⋅ Golnoosh Farnadi ⋅ Katherine Heller ⋅ Mohammad Havaei ⋅ Negar Rostamzadeh
Exhibit Hall I #145
Colors See Colors Ignore: Clothes Changing ReID with Color Disentanglement Poster Session 4 & Exhibit Hall with Coffee Break
Priyank Pathak ⋅ Yogesh Rawat
Exhibit Hall I #183
MonoSOWA: Scalable monocular 3D Object detector Without human Annotations Poster Session 2 & Exhibit Hall with Coffee Break
Jan Skvrna ⋅ Lukas Neumann
Exhibit Hall I #244
MA-CIR: A Multimodal Arithmetic Benchmark for Composed Image Retrieval Poster Session 5 & Exhibit Hall
Jaeseok Byun ⋅ Young Kyun Jang ⋅ Seokhyeon Jeong ⋅ Donghyun Kim ⋅ Taesup Moon
Exhibit Hall I #143
Revisiting Point Cloud Completion: Are We Ready For The Real-World? Poster Session 6 & Exhibit Hall with Coffee Break
Stuti Pathak ⋅ Prashant Kumar ⋅ Dheeraj Baiju ⋅ Nicholus Mboga ⋅ Gunther Steenackers ⋅ Rudi Penne
Exhibit Hall I #63
Clink! Chop! Thud! - Learning Object Sounds from Real-World Interactions Poster Session 3 & Exhibit Hall
Mengyu Yang ⋅ Yiming Chen ⋅ Haozheng Pei ⋅ Siddhant Agarwal ⋅ Arun Vasudevan ⋅ James Hays
Exhibit Hall I #428
UnZipLoRA: Separating Content and Style from a Single Image Poster Session 4 & Exhibit Hall with Coffee Break
Chang Liu ⋅ Viraj Shah ⋅ Aiyu Cui ⋅ Svetlana Lazebnik
Exhibit Hall I #181
Learning Precise Affordances from Egocentric Videos for Robotic Manipulation Poster Session 3 & Exhibit Hall
Li ⋅ Nikolaos Tsagkas ⋅ Jifei Song ⋅ Ruaridh Mon-Williams ⋅ Sethu Vijayakumar ⋅ Kun Shao ⋅ Laura Sevilla-Lara
Exhibit Hall I #53
Learning Visual Proxy for Compositional Zero-Shot Learning Poster Session 1 & Exhibit Hall
Shiyu Zhang ⋅ Cheng Yan ⋅ Yang Liu ⋅ Chenchen Jing ⋅ Lei Zhou ⋅ Wenjun Wang
Exhibit Hall I #257
SC-Captioner: Improving Image Captioning with Self-Correction by Reinforcement Learning Poster Session 5 & Exhibit Hall
Lin Zhang ⋅ Xianfang Zeng ⋅ Kangcong Li ⋅ Gang YU ⋅ Tao Chen
Exhibit Hall I #316
Principles of Visual Tokens for Efficient Video Understanding Poster Session 5 & Exhibit Hall
Xinyue Hao ⋅ Li ⋅ Shreyank Gowda ⋅ Robert Fisher ⋅ Jonathan Huang ⋅ Anurag Arnab ⋅ Laura Sevilla-Lara
Exhibit Hall I #135
DuoCLR: Dual-Surrogate Contrastive Learning for Skeleton-based Human Action Segmentation Poster Session 3 & Exhibit Hall
Haitao Tian
Exhibit Hall I #354
NuiScene: Exploring Efficient Generation of Unbounded Outdoor Scenes Poster Session 6 & Exhibit Hall with Coffee Break
Han-Hung Lee ⋅ Qinghong Han ⋅ Angel Chang
Exhibit Hall I #173
Progressive Artwork Outpainting via Latent Diffusion Models Poster Session 4 & Exhibit Hall with Coffee Break
Dae-Young Song ⋅ Jung-Jae Yu ⋅ Donghyeon Cho
Exhibit Hall I #48
Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generation Poster Session 4 & Exhibit Hall with Coffee Break
Sucheng Ren ⋅ Qihang Yu ⋅ Ju He ⋅ Xiaohui Shen ⋅ Alan Yuille ⋅ Liang-Chieh (Jay) Chen
Exhibit Hall I #85
CObL: Toward Zero-Shot Ordinal Layering without User Prompting Poster Session 2 & Exhibit Hall with Coffee Break
Aneel Damaraju ⋅ Dean Hazineh ⋅ Todd Zickler
Exhibit Hall I #294
Hierarchical Material Recognition from Local Appearance Poster Session 2 & Exhibit Hall with Coffee Break
Matthew Beveridge ⋅ Shree Nayar
Exhibit Hall I #295
Event-guided Unified Framework for Low-light Video Enhancement, Frame Interpolation, and Deblurring Poster Session 2 & Exhibit Hall with Coffee Break
Taewoo Kim ⋅ Kuk-Jin Yoon
Exhibit Hall I #330
DisenQ: Disentangling Q-Former for Activity-Biometrics Poster Session 3 & Exhibit Hall
Shehreen Azad ⋅ Yogesh Rawat
Exhibit Hall I #330
PS-Mamba: Spatial-Temporal Graph Mamba for Pose Sequence Refinement Poster Session 2 & Exhibit Hall with Coffee Break
Haoye Dong ⋅ Gim Hee Lee
Exhibit Hall I #334
SliderSpace: Decomposing the Visual Capabilities of Diffusion Models Poster Session 4 & Exhibit Hall with Coffee Break
Rohit Gandikota ⋅ Zongze Wu ⋅ Richard Zhang ⋅ David Bau ⋅ Eli Shechtman ⋅ Nicholas Kolkin
Exhibit Hall I #105
GIViC: Generative Implicit Video Compression Poster Session 4 & Exhibit Hall with Coffee Break
Ge Gao ⋅ Siyue Teng ⋅ Tianhao Peng ⋅ Fan Zhang ⋅ David Bull
Exhibit Hall I #237
Aligning Moments in Time using Video Queries Poster Session 5 & Exhibit Hall
Yogesh Kumar ⋅ Uday Agarwal ⋅ Manish Gupta ⋅ Anand Mishra
Exhibit Hall I #39
ReFlex: Text-Guided Editing of Real Images in Rectified Flow via Mid-Step Feature Extraction and Attention Adaptation Poster Session 4 & Exhibit Hall with Coffee Break
Jimyeong Kim ⋅ Jungwon Park ⋅ Yeji Song ⋅ Nojun Kwak ⋅ Wonjong Rhee
Exhibit Hall I #100
Streamlining Image Editing with Layered Diffusion Brushes Poster Session 4 & Exhibit Hall with Coffee Break
Peyman Gholami ⋅ Robert Xiao
Exhibit Hall I #238
MDD: A Dataset for Text-and-Music Conditioned Duet Dance Generation Poster Session 3 & Exhibit Hall
Prerit Gupta ⋅ Jason Alexander Fotso-Puepi ⋅ Zhengyuan Li ⋅ Jay Mehta ⋅ Aniket Bera
Exhibit Hall I #369
GECO: Geometrically Consistent Embedding with Lightspeed Inference Poster Session 2 & Exhibit Hall with Coffee Break
Regine Hartwig ⋅ Dominik Muhle ⋅ Riccardo Marin ⋅ Daniel Cremers
Exhibit Hall I #403
Removing Cost Volumes from Optical Flow Estimators Poster Session 1 & Exhibit Hall
Simon Kiefhaber ⋅ Stefan Roth ⋅ Simone Schaub-Meyer
Exhibit Hall I #76
HouseTour: A Virtual Real Estate A(I)gent Poster Session 4 & Exhibit Hall with Coffee Break
Ata Çelen ⋅ Iro Armeni ⋅ Daniel Barath ⋅ Marc Pollefeys
Exhibit Hall I #275
Scheduling Weight Transitions for Quantization-Aware Training Poster Session 5 & Exhibit Hall
Junghyup Lee ⋅ Jeimin Jeon ⋅ Dohyung Kim ⋅ Bumsub Ham
Exhibit Hall I #345
p-MoD: Building Mixture-of-Depths MLLMs via Progressive Ratio Decay Poster Session 1 & Exhibit Hall
Jun Zhang ⋅ Desen Meng ⋅ Zhengming Zhang ⋅ Zhenpeng Huang ⋅ Tao Wu ⋅ Limin Wang
Exhibit Hall I #344
Event-based Visual Vibrometry Poster Session 6 & Exhibit Hall with Coffee Break
Xinyu Zhou ⋅ Peiqi Duan ⋅ Yeliduosi Xiaokaiti ⋅ Chao Xu ⋅ Boxin Shi
Exhibit Hall I #75
Mobile Video Diffusion Poster Session 4 & Exhibit Hall with Coffee Break
Haitam Ben Yahia ⋅ Denis Korzhenkov ⋅ Ioannis Lelekas ⋅ Amir Ghodrati ⋅ Amir Habibian
Exhibit Hall I #437
LeGrad: An Explainability Method for Vision Transformers via Feature Formation Sensitivity Poster Session 5 & Exhibit Hall
Walid Bousselham ⋅ Angie Boggust ⋅ Sofian Chaybouti ⋅ Hendrik Strobelt ⋅ Hilde Kuehne
Exhibit Hall I #50
ConsNoTrainLoRA: Data-driven Weight Initialization of Low-rank Adapters using Constraints Poster Session 1 & Exhibit Hall
Debasmit Das ⋅ Hyoungwoo Park ⋅ Munawar Hayat ⋅ Seokeon Choi ⋅ Sungrack Yun ⋅ Fatih Porikli
Exhibit Hall I #37
Self-Calibrated Variance-Stabilizing Transformations for Real-World Image Denoising Poster Session 3 & Exhibit Hall
Sébastien Herbreteau ⋅ Michael Unser
Exhibit Hall I #45
Multi-modal Identity Extraction Poster Session 3 & Exhibit Hall
Ryan Webster ⋅ Teddy Furon
Exhibit Hall I #73
ViT-Linearizer: Distilling Quadratic Knowledge into Linear-Time Vision Models Poster Session 5 & Exhibit Hall
Guoyizhe Wei ⋅ Rama Chellappa
Exhibit Hall I #89
XTrack: Multimodal Training Boosts RGB-X Video Object Trackers Poster Session 2 & Exhibit Hall with Coffee Break
Yuedong Tan ⋅ Zongwei Wu ⋅ Yuqian Fu ⋅ Zhuyun Zhou ⋅ Guolei Sun ⋅ Eduard Zamfir ⋅ Chao Ma ⋅ Danda Pani Paudel ⋅ Luc Gool ⋅ Radu Timofte
Exhibit Hall I #66
FaceXFormer: A Unified Transformer for Facial Analysis Poster Session 3 & Exhibit Hall
Kartik Narayan ⋅ Vibashan VS ⋅ Rama Chellappa ⋅ Vishal Patel
Exhibit Hall I #128
Laboring on less labors: RPCA Paradigm for Pan-sharpening Poster Session 3 & Exhibit Hall
honghui xu ⋅ Chuangjie Fang ⋅ Yibin Wang ⋅ Jie Wu ⋅ Jianwei Zheng
Exhibit Hall I #130
Riemannian-Geometric Fingerprints of Generative Models Poster Session 3 & Exhibit Hall
Hae Jin Song ⋅ Laurent Itti
Exhibit Hall I #133
LDIP: Long Distance Information Propagation for Video Super-Resolution Poster Session 3 & Exhibit Hall
Michael Bernasconi ⋅ Abdelaziz Djelouah ⋅ Yang Zhang ⋅ Markus Gross ⋅ Christopher Schroers
Exhibit Hall I #145
Multi-identity Human Image Animation with Structural Video Diffusion Poster Session 3 & Exhibit Hall
Zhenzhi Wang ⋅ Yixuan Li ⋅ yanhong zeng ⋅ Yuwei Guo ⋅ Dahua Lin ⋅ Tianfan Xue ⋅ Bo Dai
Exhibit Hall I #182
FedDifRC: Unlocking the Potential of Text-to-Image Diffusion Models in Heterogeneous Federated Learning Poster Session 1 & Exhibit Hall
Huan Wang ⋅ Haoran Li ⋅ Huaming Chen ⋅ Jun Yan ⋅ Jiahua Shi ⋅ Jun Shen
Exhibit Hall I #346
LoRA-FAIR: Federated LoRA Fine-Tuning with Aggregation and Initialization Refinement Poster Session 1 & Exhibit Hall
Jieming Bian ⋅ Lei Wang ⋅ Letian Zhang ⋅ Jie Xu
Exhibit Hall I #347
Category-Specific Selective Feature Enhancement for Long-Tailed Multi-Label Image Classification Poster Session 1 & Exhibit Hall
Ruiqi Du ⋅ Xu Tang ⋅ Xiangrong Zhang ⋅ Jingjing Ma
Exhibit Hall I #349
Meta-Learning Dynamic Center Distance: Hard Sample Mining for Learning with Noisy Labels Poster Session 1 & Exhibit Hall
Chenyu Mu ⋅ Yijun Qu ⋅ Jiexi Yan ⋅ Erkun Yang ⋅ Cheng Deng
Exhibit Hall I #29
FreeMorph: Tuning-Free Generalized Image Morphing with Diffusion Model Poster Session 4 & Exhibit Hall with Coffee Break
Yukang Cao ⋅ Chenyang Si ⋅ Jinghao Wang ⋅ Ziwei Liu
Exhibit Hall I #310
Registration beyond Points: General Affine Subspace Alignment via Geodesic Distance on Grassmann Manifold Poster Session 1 & Exhibit Hall
Jaeho Shin ⋅ Hyeonjae Gil ⋅ Junwoo Jang ⋅ Maani Ghaffari ⋅ Ayoung Kim
Exhibit Hall I #350
DM-EFS: Dynamically Multiplexed Expanded Features Set Form for Robust and Efficient Small Object Detection Poster Session 5 & Exhibit Hall
Aashish Sharma
Exhibit Hall I #447
Inverse Image-Based Rendering for Light Field Generation from Single Images Poster Session 6 & Exhibit Hall with Coffee Break
Hyunjun Jung ⋅ Hae-Gon Jeon
Exhibit Hall I #2
PossLoss: A Reliable and Sensitive Facial Landmark Detection Loss Function Poster Session 6 & Exhibit Hall with Coffee Break
Qikui Zhu
Exhibit Hall I #13
DAA*: Deep Angular A Star for Image-based Path Planning Poster Session 6 & Exhibit Hall with Coffee Break
Zhiwei Xu
Exhibit Hall I #53
Pseudo-SD: Pseudo Controlled Stable Diffusion for Semi-Supervised and Cross-Domain Semantic Segmentation Poster Session 5 & Exhibit Hall
Dong Zhao ⋅ Qi Zang ⋅ Shuang Wang ⋅ Nicu Sebe ⋅ Zhun Zhong
Exhibit Hall I #244
Frequency-Dynamic Attention Modulation For Dense Prediction Poster Session 5 & Exhibit Hall
Linwei Chen ⋅ Lin Gu ⋅ Ying Fu
Exhibit Hall I #265
Memory-Efficient 4-bit Preconditioned Stochastic Optimization Poster Session 5 & Exhibit Hall
Jingyang Li ⋅ Kuangyu Ding ⋅ Kim-chuan Toh ⋅ Pan Zhou
Exhibit Hall I #266
Hierarchical Cross-modal Prompt Learning for Vision-Language Models Poster Session 1 & Exhibit Hall
Hao Zheng ⋅ Shunzhi Yang ⋅ Zhuoxin He ⋅ Jinfeng Yang ⋅ Zhenhua Huang
Exhibit Hall I #171
Boosting Domain Generalized and Adaptive Detection with Diffusion Models: Fitness, Generalization, and Transferability Poster Session 1 & Exhibit Hall
Boyong He ⋅ Yuxiang Ji ⋅ Zhuoyue Tan ⋅ Liaoni Wu
Exhibit Hall I #173
ObjectRelator: Enabling Cross-View Object Relation Understanding Across Ego-Centric and Exo-Centric Perspectives Poster Session 2 & Exhibit Hall with Coffee Break
Yuqian Fu ⋅ Runze Wang ⋅ Bin Ren ⋅ Guolei Sun ⋅ Biao Gong ⋅ Yanwei Fu ⋅ Danda Pani Paudel ⋅ Xuanjing Huang ⋅ Luc Gool
Exhibit Hall I #141
Target Bias Is All You Need: Zero-Shot Debiasing of Vision-Language Models with Bias Corpus Poster Session 1 & Exhibit Hall
Taeuk Jang ⋅ Hoin Jung ⋅ Xiaoqian Wang
Exhibit Hall I #175
Long-Context State-Space Video World Models Poster Session 2 & Exhibit Hall with Coffee Break
Ryan Po ⋅ Yotam Nitzan ⋅ Richard Zhang ⋅ Berlin Chen ⋅ Tri Dao ⋅ Eli Shechtman ⋅ Gordon Wetzstein ⋅ Xun Huang
Exhibit Hall I #349
PASG: A Closed-Loop Framework for Automated Geometric Primitive Extraction and Semantic Anchoring in Robotic Manipulation Poster Session 2 & Exhibit Hall with Coffee Break
Zhihao ZHU ⋅ Yifan Zheng ⋅ Siyu Pan ⋅ Yaohui Jin ⋅ Yao Mu
Exhibit Hall I #369
DiST-4D: Disentangled Spatiotemporal Diffusion with Metric Depth for 4D Driving Scene Generation Poster Session 6 & Exhibit Hall with Coffee Break
Jiazhe Guo ⋅ Yikang Ding ⋅ Xiwu Chen ⋅ Shuo Chen ⋅ Bohan Li ⋅ Yingshuang Zou ⋅ Xiaoyang Lyu ⋅ Feiyang Tan ⋅ Xiaojuan Qi ⋅ Zhiheng Li ⋅ Hao Zhao
Exhibit Hall I #243
Fine-grained Spatiotemporal Grounding on Egocentric Videos Poster Session 2 & Exhibit Hall with Coffee Break
Shuo LIANG ⋅ Yiwu Zhong ⋅ Zi-Yuan Hu ⋅ Yeyao Tao ⋅ Liwei Wang
Exhibit Hall I #410
VRBench: A Benchmark for Multi-Step Reasoning in Long Narrative Videos Poster Session 5 & Exhibit Hall
Jiashuo Yu ⋅ Yue Wu ⋅ Meng Chu ⋅ Zhifei Ren ⋅ Zizheng Huang ⋅ Pei Chu ⋅ Ruijie Zhang ⋅ Yinan He ⋅ Qirui Li ⋅ Songze Li ⋅ Zhenxiang Li ⋅ Zhongying Tu ⋅ Conghui He ⋅ Yu Qiao ⋅ Yali Wang ⋅ Yi Wang ⋅ Limin Wang
Exhibit Hall I #174
Improving SAM for Camouflaged Object Detection via Dual Stream Adapters Poster Session 5 & Exhibit Hall
Jiaming Liu ⋅ Linghe Kong ⋅ Guihai Chen
Exhibit Hall I #197
FedMeNF: Privacy-Preserving Federated Meta-Learning for Neural Fields Poster Session 1 & Exhibit Hall
Junhyeog Yun ⋅ Minui Hong ⋅ Gunhee Kim
Exhibit Hall I #196
Sparsity Outperforms Low-Rank Projections in Few-Shot Adaptation Poster Session 1 & Exhibit Hall
Nairouz Mrabah ⋅ Nicolas Richet ⋅ Ismail Ayed ⋅ Eric Granger
Exhibit Hall I #290
DWIM: Towards Tool-aware Visual Reasoning via Discrepancy-aware Workflow Generation & Instruct-Masking Tuning Poster Session 1 & Exhibit Hall
Fucai Ke ⋅ Vijay Kumar b g ⋅ Xingjian Leng ⋅ Zhixi Cai ⋅ Zaid Khan ⋅ Weiqing Wang ⋅ Pari Delir Haghighi ⋅ Hamid Rezatofighi ⋅ Manmohan Chandraker
Exhibit Hall I #314
GT-Mean Loss: A Simple Yet Effective Solution for Brightness Mismatch in Low-Light Image Enhancement Poster Session 2 & Exhibit Hall with Coffee Break
Jingxi Liao ⋅ Shijie Hao ⋅ Richang Hong ⋅ Meng Wang
Exhibit Hall I #102
Trust but Verify: Programmatic VLM Evaluation in the Wild Poster Session 1 & Exhibit Hall
Viraj Prabhu ⋅ Senthil Purushwalkam ⋅ An Yan ⋅ Caiming Xiong ⋅ Ran Xu
Exhibit Hall I #301
MemDistill: Distilling LiDAR Knowledge into Memory for Camera-Only 3D Object Detection Poster Session 2 & Exhibit Hall with Coffee Break
Donghyeon Kwon ⋅ Youngseok Yoon ⋅ Hyeongseok Son ⋅ Suha Kwak
Exhibit Hall I #170
HRScene: How Far Are VLMs from Effective High-Resolution Image Understanding? Poster Session 5 & Exhibit Hall
Yusen Zhang ⋅ Wenliang Zheng ⋅ Aashrith Madasu ⋅ Peng Shi ⋅ Ryo Kamoi ⋅ Hao Zhou ⋅ Zhuoyang Zou ⋅ Shu Zhao ⋅ Sarkar Snigdha Sarathi Das ⋅ Vipul Gupta ⋅ Xiaoxin Lu ⋅ Nan Zhang ⋅ Ranran Zhang ⋅ Avitej Iyer ⋅ Renze Lou ⋅ Wenpeng Yin ⋅ Rui Zhang
Exhibit Hall I #293
DrivingGPT: Unifying Driving World Modeling and Planning with Multi-modal Autoregressive Transformers Poster Session 6 & Exhibit Hall with Coffee Break
Yuntao Chen ⋅ Yuqi Wang ⋅ Zhaoxiang Zhang
Exhibit Hall I #209
GenieBlue: Integrating both Linguistic and Multimodal Capabilities for Large Language Models on Mobile Devices Poster Session 1 & Exhibit Hall
Xudong LU ⋅ Yinghao Chen ⋅ Renshou Wu ⋅ Haohao Gao ⋅ Xi Chen ⋅ Xue Yang ⋅ Xiangyu Zhao ⋅ Aojun Zhou ⋅ Fangyuan Li ⋅ Yafei Wen ⋅ Xiaoxin Chen ⋅ shuai ren ⋅ Hongsheng Li
Exhibit Hall I #393
Task-Aware Prompt Gradient Projection for Parameter-Efficient Tuning Federated Class-Incremental Learning Poster Session 1 & Exhibit Hall
Hualong Ke ⋅ Yachao Zhang ⋅ Jiangming Shi ⋅ FangyongWang FangyongWang ⋅ Yuan Xie ⋅ Yanyun Qu
Exhibit Hall I #242
Parameter-Efficient Adaptation of Geospatial Foundation Models through Embedding Deflection Poster Session 2 & Exhibit Hall with Coffee Break
Romain Thoreau ⋅ Valerio Marsocci ⋅ Dawa Derksen
Exhibit Hall I #429
Multimodal LLM Guided Exploration and Active Mapping using Fisher Information Poster Session 2 & Exhibit Hall with Coffee Break
Wen Jiang ⋅ BOSHU LEI ⋅ Katrina Ashton ⋅ Kostas Daniilidis
Exhibit Hall I #35
Orchid: Image Latent Diffusion for Joint Appearance and Geometry Generation Poster Session 6 & Exhibit Hall with Coffee Break
Akshay Krishnan ⋅ Xinchen Yan ⋅ Vincent Casser ⋅ Abhijit Kundu
Exhibit Hall I #337
CLIPSym: Delving into Symmetry Detection with CLIP Poster Session 5 & Exhibit Hall
Tinghan Yang ⋅ Md Ashiqur Rahman ⋅ Raymond A. Yeh
Exhibit Hall I #113
UDC-VIT: A Real-World Video Dataset for Under-Display Cameras Poster Session 3 & Exhibit Hall
Kyusu Ahn ⋅ JiSoo Kim ⋅ Sangik Lee ⋅ HyunGyu Lee ⋅ Byeonghyun Ko ⋅ Chanwoo Park ⋅ Jaejin Lee
Exhibit Hall I #89
ScenePainter: Semantically Consistent Perpetual 3D Scene Generation with Concept Relation Alignment Poster Session 6 & Exhibit Hall with Coffee Break
Chong Xia ⋅ Shengjun Zhang ⋅ Fangfu Liu ⋅ Chang Liu ⋅ Khodchaphun Hirunyaratsameewong ⋅ Yueqi Duan
Exhibit Hall I #394
Temperature in Cosine-based Softmax Loss Poster Session 5 & Exhibit Hall
Takumi Kobayashi
Exhibit Hall I #224
SceneSplat: Gaussian Splatting-based Scene Understanding with Vision-Language Pretraining Poster Session 2 & Exhibit Hall with Coffee Break
Yue Li ⋅ Qi Ma ⋅ Runyi Yang ⋅ Huapeng Li ⋅ Mengjiao Ma ⋅ Bin Ren ⋅ Nikola Popovic ⋅ Nicu Sebe ⋅ Ender Konukoglu ⋅ Theo Gevers ⋅ Luc Gool ⋅ Martin R. Oswald ⋅ Danda Pani Paudel
Exhibit Hall I #146
Understanding Museum Exhibits using Vision-Language Reasoning Poster Session 1 & Exhibit Hall
Ada-Astrid Balauca ⋅ Sanjana Garai ⋅ Stefan Balauca ⋅ Rasesh Shetty ⋅ Naitik Agrawal ⋅ Dhwanil Shah ⋅ Yuqian Fu ⋅ Xi Wang ⋅ Kristina Toutanova ⋅ Danda Pani Paudel ⋅ Luc Gool
Exhibit Hall I #202
Neural Solver of Dichromatic Reflection Model for Specular Highlight Removal Poster Session 2 & Exhibit Hall with Coffee Break
Gang Fu
Exhibit Hall I #208
Correspondence-Free Fast and Robust Spherical Point Pattern Registration Poster Session 6 & Exhibit Hall with Coffee Break
Anik Sarker ⋅ Alan Asbeck
Exhibit Hall I #331
SILO: Solving Inverse Problems with Latent Operators Poster Session 3 & Exhibit Hall
Ron Raphaeli ⋅ Sean Man ⋅ Michael Elad
Exhibit Hall I #52
Geminio: Language-Guided Gradient Inversion Attacks in Federated Learning Poster Session 1 & Exhibit Hall
Junjie Shan ⋅ Ziqi Zhao ⋅ Jialin Lu ⋅ Rui Zhang ⋅ SM Yiu ⋅ Ka-Ho Chow
Exhibit Hall I #250
SD2Actor: Continuous State Decomposition via Diffusion Embeddings for Robotic Manipulation Poster Session 3 & Exhibit Hall
lijiayi jiayi
Exhibit Hall I #352
Imbalance in Balance: Online Concept Balancing in Generation Models Poster Session 4 & Exhibit Hall with Coffee Break
Yukai Shi ⋅ Jiarong Ou ⋅ Rui Chen ⋅ Haotian Yang ⋅ Jiahao Wang ⋅ Xin Tao ⋅ Pengfei Wan ⋅ Di ZHANG ⋅ Kun Gai
Exhibit Hall I #244
Progressive Distribution Bridging: Unsupervised Adaptation for Large-scale Pre-trained Models via Adaptive Auxiliary Data Poster Session 1 & Exhibit Hall
Weinan He ⋅ Yixin Zhang ⋅ Zilei Wang
Exhibit Hall I #305
Efficient Concertormer for Image Deblurring and Beyond Poster Session 3 & Exhibit Hall
Pin-Hung Kuo ⋅ Jinshan Pan ⋅ Shao-Yi Chien ⋅ Ming-Hsuan Yang
Exhibit Hall I #439
TryOn-Refiner: Conditional Rectified-flow-based TryOn Refiner for More Accurate Detail Reconstruction Poster Session 4 & Exhibit Hall with Coffee Break
Wen Qian
Exhibit Hall I #73
ASCENT: Annotation-free Self-supervised Contrastive Embeddings for 3D Neuron Tracking in Fluorescence Microscopy Poster Session 3 & Exhibit Hall
Haejun Han ⋅ Hang Lu
Exhibit Hall I #440
IntroStyle: Training-Free Introspective Style Attribution using Diffusion Features Poster Session 4 & Exhibit Hall with Coffee Break
Anand Kumar ⋅ Jiteng Mu ⋅ Nuno Vasconcelos
Exhibit Hall I #2
From Sharp to Blur: Unsupervised Domain Adaptation for 2D Human Pose Estimation Under Extreme Motion Blur Using Event Cameras Poster Session 2 & Exhibit Hall with Coffee Break
Youngho Kim ⋅ Hoonhee Cho ⋅ Kuk-Jin Yoon
Exhibit Hall I #412
One Object, Multiple Lies: A Benchmark for Cross-task Adversarial Attack on Unified Vision-Language Models Poster Session 1 & Exhibit Hall
Jiale Zhao ⋅ XINYANG JIANG ⋅ Junyao Gao ⋅ Yuhao Xue ⋅ Cairong Zhao
Exhibit Hall I #8
ObjectMate: A Recurrence Prior for Object Insertion and Subject-Driven Generation Poster Session 4 & Exhibit Hall with Coffee Break
Daniel Winter ⋅ Asaf Shul ⋅ Matan Cohen ⋅ Dana Berman ⋅ Yael Pritch ⋅ Alex Rav-Acha ⋅ Yedid Hoshen
Exhibit Hall I #132
LLM Thought Divergence and Convergence for Dialogue-Based Image Generation Control Poster Session 4 & Exhibit Hall with Coffee Break
Hui Li
Exhibit Hall I #309
RALoc: Enhancing Outdoor LiDAR Localization via Rotation Awareness Poster Session 1 & Exhibit Hall
Yuyang Yang ⋅ Wen Li ⋅ Sheng Ao ⋅ Qingshan Xu ⋅ Shangshu Yu ⋅ guo yu ⋅ Yin Zhou ⋅ Siqi Shen ⋅ Cheng Wang
Exhibit Hall I #307
Soft Separation and Distillation: Toward Global Uniformity in Federated Unsupervised Learning Poster Session 1 & Exhibit Hall
Hung-Chieh Fang ⋅ Hsuan-Tien Lin ⋅ Irwin King ⋅ Yifei Zhang
Exhibit Hall I #274
HOLa: Zero-Shot HOI Detection with Low-Rank Decomposed VLM Feature Adaptation Poster Session 1 & Exhibit Hall
Qinqian Lei ⋅ Bo Wang ⋅ Robby Tan
Exhibit Hall I #165
Instruction-Grounded Visual Projectors for Continual Learning of Generative Vision-Language Models Poster Session 1 & Exhibit Hall
Hyundong Jin ⋅ Hyung Jin Chang ⋅ Eunwoo Kim
Exhibit Hall I #322
Spatial Preference Rewarding for MLLMs Spatial Understanding Poster Session 1 & Exhibit Hall
Han Qiu ⋅ Peng Gao ⋅ Lewei Lu ⋅ Xiaoqin Zhang ⋅ Ling Shao ⋅ Shijian Lu
Exhibit Hall I #58
Generative Zoo Poster Session 2 & Exhibit Hall with Coffee Break
Tomasz Niewiadomski ⋅ Anastasios Yiannakidis ⋅ Hanz Cuevas Velasquez ⋅ Soubhik Sanyal ⋅ Michael Black ⋅ Silvia Zuffi ⋅ Peter Kulits
Exhibit Hall I #327
Towards Adversarial Robustness via Debiased High-Confidence Logit Alignment Poster Session 1 & Exhibit Hall
Kejia Zhang ⋅ Juanjuan Weng ⋅ Zhiming Luo ⋅ Shaozi Li
Exhibit Hall I #256
Learning Null Geodesics for Gravitational Lensing Rendering in General Relativity Poster Session 6 & Exhibit Hall with Coffee Break
Mingyuan Sun ⋅ Zheng Fang ⋅ Jiaxu Wang ⋅ Kun-Yi Zhang ⋅ Qiang Zhang ⋅ Renjing Xu
Exhibit Hall I #363
St4RTrack: Simultaneous 4D Reconstruction and Tracking in the World Poster Session 2 & Exhibit Hall with Coffee Break
Haiwen Feng ⋅ Junyi Zhang ⋅ Qianqian Wang ⋅ Yufei Ye ⋅ Pengcheng Yu ⋅ Michael Black ⋅ Trevor Darrell ⋅ Angjoo Kanazawa
Exhibit Hall I #328
Describe Anything: Detailed Localized Image and Video Captioning Poster Session 5 & Exhibit Hall
Long Lian ⋅ Yifan Ding ⋅ Yunhao Ge ⋅ Sifei Liu ⋅ Hanzi Mao ⋅ Boyi Li ⋅ Marco Pavone ⋅ Ming-Yu Liu ⋅ Trevor Darrell ⋅ Adam Yala ⋅ Yin Cui
Exhibit Hall I #184
Stable-Sim2Real: Exploring Simulation of Real-Captured 3D Data with Two-Stage Depth Diffusion Poster Session 1 & Exhibit Hall
Mutian Xu ⋅ Chongjie Ye ⋅ Haolin Liu ⋅ Yushuang Wu ⋅ Jiahao Chang ⋅ Xiaoguang Han
Exhibit Hall I #240
Multi-Modal Multi-Task Unified Embedding Model (M3T-UEM): A Task-Adaptive Representation Learning Framework Poster Session 5 & Exhibit Hall
Rohan Sharma ⋅ Changyou Chen ⋅ Feng-Ju Chang ⋅ Seongjun Yun ⋅ Xiaohu Xie ⋅ Rui Meng ⋅ Dehong Xu ⋅ Alejandro Mottini ⋅ qingjun cui
Exhibit Hall I #280
AHCPTQ: Accurate and Hardware-Compatible Post-Training Quantization for Segment Anything Model Poster Session 5 & Exhibit Hall
Wenlun Zhang ⋅ Yunshan Zhong ⋅ Shimpei Ando ⋅ Kentaro Yoshioka
Exhibit Hall I #243
MVTrajecter: Multi-View Pedestrian Tracking with Trajectory Motion Cost and Trajectory Appearance Cost Poster Session 3 & Exhibit Hall
Taiga Yamane ⋅ Ryo Masumura ⋅ Satoshi Suzuki ⋅ Shota Orihashi
Exhibit Hall I #309
InvRGB+L: Inverse Rendering of Complex Scenes with Unified Color and LiDAR Reflectance Modeling Poster Session 6 & Exhibit Hall with Coffee Break
Xiaoxue Chen ⋅ Bhargav Chandaka ⋅ Chih-Hao Lin ⋅ Ya-Qin Zhang ⋅ David Forsyth ⋅ Hao Zhao ⋅ Shenlong Wang
Exhibit Hall I #238
Is Visual in-Context Learning for Compositional Medical Tasks within Reach? Poster Session 1 & Exhibit Hall
Simon Reiß ⋅ Zdravko Marinov ⋅ Alexander Jaus ⋅ Constantin Seibold ⋅ M. Sarfraz ⋅ Erik Rodner ⋅ Rainer Stiefelhagen
Exhibit Hall I #243
Moto: Latent Motion Token as the Bridging Language for Learning Robot Manipulation from Videos Poster Session 5 & Exhibit Hall
Yi Chen ⋅ Yuying Ge ⋅ Weiliang Tang ⋅ Yizhuo Li ⋅ Yixiao Ge ⋅ Mingyu Ding ⋅ Ying Shan ⋅ Xihui Liu
Exhibit Hall I #229
Effective Training Data Synthesis for Improving MLLM Chart Understanding Poster Session 1 & Exhibit Hall
Yuwei Yang ⋅ Zeyu Zhang ⋅ Yunzhong Hou ⋅ Zhuowan Li ⋅ Gaowen Liu ⋅ Ali Payani ⋅ Yuan-Sen Ting ⋅ Liang Zheng
Exhibit Hall I #244
SpinMeRound: Consistent Multi-View Identity Generation Using Diffusion Models Poster Session 3 & Exhibit Hall
Stathis Galanakis ⋅ Alexandros Lattas ⋅ Stylianos Moschoglou ⋅ Bernhard Kainz ⋅ Stefanos Zafeiriou
Exhibit Hall I #409
Heuristic-Induced Multimodal Risk Distribution Jailbreak Attack for Multimodal Large Language Models Poster Session 1 & Exhibit Hall
Ma Teng ⋅ Xiaojun Jia ⋅ Ranjie Duan ⋅ Xinfeng Li ⋅ Yihao Huang ⋅ Xiaoshuang Jia ⋅ Zhixuan Chu ⋅ Wenqi Ren
Exhibit Hall I #247
Improving Rectified Flow with Boundary Conditions Poster Session 4 & Exhibit Hall with Coffee Break
Xixi Hu ⋅ Runlong Liao ⋅ Bo Liu ⋅ Keyang Xu ⋅ Yeqing Li ⋅ Eugene Ie ⋅ Hongliang Fei ⋅ qiang liu
Exhibit Hall I #316
Active Learning Meets Foundation Models: Fast Remote Sensing Data Annotation for Object Detection Poster Session 2 & Exhibit Hall with Coffee Break
Marvin Burges ⋅ Philipe Dias ⋅ Dalton Lunga ⋅ Carson Woody ⋅ Sarah Walters
Exhibit Hall I #97
Safeguarding Vision-Language Models: Mitigating Vulnerabilities to Gaussian Noise in Perturbation-based Attacks Poster Session 1 & Exhibit Hall
Jiawei Wang ⋅ Yushen Zuo ⋅ Yuanjun Chai ⋅ Zhendong Liu ⋅ Yicheng Fu ⋅ Yichun Feng ⋅ Kin Man Lam
Exhibit Hall I #255
Optimal Transport for Brain-Image Alignment: Unveiling Redundancy and Synergy in Neural Information Processing Poster Session 5 & Exhibit Hall
Yang Xiao ⋅ Wang Lu ⋅ Jie Ji ⋅ Ruimeng Ye ⋅ Li ⋅ Xiaolong Ma ⋅ Bo Hui
Exhibit Hall I #60
TrafficLoc: Localizing Traffic Surveillance Cameras in 3D Scenes Poster Session 6 & Exhibit Hall with Coffee Break
Yan Xia ⋅ Yunxiang Lu ⋅ Rui Song ⋅ Oussema Dhaouadi ⋅ Joao F. Henriques ⋅ Daniel Cremers
Exhibit Hall I #383
Intervening in Black Box: Concept Bottleneck Model for Enhancing Human Neural Network Mutual Understanding Poster Session 1 & Exhibit Hall
Nuoye Xiong ⋅ Anqi Dong ⋅ Ning Wang ⋅ Cong Hua ⋅ Guangming Zhu ⋅ Lin Mei ⋅ peiyi shen ⋅ zhang liang
Exhibit Hall I #261
Resolving Token-Space Gradient Conflicts: Token Space Manipulation for Transformer-Based Multi-Task Learning Poster Session 1 & Exhibit Hall
Wooseong Jeong ⋅ Kuk-Jin Yoon
Exhibit Hall I #266
DisCoPatch: Taming Adversarially-driven Batch Statistics for Improved Out-of-Distribution Detection Poster Session 1 & Exhibit Hall
Francisco Caetano ⋅ Christiaan Viviers ⋅ Luis Zavala-Mondragón ⋅ Peter H.N. De With ⋅ Fons van der Sommen
Exhibit Hall I #267
Scaling and Taming Adversarial Training with Synthetic Data Poster Session 1 & Exhibit Hall
Juntao Wu ⋅ Xianting Huang ⋅ Yu Chen ⋅ Shuai Pang ⋅ Ke Wang
Exhibit Hall I #272
DOLLAR: Few-Step Video Generation via Distillation and Latent Reward Optimization Poster Session 4 & Exhibit Hall with Coffee Break
Zihan Ding ⋅ Chi Jin ⋅ Difan Liu ⋅ Haitian Zheng ⋅ Krishna Kumar Singh ⋅ Qiang Zhang ⋅ Yan Kang ⋅ Zhe Lin ⋅ Yuchen Liu
Exhibit Hall I #294
Generative Adversarial Diffusion Poster Session 4 & Exhibit Hall with Coffee Break
U-Chae Jun ⋅ Jaeeun Ko ⋅ Jiwoo Kang
Exhibit Hall I #182
Music Grounding by Short Video Poster Session 5 & Exhibit Hall
Zijie Xin ⋅ Minquan Wang ⋅ Jingyu Liu ⋅ Quan Chen ⋅ Ye Ma ⋅ Peng Jiang ⋅ Xirong Li
Exhibit Hall I #234
Fewer Denoising Steps or Cheaper Per-Step Inference: Towards Compute-Optimal Diffusion Model Deployment Poster Session 1 & Exhibit Hall
Zhenbang Du ⋅ Yonggan Fu ⋅ Lifu Wang ⋅ Jiayi Qian ⋅ Xiao Luo ⋅ Yingyan Celine Lin
Exhibit Hall I #277
Your Text Encoder Can Be An Object-Level Watermarking Controller Poster Session 4 & Exhibit Hall with Coffee Break
Naresh Kumar Devulapally ⋅ Mingzhen Huang ⋅ Vishal Asnani ⋅ Shruti Agarwal ⋅ Siwei Lyu ⋅ Vishnu Lokhande
Exhibit Hall I #162
Enhanced Event-based Dense Stereo via Cross-Sensor Knowledge Distillation Poster Session 2 & Exhibit Hall with Coffee Break
Haihao Zhang ⋅ Yunjian Zhang ⋅ Jianing Li ⋅ Lin Zhu ⋅ Meng Lv ⋅ Yao Zhu ⋅ Yanwei Liu ⋅ Xiangyang Ji
Exhibit Hall I #39
PlugMark: A Plug-in Zero-Watermarking Framework for Diffusion Models Poster Session 4 & Exhibit Hall with Coffee Break
Pengzhen Chen ⋅ Yanwei Liu ⋅ Xiaoyan Gu ⋅ Enci Liu ⋅ Zhuoyi Shang ⋅ Xiangyang Ji ⋅ Wu Liu
Exhibit Hall I #234
GauUpdate: New Object Insertion in 3D Gaussian Fields with Consistent Global Illumination Poster Session 6 & Exhibit Hall with Coffee Break
Chengwei REN ⋅ Fan Zhang ⋅ Liangchao Xu ⋅ Liang Pan ⋅ Ziwei Liu ⋅ Wenping Wang ⋅ Xiao-Ping Zhang ⋅ Yuan Liu
Exhibit Hall I #380
Diff2I2P: Differentiable Image-to-Point Cloud Registration with Diffusion Prior Poster Session 6 & Exhibit Hall with Coffee Break
Juncheng Mu ⋅ Chengwei REN ⋅ Weixiang Zhang ⋅ Liang Pan ⋅ Xiao-Ping Zhang ⋅ Yue Gao
Exhibit Hall I #101
EgoMusic-driven Human Dance Motion Estimation with Skeleton Mamba Poster Session 3 & Exhibit Hall
Quang Nguyen ⋅ Nhat Le ⋅ Baoru Huang ⋅ Minh VU ⋅ Chengcheng Tang ⋅ Van Nguyen ⋅ Ngan Le ⋅ Thieu Vo ⋅ Anh Nguyen
Exhibit Hall I #190
CT-ScanGaze: A Dataset and Baselines for 3D Volumetric Scanpath Modeling Poster Session 5 & Exhibit Hall
Trong-Thang Pham ⋅ AKASH AWASTHI ⋅ Saba Khan ⋅ Esteban Marti ⋅ Tien-Phat Nguyen ⋅ Khoa Vo ⋅ Minh Tran ⋅ Ngoc Son Nguyen ⋅ Cuong Van ⋅ Yuki Ikebe ⋅ Anh Nguyen ⋅ Anh Nguyen ⋅ Zhigang Deng ⋅ Carol Wu ⋅ Hien Nguyen ⋅ Ngan Le
Exhibit Hall I #181
Not Only Vision: Evolve Visual Speech Recognition via Peripheral Information Poster Session 1 & Exhibit Hall
Zhaoxin Yuan ⋅ Shuang Yang ⋅ Shiguang Shan ⋅ Xilin Chen
Exhibit Hall I #285
KOEnsAttack: Towards Efficient Data-Free Black-Box Adversarial Attacks via Knowledge-Orthogonalized Substitute Ensembles Poster Session 1 & Exhibit Hall
Chaoyong Yang ⋅ Jia-Li Yin ⋅ Bin Chen ⋅ Zhaozhe Hu ⋅ Xiaolei Liu ⋅ Wei Lin
Exhibit Hall I #286
CLIP-Adapted Region-to-Text Learning for Generative Open-Vocabulary Semantic Segmentation Poster Session 5 & Exhibit Hall
Jiannan Ge ⋅ Lingxi Xie ⋅ Hongtao Xie ⋅ Pandeng Li ⋅ Sun-Ao Liu ⋅ XIAOPENG ZHANG ⋅ Qi Tian ⋅ Yongdong Zhang
Exhibit Hall I #397
LV-MAE: Learning Long Video Representations through Masked-Embedding Autoencoders Poster Session 5 & Exhibit Hall
Ilan Naiman ⋅ Emanuel Baruch Baruch ⋅ Oron Anschel ⋅ Alon Shoshan ⋅ Igor Kviatkovsky ⋅ Manoj Aggarwal ⋅ Gerard Medioni
Exhibit Hall I #148
PanSt3R: Multi-view Consistent Panoptic Segmentation Poster Session 2 & Exhibit Hall with Coffee Break
Lojze Zust ⋅ Yohann Cabon ⋅ Juliette Marrie ⋅ Leonid Antsfeld ⋅ Boris Chidlovskii ⋅ Jerome Revaud ⋅ Gabriela Csurka
Exhibit Hall I #79
Guiding Diffusion-Based Articulated Object Generation by Partial Point Cloud Alignment and Physical Plausibility Constraints Poster Session 1 & Exhibit Hall
Jens U. Kreber ⋅ Joerg Stueckler
Exhibit Hall I #296
GARF: Learning Generalizable 3D Reassembly for Real-World Fractures Poster Session 2 & Exhibit Hall with Coffee Break
Sihang Li ⋅ Zeyu Jiang ⋅ Grace Chen ⋅ Chenyang Xu ⋅ Siqi Tan ⋅ Xue Wang ⋅ Irving Fang ⋅ Kristof Zyskowski ⋅ Shannon McPherron ⋅ Radu Iovita ⋅ Chen Feng ⋅ Jing Zhang
Exhibit Hall I #64
PhysSplat: Efficient Physics Simulation for 3D Scenes via MLLM-Guided Gaussian Splatting Poster Session 2 & Exhibit Hall with Coffee Break
Haoyu Zhao ⋅ Hao Wang ⋅ Xingyue Zhao ⋅ Hao Fei ⋅ Hongqiu Wang ⋅ Chengjiang Long ⋅ Hua Zou
Exhibit Hall I #21
Derm1M: A Million-scale Vision-Language Dataset Aligned with Clinical Ontology Knowledge for Dermatology Poster Session 3 & Exhibit Hall
Siyuan Yan ⋅ Ming Hu ⋅ Yiwen Jiang ⋅ Xieji Li ⋅ Hao Fei ⋅ Philipp Tschandl ⋅ Harald Kittler ⋅ Zongyuan Ge
Exhibit Hall I #252
Where, What, Why: Towards Explainable Driver Attention Prediction Poster Session 1 & Exhibit Hall
Yuchen Zhou ⋅ Jiayu Tang ⋅ Xiaoyan Xiao ⋅ Yueyao Lin ⋅ Linkai Liu ⋅ Zipeng Guo ⋅ Hao Fei ⋅ Xiaobo Xia ⋅ Chao Gou
Exhibit Hall I #246
TRNAS: A Training-Free Robust Neural Architecture Search Poster Session 1 & Exhibit Hall
Yeming Yang ⋅ Qingling Zhu ⋅ Jianping Luo ⋅ Ka-Chun Wong ⋅ Qiuzhen Lin ⋅ Jianqiang Li
Exhibit Hall I #212
Dynamic Multimodal Prototype Learning in Vision-Language Models Poster Session 1 & Exhibit Hall
Xingyu Zhu ⋅ Shuo Wang ⋅ Beier Zhu ⋅ Miaoge Li ⋅ Yunfan Li ⋅ Junfeng Fang ⋅ Zhicai Wang ⋅ Dongsheng Wang ⋅ Hanwang Zhang
Exhibit Hall I #230
CAP: Evaluation of Persuasive and Creative Image Generation Poster Session 4 & Exhibit Hall with Coffee Break
Aysan Aghazadeh ⋅ Adriana Kovashka
Exhibit Hall I #199
SummDiff: Generative Modeling of Video Summarization with Diffusion Poster Session 4 & Exhibit Hall with Coffee Break
Kwanseok Kim ⋅ Jaehoon Hahm ⋅ Sumin Kim ⋅ Jinhwan Sul ⋅ Byung-Hak Kim ⋅ Joonseok Lee
Exhibit Hall I #20
PersonalVideo: High ID-Fidelity Video Customization without Dynamic and Semantic Degradation Poster Session 4 & Exhibit Hall with Coffee Break
Hengjia Li ⋅ Haonan Qiu ⋅ Shiwei Zhang ⋅ Xiang Wang ⋅ Yujie Wei ⋅ Zekun Li ⋅ Yingya Zhang ⋅ Boxi Wu ⋅ Deng Cai
Exhibit Hall I #433
StreamGS: Online Generalizable Gaussian Splatting Reconstruction for Unposed Image Streams Poster Session 6 & Exhibit Hall with Coffee Break
Yang LI ⋅ Jinglu Wang ⋅ Lei Chu ⋅ Xiao Li ⋅ Shiu-hong Kao ⋅ Ying-Cong Chen ⋅ Yan Lu
Exhibit Hall I #107
Unleashing Vecset Diffusion Model for Fast Shape Generation Poster Session 1 & Exhibit Hall
Zeqiang Lai ⋅ Zhao Yunfei ⋅ Zibo Zhao ⋅ Haolin Liu ⋅ Fu-Yun Wang ⋅ Huiwen Shi ⋅ Xianghui Yang ⋅ Qingxiang Lin ⋅ Jingwei Huang ⋅ Lliu Yuhong ⋅ Jie Jiang ⋅ Chunchao Guo ⋅ Xiangyu Yue
Exhibit Hall I #232
Auto-Regressively Generating Multi-View Consistent Images Poster Session 1 & Exhibit Hall
JiaKui Hu ⋅ Yuxiao Yang ⋅ Jialun Liu ⋅ Jinbo Wu ⋅ Chen Zhao ⋅ Yanye Lu
Exhibit Hall I #235
MagicID: Hybrid Preference Optimization for ID-Consistent and Dynamic-Preserved Video Customization Poster Session 3 & Exhibit Hall
Hengjia Li ⋅ Lifan Jiang ⋅ Xi Xiao ⋅ Tianyang Wang ⋅ Hongwei Yi ⋅ Boxi Wu ⋅ Deng Cai
Exhibit Hall I #257
CA-I2P: Channel-Adaptive Registration Network with Global Optimal Selection Poster Session 6 & Exhibit Hall with Coffee Break
Zhixin Cheng ⋅ Jiacheng Deng ⋅ Xinjun Li ⋅ Xiaotian Yin ⋅ Bohao Liao ⋅ Baoqun Yin ⋅ Wenfei Yang ⋅ Tianzhu Zhang
Exhibit Hall I #292
Towards Performance Consistency in Multi-Level Model Collaboration Poster Session 1 & Exhibit Hall
Qi Li ⋅ Runpeng Yu ⋅ Xinchao Wang
Exhibit Hall I #236
Visual Interestingness Decoded: How GPT-4o Mirrors Human Interests Poster Session 4 & Exhibit Hall with Coffee Break
Fitim Abdullahu ⋅ Helmut Grabner
Exhibit Hall I #43
FuXi-RTM: A Physics-Guided Prediction Framework with Radiative Transfer Modeling Poster Session 2 & Exhibit Hall with Coffee Break
qiusheng huang ⋅ Xiaohui Zhong ⋅ Xu Fan ⋅ Hao Li
Exhibit Hall I #360
DMQ: Dissecting Outliers of Diffusion Models for Post-Training Quantization Poster Session 4 & Exhibit Hall with Coffee Break
Dongyeun Lee ⋅ jiwan hur ⋅ Hyounguk Shon ⋅ Jae Young Lee ⋅ Junmo Kim
Exhibit Hall I #347
Democratizing Text-to-Image Masked Generative Models with Compact Text-Aware One-Dimensional Tokens Poster Session 4 & Exhibit Hall with Coffee Break
Dongwon Kim ⋅ Ju He ⋅ Qihang Yu ⋅ Chenglin Yang ⋅ Xiaohui Shen ⋅ Suha Kwak ⋅ Liang-Chieh (Jay) Chen
Exhibit Hall I #341
Understanding Personal Concept in Open-Vocabulary Semantic Segmentation Poster Session 5 & Exhibit Hall
Sunghyun Park ⋅ Jungsoo Lee ⋅ Shubhankar Borse ⋅ Munawar Hayat ⋅ Sungha Choi ⋅ Kyuwoong Hwang ⋅ Fatih Porikli
Exhibit Hall I #15
DuoLoRA : Cycle-consistent and Rank-disentangled Content-Style Personalization Poster Session 4 & Exhibit Hall with Coffee Break
Aniket Roy ⋅ Shubhankar Borse ⋅ Shreya Kadambi ⋅ Debasmit Das ⋅ Shweta Mahajan ⋅ Risheek Garrepalli ⋅ Hyojin Park ⋅ Ankita Nayak ⋅ Rama Chellappa ⋅ Munawar Hayat ⋅ Fatih Porikli
Exhibit Hall I #47
ObjectGS: Object-aware Scene Reconstruction and Scene Understanding via Gaussian Splatting Poster Session 2 & Exhibit Hall with Coffee Break
Ruijie Zhu ⋅ Mulin Yu ⋅ Linning Xu ⋅ Lihan Jiang ⋅ Yixuan Li ⋅ Tianzhu Zhang ⋅ Jiangmiao Pang ⋅ Bo Dai
Exhibit Hall I #314
Video-T1: Test-time Scaling for Video Generation Poster Session 4 & Exhibit Hall with Coffee Break
Fangfu Liu ⋅ Hanyang Wang ⋅ Yimo Cai ⋅ Kaiyan Zhang ⋅ Xiaohang Zhan ⋅ Yueqi Duan
Exhibit Hall I #362
Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflection Poster Session 4 & Exhibit Hall with Coffee Break
Shufan Li ⋅ Konstantinos Kallidromitis ⋅ Akash Gokul ⋅ Arsh Koneru ⋅ Yusuke Kato ⋅ Kazuki Kozuka ⋅ Aditya Grover
Exhibit Hall I #72
Discovering Divergent Representations between Text-to-Image Models Poster Session 4 & Exhibit Hall with Coffee Break
Lisa Dunlap ⋅ Trevor Darrell ⋅ Joseph Gonzalez ⋅ Fabian Caba Heilbron ⋅ Josef Sivic ⋅ Bryan Russell
Exhibit Hall I #252
VRM: Knowledge Distillation via Virtual Relation Matching Poster Session 1 & Exhibit Hall
Weijia Zhang ⋅ Fei Xie ⋅ Weidong Cai ⋅ Chao Ma
Exhibit Hall I #249
SKALD: Learning-Based Shot Assembly for Coherent Multi-Shot Video Creation Poster Session 4 & Exhibit Hall with Coffee Break
Chen Yi Lu ⋅ Mehrab Tanjim ⋅ Ishita Dasgupta ⋅ Somdeb Sarkhel ⋅ Gang Wu ⋅ Saayan Mitra ⋅ Somali Chaterji
Exhibit Hall I #284
CoDa-4DGS: Dynamic Gaussian Splatting with Context and Deformation Awareness for Autonomous Driving Poster Session 6 & Exhibit Hall with Coffee Break
Rui Song ⋅ Chenwei Liang ⋅ Yan Xia ⋅ Walter Zimmer ⋅ Hu Cao ⋅ Holger Caesar ⋅ Andreas Festag ⋅ Alois Knoll
Exhibit Hall I #319
GDKVM: Echocardiography Video Segmentation via Spatiotemporal Key-Value Memory with Gated Delta Rule Poster Session 3 & Exhibit Hall
Rui Wang ⋅ Yimu Sun ⋅ Jingxing Guo ⋅ Huisi Wu ⋅ Jing Qin
Exhibit Hall I #205
ImageGem: In-the-wild Generative Image Interaction Dataset for Generative Model Personalization Poster Session 4 & Exhibit Hall with Coffee Break
Yuanhe Guo ⋅ Linxi Xie ⋅ Zhuoran Chen ⋅ Kangrui Yu ⋅ Ryan Po ⋅ Guandao Yang ⋅ Gordon Wetzstein ⋅ Hongyi Wen
Exhibit Hall I #449
One Trajectory, One Token: Grounded Video Tokenization via Panoptic Sub-object Trajectory Poster Session 5 & Exhibit Hall
Chenhao Zheng ⋅ Jieyu Zhang ⋅ Mohammadreza Salehi ⋅ Ziqi Gao ⋅ Vishnu Iyengar ⋅ Norimasa Kobori ⋅ Quan Kong ⋅ Ranjay Krishna
Exhibit Hall I #317
LGA-Net: Learning Local and Global Affinities for Sparse Scribble based Image Colorization Poster Session 2 & Exhibit Hall with Coffee Break
Hongjin Lyu ⋅ Bo Li ⋅ Paul Rosin ⋅ Yu-Kun Lai
Exhibit Hall I #293
Backdoor Attacks on Neural Networks via One-Bit Flip Poster Session 1 & Exhibit Hall
Xiang Li ⋅ Lannan Luo ⋅ Qiang Zeng
Exhibit Hall I #406
Gaze-Language Alignment for Zero-Shot Prediction of Visual Search Targets from Human Gaze Scanpaths Poster Session 1 & Exhibit Hall
Sounak Mondal ⋅ Naveen Sendhilnathan ⋅ Ting Zhang ⋅ Yue Liu ⋅ Michael Proulx ⋅ Michael Iuzzolino ⋅ Chuan Qin ⋅ Tanya Jonker
Exhibit Hall I #252
O-MaMa: Learning Object Mask Matching between Egocentric and Exocentric Views Poster Session 2 & Exhibit Hall with Coffee Break
Lorenzo Mur-Labadia ⋅ Maria Santos-Villafranca ⋅ Jesus Bermudez-cameo ⋅ Alejandro Perez-Yus ⋅ Ruben Martinez-Cantin ⋅ Jose Guerrero
Exhibit Hall I #176
SAM Encoder Breach by Adversarial Simplicial Complex Triggers Downstream Model Failures Poster Session 3 & Exhibit Hall
Yi Qin ⋅ Rui Wang ⋅ Tao Huang ⋅ Tong Xiao ⋅ Liping Jing
Exhibit Hall I #57
HiMTok: Learning Hierarchical Mask Tokens for Image Segmentation with Large Multimodal Model Poster Session 5 & Exhibit Hall
Tao Wang ⋅ Changxu Cheng ⋅ Lingfeng Wang ⋅ Senda Chen ⋅ Wuyue Zhao
Exhibit Hall I #327
Semi-supervised Concept Bottleneck Models Poster Session 1 & Exhibit Hall
Lijie Hu ⋅ Tianhao Huang ⋅ Huanyi Xie ⋅ Xilin Gong ⋅ Chenyang Ren ⋅ Zhengyu Hu ⋅ Lu Yu ⋅ Ping Ma ⋅ Di Wang
Exhibit Hall I #191
Normal and Abnormal Pathology Knowledge-Augmented Vision-Language Model for Anomaly Detection in Pathology Images Poster Session 5 & Exhibit Hall
Jinsol Song ⋅ Jiamu Wang ⋅ Anh Nguyen ⋅ Keunho Byeon ⋅ Sangjeong Ahn ⋅ Sung Hak Lee ⋅ Jin Tae Kwak
Exhibit Hall I #212
DuET: Dual Incremental Object Detection via Exemplar-Free Task Arithmetic Poster Session 1 & Exhibit Hall
Munish Monga ⋅ Vishal Chudasama ⋅ Pankaj Wasnik ⋅ Biplab Banerjee
Exhibit Hall I #288
WINS: Winograd Structured Pruning for Fast Winograd Convolution Poster Session 5 & Exhibit Hall
Cheonjun Park ⋅ Hyunjae Oh ⋅ Mincheol Park ⋅ Hyunchan Moon ⋅ Minsik Kim ⋅ Suhyun Kim ⋅ Myung Kuk Yoon ⋅ Won Woo Ro
Exhibit Hall I #252
ART: Adaptive Relation Tuning for Generalized Relation Prediction Poster Session 4 & Exhibit Hall with Coffee Break
Gopika Sudhakaran ⋅ Hikaru Shindo ⋅ Patrick Schramowski ⋅ Simone Schaub-Meyer ⋅ Kristian Kersting ⋅ Stefan Roth
Exhibit Hall I #136
Feed-Forward SceneDINO for Unsupervised Semantic Scene Completion Poster Session 2 & Exhibit Hall with Coffee Break
Aleksandar Jevtić ⋅ Christoph Reich ⋅ Felix Wimbauer ⋅ Oliver Hahn ⋅ Christian Rupprecht ⋅ Stefan Roth ⋅ Daniel Cremers
Exhibit Hall I #166
DISTIL: Data-Free Inversion of Suspicious Trojan Inputs via Latent Diffusion Poster Session 1 & Exhibit Hall
Hossein Mirzaei ⋅ Zeinab Taghavi ⋅ Sepehr Rezaee ⋅ Masoud Hadi ⋅ Moein Madadi ⋅ Mackenzie Mathis
Exhibit Hall I #295
Factorized Learning for Temporally Grounded Video-Language Models Poster Session 5 & Exhibit Hall
Wenzheng Zeng ⋅ Difei Gao ⋅ Mike Zheng Shou ⋅ Hwee Tou Ng
Exhibit Hall I #84
FedPall: Prototype-based Adversarial and Collaborative Learning for Federated Learning with Feature Drift Poster Session 1 & Exhibit Hall
yong zhang ⋅ Feng Liang ⋅ Guanghu Yuan ⋅ Min Yang ⋅ Chengming Li ⋅ Xiping Hu
Exhibit Hall I #287
Multimodal LLMs as Customized Reward Models for Text-to-Image Generation Poster Session 4 & Exhibit Hall with Coffee Break
Shijie Zhou ⋅ Ruiyi Zhang ⋅ Huaisheng Zhu ⋅ Branislav Kveton ⋅ Yufan Zhou ⋅ Jiuxiang Gu ⋅ Jian Chen ⋅ Changyou Chen
Exhibit Hall I #455
MissRAG: Addressing the Missing Modality Challenge in Multimodal Large Language Models Poster Session 1 & Exhibit Hall
Vittorio Pipoli ⋅ Alessia Saporita ⋅ Federico Bolelli ⋅ Marcella Cornia ⋅ Lorenzo Baraldi ⋅ Costantino Grana ⋅ Rita Cucchiara ⋅ Elisa Ficarra
Exhibit Hall I #297
FinMMR: Make Financial Numerical Reasoning More Multimodal, Comprehensive, and Challenging Poster Session 1 & Exhibit Hall
Zichen Tang ⋅ Haihong E ⋅ Jiacheng Liu ⋅ Zhongjun Yang ⋅ Rongjin Li ⋅ Zihua Rong ⋅ Haoyang He ⋅ Zhuodi Hao ⋅ Xinyang Hu ⋅ Kun Ji ⋅ Ziyan Ma ⋅ Mengyuan Ji ⋅ Jun Zhang ⋅ Chenghao Ma ⋅ Qianhe Zheng ⋅ Yang Liu ⋅ Yiling Huang ⋅ Xinyi Hu ⋅ Qing Huang ⋅ Zijian Xie ⋅ Shiyao Peng
Exhibit Hall I #300
Iris: Breaking GUI Complexity with Adaptive Focus and Self-Refining Poster Session 5 & Exhibit Hall
Zhiqi Ge ⋅ Juncheng Li ⋅ Xinglei Pang ⋅ Minghe Gao ⋅ Kaihang Pan ⋅ Wang Lin ⋅ Hao Fei ⋅ Wenqiao Zhang ⋅ Siliang Tang ⋅ Yueting Zhuang
Exhibit Hall I #446
No Pose at All: Self-Supervised Pose-Free 3D Gaussian Splatting from Sparse Views Poster Session 6 & Exhibit Hall with Coffee Break
Ranran Huang ⋅ Krystian Mikolajczyk
Exhibit Hall I #311
External Knowledge Injection for CLIP-Based Class-Incremental Learning Poster Session 1 & Exhibit Hall
Da-Wei Zhou ⋅ Kai-Wen Li ⋅ Jingyi Ning ⋅ Han-Jia Ye ⋅ Lijun Zhang ⋅ De-Chuan Zhan
Exhibit Hall I #308
Cooperative Pseudo Labeling for Unsupervised Federated Classification Poster Session 1 & Exhibit Hall
Kuangpu Guo ⋅ Lijun Sheng ⋅ Yongcan Yu ⋅ Jian Liang ⋅ Zilei Wang ⋅ Ran He
Exhibit Hall I #309
DreamLayer: Simultaneous Multi-Layer Generation via Diffusion Model Poster Session 1 & Exhibit Hall
Junjia Huang ⋅ Pengxiang Yan ⋅ Jinhang Cai ⋅ Jiyang Liu ⋅ Zhao Wang ⋅ Yitong Wang ⋅ Xinglong Wu ⋅ Guanbin Li
Exhibit Hall I #312
AutoOcc: Automatic Open-Ended Semantic Occupancy Annotation via Vision-Language Guided Gaussian Splatting Poster Session 1 & Exhibit Hall
Xiaoyu Zhou ⋅ Jingqi Wang ⋅ Yongtao Wang ⋅ Yufei Wei ⋅ Nan Dong ⋅ Ming-Hsuan Yang
Exhibit Hall I #313
Augmenting Moment Retrieval: Zero-Dependency Two-Stage Learning Poster Session 1 & Exhibit Hall
Zhengxuan Wei ⋅ Jiajin Tang ⋅ Sibei Yang
Exhibit Hall I #316
Rep-MTL: Unleashing the Power of Representation-level Task Saliency for Multi-Task Learning Poster Session 1 & Exhibit Hall
Zedong Wang ⋅ Siyuan Li ⋅ Dan Xu
Exhibit Hall I #317
Fine-Grained Evaluation of Large Vision-Language Models in Autonomous Driving Poster Session 2 & Exhibit Hall with Coffee Break
Yue Li ⋅ Meng Tian ⋅ Zhenyu Lin ⋅ Jiangtong Zhu ⋅ Dechang Zhu ⋅ Haiqiang Liu ⋅ Yueyi Zhang ⋅ Zhiwei Xiong ⋅ Xinhai Zhao
Exhibit Hall I #414
Activation Subspaces for Out-of-Distribution Detection Poster Session 1 & Exhibit Hall
Barış Zöngür ⋅ Robin Hesse ⋅ Stefan Roth
Exhibit Hall I #326
PAN-Crafter: Learning Modality-Consistent Alignment for PAN-Sharpening Poster Session 1 & Exhibit Hall
Jeonghyeok Do ⋅ Sungpyo Kim ⋅ Geunhyuk Youk ⋅ Jaehyup Lee ⋅ Munchurl Kim
Exhibit Hall I #397
Continual Adaptation: Environment-Conditional Parameter Generation for Object Detection in Dynamic Scenarios Poster Session 1 & Exhibit Hall
Deng Li ⋅ Aming WU ⋅ Yang Li ⋅ Yaowei Wang ⋅ Yahong Han
Exhibit Hall I #416
Differentially Private Fine-Tuning of Diffusion Models Poster Session 1 & Exhibit Hall
Yu-Lin Tsai ⋅ Yizhe Li ⋅ Zekai Chen ⋅ Po-Yu Chen ⋅ Francois Buet-Golfouse ⋅ Chia-Mu Yu ⋅ Xuebin Ren
Exhibit Hall I #428
IRGPT: Understanding Real-world Infrared Image with Bi-cross-modal Curriculum on Large-scale Benchmark Poster Session 1 & Exhibit Hall
Zhe Cao ⋅ Jin Zhang ⋅ Ruiheng Zhang
Exhibit Hall I #6
Multi-turn Consistent Image Editing Poster Session 4 & Exhibit Hall with Coffee Break
Zijun Zhou ⋅ Yingying Deng ⋅ Xiangyu He ⋅ Weiming Dong ⋅ Fan Tang
Exhibit Hall I #86
A Hidden Stumbling Block in Generalized Category Discovery: Distracted Attention Poster Session 1 & Exhibit Hall
Qiyu Xu ⋅ Zhanxuan Hu ⋅ Yu Duan ⋅ Ercheng Pei ⋅ Yonghang Tai
Exhibit Hall I #28
CAFA: a Controllable Automatic Foley Artist Poster Session 4 & Exhibit Hall with Coffee Break
Roi Benita ⋅ Michael Finkelson ⋅ Tavi Halperin ⋅ Gleb Sterkin ⋅ Yossi Adi
Exhibit Hall I #98
Unknown Text Learning for CLIP-based Few-Shot Open-set Recognition Poster Session 1 & Exhibit Hall
Rui Ma ⋅ Qilong Wang ⋅ Bing Cao ⋅ Qinghua Hu ⋅ Yahong Han
Exhibit Hall I #52
Reducing Unimodal Bias in Multi-Modal Semantic Segmentation with Multi-Scale Functional Entropy Regularization Poster Session 5 & Exhibit Hall
Xu Zheng ⋅ Yuanhuiyi Lyu ⋅ Lutao Jiang ⋅ Danda Pani Paudel ⋅ Luc Gool ⋅ Xuming Hu
Exhibit Hall I #127
Personalized Federated Learning under Local Supervision Poster Session 1 & Exhibit Hall
Qiqi Liu ⋅ Jiaqiang Li ⋅ Yuchen Liu ⋅ Yaochu Jin ⋅ Lingjuan Lyu ⋅ Xiaohu Wu ⋅ Han Yu
Exhibit Hall I #379
Multi-View 3D Point Tracking Poster Session 1 & Exhibit Hall
Frano Rajič ⋅ Haofei Xu ⋅ Marko Mihajlovic ⋅ Siyuan Li ⋅ Irem Demir ⋅ Emircan Gündoğdu ⋅ Lei Ke ⋅ Sergey Prokudin ⋅ Marc Pollefeys ⋅ Siyu Tang
Exhibit Hall I #75
Hyper-Depth: Hypergraph-based Multi-Scale Representation Fusion for Monocular Depth Estimation Poster Session 2 & Exhibit Hall with Coffee Break
Lin Bie ⋅ Siqi Li ⋅ Yifan Feng ⋅ Yue Gao
Exhibit Hall I #6
Learning Separable Fine-Grained Representation via Dendrogram Construction from Coarse Labels for Fine-grained Visual Recognition Poster Session 1 & Exhibit Hall
Guanghui Shi ⋅ Xuefeng liang ⋅ Wenjie Li ⋅ Xiaoyu Lin
Exhibit Hall I #72
PRVQL: Progressive Knowledge-guided Refinement for Robust Egocentric Visual Query Localization Poster Session 2 & Exhibit Hall with Coffee Break
Bing Fan ⋅ Yunhe Feng ⋅ Yapeng Tian ⋅ James Liang ⋅ Yuewei Lin ⋅ Yan Huang ⋅ Heng Fan
Exhibit Hall I #13
PRO-VPT: Distribution-Adaptive Visual Prompt Tuning via Prompt Relocation Poster Session 1 & Exhibit Hall
Chikai Shang ⋅ Mengke Li ⋅ Yiqun Zhang ⋅ Zhen Chen ⋅ Jinlin Wu ⋅ Fangqing Gu ⋅ Yang Lu ⋅ Yiu-ming Cheung
Exhibit Hall I #138
Language-Driven Multi-Label Zero-Shot Learning with Semantic Granularity Poster Session 1 & Exhibit Hall
Shouwen Wang ⋅ Qian Wan ⋅ Junbin Gao ⋅ Zhigang Zeng
Exhibit Hall I #178
Generalized Deep Multi-view Clustering via Causal Learning with Partially Aligned Cross-view Correspondence Poster Session 1 & Exhibit Hall
Xihong Yang ⋅ Siwei Wang ⋅ Jiaqi Jin ⋅ Fangdi Wang ⋅ Tianrui Liu ⋅ Yueming Jin ⋅ Xinwang Liu ⋅ En Zhu ⋅ Kunlun He
Exhibit Hall I #180
Granular Concept Circuits: Toward a Fine-Grained Circuit Discovery for Concept Representations Poster Session 1 & Exhibit Hall
Dahee Kwon ⋅ Sehyun Lee ⋅ Jaesik Choi
Exhibit Hall I #214
Learning an Implicit Physics Model for Image-based Fluid Simulation Poster Session 2 & Exhibit Hall with Coffee Break
Emily Jia ⋅ Jiageng Mao ⋅ Zhiyuan Gao ⋅ Yajie Zhao ⋅ Yue Wang
Exhibit Hall I #190
Less is More: Empowering GUI Agent with Context-Aware Simplification Poster Session 2 & Exhibit Hall with Coffee Break
Gongwei Chen ⋅ Xurui Zhou ⋅ Rui Shao ⋅ Yibo Lyu ⋅ Kaiwen Zhou ⋅ Shuai Wang ⋅ WenTao Li ⋅ Yinchuan Li ⋅ Zhongang Qi ⋅ Liqiang Nie
Exhibit Hall I #83
Trace3D: Consistent Segmentation Lifting via Gaussian Instance Tracing Poster Session 2 & Exhibit Hall with Coffee Break
Hongyu Shen ⋅ Junfeng Ni ⋅ Weishuo Li ⋅ Mingtao Pei ⋅ Yixin Chen ⋅ Siyuan Huang
Exhibit Hall I #155
IM360: Large-scale Indoor Mapping with 360 Cameras Poster Session 6 & Exhibit Hall with Coffee Break
Dongki Jung ⋅ Jaehoon Choi ⋅ Yonghan Lee ⋅ Dinesh Manocha
Exhibit Hall I #416
EventUPS: Uncalibrated Photometric Stereo Using an Event Camera Poster Session 2 & Exhibit Hall with Coffee Break
Jinxiu Liang ⋅ Bohan Yu ⋅ Siqi Yang ⋅ Haotian Zhuang ⋅ Jieji Ren ⋅ Peiqi Duan ⋅ Boxin Shi
Exhibit Hall I #235
Noise-Modeled Diffusion Models for Low-Light Spike Image Restoration Poster Session 1 & Exhibit Hall
Ruonan Liu ⋅ Lin Zhu ⋅ Xijie Xiang ⋅ Lizhi Wang ⋅ Hua Huang
Exhibit Hall I #382
Rethinking the Upsampling Process in Light Field Super-Resolution with Spatial-Epipolar Implicit Image Function Poster Session 2 & Exhibit Hall with Coffee Break
Ruixuan Cong ⋅ Yu Wang ⋅ Mingyuan Zhao ⋅ Da Yang ⋅ Rongshan Chen ⋅ Hao Sheng
Exhibit Hall I #239
Harnessing Input-Adaptive Inference for Efficient VLN Poster Session 2 & Exhibit Hall with Coffee Break
Dongwoo Kang ⋅ Akhil Perincherry ⋅ Zachary Coalson ⋅ Aiden Gabriel ⋅ Stefan Lee ⋅ Sanghyun Hong
Exhibit Hall I #300
When Lighting Deceives: Exposing Vision-Language Models' Illumination Vulnerability Through Illumination Transformation Attack Poster Session 3 & Exhibit Hall
Hanqing Liu ⋅ Shouwei Ruan ⋅ Yao Huang ⋅ Shiji Zhao ⋅ Xingxing Wei
Exhibit Hall I #44
SemTalk: Holistic Co-speech Motion Generation with Frame-level Semantic Emphasis Poster Session 3 & Exhibit Hall
Xiangyue Zhang ⋅ Jianfang Li ⋅ Jiaxu Zhang ⋅ Ziqiang Dang ⋅ Jianqiang Ren ⋅ Liefeng Bo ⋅ Zhigang Tu
Exhibit Hall I #353
PersonaCraft: Personalized and Controllable Full-Body Multi-Human Scene Generation Using Occlusion-Aware 3D-Conditioned Diffusion Poster Session 3 & Exhibit Hall
Gwanghyun Kim ⋅ Suh Jeon Jeon ⋅ Seunggyu Lee ⋅ Se Young Chun
Exhibit Hall I #191
MotionFollower: Editing Video Motion via Score-Guided Diffusion Poster Session 3 & Exhibit Hall
Shuyuan Tu ⋅ Qi Dai ⋅ Zihao Zhang ⋅ Sicheng Xie ⋅ Zhi-Qi Cheng ⋅ Chong Luo ⋅ Xintong Han ⋅ Zuxuan Wu ⋅ Yu-Gang Jiang
Exhibit Hall I #265
Online Generic Event Boundary Detection Poster Session 3 & Exhibit Hall
Hyung Rok Jung ⋅ Daneul Kim ⋅ Seunggyun Lim ⋅ Jeany Son ⋅ Jonghyun Choi
Exhibit Hall I #351
A Recipe for Generating 3D Worlds from a Single Image Poster Session 1 & Exhibit Hall
Katja Schwarz ⋅ Denis Rozumny ⋅ Samuel Rota Bulò ⋅ Lorenzo Porzi ⋅ Peter Kontschieder
Exhibit Hall I #327
Guiding Diffusion Models with Adaptive Negative Sampling Without External Resources Poster Session 4 & Exhibit Hall with Coffee Break
Alakh Desai ⋅ Nuno Vasconcelos
Exhibit Hall I #117
Transformed Low-rank Adaptation via Tensor Decomposition and Its Applications to Text-to-image Models Poster Session 4 & Exhibit Hall with Coffee Break
Zerui Tao ⋅ Yuhta Takida ⋅ Naoki Murata ⋅ Qibin Zhao ⋅ Yuki Mitsufuji
Exhibit Hall I #137
DLFR-Gen: Diffusion-based Video Generation with Dynamic Latent Frame Rate Poster Session 4 & Exhibit Hall with Coffee Break
Zhihang Yuan ⋅ Rui Xie ⋅ Yuzhang Shang ⋅ Hanling Zhang ⋅ Siyuan Wang ⋅ Shengen Yan ⋅ Guohao Dai ⋅ Yu Wang
Exhibit Hall I #144
DiffDoctor: Diagnosing Image Diffusion Models Before Treating Poster Session 4 & Exhibit Hall with Coffee Break
Yiyang Wang ⋅ Xi Chen ⋅ Xiaogang Xu ⋅ Sihui Ji ⋅ Yu Liu ⋅ Yujun Shen ⋅ Hengshuang Zhao
Exhibit Hall I #387
GroundingSuite: Measuring Complex Multi-Granular Pixel Grounding Poster Session 5 & Exhibit Hall
Rui Hu ⋅ Yuxuan Zhang ⋅ Lianghui Zhu ⋅ Tianheng Cheng ⋅ Lei Liu ⋅ Heng Liu ⋅ Longjin Ran ⋅ Xiaoxin Chen ⋅ Wenyu Liu ⋅ Xinggang Wang
Exhibit Hall I #312
Video Motion Graphs Poster Session 3 & Exhibit Hall
Haiyang Liu ⋅ Zhan Xu ⋅ Fating Hong ⋅ Hsin-Ping Huang ⋅ Yi Zhou ⋅ Yang Zhou
Exhibit Hall I #350
Adaptive Learning of High-Value Regions for Semi-Supervised Medical Image Segmentation Poster Session 5 & Exhibit Hall
Tao Lei ⋅ Ziyao Yang ⋅ Xingwu wang ⋅ Yi Wang ⋅ Xuan Wang ⋅ FeimanSun FeimanSun ⋅ Asoke Nandi
Exhibit Hall I #153
Constructing Ophthalmic MLLM for Positioning-diagnosis Collaboration Through Clinical Cognitive Chain Reasoning Poster Session 5 & Exhibit Hall
Xinyao Liu ⋅ Diping Song
Exhibit Hall I #164
Hallucinatory Image Tokens: A Training-free EAZY Approach to Detecting and Mitigating Object Hallucinations in LVLMs Poster Session 5 & Exhibit Hall
Liwei Che ⋅ Qingze T Liu ⋅ Jing Jia ⋅ Weiyi Qin ⋅ Ruixiang Tang ⋅ Vladimir Pavlovic
Exhibit Hall I #172
Keep Your Friends Close, and Your Enemies Farther: Distance-aware Voxel-wise Contrastive Learning for Semi-supervised Multi-organ Segmentation Poster Session 5 & Exhibit Hall
Haochen Zhao ⋅ Jianwei Niu ⋅ Xuefeng Liu ⋅ Xiaozheng Xie ⋅ Li Kuang ⋅ Haotian Yang ⋅ Bin Dai ⋅ Hui Meng ⋅ Yong Wang
Exhibit Hall I #190
Integrating Biological Knowledge for Robust Microscopy Image Profiling on De Novo Cell Lines Poster Session 5 & Exhibit Hall
Jiayuan Chen ⋅ Thai-Hoang Pham ⋅ Yuanlong Wang ⋅ Ping Zhang
Exhibit Hall I #286
Spectral Sensitivity Estimation with an Uncalibrated Diffraction Grating Poster Session 6 & Exhibit Hall with Coffee Break
Lilika Makabe ⋅ Hiroaki Santo ⋅ Fumio Okura ⋅ Michael Brown ⋅ Yasuyuki Matsushita
Exhibit Hall I #245
TransiT: Transient Transformer for Non-line-of-sight Videography Poster Session 6 & Exhibit Hall with Coffee Break
Ruiqian Li ⋅ Siyuan Shen ⋅ Suan Xia ⋅ Ziheng Wang ⋅ Xingyue Peng ⋅ Chengxuan Song ⋅ Yingsheng Zhu ⋅ Tao Wu ⋅ Shiying Li ⋅ Jingyi Yu
Exhibit Hall I #272
LA-MOTR: End-to-End Multi-Object Tracking by Learnable Association Poster Session 3 & Exhibit Hall
Peng Wang ⋅ Yongcai Wang ⋅ Hualong Cao ⋅ Wang Chen ⋅ Deying Li
Exhibit Hall I #230
Contact-Aware Amodal Completion for Human-Object Interaction via Multi-Regional Inpainting Poster Session 2 & Exhibit Hall with Coffee Break
Seunggeun Chi ⋅ Pin-Hao Huang ⋅ Enna Sachdeva ⋅ Kwonjoon Lee
Exhibit Hall I #419
On the Complexity-Faithfulness Trade-off of Gradient-Based Explanations Poster Session 1 & Exhibit Hall
Amir Mehrpanah ⋅ Matteo Gamba ⋅ Kevin Smith ⋅ Hossein Azizpour
Exhibit Hall I #328
TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Models Poster Session 1 & Exhibit Hall
Mark YU ⋅ Wenbo Hu ⋅ Jinbo Xing ⋅ Ying Shan
Exhibit Hall I #154
CSD-VAR: Content-Style Decomposition in Visual Autoregressive Models Poster Session 4 & Exhibit Hall with Coffee Break
Quang-Binh Nguyen ⋅ Minh Luu ⋅ Quang Nguyen ⋅ Anh Tran ⋅ Khoi Nguyen
Exhibit Hall I #203
Oasis: One Image is All You Need for Multimodal Instruction Data Synthesis Poster Session 1 & Exhibit Hall
Letian Zhang ⋅ Quan Cui ⋅ Bingchen Zhao ⋅ Cheng Yang
Exhibit Hall I #329
Learning to Inference Adaptively for Multimodal Large Language Models Poster Session 1 & Exhibit Hall
Zhuoyan Xu ⋅ Khoi Nguyen ⋅ Preeti Mukherjee ⋅ Saurabh Bagchi ⋅ Somali Chaterji ⋅ Yingyu Liang ⋅ Yin Li
Exhibit Hall I #330
Self-Reinforcing Prototype Evolution with Dual-Knowledge Cooperation for Semi-Supervised Lifelong Person Re-Identification Poster Session 1 & Exhibit Hall
Kunlun Xu ⋅ Fan Zhuo ⋅ Jiangmeng Li ⋅ Xu Zou ⋅ Jiahuan Zhou
Exhibit Hall I #331
Hierarchical Divide-and-Conquer Grouping for Classification Adaptation of Pre-Trained Models Poster Session 1 & Exhibit Hall
Ziqian Lu ⋅ Yunlong Yu ⋅ Qinyue Tong ⋅ Jun Liu
Exhibit Hall I #332
Lark: Low-Rank Updates After Knowledge Localization for Few-shot Class-Incremental Learning Poster Session 1 & Exhibit Hall
Jinxin Shi ⋅ Jiabao Zhao ⋅ Yifan Yang ⋅ Xingjiao Wu ⋅ Jiawen Li ⋅ Liang He
Exhibit Hall I #335
Large Multi-modal Models Can Interpret Features in Large Multi-modal Models Poster Session 1 & Exhibit Hall
Kaichen Zhang ⋅ Yifei Shen ⋅ Bo Li ⋅ Ziwei Liu
Exhibit Hall I #339
A Conditional Probability Framework for Compositional Zero-shot Learning Poster Session 1 & Exhibit Hall
Peng Wu ⋅ Qiuxia Lai ⋅ Hao Fang ⋅ Guo-Sen Xie ⋅ Yilong Yin ⋅ Xiankai Lu ⋅ Wenguan Wang
Exhibit Hall I #341
Mind the Gap: Preserving and Compensating for the Modality Gap in CLIP-Based Continual Learning Poster Session 1 & Exhibit Hall
Linlan Huang ⋅ Xusheng Cao ⋅ Haori Lu ⋅ Yifan Meng ⋅ Fei Yang ⋅ Xialei Liu
Exhibit Hall I #351
BUFFER-X: Towards Zero-Shot Point Cloud Registration in Diverse Scenes Poster Session 1 & Exhibit Hall
Minkyun Seo ⋅ Hyungtae Lim ⋅ Kanghee Lee ⋅ Luca Carlone ⋅ Jaesik Park
Exhibit Hall I #358
RANKCLIP: Ranking-Consistent Language-Image Pretraining Poster Session 1 & Exhibit Hall
Yiming Zhang ⋅ Zhuokai Zhao ⋅ Zhaorun Chen ⋅ Zhili Feng ⋅ Zenghui Ding ⋅ Yining Sun
Exhibit Hall I #360
An Efficient Post-hoc Framework for Reducing Task Discrepancy of Text Encoders for Composed Image Retrieval Poster Session 1 & Exhibit Hall
Jaeseok Byun ⋅ Seokhyeon Jeong ⋅ Wonjae Kim ⋅ Sanghyuk Chun ⋅ Taesup Moon
Exhibit Hall I #362
ZIUM: Zero-Shot Intent-Aware Adversarial Attack on Unlearned Models Poster Session 1 & Exhibit Hall
Hyun Jun Yook ⋅ Ga San Jhun ⋅ Cho Hyun ⋅ Min Jeon ⋅ Donghyun Kim ⋅ Tae Hyung Kim ⋅ Youn Lee
Exhibit Hall I #365
Federated Prompt-Tuning with Heterogeneous and Incomplete Multimodal Client Data Poster Session 1 & Exhibit Hall
Hang Phung ⋅ Manh Nguyen ⋅ Thanh Huynh ⋅ Quoc Viet Hung Nguyen ⋅ Trong Nghia Hoang ⋅ Phi Le Nguyen
Exhibit Hall I #366
Find a Scapegoat: Poisoning Membership Inference Attack and Defense to Federated Learning Poster Session 1 & Exhibit Hall
Wenjin Mo ⋅ Zhiyuan Li ⋅ Minghong Fang ⋅ Mingwei Fang
Exhibit Hall I #369
OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning Poster Session 1 & Exhibit Hall
Xianhang Li ⋅ Yanqing Liu ⋅ Haoqin Tu ⋅ Cihang Xie
Exhibit Hall I #370
Integrating Visual Interpretation and Linguistic Reasoning for Geometric Problem Solving Poster Session 1 & Exhibit Hall
Zixian Guo ⋅ Ming Liu ⋅ Qilong Wang ⋅ Zhilong Ji ⋅ Jinfeng Bai ⋅ Lei Zhang ⋅ Wangmeng Zuo
Exhibit Hall I #371
SAFER: Sharpness Aware layer-selective Finetuning for Enhanced Robustness in vision transformers Poster Session 1 & Exhibit Hall
Bhavna Gopal ⋅ Huanrui Yang ⋅ Mark Horton ⋅ Yiran Chen
Exhibit Hall I #372
To Label or Not to Label: PALM – A Predictive Model for Evaluating Sample Efficiency in Active Learning Models Poster Session 1 & Exhibit Hall
Julia Machnio ⋅ Mads Nielsen ⋅ Mostafa Mehdipour Ghazi
Exhibit Hall I #376
Uncalibrated Structure from Motion on a Sphere Poster Session 1 & Exhibit Hall
Jonathan Ventura ⋅ Viktor Larsson ⋅ Fredrik Kahl
Exhibit Hall I #303
Prototype-based Contrastive Learning with Stage-wise Progressive Augmentation for Self-Supervised Fine-Grained Learning Poster Session 1 & Exhibit Hall
BaoFeng Tan ⋅ Xiu-Shen Wei ⋅ Lin Zhao
Exhibit Hall I #386
Radiant Foam: Real-Time Differentiable Ray Tracing Poster Session 1 & Exhibit Hall
Shrisudhan Govindarajan ⋅ Daniel Rebain ⋅ Kwang Moo Yi ⋅ Andrea Tagliasacchi
Exhibit Hall I #387
COSTARR: Consolidated Open Set Technique with Attenuation for Robust Recognition Poster Session 1 & Exhibit Hall
Ryan Rabinowitz ⋅ Steve Cruz ⋅ Walter Scheirer ⋅ Terrance Boult
Exhibit Hall I #388
Information Density Principle for MLLM Benchmarks Poster Session 1 & Exhibit Hall
Chunyi Li ⋅ Xiaozhe Li ⋅ Zicheng Zhang ⋅ Yuan Tian ⋅ Ziheng Jia ⋅ Xiaohong Liu ⋅ Xiongkuo Min ⋅ Jia Wang ⋅ Haodong Duan ⋅ Kai Chen ⋅ Guangtao Zhai
Exhibit Hall I #390
ReTracker: Exploring Image Matching for Robust Online Any Point Tracking Poster Session 1 & Exhibit Hall
Dongli Tan ⋅ Xingyi He ⋅ Sida Peng ⋅ Yiqing Gong ⋅ Xing Zhu ⋅ Jiaming Sun ⋅ Ruizhen Hu ⋅ Yujun Shen ⋅ Hujun Bao ⋅ Xiaowei Zhou
Exhibit Hall I #404
Perspective-Aware Teaching: Adapting Knowledge for Heterogeneous Distillation Poster Session 1 & Exhibit Hall
Jhe-Hao Lin ⋅ Yi Yao ⋅ Chan-Feng Hsu ⋅ Hongxia Xie ⋅ Hong-Han Shuai ⋅ Wen-Huang Cheng
Exhibit Hall I #391
Is Meta-Learning Out? Rethinking Unsupervised Few-Shot Classification with Limited Entropy Poster Session 1 & Exhibit Hall
Yunchuan Guan ⋅ Yu Liu ⋅ Ke Zhou ⋅ Zhiqi Shen ⋅ Jenq-Newng Hwang ⋅ Serge Belongie ⋅ Lei Li
Exhibit Hall I #392
Learning to Unlearn while Retaining: Combating Gradient Conflicts in Machine Unlearning Poster Session 1 & Exhibit Hall
Gaurav Patel ⋅ Qiang Qiu
Exhibit Hall I #394
Robust Multi-View Learning via Representation Fusion of Sample-Level Attention and Alignment of Simulated Perturbation Poster Session 1 & Exhibit Hall
Jie Xu ⋅ Na Zhao ⋅ Gang Niu ⋅ Masashi Sugiyama ⋅ Xiaofeng Zhu
Exhibit Hall I #396
Beyond the Frame: Generating 360° Panoramic Videos from Perspective Videos Poster Session 3 & Exhibit Hall
Rundong Luo ⋅ Matthew Wallingford ⋅ Ali Farhadi ⋅ Noah Snavely ⋅ Wei-Chiu Ma
Exhibit Hall I #408
A Linear N-Point Solver for Structure and Motion from Asynchronous Tracks Poster Session 1 & Exhibit Hall
Hang Su ⋅ Yunlong Feng ⋅ Daniel Gehrig ⋅ Panfeng Jiang ⋅ Ling Gao ⋅ Xavier Lagorce ⋅ Laurent Kneip
Exhibit Hall I #407
Differentiable Room Acoustic Rendering with Multi-View Vision Priors Poster Session 1 & Exhibit Hall
Derong Jin ⋅ Ruohan Gao
Exhibit Hall I #381
Long-LRM: Long-sequence Large Reconstruction Model for Wide-coverage Gaussian Splats Poster Session 1 & Exhibit Hall
Chen Ziwen ⋅ Hao Tan ⋅ Kai Zhang ⋅ Sai Bi ⋅ Fujun Luan ⋅ Yicong Hong ⋅ Li Fuxin ⋅ Zexiang Xu
Exhibit Hall I #408
SplatTalk: 3D VQA with Gaussian Splatting Poster Session 1 & Exhibit Hall
Anh Thai ⋅ Kyle Genova ⋅ Songyou Peng ⋅ Leonidas Guibas ⋅ Thomas Funkhouser
Exhibit Hall I #442
Joint Diffusion Models in Continual Learning Poster Session 1 & Exhibit Hall
Paweł Skierś ⋅ Kamil Deja
Exhibit Hall I #411
GT-Loc: Unifying When and Where in Images through a Joint Embedding Space Poster Session 1 & Exhibit Hall
David G. Shatwell ⋅ Ishan Rajendrakumar Dave ⋅ Swetha Sirnam ⋅ Mubarak Shah
Exhibit Hall I #153
TurboTrain: Towards Efficient and Balanced Multi-Task Learning for Multi-Agent Perception and Prediction Poster Session 1 & Exhibit Hall
Zewei Zhou ⋅ Zhihao Zhao ⋅ Tianhui Cai ⋅ Zhiyu Huang ⋅ Bolei Zhou ⋅ Jiaqi Ma
Exhibit Hall I #412
InfiniDreamer: Arbitrarily Long Human Motion Generation via Segment Score Distillation Poster Session 3 & Exhibit Hall
Wenjie Zhuo ⋅ Fan Ma ⋅ Hehe Fan
Exhibit Hall I #441
Multimodal Large Language Model-Guided ISP Hyperparameter Optimization with Dynamic Preference Learning Poster Session 1 & Exhibit Hall
Xinyu Sun ⋅ Zhikun Zhao ⋅ congyan lang ⋅ Bing Li ⋅ Juan Wang
Exhibit Hall I #31
VITAL: More Understandable Feature Visualization through Distribution Alignment and Relevant Information Flow Poster Session 1 & Exhibit Hall
Ada Görgün ⋅ Bernt Schiele ⋅ Jonas Fischer
Exhibit Hall I #413
ToolVQA: A Dataset for Multi-step Reasoning VQA with External Tools Poster Session 1 & Exhibit Hall
Shaofeng Yin ⋅ Ting Lei ⋅ Yang Liu
Exhibit Hall I #415
MMCR: Benchmarking Cross-Source Reasoning in Scientific Papers Poster Session 1 & Exhibit Hall
Yang Tian ⋅ Zheng Lu ⋅ Mingqi Gao ⋅ Zheng Liu ⋅ Bo Zhao
Exhibit Hall I #36
MosaicDiff: Training-free Structural Pruning for Diffusion Model Acceleration Reflecting Pretraining Dynamics Poster Session 1 & Exhibit Hall
Bowei Guo ⋅ Shengkun Tang ⋅ Cong Zeng ⋅ Zhiqiang Shen
Exhibit Hall I #147
FEVER-OOD: Free Energy Vulnerability Elimination for Robust Out-of-Distribution Detection Poster Session 1 & Exhibit Hall
Brian Isaac-Medina ⋅ Mauricio Che ⋅ Yona Falinie A. Gaus ⋅ Samet Akcay ⋅ Toby Breckon
Exhibit Hall I #425
CAVIS: Context-Aware Video Instance Segmentation Poster Session 1 & Exhibit Hall
Seunghun Lee ⋅ Jiwan Seo ⋅ Kiljoon Han ⋅ Minwoo Choi ⋅ Sunghoon Im
Exhibit Hall I #423
Adversarial Purification via Super-Resolution and Diffusion Poster Session 1 & Exhibit Hall
Mincheol Park ⋅ Cheonjun Park ⋅ Seungseop Lim ⋅ Mijin Koo ⋅ Hyunwuk Lee ⋅ Won Woo Ro ⋅ Suhyun Kim
Exhibit Hall I #432
FedWSQ: Efficient Federated Learning with Weight Standardization and Distribution-Aware Non-Uniform Quantization Poster Session 1 & Exhibit Hall
Seung-Wook Kim ⋅ Seongyeol Kim ⋅ Jiah Kim ⋅ Seowon Ji ⋅ Se-Ho Lee
Exhibit Hall I #433
CMAD: Correlation-Aware and Modalities-Aware Distillation for Multimodal Sentiment Analysis with Missing Modalities Poster Session 1 & Exhibit Hall
Yan Zhuang ⋅ Minhao Liu ⋅ Wei Bai ⋅ Yanru Zhang ⋅ Xiaoyue Zhang ⋅ Jiawen Deng ⋅ Fuji Ren
Exhibit Hall I #434
SimpleVQA: Multimodal Factuality Evaluation for Multimodal Large Language Models Poster Session 1 & Exhibit Hall
Xianfu Cheng ⋅ Wei Zhang ⋅ Shiwei Zhang ⋅ Jian Yang ⋅ Xiangyuan Guan ⋅ Xianjie Wu ⋅ Xiang Li ⋅ Ge Zhang ⋅ Jiaheng Liu ⋅ Yuying Mai ⋅ Yutao Zeng ⋅ Zhoufutu Wen ⋅ JinKe JinKe ⋅ Baorui Wang ⋅ Weixiao Zhou ⋅ Lu Yunhong ⋅ Hangyuan Ji ⋅ Tongliang Li ⋅ Wenhao Huang ⋅ Zhoujun Li
Exhibit Hall I #435
2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining Poster Session 1 & Exhibit Hall
Wenqi Zhang ⋅ Hang Zhang ⋅ Xin Li ⋅ Jiashuo Sun ⋅ Yongliang Shen ⋅ Weiming Lu ⋅ Deli Zhao ⋅ Yueting Zhuang ⋅ Lidong Bing
Exhibit Hall I #436
Revelio: Interpreting and leveraging semantic information in diffusion models Poster Session 1 & Exhibit Hall
Dahye Kim ⋅ Xavier Thomas ⋅ Deepti Ghadiyaram
Exhibit Hall I #437
CLIP-GS: Unifying Vision-Language Representation with 3D Gaussian Splatting Poster Session 1 & Exhibit Hall
Siyu Jiao ⋅ Haoye Dong ⋅ Yuyang Yin ⋅ ZEQUN JIE ⋅ Yinlong Qian ⋅ Yao Zhao ⋅ Humphrey Shi ⋅ Yunchao Wei
Exhibit Hall I #438
ProJudge: A Multi-Modal Multi-Discipline Benchmark and Instruction-Tuning Dataset for MLLM-based Process Judges Poster Session 1 & Exhibit Hall
Jiaxin Ai ⋅ Pengfei Zhou ⋅ xu Pan ⋅ Ming Li ⋅ Fanrui Zhang ⋅ Zizhen Li ⋅ Jianwen Sun ⋅ Yukang Feng ⋅ Baojin Huang ⋅ Zhongyuan Wang ⋅ Kaipeng Zhang
Exhibit Hall I #439
Failure Cases Are Better Learned But Boundary Says Sorry: Facilitating Smooth Perception Change for Accuracy-Robustness Trade-Off in Adversarial Training Poster Session 1 & Exhibit Hall
Yanyun Wang ⋅ Li Liu
Exhibit Hall I #440
Taming the Untamed: Graph-Based Knowledge Retrieval and Reasoning for MLLMs to Conquer the Unknown Poster Session 1 & Exhibit Hall
Bowen Wang ⋅ Zhouqiang Jiang ⋅ Yasuaki Susumu ⋅ Shotaro Miwa ⋅ Tianwei Chen ⋅ Yuta Nakashima
Exhibit Hall I #444
Causality-guided Prompt Learning for Vision-language Models via Visual Granulation Poster Session 1 & Exhibit Hall
Mengyu Gao ⋅ Qiulei Dong
Exhibit Hall I #99
MUNBa: Machine Unlearning via Nash Bargaining Poster Session 1 & Exhibit Hall
Jing Wu ⋅ Mehrtash Harandi
Exhibit Hall I #446
Auxiliary Prompt Tuning of Vision-Language Models for Few-Shot Out-of-Distribution Detection Poster Session 1 & Exhibit Hall
Wenjun Miao ⋅ Guansong Pang ⋅ Zihan Wang ⋅ Jin Zheng ⋅ Xiao Bai
Exhibit Hall I #448
Improved Noise Schedule for Diffusion Training Poster Session 1 & Exhibit Hall
Tiankai Hang ⋅ Shuyang Gu ⋅ Jianmin Bao ⋅ Fangyun Wei ⋅ Dong Chen ⋅ Xin Geng ⋅ Baining Guo
Exhibit Hall I #450
Secure On-Device Video OOD Detection Without Backpropagation Poster Session 1 & Exhibit Hall
Li Li ⋅ Peilin Cai ⋅ Yuxiao Zhou ⋅ Zhiyu Ni ⋅ Renjie Liang ⋅ QIN YOU ⋅ Yi Nian ⋅ Zhengzhong Tu ⋅ Xiyang Hu ⋅ Yue Zhao
Exhibit Hall I #1
Learning Counterfactually Decoupled Attention for Open-World Model Attribution Poster Session 1 & Exhibit Hall
Yu Zheng ⋅ Boyang Gong ⋅ Fanye Kong ⋅ Yueqi Duan ⋅ Bingyao Yu ⋅ Wenzhao Zheng ⋅ Lei Chen ⋅ Jiwen Lu ⋅ Jie Zhou
Exhibit Hall I #2
Latte: Collaborative Test-Time Adaptation of Vision-Language Models in Federated Learning Poster Session 1 & Exhibit Hall
Wenxuan Bao ⋅ Ruxi Deng ⋅ Ruizhong Qiu ⋅ Tianxin Wei ⋅ Hanghang Tong ⋅ Jingrui He
Exhibit Hall I #3
Is Less More? Exploring Token Condensation as Training-free Test-time Adaptation Poster Session 1 & Exhibit Hall
Zixin Wang ⋅ Dong Gong ⋅ Sen Wang ⋅ Zi Huang ⋅ Yadan Luo
Exhibit Hall I #4
WIPES: Wavelet-based Visual Primitives Poster Session 6 & Exhibit Hall with Coffee Break
Wenhao Zhang ⋅ Hao Zhu ⋅ Delong Wu ⋅ Di Kang ⋅ Linchao Bao ⋅ Xun Cao ⋅ Zhan Ma
Exhibit Hall I #253
Mastering Collaborative Multi-modal Data Selection: A Focus on Informativeness, Uniqueness, and Representativeness Poster Session 1 & Exhibit Hall
Qifan Yu ⋅ Zhebei Shen ⋅ Zhongqi Yue ⋅ Yang Wu ⋅ Bosheng Qin ⋅ Wenqiao Zhang ⋅ Yunfei Li ⋅ Juncheng Li ⋅ Siliang Tang ⋅ Yueting Zhuang
Exhibit Hall I #5
SMoLoRA: Exploring and Defying Dual Catastrophic Forgetting in Continual Visual Instruction Tuning Poster Session 1 & Exhibit Hall
Ziqi Wang ⋅ Chang Che ⋅ Qi Wang ⋅ Yangyang Li ⋅ Zenglin Shi ⋅ Meng Wang
Exhibit Hall I #7
Generalized Tensor-based Parameter-Efficient Fine-Tuning via Lie Group Transformations Poster Session 1 & Exhibit Hall
Chongjie Si ⋅ Zhiyi Shi ⋅ Xuehui Wang ⋅ Yichen Xiao ⋅ Xiaokang Yang ⋅ Wei Shen
Exhibit Hall I #9
Bootstrapping Grounded Chain-of-Thought in Multimodal LLMs for Data-Efficient Model Adaptation Poster Session 1 & Exhibit Hall
Jiaer Xia ⋅ Bingkui Tong ⋅ Yuhang Zang ⋅ Rui Shao ⋅ Kaiyang Zhou
Exhibit Hall I #10
One Encoder to Rule them All: Representation Learning for Model-free Visual Reinforcement Learning using Fourier Neural Operators Poster Session 1 & Exhibit Hall
Parag Dutta ⋅ Mohd Ayyoob ⋅ Shalabh Bhatnagar ⋅ Ambedkar Dukkipati
Exhibit Hall I #452
Deciphering Cross-Modal Alignment in Large Vision-Language Models via Modality Integration Rate Poster Session 1 & Exhibit Hall
Qidong Huang ⋅ Xiaoyi Dong ⋅ Pan Zhang ⋅ Yuhang Zang ⋅ Yuhang Cao ⋅ Jiaqi Wang ⋅ Weiming Zhang ⋅ Nenghai Yu
Exhibit Hall I #11
X-Fusion: Introducing New Modality to Frozen Large Language Models Poster Session 1 & Exhibit Hall
Sicheng Mo ⋅ Thao Nguyen ⋅ Xun Huang ⋅ Siddharth Iyer ⋅ Yijun Li ⋅ Yuchen Liu ⋅ Abhishek Tandon ⋅ Eli Shechtman ⋅ Krishna Kumar Singh ⋅ Yong Jae Lee ⋅ Bolei Zhou ⋅ Yuheng Li
Exhibit Hall I #12
LLaVA-KD: A Framework of Distilling Multimodal Large Language Models Poster Session 1 & Exhibit Hall
Yuxuan Cai ⋅ Jiangning Zhang ⋅ Haoyang He ⋅ Xinwei He ⋅ Ao Tong ⋅ Zhenye Gan ⋅ Chengjie Wang ⋅ Zhucun Xue ⋅ Yong Liu ⋅ Xiang Bai
Exhibit Hall I #13
Doodle Your Keypoints: Sketch-Based Few-Shot Keypoint Detection Poster Session 1 & Exhibit Hall
Subhajit Maity ⋅ Ayan Bhunia ⋅ Subhadeep Koley ⋅ Pinaki Chowdhury ⋅ Aneeshan Sain ⋅ Yi-Zhe Song
Exhibit Hall I #17
Dissecting Generalized Category Discovery: Multiplex Consensus under Self-Deconstruction Poster Session 1 & Exhibit Hall
Luyao Tang ⋅ Kunze Huang ⋅ Yuxuan Yuan ⋅ Chenxin Li ⋅ Xiaotong Tu ⋅ Xinghao Ding ⋅ Chaoqi Chen ⋅ Yue Huang
Exhibit Hall I #18
Partial Forward Blocking: A Novel Data Pruning Paradigm for Lossless Training Acceleration Poster Session 1 & Exhibit Hall
Dongyue Wu ⋅ Zilin Guo ⋅ Jialong Zuo ⋅ Nong Sang ⋅ Changxin Gao
Exhibit Hall I #20
LIFT: Latent Implicit Functions for Task- and Data-Agnostic Encoding Poster Session 1 & Exhibit Hall
Amirhossein Kazerouni ⋅ Soroush Mehraban ⋅ Michael Brudno ⋅ Babak Taati
Exhibit Hall I #453
ChartPoint: Guiding MLLMs with Grounding Reflection for Chart Reasoning Poster Session 1 & Exhibit Hall
Zhengzhuo Xu ⋅ Sinan Du ⋅ Yiyan Qi ⋅ Siwen Lu ⋅ Chengjin Xu ⋅ Chun Yuan ⋅ Jian Guo
Exhibit Hall I #30
ShortV: Efficient Multimodal Large Language Models by Freezing Visual Tokens in Ineffective Layers Poster Session 1 & Exhibit Hall
Qianhao Yuan ⋅ Qingyu Zhang ⋅ yanjiang liu ⋅ Jiawei Chen ⋅ Yaojie Lu ⋅ Hongyu Lin ⋅ Jia Zheng ⋅ Xianpei Han ⋅ Le Sun
Exhibit Hall I #21
Achieving More with Less: Additive Prompt Tuning for Rehearsal-Free Class-Incremental Learning Poster Session 1 & Exhibit Hall
Haoran Chen ⋅ Ping Wang ⋅ Zihan Zhou ⋅ Xu Zhang ⋅ Zuxuan Wu ⋅ Yu-Gang Jiang
Exhibit Hall I #22
CIARD: Cyclic Iterative Adversarial Robustness Distillation Poster Session 1 & Exhibit Hall
Liming Lu ⋅ Shuchao Pang ⋅ Xu Zheng ⋅ Xiang GU ⋅ Anan Du ⋅ Yunhuai Liu ⋅ Yongbin Zhou
Exhibit Hall I #23
MOBIUS: Big-to-Mobile Universal Instance Segmentation via Multi-modal Bottleneck Fusion and Calibrated Decoder Pruning Poster Session 5 & Exhibit Hall
Mattia Segu ⋅ Marta Tintore Gazulla ⋅ Yongqin Xian ⋅ Luc Gool ⋅ Federico Tombari
Exhibit Hall I #88
MambaML: Exploring State Space Models for Multi-Label Image Classification Poster Session 1 & Exhibit Hall
Xuelin Zhu ⋅ Jian liu ⋅ Jiuxin Cao ⋅ Bing WANG
Exhibit Hall I #445
Moderating the Generalization of Score-based Generative Model Poster Session 1 & Exhibit Hall
Wan Jiang ⋅ He Wang ⋅ Xin Zhang ⋅ Dan Guo ⋅ Zhaoxin Fan ⋅ Yunfeng Diao ⋅ Richang Hong
Exhibit Hall I #24
Scaling Language-Free Visual Representation Learning Poster Session 1 & Exhibit Hall
David Fan ⋅ Shengbang Tong ⋅ Jiachen Zhu ⋅ Koustuv Sinha ⋅ Zhuang Liu ⋅ Xinlei Chen ⋅ Michael Rabbat ⋅ Nicolas Ballas ⋅ Yann LeCun ⋅ Amir Bar ⋅ Saining Xie
Exhibit Hall I #25
Improving Noise Efficiency in Privacy-preserving Dataset Distillation Poster Session 1 & Exhibit Hall
Runkai Zheng ⋅ Vishnu Dasu ⋅ Yinong Wang ⋅ Haohan Wang ⋅ Fernando De la Torre
Exhibit Hall I #454
LLM-assisted Entropy-based Adaptive Distillation for Unsupervised Fine-grained Visual Representation Learning Poster Session 1 & Exhibit Hall
Jianfeng Dong ⋅ Danfeng Luo ⋅ Daizong Liu ⋅ Jie Sun ⋅ Xiaoye Qu ⋅ Xun Yang ⋅ Dongsheng Liu ⋅ Xun Wang
Exhibit Hall I #26
DiffRefine: Diffusion-based Proposal Specific Point Cloud Densification for Cross-Domain Object Detection Poster Session 1 & Exhibit Hall
Sangyun Shin ⋅ Yuhang He ⋅ Xinyu Hou ⋅ Samuel Hodgson ⋅ Andrew Markham ⋅ Niki Trigoni
Exhibit Hall I #459
On the Robustness Tradeoff in Fine-Tuning Poster Session 1 & Exhibit Hall
Kunyang Li ⋅ Jean-Charles Noirot Ferrand ⋅ Ryan Sheatsley ⋅ Blaine Hoak ⋅ Yohan Beugin ⋅ Eric Pauley ⋅ Patrick McDaniel
Exhibit Hall I #460
Gradient Short-Circuit: Efficient Out-of-Distribution Detection via Feature Intervention Poster Session 1 & Exhibit Hall
Jiawei Gu ⋅ Ziyue Qiao ⋅ Zechao Li
Exhibit Hall I #33
Boundary Probing for Input Privacy Protection When Using LMM Services Poster Session 1 & Exhibit Hall
Xiaofei Hui ⋅ Haoxuan Qu ⋅ Ping Hu ⋅ Hossein Rahmani ⋅ Jun Liu
Exhibit Hall I #34
Intrepretable Zero-Shot Learning with Locally-Aligned Vision-Language Model Poster Session 1 & Exhibit Hall
Shiming Chen ⋅ Bowen Duan ⋅ Salman Khan ⋅ Fahad Khan
Exhibit Hall I #35
UPRE: Zero-Shot Domain Adaptation for Object Detection via Unified Prompt and Representation Enhancement Poster Session 1 & Exhibit Hall
Xiao Zhang ⋅ Fei Wei ⋅ Yong Wang ⋅ Wenda Zhao ⋅ Feiyi Li ⋅ Xiangxiang Chu
Exhibit Hall I #38
HAMSt3R: Human-Aware Multi-view Stereo 3D Reconstruction Poster Session 2 & Exhibit Hall with Coffee Break
Sara Rojas Martinez ⋅ Matthieu Armando ⋅ Bernard Ghanem ⋅ Philippe Weinzaepfel ⋅ Vincent Leroy ⋅ Grégory Rogez
Exhibit Hall I #1
Dataset Distillation as Data Compression: A Rate-Utility Perspective Poster Session 1 & Exhibit Hall
Youneng Bao ⋅ Yiping Liu ⋅ Zhuo Chen ⋅ Yongsheng Liang ⋅ Mu Li ⋅ Kede Ma
Exhibit Hall I #39
Boosting Generative Adversarial Transferability with Self-supervised Vision Transformer Features Poster Session 1 & Exhibit Hall
Shangbo Wu ⋅ Yu-an Tan ⋅ Ruinan Ma ⋅ Wencong Ma ⋅ Dehua Zhu ⋅ Yuanzhang Li
Exhibit Hall I #40
TinyViM: Frequency Decoupling for Tiny Hybrid Vision Mamba Poster Session 5 & Exhibit Hall
Xiaowen Ma ⋅ Zhen-Liang Ni ⋅ Xinghao Chen
Exhibit Hall I #350
Open-set Cross Modal Generalization via Multimodal Unified Representation Poster Session 1 & Exhibit Hall
Hai Huang ⋅ Yan Xia ⋅ Shulei Wang ⋅ Hanting Wang ⋅ Minghui Fang ⋅ Shengpeng Ji ⋅ Sashuai Zhou ⋅ Tao Jin ⋅ Zhou Zhao
Exhibit Hall I #41
Adversarial Data Augmentation for Single Domain Generalization via Lyapunov Exponent-Guided Optimization Poster Session 1 & Exhibit Hall
ZUYU ZHANG ⋅ Ning Chen ⋅ Yongshan Liu ⋅ Qinghua Zhang ⋅ Xu Zhang
Exhibit Hall I #42
Adversarial Robust Memory-Based Continual Learner Poster Session 1 & Exhibit Hall
Xiaoyue Mi ⋅ Fan Tang ⋅ Zonghan Yang ⋅ Danding Wang ⋅ Juan Cao ⋅ Peng Li ⋅ Yang Liu
Exhibit Hall I #43
NegRefine: Refining Negative Label-Based Zero-Shot OOD Detection Poster Session 1 & Exhibit Hall
Amirhossein Ansari ⋅ Ke Wang ⋅ Pulei Xiong
Exhibit Hall I #44
Divide-and-Conquer for Enhancing Unlabeled Learning, Stability, and Plasticity in Semi-supervised Continual Learning Poster Session 1 & Exhibit Hall
Yue Duan ⋅ Taicai Chen ⋅ Lei Qi ⋅ Yinghuan Shi
Exhibit Hall I #45
A Unified Framework to BRIDGE Complete and Incomplete Deep Multi-View Clustering under Non-IID Missing Patterns Poster Session 1 & Exhibit Hall
Xiaorui Jiang ⋅ Buyun He ⋅ Peng Yuan Zhou ⋅ Xinyue Chen ⋅ Jingcai Guo ⋅ Jie Xu ⋅ Yong Liao
Exhibit Hall I #46
HumorDB: Can AI understand graphical humor? Poster Session 1 & Exhibit Hall
Vedaant V Jain ⋅ Gabriel Kreiman ⋅ Felipe Feitosa
Exhibit Hall I #47
GCAV: A Global Concept Activation Vector Framework for Cross-Layer Consistency in Interpretability Poster Session 1 & Exhibit Hall
Zhenghao He ⋅ Sanchit Sinha ⋅ Guangzhi Xiong ⋅ Aidong Zhang
Exhibit Hall I #48
Ensemble Foreground Management for Unsupervised Object Discovery Poster Session 5 & Exhibit Hall
Ziling Wu ⋅ Armaghan Moemeni ⋅ Praminda Caleb-Solly
Exhibit Hall I #44
Detect Anything 3D in the Wild Poster Session 2 & Exhibit Hall with Coffee Break
Hanxue Zhang ⋅ Haoran Jiang ⋅ Qingsong Yao ⋅ Yanan SUN ⋅ Renrui Zhang ⋅ Hao Zhao ⋅ Hongyang Li ⋅ Hongzi Zhu ⋅ Zetong Yang
Exhibit Hall I #3
Confound from All Sides, Distill with Resilience: Multi-Objective Adversarial Paths to Zero-Shot Robustness Poster Session 1 & Exhibit Hall
Junhao Dong ⋅ Jiao Liu ⋅ Xinghua Qu ⋅ YEW-SOON ONG
Exhibit Hall I #49
VolumetricSMPL: A Neural Volumetric Body Model for Efficient Interactions, Contacts, and Collisions Poster Session 2 & Exhibit Hall with Coffee Break
Marko Mihajlovic ⋅ Siwei Zhang ⋅ Gen Li ⋅ KAIFENG ZHAO ⋅ Lea Müller ⋅ Siyu Tang
Exhibit Hall I #4
Mitigating Object Hallucinations via Sentence-Level Early Intervention Poster Session 1 & Exhibit Hall
Shangpin Peng ⋅ Senqiao Yang ⋅ Li Jiang ⋅ Zhuotao Tian
Exhibit Hall I #50
Active Membership Inference Test (aMINT): Enhancing Model Auditability with Multi-Task Learning. Poster Session 1 & Exhibit Hall
Daniel DeAlcala ⋅ Aythami Morales ⋅ Julian Fierrez ⋅ Gonzalo Mancera ⋅ Ruben Tolosana ⋅ Javier Ortega-Garcia
Exhibit Hall I #51
One-Shot Knowledge Transfer for Scalable Person Re-Identification Poster Session 1 & Exhibit Hall
Longhua Li ⋅ Lei Qi ⋅ Xin Geng
Exhibit Hall I #53
ShortFT: Diffusion Model Alignment via Shortcut-based Fine-Tuning Poster Session 1 & Exhibit Hall
Xiefan Guo ⋅ Miaomiao Cui ⋅ Liefeng Bo ⋅ Di Huang
Exhibit Hall I #54
PRISM: Reducing Spurious Implicit Biases in Vision-Language Models with LLM-Guided Embedding Projection Poster Session 1 & Exhibit Hall
Mahdiyar Molahasani ⋅ Azadeh Motamedi ⋅ Michael Greenspan ⋅ Il-Min Kim ⋅ Ali Etemad
Exhibit Hall I #55
Open-Unfairness Adversarial Mitigation for Generalized Deepfake Detection Poster Session 1 & Exhibit Hall
Zhaoyang Li ⋅ Zhu Teng ⋅ Baopeng Zhang ⋅ Jianping Fan
Exhibit Hall I #56
EA-KD: Entropy-based Adaptive Knowledge Distillation Poster Session 1 & Exhibit Hall
Chi-Ping Su ⋅ Ching-Hsun Tseng ⋅ Bin Pu ⋅ Lei Zhao ⋅ Jiewen Yang ⋅ Zhuangzhuang Chen ⋅ Shin-Jye Lee
Exhibit Hall I #59
Structured Policy Optimization: Enhance Large Vision-Language Model via Self-referenced Dialogue Poster Session 1 & Exhibit Hall
Guohao Sun ⋅ Can Qin ⋅ Yihao Feng ⋅ Zeyuan Chen ⋅ Ran Xu ⋅ Sohail Dianat ⋅ MAJID RABBANI ⋅ Raghuveer Rao ⋅ Zhiqiang Tao
Exhibit Hall I #60
Seal Your Backdoor with Variational Defense Poster Session 1 & Exhibit Hall
Ivan Sabolic ⋅ Matej Grcic ⋅ Siniša Šegvić
Exhibit Hall I #61
Semi-ViM: Bidirectional State Space Model for Mitigating Label Imbalance in Semi-Supervised Learning Poster Session 1 & Exhibit Hall
Hongyang He ⋅ Hongyang Xie ⋅ Haochen You ⋅ Victor Sanchez
Exhibit Hall I #62
Integrating Task-Specific and Universal Adapters for Pre-Trained Model-based Class-Incremental Learning Poster Session 1 & Exhibit Hall
yan wang ⋅ Da-Wei Zhou ⋅ Han-Jia Ye
Exhibit Hall I #66
Contact-Aware Refinement of Human Pose Pseudo-Ground Truth via Bioimpedance Sensing Poster Session 2 & Exhibit Hall with Coffee Break
Maria-Paola Forte ⋅ Nikos Athanasiou ⋅ Giulia Ballardini ⋅ Jan Bartels ⋅ Katherine J. Kuchenbecker ⋅ Michael Black
Exhibit Hall I #5
CODE-CL: Conceptor-Based Gradient Projection for Deep Continual Learning Poster Session 1 & Exhibit Hall
Marco P. Apolinario ⋅ Sakshi Choudhary ⋅ Kaushik Roy
Exhibit Hall I #63
SAMO: A Lightweight Sharpness-Aware Approach for Multi-Task Optimization with Joint Global-Local Perturbation Poster Session 1 & Exhibit Hall
Hao Ban ⋅ Gokul Ram Subramani ⋅ Kaiyi Ji
Exhibit Hall I #64
Beyond the Limits: Overcoming Negative Correlation of Activation-Based Training-Free NAS Poster Session 1 & Exhibit Hall
Haidong Kang ⋅ Lianbo Ma ⋅ Pengjun Chen ⋅ Guo Yu ⋅ Xingwei Wang ⋅ Min Huang
Exhibit Hall I #65
Diffusion Guided Adaptive Augmentation for Generalization in Visual Reinforcement Learning Poster Session 1 & Exhibit Hall
Jeong Woon Lee ⋅ Hyoseok Hwang
Exhibit Hall I #73
I Am Big, You Are Little; I Am Right, You Are Wrong Poster Session 1 & Exhibit Hall
David A Kelly ⋅ Akchunya Chanchal ⋅ Nathan Blake
Exhibit Hall I #67
Semi-supervised Deep Transfer for Regression without Domain Alignment Poster Session 1 & Exhibit Hall
Mainak Biswas ⋅ Ambedkar Dukkipati ⋅ Devarajan Sridharan
Exhibit Hall I #68
DocThinker: Explainable Multimodal Large Language Models with Rule-based Reinforcement Learning for Document Understanding Poster Session 1 & Exhibit Hall
Wenwen Yu ⋅ Zhibo Yang ⋅ Yuliang Liu ⋅ Xiang Bai
Exhibit Hall I #69
From Easy to Hard: The MIR Benchmark for Progressive Interleaved Multi-Image Reasoning Poster Session 1 & Exhibit Hall
Hang Du ⋅ Jiayang Zhang ⋅ Guoshun Nan ⋅ Wendi Deng ⋅ Zhenyan Chen ⋅ Chenyang Zhang ⋅ Wang Xiao ⋅ Shan Huang ⋅ Yuqi Pan ⋅ Tao Qi ⋅ Sicong Leng
Exhibit Hall I #71
Fast Globally Optimal and Geometrically Consistent 3D Shape Matching Poster Session 1 & Exhibit Hall
Paul Roetzer ⋅ Florian Bernard
Exhibit Hall I #78
A Framework for Double-Blind Federated Adaptation of Foundation Models Poster Session 1 & Exhibit Hall
Nurbek Tastan ⋅ Karthik Nandakumar
Exhibit Hall I #79
VGGSounder: Audio-Visual Evaluations for Foundation Models Poster Session 1 & Exhibit Hall
Daniil Zverev ⋅ Thaddäus Wiedemer ⋅ Ameya Prabhu ⋅ Matthias Bethge ⋅ Wieland Brendel ⋅ A. Sophia Koepke
Exhibit Hall I #88
EA-Vit: Efficient Adaptation for Elastic Vision Transformer Poster Session 1 & Exhibit Hall
Chen Zhu ⋅ Wangbo Zhao ⋅ Huiwen Zhang ⋅ Yuhao Zhou ⋅ Weidong Tang ⋅ Shuo Wang ⋅ Zhihang Yuan ⋅ Yuzhang Shang ⋅ Xiaojiang Peng ⋅ Kai Wang ⋅ Dawei Yang
Exhibit Hall I #89
Web Artifact Attacks Disrupt Vision Language Models Poster Session 1 & Exhibit Hall
Maan Qraitem ⋅ Piotr Teterwak ⋅ Kate Saenko ⋅ Bryan Plummer
Exhibit Hall I #90
Feature Coding in the Era of Large Models: Dataset, Test Conditions, and Benchmark Poster Session 1 & Exhibit Hall
Changsheng Gao ⋅ Yifan Ma ⋅ Qiaoxi Chen ⋅ Xu yenan ⋅ Dong Liu ⋅ Weisi Lin
Exhibit Hall I #92
Generate, Refine, and Encode: Leveraging Synthesized Novel Samples for On-the-Fly Fine-Grained Category Discovery Poster Session 1 & Exhibit Hall
Xiao Liu ⋅ Nan Pu ⋅ Haiyang Zheng ⋅ Wenjing Li ⋅ Nicu Sebe ⋅ Zhun Zhong
Exhibit Hall I #93
MMOne: Representing Multiple Modalities in One Scene Poster Session 1 & Exhibit Hall
Zhifeng Gu ⋅ Bing WANG
Exhibit Hall I #94
MM-IFEngine: Towards Multimodal Instruction Following Poster Session 1 & Exhibit Hall
Shengyuan Ding ⋅ Wu Shenxi ⋅ Xiangyu Zhao ⋅ Yuhang Zang ⋅ Haodong Duan ⋅ Xiaoyi Dong ⋅ Pan Zhang ⋅ Yuhang Cao ⋅ Dahua Lin ⋅ Jiaqi Wang
Exhibit Hall I #95
RainbowPrompt: Diversity-Enhanced Prompt-Evolving for Continual Learning Poster Session 1 & Exhibit Hall
Kiseong Hong ⋅ Gyeong-Hyeon Kim ⋅ Eunwoo Kim
Exhibit Hall I #98
VisionMath: Vision-Form Mathematical Problem-Solving Poster Session 1 & Exhibit Hall
Zongyang Ma ⋅ Yuxin Chen ⋅ Ziqi Zhang ⋅ Zhongang Qi ⋅ Chunfeng Yuan ⋅ Shaojie Zhu ⋅ Chengxiang Zhuo ⋅ Bing Li ⋅ Ye Liu ⋅ Zang Li ⋅ Ying Shan ⋅ Weiming Hu
Exhibit Hall I #101
Dataset Distillation via the Wasserstein Metric Poster Session 1 & Exhibit Hall
Haoyang Liu ⋅ Peiran Wang ⋅ Yijiang Li ⋅ Tiancheng Xing ⋅ Vibhu Dalal ⋅ Luwei LI ⋅ Jingrui He ⋅ Haohan Wang
Exhibit Hall I #105
A Good Teacher Adapts Their Knowledge for Distillation Poster Session 1 & Exhibit Hall
Chengyao Qian ⋅ Trung Le ⋅ Mehrtash Harandi
Exhibit Hall I #108
Quanta Neural Networks: From Photons to Perception Poster Session 2 & Exhibit Hall with Coffee Break
Varun Sundar ⋅ Tianyi Zhang ⋅ Sacha Jungerman ⋅ Mohit Gupta
Exhibit Hall I #7
AdaDrive: Self-Adaptive Slow-Fast System for Language-Grounded Autonomous Driving Poster Session 2 & Exhibit Hall with Coffee Break
Ruifei Zhang ⋅ Junlin Xie ⋅ Wei Zhang ⋅ Weikai Chen ⋅ Xiao Tan ⋅ Xiang Wan ⋅ Guanbin Li
Exhibit Hall I #9
Consistent Time-of-Flight Depth Denoising via Graph-Informed Geometric Attention Poster Session 2 & Exhibit Hall with Coffee Break
Weida Wang ⋅ Changyong He ⋅ Jin Zeng ⋅ Di Qiu
Exhibit Hall I #16
SIGMAN: Scaling 3D Human Gaussian Generation with Millions of Assets Poster Session 2 & Exhibit Hall with Coffee Break
Yuhang Yang ⋅ Fengqi Liu ⋅ Yixing Lu ⋅ Qin Zhao ⋅ Pingyu Wu ⋅ Wei Zhai ⋅ Ran Yi ⋅ Yang Cao ⋅ Lizhuang Ma ⋅ Zheng-Jun Zha ⋅ Junting Dong
Exhibit Hall I #10
Depth Any Event Stream: Enhancing Event-based Monocular Depth Estimation via Dense-to-Sparse Distillation Poster Session 2 & Exhibit Hall with Coffee Break
Jinjing Zhu ⋅ Tianbo Pan ⋅ Zidong Cao ⋅ Yexin Liu ⋅ James Kwok ⋅ Hui Xiong
Exhibit Hall I #12
Evading Data Provenance in Deep Neural Networks Poster Session 1 & Exhibit Hall
Hongyu Zhu ⋅ Sichu Liang ⋅ Wenwen Wang ⋅ Zhuomeng Zhang ⋅ Fangqi Li ⋅ Shi-Lin Wang
Exhibit Hall I #109
WildSeg3D: Segment Any 3D Objects in the Wild from 2D Images Poster Session 2 & Exhibit Hall with Coffee Break
Yansong Guo ⋅ Jie Hu ⋅ Yansong Qu ⋅ Liujuan Cao
Exhibit Hall I #14
AllTracker: Efficient Dense Point Tracking at High Resolution Poster Session 2 & Exhibit Hall with Coffee Break
Adam Harley ⋅ Yang You ⋅ Yang Zheng ⋅ Xinglong Sun ⋅ Nikhil Raghuraman ⋅ Sheldon Liang ⋅ Yunqi Gu ⋅ Wen-Hsuan Chu ⋅ Suya You ⋅ Achal Dave ⋅ Rares Ambrus ⋅ Katerina Fragkiadaki ⋅ Leonidas Guibas
Exhibit Hall I #22
Extending Foundational Monocular Depth Estimators to Fisheye Cameras with Calibration Tokens Poster Session 2 & Exhibit Hall with Coffee Break
Suchisrit Gangopadhyay ⋅ Jung Hee Kim ⋅ Xien Chen ⋅ Patrick Rim ⋅ Hyoungseob Park ⋅ Alex Wong
Exhibit Hall I #17
MPBR: Multimodal Progressive Bidirectional Reasoning for Open-Set Fine-Grained Recognition Poster Session 1 & Exhibit Hall
Junfu Tan ⋅ Peiguang Jing ⋅ Yu Zhu ⋅ YU LIU
Exhibit Hall I #112
MAVias: Mitigate any Visual Bias Poster Session 1 & Exhibit Hall
Ioannis Sarridis ⋅ Christos Koutlis ⋅ Symeon Papadopoulos ⋅ Christos Diou
Exhibit Hall I #111
OpenSubstance: A High-quality Measured Dataset of Multi-View and -Lighting Images and Shapes Poster Session 2 & Exhibit Hall with Coffee Break
Fan Pei ⋅ jinchen bai ⋅ Xiang Feng ⋅ Zoubin Bi ⋅ Kun Zhou ⋅ Hongzhi Wu
Exhibit Hall I #19
DIP: Unsupervised Dense In-Context Post-training of Visual Representations Poster Session 1 & Exhibit Hall
Sophia Sirko-Galouchenko ⋅ Spyros Gidaris ⋅ Antonin Vobecky ⋅ Andrei Bursuc ⋅ Nicolas THOME
Exhibit Hall I #399
Towards Higher Effective Rank in Parameter-Efficient Fine-tuning using Khatri-Rao Product Poster Session 1 & Exhibit Hall
Paul Albert ⋅ Frederic Zhang ⋅ Hemanth Saratchandran ⋅ Anton Hengel ⋅ Ehsan Abbasnejad
Exhibit Hall I #113
PseudoMapTrainer: Learning Online Mapping without HD Maps Poster Session 2 & Exhibit Hall with Coffee Break
Christian Löwens ⋅ Thorben Funke ⋅ Jingchao Xie ⋅ Alexandru Condurache
Exhibit Hall I #23
LONG3R: Long Sequence Streaming 3D Reconstruction Poster Session 2 & Exhibit Hall with Coffee Break
Zhuoguang Chen ⋅ Minghui Qin ⋅ Tianyuan Yuan ⋅ Zhe Liu ⋅ Hang Zhao
Exhibit Hall I #24
VGMamba: Attribute-to-Location Clue Reasoning for Quantity-Agnostic 3D Visual Grounding Poster Session 2 & Exhibit Hall with Coffee Break
Zhu Yihang ⋅ Jinhao Zhang ⋅ Yuxuan Wang ⋅ Aming WU ⋅ Cheng Deng
Exhibit Hall I #26
AnnofreeOD: Detecting All Classes at Low Frame Rates Without Human Annotations Poster Session 2 & Exhibit Hall with Coffee Break
Boyi Sun ⋅ Yuhang Liu ⋅ Houxin He ⋅ Yonglin Tian ⋅ Fei-Yue Wang
Exhibit Hall I #28
Federated Continual Instruction Tuning Poster Session 1 & Exhibit Hall
Haiyang Guo ⋅ Fanhu Zeng ⋅ Fei Zhu ⋅ Wenzhuo Liu ⋅ Da-Han Wang ⋅ Jian Xu ⋅ Xu-Yao Zhang ⋅ Cheng-Lin Liu
Exhibit Hall I #116
TWIST & SCOUT: Grounding Multimodal LLM-Experts by Forget-Free Tuning Poster Session 1 & Exhibit Hall
Aritra Bhowmik ⋅ Mohammad Mahdi Derakhshani ⋅ Dennis Koelma ⋅ Yuki Asano ⋅ Martin R. Oswald ⋅ Cees Snoek
Exhibit Hall I #119
Generate, Transduct, Adapt: Iterative Transduction with VLMs Poster Session 1 & Exhibit Hall
Oindrila Saha ⋅ Logan Lawrence ⋅ Grant Horn ⋅ Subhransu Maji
Exhibit Hall I #120
BabyVLM: Data-Efficient Pretraining of VLMs Inspired by Infant Learning Poster Session 1 & Exhibit Hall
Shengao Wang ⋅ Arjun Chandra ⋅ Aoming Liu ⋅ Boqing Gong ⋅ Venkatesh Saligrama
Exhibit Hall I #121
Controlling Multimodal LLMs via Reward-guided Decoding Poster Session 1 & Exhibit Hall
Oscar Mañas ⋅ Pierluca D'Oro ⋅ Koustuv Sinha ⋅ Adriana Romero-Soriano ⋅ Michal Drozdzal ⋅ Aishwarya Agrawal
Exhibit Hall I #122
Improving Large Vision and Language Models by Learning from a Panel of Peers Poster Session 1 & Exhibit Hall
Jefferson Hernandez ⋅ Jing Shi ⋅ Simon Jenni ⋅ Vicente Ordonez ⋅ Kushal Kafle
Exhibit Hall I #123
CE-FAM: Concept-Based Explanation via Fusion of Activation Maps Poster Session 1 & Exhibit Hall
Michihiro Kuroki ⋅ Toshihiko Yamasaki
Exhibit Hall I #124
PEFTDiff: Diffusion-Guided Transferability Estimation for Parameter-Efficient Fine-Tuning Poster Session 1 & Exhibit Hall
PRAFFUL KHOBA ⋅ Zijian Wang ⋅ Chetan Arora ⋅ Mahsa Baktashmotlagh
Exhibit Hall I #128
Towards Privacy-preserved Pre-training of Remote Sensing Foundation Models with Federated Mutual-guidance Learning Poster Session 1 & Exhibit Hall
Jieyi Tan ⋅ Chengwei Zhang ⋅ Bo Dang ⋅ Yansheng Li
Exhibit Hall I #163
AVTrustBench: Assessing and Enhancing Reliability and Robustness in Audio-Visual LLMs Poster Session 1 & Exhibit Hall
Sanjoy Chowdhury ⋅ Sayan Nag ⋅ Subhrajyoti Dasgupta ⋅ Yaoting Wang ⋅ Mohamed Elhoseiny ⋅ Ruohan Gao ⋅ Dinesh Manocha
Exhibit Hall I #141
Verbalized Representation Learning for Interpretable Few-Shot Generalization Poster Session 1 & Exhibit Hall
Cheng-Fu Yang ⋅ Da Yin ⋅ Wenbo Hu ⋅ Heng Ji ⋅ Nanyun Peng ⋅ Bolei Zhou ⋅ Kai-Wei Chang
Exhibit Hall I #142
RMultiplex200K: Toward Reliable Multimodal Process Supervision for Visual Language Models on Telecommunications Poster Session 1 & Exhibit Hall
Sijia Chen ⋅ Bin Song
Exhibit Hall I #150
Equipping Vision Foundation Model with Mixture of Experts for Out-of-Distribution Detection Poster Session 1 & Exhibit Hall
Shizhen Zhao ⋅ Jiahui Liu ⋅ Xin Wen ⋅ Haoru Tan ⋅ Xiaojuan Qi
Exhibit Hall I #158
Class-Wise Federated Averaging for Efficient Personalization Poster Session 1 & Exhibit Hall
Gyuejeong Lee ⋅ Daeyoung Choi
Exhibit Hall I #160
Multi-view Gaze Target Estimation Poster Session 2 & Exhibit Hall with Coffee Break
Qiaomu Miao ⋅ Vivek Golani ⋅ Jingyi Xu ⋅ Progga Paromita Dutta ⋅ Minh Hoai ⋅ Dimitris Samaras
Exhibit Hall I #33
EFTViT: Efficient Federated Training of Vision Transformers with Masked Images on Resource-Constrained Clients Poster Session 1 & Exhibit Hall
meihan wu ⋅ Tao Chang ⋅ Cui Miao ⋅ Jie Zhou ⋅ Chun Li ⋅ Xiangyu Xu ⋅ Ming Li ⋅ Xiaodong Wang
Exhibit Hall I #164
ODP-Bench: Benchmarking Out-of-Distribution Performance Prediction Poster Session 1 & Exhibit Hall
Han Yu ⋅ Kehan Li ⋅ Dongbai Li ⋅ Yue He ⋅ Xingxuan Zhang ⋅ Peng Cui
Exhibit Hall I #167
R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization Poster Session 1 & Exhibit Hall
Jingyi Zhang ⋅ Jiaxing Huang ⋅ Huanjin Yao ⋅ Shunyu Liu ⋅ Xikun ZHANG ⋅ Shijian Lu ⋅ Dacheng Tao
Exhibit Hall I #168
Human-Object Interaction from Human-Level Instructions Poster Session 3 & Exhibit Hall
Zhen Wu ⋅ Jiaman Li ⋅ Pei Xu ⋅ Karen Liu
Exhibit Hall I #110
FG-OrIU: Towards Better Forgetting via Feature-Gradient Orthogonality for Incremental Unlearning Poster Session 1 & Exhibit Hall
qian feng ⋅ Jiahang Tu ⋅ Mintong Kang ⋅ Hanbin Zhao ⋅ Chao Zhang ⋅ Hui Qian
Exhibit Hall I #177
ViT-EnsembleAttack: Augmenting Ensemble Models for Stronger Adversarial Transferability in Vision Transformers Poster Session 1 & Exhibit Hall
Hanwen Cao ⋅ Haobo Lu ⋅ Xiaosen Wang ⋅ Kun He
Exhibit Hall I #181
Instruction-Oriented Preference Alignment for Enhancing Multi-Modal Comprehension Capability of MLLMs Poster Session 1 & Exhibit Hall
Zitian Wang ⋅ Yue Liao ⋅ RONG KANG ⋅ Fengyun Rao ⋅ Yibo Yang ⋅ Si Liu
Exhibit Hall I #182
Visual-RFT: Visual Reinforcement Fine-Tuning Poster Session 1 & Exhibit Hall
Ziyu Liu ⋅ Zeyi Sun ⋅ Yuhang Zang ⋅ Xiaoyi Dong ⋅ Yuhang Cao ⋅ Haodong Duan ⋅ Dahua Lin ⋅ Jiaqi Wang
Exhibit Hall I #184
Enhancing Transformers Through Conditioned Embedded Tokens Poster Session 1 & Exhibit Hall
Hemanth Saratchandran ⋅ Simon Lucey
Exhibit Hall I #449
Jailbreaking Multimodal Large Language Models via Shuffle Inconsistency Poster Session 1 & Exhibit Hall
Shiji Zhao ⋅ Ranjie Duan ⋅ Fengxiang Wang ⋅ Chi Chen ⋅ Caixin KANG ⋅ Shouwei Ruan ⋅ Jialing Tao ⋅ YueFeng Chen ⋅ Hui Xue ⋅ Xingxing Wei
Exhibit Hall I #185
Large Learning Rates Simultaneously Achieve Robustness to Spurious Correlations and Compressibility Poster Session 1 & Exhibit Hall
Melih Barsbey ⋅ Lucas Prieto ⋅ Stefanos Zafeiriou ⋅ Tolga Birdal
Exhibit Hall I #186
Dynamic Multi-Layer Null Space Projection for Vision-Language Continual Learning Poster Session 1 & Exhibit Hall
Borui Kang ⋅ Lei Wang ⋅ Zhiping Wu ⋅ Tao Feng ⋅ Yawen Li ⋅ Yang Gao ⋅ Wenbin Li
Exhibit Hall I #188
LLaVA-CoT: Let Vision Language Models Reason Step-by-Step Poster Session 1 & Exhibit Hall
Guowei Xu ⋅ Peng Jin ⋅ ZiangWu ZiangWu ⋅ Li Hao ⋅ Yibing Song ⋅ Lichao Sun ⋅ Li Yuan
Exhibit Hall I #189
Visual Modality Prompt for Adapting Vision-Language Object Detectors Poster Session 1 & Exhibit Hall
Heitor Rapela Medeiros ⋅ Atif Belal ⋅ Srikanth Muralidharan ⋅ Eric Granger ⋅ Marco Pedersoli
Exhibit Hall I #197
What's in a Latent? Leveraging Diffusion Latent Space for Domain Generalization Poster Session 1 & Exhibit Hall
Xavier Thomas ⋅ Deepti Ghadiyaram
Exhibit Hall I #198
Prototype Guided Backdoor Defense via Activation Space Manipulation Poster Session 1 & Exhibit Hall
Venkat Adithya Amula ⋅ Sunayana Samavedam ⋅ Saurabh Saini ⋅ Avani Gupta ⋅ P J Narayanan
Exhibit Hall I #199
RIPE: Reinforcement Learning on Unlabeled Image Pairs for Robust Keypoint Extraction Poster Session 1 & Exhibit Hall
Johannes Künzel ⋅ Anna Hilsmann ⋅ Peter Eisert
Exhibit Hall I #457
Analyzing Finetuning Representation Shift for Multimodal LLMs Steering Poster Session 1 & Exhibit Hall
Pegah KHAYATAN ⋅ Mustafa Shukor ⋅ Jayneel Parekh ⋅ Arnaud Dapogny ⋅ Matthieu Cord
Exhibit Hall I #200
Efficient Unsupervised Shortcut Learning Detection and Mitigation in Transformers Poster Session 1 & Exhibit Hall
Lukas Kuhn ⋅ sari sadiya ⋅ Jörg Schlötterer ⋅ Florian Buettner ⋅ Christin Seifert ⋅ Gemma Roig
Exhibit Hall I #201
VSP: Diagnosing the Dual Challenges of Perception and Reasoning in Spatial Planning Tasks for MLLMs Poster Session 1 & Exhibit Hall
Qiucheng Wu ⋅ Handong Zhao ⋅ Michael Saxon ⋅ Trung Bui ⋅ William Yang Wang ⋅ Yang Zhang ⋅ Shiyu Chang
Exhibit Hall I #206
Multi-Cache Enhanced Prototype Learning for Test-Time Generalization of Vision-Language Models Poster Session 1 & Exhibit Hall
Xinyu Chen ⋅ Haotian Zhai ⋅ Can Zhang ⋅ XIUPENG SHI ⋅ Ruirui Li
Exhibit Hall I #207
AVAM: a Universal Training-free Adaptive Visual Anchoring Embedded into Multimodal Large Language Model for Multi-image Question Answering Poster Session 1 & Exhibit Hall
Kang Zeng ⋅ Guojin Zhong ⋅ Jintao Cheng ⋅ Jin Yuan ⋅ Zhiyong Li
Exhibit Hall I #208
R1-Onevision: Advancing Generalized Multimodal Reasoning through Cross-Modal Formalization Poster Session 1 & Exhibit Hall
yi yang ⋅ Xiaoxuan He ⋅ Hongkun Pan ⋅ Xiyan Jiang ⋅ Yan Deng ⋅ Xingtao Yang ⋅ Haoyu Lu ⋅ Dacheng Yin ⋅ Fengyun Rao ⋅ Minfeng Zhu ⋅ Bo Zhang ⋅ Wei Chen
Exhibit Hall I #216
The Inter-Intra Modal Measure: A Predictive Lens on Fine-Tuning Outcomes in Vision-Language Models Poster Session 1 & Exhibit Hall
Laura Niss ⋅ Kevin Vogt-Lowell ⋅ Theodoros Tsiligkaridis
Exhibit Hall I #218
What to Distill? Fast Knowledge Distillation with Adaptive Sampling Poster Session 1 & Exhibit Hall
Byungchul Chae ⋅ Seonyeong Heo
Exhibit Hall I #219
Flexi-FSCIL: Adaptive Knowledge Retention for Breaking the Stability-Plasticity Dilemma in Few-Shot Class-Incremental Learning Poster Session 1 & Exhibit Hall
Wufei Xie ⋅ Yalin Wang ⋅ Chenliang Liu ⋅ Zhaohui Jiang ⋅ Xue Yang
Exhibit Hall I #223
Multispectral Demosaicing via Dual Cameras Poster Session 2 & Exhibit Hall with Coffee Break
SaiKiran Tedla ⋅ Junyong Lee ⋅ Beixuan Yang ⋅ Mahmoud Afifi ⋅ Michael Brown
Exhibit Hall I #36
Generative Modeling of Shape-Dependent Self-Contact Human Poses Poster Session 2 & Exhibit Hall with Coffee Break
Takehiko Ohkawa ⋅ Jihyun Lee ⋅ Shunsuke Saito ⋅ Jason Saragih ⋅ Fabian Prada ⋅ Yichen Xu ⋅ Shoou-I Yu ⋅ Ryosuke Furuta ⋅ Yoichi Sato ⋅ Takaaki Shiratori
Exhibit Hall I #38
Met2Net: A Decoupled Two-Stage Spatio-Temporal Forecasting Model for Complex Meteorological Systems Poster Session 2 & Exhibit Hall with Coffee Break
Shaohan Li ⋅ Hao Yang ⋅ Min Chen ⋅ Xiaolin Qin
Exhibit Hall I #41
TriDi: Trilateral Diffusion of 3D Humans, Objects, and Interactions Poster Session 2 & Exhibit Hall with Coffee Break
Ilya A. Petrov ⋅ Riccardo Marin ⋅ Julian Chibane ⋅ Gerard Pons-Moll
Exhibit Hall I #47
Beyond RGB: Adaptive Parallel Processing for RAW Object Detection Poster Session 2 & Exhibit Hall with Coffee Break
Shani Gamrian ⋅ Hila Barel ⋅ Feiran Li ⋅ Masakazu Yoshimura ⋅ Daisuke Iso
Exhibit Hall I #49
egoPPG: Heart Rate Estimation from Eye-Tracking Cameras in Egocentric Systems to Benefit Downstream Vision Tasks Poster Session 2 & Exhibit Hall with Coffee Break
Björn Braun ⋅ Rayan Armani ⋅ Manuel Meier ⋅ Max Moebus ⋅ Christian Holz
Exhibit Hall I #52
PoseSyn: Synthesizing Diverse 3D Pose Data from In-the-Wild 2D Data Poster Session 2 & Exhibit Hall with Coffee Break
CHANGHEE YANG ⋅ Hyeonseop Song ⋅ Seokhun Choi ⋅ Seungwoo Lee ⋅ Jaechul Kim ⋅ Hoseok Do
Exhibit Hall I #55
Diffusion-Based Extreme High-speed Scenes Reconstruction with the Complementary Vision Sensor Poster Session 2 & Exhibit Hall with Coffee Break
Yapeng Meng ⋅ Yihan Lin ⋅ Taoyi Wang ⋅ Yuguo Chen ⋅ Lijian Wang ⋅ Rong Zhao
Exhibit Hall I #63
TorchAdapt: Towards Light-Agnostic Real-Time Visual Perception Poster Session 2 & Exhibit Hall with Coffee Break
Khurram Azeem Hashmi ⋅ Karthik Suresh ⋅ Didier Stricker ⋅ Muhammad Zeshan Afzal
Exhibit Hall I #58
Human-in-the-Loop Local Corrections of 3D Scene Layouts via Infilling Poster Session 2 & Exhibit Hall with Coffee Break
Christopher Xie ⋅ Armen Avetisyan ⋅ Henry Howard-Jenkins ⋅ Yawar Siddiqui ⋅ Julian Straub ⋅ Richard Newcombe ⋅ Vasileios Balntas ⋅ Jakob Engel
Exhibit Hall I #59
POMATO: Marrying Pointmap Matching with Temporal Motions for Dynamic 3D Reconstruction Poster Session 2 & Exhibit Hall with Coffee Break
Songyan Zhang ⋅ Yongtao Ge ⋅ Jinyuan Tian ⋅ Guangkai Xu ⋅ Hao Chen ⋅ Chen Lv ⋅ Chunhua Shen
Exhibit Hall I #61
Boosting Class Representation via Semantically Related Instances for Robust Long-Tailed Learning with Noisy Labels Poster Session 1 & Exhibit Hall
Yuhang Li ⋅ Zhuying Li ⋅ Yuheng Jia
Exhibit Hall I #134
CAT: A Unified Click-and-Track Framework for Realistic Tracking Poster Session 2 & Exhibit Hall with Coffee Break
Yongsheng Yuan ⋅ Jie Zhao ⋅ Dong Wang ⋅ Huchuan Lu
Exhibit Hall I #62
DepR: Depth Guided Single-view Scene Reconstruction with Instance-level Diffusion Poster Session 2 & Exhibit Hall with Coffee Break
Qingcheng Zhao ⋅ Xiang Zhang ⋅ Haiyang Xu ⋅ Zeyuan Chen ⋅ Jianwen Xie ⋅ Yuan Gao ⋅ Zhuowen Tu
Exhibit Hall I #65
Invisible Watermarks, Visible Gains: Steering Machine Unlearning with Bi-Level Watermarking Design Poster Session 1 & Exhibit Hall
Yuhao Sun ⋅ Yihua Zhang ⋅ Gaowen Liu ⋅ Hongtao Xie ⋅ Sijia Liu
Exhibit Hall I #220
DiffuMatch: Category-Agnostic Spectral Diffusion Priors for Robust Non-rigid Shape Matching Poster Session 2 & Exhibit Hall with Coffee Break
Emery Pierson ⋅ Lei Li ⋅ Angela Dai ⋅ Maks Ovsjanikov
Exhibit Hall I #67
SAC-GNC: SAmple Consensus for adaptive Graduated Non-Convexity Poster Session 2 & Exhibit Hall with Coffee Break
Valter Piedade ⋅ Chitturi Sidhartha ⋅ José Gaspar ⋅ Venu Madhav Govindu ⋅ Pedro Miraldo
Exhibit Hall I #70
AstroLoc: Robust Space to Ground Image Localizer Poster Session 2 & Exhibit Hall with Coffee Break
Gabriele Berton ⋅ Alex Stoken ⋅ Carlo Masone
Exhibit Hall I #73
Do It Yourself: Learning Semantic Correspondence from Pseudo-Labels Poster Session 2 & Exhibit Hall with Coffee Break
Olaf Dünkel ⋅ Thomas Wimmer ⋅ Christian Theobalt ⋅ Christian Rupprecht ⋅ Adam Kortylewski
Exhibit Hall I #77
Stochastic Interpolants for Revealing Stylistic Flows across the History of Art Poster Session 2 & Exhibit Hall with Coffee Break
Pingchuan Ma ⋅ Ming Gui ⋅ Johannes Schusterbauer ⋅ Xiaopei Yang ⋅ Olga Grebenkova ⋅ Vincent Tao Hu ⋅ Björn Ommer
Exhibit Hall I #80
Is Tracking really more challenging in First Person Egocentric Vision? Poster Session 2 & Exhibit Hall with Coffee Break
Matteo Dunnhofer ⋅ Zaira Manigrasso ⋅ Christian Micheloni
Exhibit Hall I #81
VLDrive: Vision-Augmented Lightweight MLLMs for Efficient Language-grounded Autonomous Driving Poster Session 2 & Exhibit Hall with Coffee Break
Ruifei Zhang ⋅ Wei Zhang ⋅ Xiao Tan ⋅ Sibei Yang ⋅ Xiang Wan ⋅ Xiaonan Luo ⋅ Guanbin Li
Exhibit Hall I #85
Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformers Poster Session 2 & Exhibit Hall with Coffee Break
Zhengyao Lyu ⋅ Tianlin Pan ⋅ Chenyang Si ⋅ Zhaoxi Chen ⋅ Wangmeng Zuo ⋅ Ziwei Liu ⋅ Kwan-Yee K. Wong
Exhibit Hall I #86
Toward Material-Agnostic System Identification from Videos Poster Session 2 & Exhibit Hall with Coffee Break
Yizhou Zhao ⋅ Haoyu Chen ⋅ Chunjiang Liu ⋅ Zhenyang Li ⋅ Charles Herrmann ⋅ Junhwa Hur ⋅ Yinxiao Li ⋅ Ming-Hsuan Yang ⋅ Bhiksha Raj ⋅ Min Xu
Exhibit Hall I #87
MagicHOI: Leveraging 3D Priors for Accurate Hand-object Reconstruction from Short Monocular Video Clips Poster Session 2 & Exhibit Hall with Coffee Break
SHIBO WANG ⋅ Haonan He ⋅ Maria Parelli ⋅ Christoph Gebhardt ⋅ Zicong Fan ⋅ Jie Song
Exhibit Hall I #88
Boosting Multi-View Indoor 3D Object Detection via Adaptive 3D Volume Construction Poster Session 2 & Exhibit Hall with Coffee Break
Runmin Zhang ⋅ Zhu Yu ⋅ Si-Yuan Cao ⋅ Lingyu Zhu ⋅ Guangyi Zhang ⋅ Xiaokai Bai ⋅ Hui-liang Shen
Exhibit Hall I #90
ETA: Energy-based Test-time Adaptation for Depth Completion Poster Session 2 & Exhibit Hall with Coffee Break
Younjoon Chung ⋅ Hyoungseob Park ⋅ Patrick Rim ⋅ Xiaoran Zhang ⋅ Jihe He ⋅ Ziyao Zeng ⋅ Safa Cicek ⋅ Byung-Woo Hong ⋅ James Duncan ⋅ Alex Wong
Exhibit Hall I #92
CoTracker3: Simpler and Better Point Tracking by Pseudo-Labelling Real Videos Poster Session 2 & Exhibit Hall with Coffee Break
Nikita Karaev ⋅ Iurii Makarov ⋅ Jianyuan Wang ⋅ Natalia Neverova ⋅ Andrea Vedaldi ⋅ Christian Rupprecht
Exhibit Hall I #93
SceneMI: Motion In-betweening for Modeling Human-Scene Interaction Poster Session 2 & Exhibit Hall with Coffee Break
Inwoo Hwang ⋅ Bing Zhou ⋅ Young Min Kim ⋅ Jian Wang ⋅ chuan guo
Exhibit Hall I #95
DEPTHOR: Depth Enhancement from a Practical Light-Weight dToF Sensor and RGB Image Poster Session 2 & Exhibit Hall with Coffee Break
Jijun Xiang ⋅ Xuan Zhu ⋅ Xianqi Wang ⋅ Yu Wang ⋅ Hong Zhang ⋅ Fei Guo ⋅ Xin Yang
Exhibit Hall I #101
GeoExplorer: Active Geo-localization with Curiosity-Driven Exploration Poster Session 2 & Exhibit Hall with Coffee Break
Li Mi ⋅ Manon Béchaz ⋅ Zeming Chen ⋅ Antoine Bosselut ⋅ Devis Tuia
Exhibit Hall I #103
ROADWork: A Dataset and Benchmark for Learning to Recognize, Observe, Analyze and Drive Through Work Zones Poster Session 2 & Exhibit Hall with Coffee Break
Anurag Ghosh ⋅ Shen Zheng ⋅ Robert Tamburo ⋅ Khiem Vuong ⋅ Juan Alvarez-Padilla ⋅ Hailiang Zhu ⋅ Nicholas Dunn ⋅ Michael Cardei ⋅ Christoph Mertz ⋅ Srinivasa Narasimhan
Exhibit Hall I #104
RoMo: Robust Motion Segmentation Improves Structure from Motion Poster Session 2 & Exhibit Hall with Coffee Break
Lily Goli ⋅ Sara Sabour ⋅ Mark Matthews ⋅ Marcus Brubaker ⋅ Dmitry Lagun ⋅ Alec Jacobson ⋅ David Fleet ⋅ Saurabh Saxena ⋅ Andrea Tagliasacchi
Exhibit Hall I #106
Hints of Prompt: Enhancing Visual Representation for Multimodal LLMs in Autonomous Driving Poster Session 2 & Exhibit Hall with Coffee Break
Hao Zhou ⋅ Zhanning Gao ⋅ Zhili Chen ⋅ Maosheng Ye ⋅ Qifeng Chen ⋅ Tongyi Cao ⋅ Honggang Qi
Exhibit Hall I #107
Learning Large Motion Estimation from Intermediate Representations with a High-Resolution Optical Flow Dataset Featuring Long-Range Dynamic Motion Poster Session 2 & Exhibit Hall with Coffee Break
Hoonhee Cho ⋅ Yuhwan Jeong ⋅ Kuk-Jin Yoon
Exhibit Hall I #108
Robust Low-light Scene Restoration via Illumination Transition Poster Session 2 & Exhibit Hall with Coffee Break
Ze Li ⋅ Feng Zhang ⋅ Xiatian Zhu ⋅ Zhang Meng ⋅ Yanghong Zhou ⋅ P.Y. Mok
Exhibit Hall I #109
Towards Real Unsupervised Anomaly Detection Via Confident Meta-Learning Poster Session 1 & Exhibit Hall
Muhammad Aqeel ⋅ Shakiba Sharifi ⋅ Marco Cristani ⋅ Francesco Setti
Exhibit Hall I #456
CCMNet: Leveraging Calibrated Color Correction Matrices for Cross-Camera Color Constancy Poster Session 2 & Exhibit Hall with Coffee Break
Dongyoung Kim ⋅ Mahmoud Afifi ⋅ Dongyun Kim ⋅ Michael Brown ⋅ Seon Joo Kim
Exhibit Hall I #110
MGSfM: Multi-Camera Geometry Driven Global Structure-from-Motion Poster Session 2 & Exhibit Hall with Coffee Break
peilin Tao ⋅ Hainan Cui ⋅ Diantao Tu ⋅ Shuhan Shen
Exhibit Hall I #20
UrbanLLaVA: A Multi-modal Large Language Model for Urban Intelligence Poster Session 2 & Exhibit Hall with Coffee Break
Jie Feng ⋅ Shengyuan Wang ⋅ Tianhui Liu ⋅ Yanxin Xi ⋅ Yong Li
Exhibit Hall I #111
Zero-shot Inexact CAD Model Alignment from a Single Image Poster Session 2 & Exhibit Hall with Coffee Break
Pattaramanee Arsomngern ⋅ Sasikarn Khwanmuang ⋅ Matthias Nießner ⋅ Supasorn Suwajanakorn
Exhibit Hall I #113
HazeFlow: Revisit Haze Physical Model as ODE and Non-Homogeneous Haze Generation for Real-World Dehazing Poster Session 2 & Exhibit Hall with Coffee Break
Junseong Shin ⋅ Seungwoo Chung ⋅ Yunjeong Yang ⋅ Tae Hyun Kim
Exhibit Hall I #116
Motal: Unsupervised 3D Object Detection by Modality and Task-specific Knowledge Transfer Poster Session 2 & Exhibit Hall with Coffee Break
Hai Wu ⋅ Hongwei Lin ⋅ Xusheng Guo ⋅ Xin Li ⋅ Mingming Wang ⋅ Cheng Wang ⋅ Chenglu Wen
Exhibit Hall I #118
Dual-Rate Dynamic Teacher for Source-Free Domain Adaptive Object Detection Poster Session 1 & Exhibit Hall
Qi He ⋅ Xiao Wu ⋅ Jun-Yan He ⋅ Shuai Li
Exhibit Hall I #187
DeGauss: Dynamic-Static Decomposition with Gaussian Splatting for Distractor-free 3D Reconstruction Poster Session 2 & Exhibit Hall with Coffee Break
Rui Wang ⋅ Quentin Lohmeyer ⋅ Mirko Meboldt ⋅ Siyu Tang
Exhibit Hall I #119
Scaling Inference-Time Search with Vision Value Model for Improved Visual Comprehension Poster Session 1 & Exhibit Hall
Xiyao Wang ⋅ Zhengyuan Yang ⋅ Linjie Li ⋅ Hongjin Lu ⋅ Yuancheng Xu ⋅ Chung-Ching Lin ⋅ Kevin Lin ⋅ Furong Huang ⋅ Lijuan Wang
Exhibit Hall I #102
Manual-PA: Learning 3D Part Assembly from Instruction Diagrams Poster Session 2 & Exhibit Hall with Coffee Break
Jiahao Zhang ⋅ Anoop Cherian ⋅ Cristian Rodriguez-Opazo ⋅ Weijian Deng ⋅ Stephen Gould
Exhibit Hall I #120
MoMa-Kitchen: A 100K+ Benchmark for Affordance-Grounded Last-Mile Navigation in Mobile Manipulation Poster Session 2 & Exhibit Hall with Coffee Break
Pingrui Zhang ⋅ Xianqiang Gao ⋅ Yuhan Wu ⋅ Kehui Liu ⋅ Dong Wang ⋅ Zhigang Wang ⋅ Bin Zhao ⋅ Yan Ding ⋅ Xuelong Li
Exhibit Hall I #121
NavQ: Learning a Q-Model for Foresighted Vision-and-Language Navigation Poster Session 2 & Exhibit Hall with Coffee Break
Peiran Xu ⋅ Xicheng Gong ⋅ Yadong Mu
Exhibit Hall I #122
Embodied VideoAgent: Persistent Memory from Egocentric Videos and Embodied Sensors Enables Dynamic Scene Understanding Poster Session 2 & Exhibit Hall with Coffee Break
Yue Fan ⋅ Xiaojian Ma ⋅ Rongpeng Su ⋅ Jun Guo ⋅ Rujie Wu ⋅ Xi Chen ⋅ Qing Li
Exhibit Hall I #123
LightsOut: Diffusion-based Outpainting for Enhanced Lens Flare Removal Poster Session 2 & Exhibit Hall with Coffee Break
Shr-Ruei Tsai ⋅ Wei-Cheng Chang ⋅ Jie-Ying Lee ⋅ Chih-Hai Su ⋅ Yu-Lun Liu
Exhibit Hall I #124
Rethinking Multi-modal Object Detection from the Perspective of Mono-Modality Feature Learning Poster Session 2 & Exhibit Hall with Coffee Break
Tianyi Zhao ⋅ Boyang Liu ⋅ Yanglei Gao ⋅ Yiming Sun ⋅ Maoxun Yuan ⋅ Xingxing Wei
Exhibit Hall I #125
GeoDiffusion: A Training-Free Framework for Accurate 3D Geometric Conditioning in Image Generation Poster Session 2 & Exhibit Hall with Coffee Break
Phillip Mueller ⋅ Talip Ünlü ⋅ Sebastian Schmidt ⋅ Marcel Kollovieh ⋅ Jiajie Fan ⋅ Stephan Günnemann ⋅ Lars Mikelsons
Exhibit Hall I #126
OVA-Fields: Weakly Supervised Open-Vocabulary Affordance Fields for Robot Operational Part Detection Poster Session 2 & Exhibit Hall with Coffee Break
Heng Su ⋅ Mengying Xie ⋅ Nieqing Cao ⋅ Yan Ding ⋅ Beichen Shao ⋅ Xianlei Long ⋅ Fuqiang Gu ⋅ Chao Chen
Exhibit Hall I #127
Arti-PG: A Toolbox for Procedurally Synthesizing Large-Scale and Diverse Articulated Objects with Rich Annotations Poster Session 2 & Exhibit Hall with Coffee Break
Jianhua Sun ⋅ Yuxuan Li ⋅ Jiude Wei ⋅ Xu Longfei ⋅ Wang Nange ⋅ Yining Zhang ⋅ Cewu Lu
Exhibit Hall I #128
Scaling 3D Compositional Models for Robust Classification and Pose Estimation Poster Session 2 & Exhibit Hall with Coffee Break
Xiaoding Yuan ⋅ Prakhar Kaushik ⋅ Guofeng Zhang ⋅ Artur Jesslen ⋅ Adam Kortylewski ⋅ Alan Yuille
Exhibit Hall I #129
RoboTron-Nav: A Unified Framework for Embodied Navigation Integrating Perception, Planning, and Prediction Poster Session 2 & Exhibit Hall with Coffee Break
Yufeng Zhong ⋅ Chengjian Feng ⋅ Feng yan ⋅ Fanfan Liu ⋅ Liming Zheng ⋅ Lin Ma
Exhibit Hall I #130
Corvid: Improving Multimodal Large Language Models Towards Chain-of-Thought Reasoning Poster Session 1 & Exhibit Hall
Jingjing Jiang ⋅ Chao Ma ⋅ Xurui Song ⋅ Hanwang Zhang ⋅ Jun Luo
Exhibit Hall I #280
DAMap: Distance-aware MapNet for High Quality HD Map Construction Poster Session 2 & Exhibit Hall with Coffee Break
JINPENG DONG ⋅ Chen Li ⋅ Yutong Lin ⋅ Jingwen Fu ⋅ Sanping Zhou ⋅ Nanning Zheng
Exhibit Hall I #25
X-Capture: An Open-Source Portable Device for Multi-Sensory Learning Poster Session 2 & Exhibit Hall with Coffee Break
Samuel Clarke ⋅ Suzannah Wistreich ⋅ Yanjie Ze ⋅ Jiajun Wu
Exhibit Hall I #132
DRaM-LHM: A Quaternion Framework for Iterative Camera Pose Estimation Poster Session 2 & Exhibit Hall with Coffee Break
Chen Lin ⋅ Weizhi Du ⋅ Zhixiang Min ⋅ Baochen She ⋅ Enrique Dunn ⋅ Sonya Hanson
Exhibit Hall I #133
Focal Plane Visual Feature Generation and Matching on a Pixel Processor Array Poster Session 6 & Exhibit Hall with Coffee Break
Hongyi Zhang ⋅ Laurie Bose ⋅ Jianing Chen ⋅ Piotr Dudek ⋅ Walterio Mayol-Cuevas
Exhibit Hall I #415
VoteSplat: Hough Voting Gaussian Splatting for 3D Scene Understanding Poster Session 2 & Exhibit Hall with Coffee Break
Minchao Jiang ⋅ Shunyu Jia ⋅ Jiaming Gu ⋅ Xiaoyuan Lu ⋅ Guangming Zhu ⋅ Anqi Dong ⋅ zhang liang
Exhibit Hall I #134
Prior-aware Dynamic Temporal Modeling Framework for Sequential 3D Hand Pose Estimation Poster Session 2 & Exhibit Hall with Coffee Break
Pengfei Ren ⋅ Jingyu Wang ⋅ Haifeng Sun ⋅ Qi Qi ⋅ Xingyu Liu ⋅ Menghao Zhang ⋅ Lei Zhang ⋅ Jing Wang ⋅ Jianxin Liao
Exhibit Hall I #136
Epipolar Consistent Attention Aggregation Network for Unsupervised Light Field Disparity Estimation Poster Session 2 & Exhibit Hall with Coffee Break
Chen Gao ⋅ Shuo Zhang ⋅ Youfang Lin
Exhibit Hall I #137
ATLAS: Decoupling Skeletal and Shape Parameters for Expressive Parametric Human Modeling Poster Session 2 & Exhibit Hall with Coffee Break
Jinhyung Park ⋅ Javier Romero ⋅ Shunsuke Saito ⋅ Fabian Prada ⋅ Takaaki Shiratori ⋅ Yichen Xu ⋅ Federica Bogo ⋅ Shoou-I Yu ⋅ Kris Kitani ⋅ Rawal Khirodkar
Exhibit Hall I #139
On the Generalization of Representation Uncertainty in Earth Observation Poster Session 2 & Exhibit Hall with Coffee Break
Spyros Kondylatos ⋅ Nikolaos Ioannis Bountos ⋅ Dimitrios Michail ⋅ Xiao Xiang Zhu ⋅ Gustau Camps-Valls ⋅ Ioannis Papoutsis
Exhibit Hall I #143
Predict-Optimize-Distill: A Self-Improving Cycle for 4D Object Understanding Poster Session 2 & Exhibit Hall with Coffee Break
Mingxuan Wu ⋅ Huang Huang ⋅ Justin Kerr ⋅ Chung Min Kim ⋅ Anthony Zhang ⋅ Brent Yi ⋅ Angjoo Kanazawa
Exhibit Hall I #145
Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data and Metric Perspectives Poster Session 2 & Exhibit Hall with Coffee Break
Shaoyuan Xie ⋅ Lingdong Kong ⋅ Yuhao Dong ⋅ Chonghao Sima ⋅ Wenwei Zhang ⋅ Qi Alfred Chen ⋅ Ziwei Liu ⋅ Liang Pan
Exhibit Hall I #148
Humans as a Calibration Pattern: Dynamic 3D Scene Reconstruction from Unsynchronized and Uncalibrated Videos Poster Session 2 & Exhibit Hall with Coffee Break
Changwoon Choi ⋅ Jeongjun Kim ⋅ Geonho Cha ⋅ Minkwan Kim ⋅ Dongyoon Wee ⋅ Young Min Kim
Exhibit Hall I #149
GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors Poster Session 2 & Exhibit Hall with Coffee Break
Tian-Xing Xu ⋅ Xiangjun Gao ⋅ Wenbo Hu ⋅ Xiaoyu Li ⋅ Song-Hai Zhang ⋅ Ying Shan
Exhibit Hall I #153
Hybrid-grained Feature Aggregation with Coare-to-fine Language Guidance for Self-supervised Monocular Depth Estimation Poster Session 2 & Exhibit Hall with Coffee Break
Wenyao Zhang ⋅ Hongsi Liu ⋅ Bohan Li ⋅ Jiawei He ⋅ Zekun Qi ⋅ Yunnan Wang ⋅ Eastern Institute of Technology Shengyang ⋅ Ningbo Institute Of Digital Twin XinQiang ⋅ Galbot Wenjun ⋅ Eastern Institute for Advanced Study Xin
Exhibit Hall I #157
Zero-AVSR: Zero-Shot Audio-Visual Speech Recognition with LLMs by Learning Language-Agnostic Speech Representations Poster Session 2 & Exhibit Hall with Coffee Break
Jeong Hun Yeo ⋅ Minsu Kim ⋅ Chae Won Kim ⋅ Stavros Petridis ⋅ Yong Man Ro
Exhibit Hall I #158
Jigsaw++: Imagining Complete Shape Priors for Object Reassembly Poster Session 2 & Exhibit Hall with Coffee Break
Jiaxin Lu ⋅ Gang Hua ⋅ Qixing Huang
Exhibit Hall I #159
Seeing and Seeing Through the Glass: Real and Synthetic Data for Multi-Layer Depth Estimation Poster Session 2 & Exhibit Hall with Coffee Break
Hongyu Wen ⋅ Yiming Zuo ⋅ Venkat Subramanian ⋅ Patrick Chen ⋅ Jia Deng
Exhibit Hall I #160
SpatialTrackerV2: Advancing 3D Point Tracking with Explicit Camera Motion Poster Session 2 & Exhibit Hall with Coffee Break
Yuxi Xiao ⋅ Jianyuan Wang ⋅ Nan Xue ⋅ Nikita Karaev ⋅ Iurii Makarov ⋅ Bingyi Kang ⋅ Xing Zhu ⋅ Hujun Bao ⋅ Yujun Shen ⋅ Xiaowei Zhou
Exhibit Hall I #161
A Simple yet Mighty Hartley Diffusion Versatilist for Generalizable Dense Vision Tasks Poster Session 2 & Exhibit Hall with Coffee Break
Qi Bi ⋅ Jingjun Yi ⋅ Huimin Huang ⋅ Hao Zheng ⋅ Haolan Zhan ⋅ Wei Ji ⋅ Yawen Huang ⋅ Yuexiang Li ⋅ Yefeng Zheng
Exhibit Hall I #163
IGL-Nav: Incremental 3D Gaussian Localization for Image-goal Navigation Poster Session 2 & Exhibit Hall with Coffee Break
Wenxuan Guo ⋅ Xiuwei Xu ⋅ Hang Yin ⋅ Ziwei Wang ⋅ Jianjiang Feng ⋅ Jie Zhou ⋅ Jiwen Lu
Exhibit Hall I #168
AR-VRM: Imitating Human Motions for Visual Robot Manipulation with Analogical Reasoning Poster Session 2 & Exhibit Hall with Coffee Break
Dejie Yang ⋅ Zijing Zhao ⋅ Yang Liu
Exhibit Hall I #169
Unleashing the Temporal Potential of Stereo Event Cameras for Continuous-Time 3D Object Detection Poster Session 2 & Exhibit Hall with Coffee Break
Jae Young Kang ⋅ Hoonhee Cho ⋅ Kuk-Jin Yoon
Exhibit Hall I #174
PlaneRAS: Learning Planar Primitives for 3D Plane Recovery Poster Session 2 & Exhibit Hall with Coffee Break
Fang Zhang ⋅ Wenzhao Zheng ⋅ Linqing Zhao ⋅ Zelan Zhu ⋅ Jiwen Lu ⋅ Xiuzhuang Zhou
Exhibit Hall I #175
FedVLA: Federated Vision-Language-Action Learning with Dual Gating Mixture-of-Experts for Robotic Manipulation Poster Session 2 & Exhibit Hall with Coffee Break
Cui Miao ⋅ Tao Chang ⋅ meihan wu ⋅ Hongbin Xu ⋅ Chun Li ⋅ Ming Li ⋅ Xiaodong Wang
Exhibit Hall I #177
3DSRBench: A Comprehensive 3D Spatial Reasoning Benchmark Poster Session 2 & Exhibit Hall with Coffee Break
Wufei Ma ⋅ Haoyu Chen ⋅ Guofeng Zhang ⋅ Yu-Cheng Chou ⋅ Celso de Melo ⋅ Alan Yuille ⋅ Jieneng Chen
Exhibit Hall I #179
3DGraphLLM: Combining Semantic Graphs and Large Language Models for 3D Scene Understanding Poster Session 2 & Exhibit Hall with Coffee Break
Tatiana Zemskova ⋅ Dmitry Yudin
Exhibit Hall I #363
TAR3D: Creating High-Quality 3D Assets via Next-Part Prediction Poster Session 2 & Exhibit Hall with Coffee Break
Xuying Zhang ⋅ Yutong Liu ⋅ Yangguang Li ⋅ Renrui Zhang ⋅ Yufei Liu ⋅ Kai Wang ⋅ Wanli Ouyang ⋅ Zhiwei Xiong ⋅ Peng Gao ⋅ Qibin Hou ⋅ Ming-Ming Cheng
Exhibit Hall I #11
Exploring the Adversarial Vulnerabilities of Vision-Language-Action Models in Robotics Poster Session 2 & Exhibit Hall with Coffee Break
Taowen Wang ⋅ Cheng Han ⋅ James Liang ⋅ Wenhao Yang ⋅ Dongfang Liu ⋅ Luna Zhang ⋅ Qifan Wang ⋅ Jiebo Luo ⋅ Ruixiang Tang
Exhibit Hall I #181
Simultaneous Motion And Noise Estimation with Event Cameras Poster Session 2 & Exhibit Hall with Coffee Break
Shintaro Shiba ⋅ Yoshimitsu Aoki ⋅ Guillermo Gallego
Exhibit Hall I #182
Layer-wise Vision Injection with Disentangled Attention for Efficient LVLMs Poster Session 2 & Exhibit Hall with Coffee Break
Xuange Zhang ⋅ Dengjie Li ⋅ Bo Liu ⋅ Zenghao Bao ⋅ Yao Zhou ⋅ Baisong Yang ⋅ liuzhongying liuzhongying ⋅ Yujie Zhong ⋅ Tongtong Yuan
Exhibit Hall I #186
CMT: A Cascade MAR with Topology Predictor for Multimodal Conditional CAD Generation Poster Session 2 & Exhibit Hall with Coffee Break
Jianyu Wu ⋅ Yizhou Wang ⋅ Xiangyu Yue ⋅ Xinzhu Ma ⋅ Jinyang Guo ⋅ Dongzhan Zhou ⋅ Wanli Ouyang ⋅ SHIXIANG TANG
Exhibit Hall I #187
StableDepth: Scene-Consistent and Scale-Invariant Monocular Depth Poster Session 2 & Exhibit Hall with Coffee Break
Zheng Zhang ⋅ Lihe Yang ⋅ Tianyu Yang ⋅ Chaohui Yu ⋅ Xiaoyang Guo ⋅ Yixing Lao ⋅ Hengshuang Zhao
Exhibit Hall I #192
4DSegStreamer: Streaming 4D Panoptic Segmentation via Dual Threads Poster Session 2 & Exhibit Hall with Coffee Break
Ling Liu ⋅ Jun Tian ⋅ Li Yi
Exhibit Hall I #194
Color Matching Using Hypernetwork-Based Kolmogorov-Arnold Networks Poster Session 2 & Exhibit Hall with Coffee Break
Artem Nikonorov ⋅ Georgy Perevozchikov ⋅ Andrei Korepanov ⋅ Nancy Mehta ⋅ Mahmoud Afifi ⋅ Egor Ershov ⋅ Radu Timofte
Exhibit Hall I #195
HccePose (BF): Predicting Front & Back Surfaces to Construct Ultra-Dense 2D-3D Correspondences for Pose Estimation Poster Session 2 & Exhibit Hall with Coffee Break
Yulin Wang ⋅ Mengting Hu ⋅ Hongli Li ⋅ Chen LUO
Exhibit Hall I #201
GaussianVideo: Efficient Video Representation via Hierarchical Gaussian Splatting Poster Session 2 & Exhibit Hall with Coffee Break
Andrew Bond ⋅ Jui-Hsien Wang ⋅ Long Mai ⋅ Erkut Erdem ⋅ Aykut Erdem
Exhibit Hall I #203
PhysTwin: Physics-Informed Reconstruction and Simulation of Deformable Objects from Videos Poster Session 2 & Exhibit Hall with Coffee Break
Hanxiao Jiang ⋅ Hao-Yu Hsu ⋅ Kaifeng Zhang ⋅ Hsin-Ni Yu ⋅ Shenlong Wang ⋅ Yunzhu Li
Exhibit Hall I #206
GaussianProperty: Integrating Physical Properties to 3D Gaussians with LMMs Poster Session 2 & Exhibit Hall with Coffee Break
Xinli Xu ⋅ Wenhang Ge ⋅ Dicong Qiu ⋅ ZhiFei Chen ⋅ Dongyu Yan ⋅ Zhuoyun LIU ⋅ Haoyu Zhao ⋅ hanfeng Zhao ⋅ Shunsi Zhang ⋅ Junwei Liang ⋅ Ying-Cong Chen
Exhibit Hall I #207
Enhancing Image Restoration Transformer via Adaptive Translation Equivariance Poster Session 4 & Exhibit Hall with Coffee Break
JiaKui Hu ⋅ Zhengjian Yao ⋅ Lujia Jin ⋅ Hangzhou He ⋅ Yanye Lu
Exhibit Hall I #110
Frequency-Aligned Knowledge Distillation for Lightweight Spatiotemporal Forecasting Poster Session 2 & Exhibit Hall with Coffee Break
Yuqi Li ⋅ Chuanguang Yang ⋅ Hansheng Zeng ⋅ Zeyu Dong ⋅ Zhulin An ⋅ Yongjun Xu ⋅ Yingli Tian ⋅ Hao Wu
Exhibit Hall I #210
CAD-Assistant: Tool-Augmented VLLMs as Generic CAD Task Solvers Poster Session 2 & Exhibit Hall with Coffee Break
Dimitrios Mallis ⋅ Ahmet Karadeniz ⋅ Sebastian Cavada ⋅ Danila Rukhovich ⋅ Niki Foteinopoulou ⋅ Kseniya Cherenkova ⋅ Anis Kacem ⋅ Djamila Aouada
Exhibit Hall I #212
Dynamic Point Maps: A Versatile Representation for Dynamic 3D Reconstruction Poster Session 2 & Exhibit Hall with Coffee Break
Edgar Sucar ⋅ Zihang Lai ⋅ Eldar Insafutdinov ⋅ Andrea Vedaldi
Exhibit Hall I #213
Physics Context Builders: A Modular Framework for Physical Reasoning in Vision-Language Models Poster Session 2 & Exhibit Hall with Coffee Break
Vahid Balazadeh ⋅ Mohammadmehdi Ataei ⋅ Hyunmin Cheong ⋅ Amir Khasahmadi ⋅ Rahul Krishnan
Exhibit Hall I #215
VOccl3D: A Video Benchmark Dataset for 3D Human Pose and Shape Estimation under real Occlusions Poster Session 2 & Exhibit Hall with Coffee Break
Yash Garg ⋅ Saketh Bachu ⋅ Arindam Dutta ⋅ Rohit Lal ⋅ Sarosij Bose ⋅ Calvin-Khang Ta ⋅ M. Salman Asif ⋅ Amit Roy-Chowdhury
Exhibit Hall I #218
Tracking Tiny Drones against Clutter: Large-Scale Infrared Benchmark with Motion-Centric Adaptive Algorithm Poster Session 2 & Exhibit Hall with Coffee Break
Jiahao Zhang ⋅ Zongli Jiang ⋅ Gang Wang ⋅ Jinli Zhang ⋅ Yixin Wei ⋅ Liang Li ⋅ Yizheng Wang
Exhibit Hall I #219
MM-Spatial: Exploring 3D Spatial Understanding in Multimodal LLMs Poster Session 2 & Exhibit Hall with Coffee Break
Erik Daxberger ⋅ Nina Wenzel ⋅ David Griffiths ⋅ Haiming Gang ⋅ Justin Lazarow ⋅ Gefen Kohavi ⋅ Kai Kang ⋅ Marcin Eichner ⋅ Yinfei Yang ⋅ Afshin Dehghan ⋅ Peter Grasch
Exhibit Hall I #222
AutoComPose: Automatic Generation of Pose Transition Descriptions for Composed Pose Retrieval Using Multimodal LLMs Poster Session 2 & Exhibit Hall with Coffee Break
Yi-Ting Shen ⋅ Sungmin Eum ⋅ Doheon Lee ⋅ Rohit Shete ⋅ Chiao-Yi Wang ⋅ Heesung Kwon ⋅ Shuvra Bhattacharyya
Exhibit Hall I #223
Understanding Flatness in Generative Models: Its Role and Benefits Poster Session 1 & Exhibit Hall
Taehwan Lee ⋅ Kyeongkook Seo ⋅ Jaejun Yoo ⋅ Sung Whan Yoon
Exhibit Hall I #461
Image-Guided Shape-from-Template Using Mesh Inextensibility Constraints Poster Session 2 & Exhibit Hall with Coffee Break
Dinh-Vinh-Thuy Tran ⋅ Ruochen Chen ⋅ Shaifali Parashar
Exhibit Hall I #224
3D-MOOD: Lifting 2D to 3D for Monocular Open-Set Object Detection Poster Session 2 & Exhibit Hall with Coffee Break
Yung-Hsu Yang ⋅ Luigi Piccinelli ⋅ Mattia Segu ⋅ Siyuan Li ⋅ Rui Huang ⋅ Yuqian Fu ⋅ Marc Pollefeys ⋅ Hermann Blum ⋅ Zuria Bauer
Exhibit Hall I #227
LUDVIG: Learning-Free Uplifting of 2D Visual Features to Gaussian Splatting Scenes Poster Session 2 & Exhibit Hall with Coffee Break
Juliette Marrie ⋅ Romain Menegaux ⋅ Michael Arbel ⋅ Diane Larlus ⋅ Julien Mairal
Exhibit Hall I #228
VOVTrack: Exploring the Potentiality in Raw Videos for Open-Vocabulary Multi-Object Tracking Poster Session 2 & Exhibit Hall with Coffee Break
Zekun Qian ⋅ Ruize Han ⋅ Junhui Hou ⋅ Linqi Song ⋅ Wei Feng
Exhibit Hall I #231
PHD: Personalized 3D Human Body Fitting with Point Diffusion Poster Session 2 & Exhibit Hall with Coffee Break
Hsuan-I Ho ⋅ Chen Guo ⋅ Po-Chen Wu ⋅ Ivan Shugurov ⋅ Chengcheng Tang ⋅ Abhay Mittal ⋅ Sizhe An ⋅ Manuel Kaufmann ⋅ Linguang Zhang
Exhibit Hall I #236
Frequency Domain-Based Diffusion Model for Unpaired Image Dehazing Poster Session 2 & Exhibit Hall with Coffee Break
Chengxu Liu ⋅ Lu Qi ⋅ Jinshan Pan ⋅ Xueming Qian ⋅ Ming-Hsuan Yang
Exhibit Hall I #237
Language Driven Occupancy Prediction Poster Session 2 & Exhibit Hall with Coffee Break
Zhu Yu ⋅ Bowen Pang ⋅ Lizhe Liu ⋅ Runmin Zhang ⋅ Qiang Li ⋅ Si-Yuan Cao ⋅ Maochun Luo ⋅ Mingxia Chen ⋅ Sheng Yang ⋅ Hui-liang Shen
Exhibit Hall I #238
C4D: 4D Made from 3D through Dual Correspondences Poster Session 2 & Exhibit Hall with Coffee Break
Shizun Wang ⋅ Zhenxiang Jiang ⋅ Xingyi Yang ⋅ Xinchao Wang
Exhibit Hall I #240
ScoreHOI: Physically Plausible Reconstruction of Human-Object Interaction via Score-Guided Diffusion Poster Session 2 & Exhibit Hall with Coffee Break
AO LI ⋅ Jinpeng Liu ⋅ Yixuan Zhu ⋅ Yansong Tang
Exhibit Hall I #242
Estimating 2D Camera Motion with Hybrid Motion Basis Poster Session 2 & Exhibit Hall with Coffee Break
Haipeng Li ⋅ Tianhao Zhou ⋅ Zhanglei Yang ⋅ WuYi WuYi ⋅ Chen Yan ⋅ Zijing Mao ⋅ Shen Cheng ⋅ Bing Zeng ⋅ Shuaicheng Liu
Exhibit Hall I #245
AgroBench: Vision-Language Model Benchmark in Agriculture Poster Session 2 & Exhibit Hall with Coffee Break
Risa Shinoda ⋅ Nakamasa Inoue ⋅ Hirokatsu Kataoka ⋅ Masaki Onishi ⋅ Yoshitaka Ushiku
Exhibit Hall I #246
Princeton365: A Diverse Dataset with Accurate Camera Pose Poster Session 2 & Exhibit Hall with Coffee Break
Karhan Kayan ⋅ Stamatis Alexandropoulos ⋅ Rishabh Jain ⋅ Yiming Zuo ⋅ Erich Liang ⋅ Jia Deng
Exhibit Hall I #247
H3R: Hybrid Multi-view Correspondence for Generalizable 3D Reconstruction Poster Session 2 & Exhibit Hall with Coffee Break
Heng Jia ⋅ Na Zhao ⋅ Linchao Zhu
Exhibit Hall I #248
After the Party: Navigating the Mapping From Color to Ambient Lighting Poster Session 2 & Exhibit Hall with Coffee Break
Florin-Alexandru Vasluianu ⋅ Tim Seizinger ⋅ Zongwei Wu ⋅ Radu Timofte
Exhibit Hall I #395
From Abyssal Darkness to Blinding Glare: A Benchmark on Extreme Exposure Correction in Real World Poster Session 2 & Exhibit Hall with Coffee Break
Bo Wang ⋅ Huiyuan Fu ⋅ Zhiye Huang ⋅ Siru Zhang ⋅ Xin Wang ⋅ Huadong Ma
Exhibit Hall I #249
Dita: Scaling Diffusion Transformer for Generalist Vision-Language-Action Policy Poster Session 2 & Exhibit Hall with Coffee Break
Zhi Hou ⋅ Tianyi Zhang ⋅ Yuwen Xiong ⋅ Haonan Duan ⋅ Hengjun Pu ⋅ Ronglei Tong ⋅ Chengyang Zhao ⋅ Xizhou Zhu ⋅ Yu Qiao ⋅ Jifeng Dai ⋅ Yuntao Chen
Exhibit Hall I #251
Voyaging into Perpetual Dynamic Scenes from a Single View Poster Session 2 & Exhibit Hall with Coffee Break
Fengrui Tian ⋅ Tianjiao Ding ⋅ Jinqi Luo ⋅ Hancheng Min ⋅ Rene Vidal
Exhibit Hall I #252
Learnable Feature Patches and Vectors for Boosting Low-light Image Enhancement without External Knowledge Poster Session 2 & Exhibit Hall with Coffee Break
Xiaogang Xu ⋅ Jiafei Wu ⋅ Qingsen Yan ⋅ Jiequan Cui ⋅ Richang Hong ⋅ Bei Yu
Exhibit Hall I #258
TESPEC: Temporally-Enhanced Self-Supervised Pretraining for Event Cameras Poster Session 2 & Exhibit Hall with Coffee Break
Mohammad Mohammadi ⋅ Ziyi Wu ⋅ Igor Gilitschenski
Exhibit Hall I #260
CL-Splats: Continual Learning of Gaussian Splatting with Local Optimization Poster Session 2 & Exhibit Hall with Coffee Break
Jan Ackermann ⋅ Jonas Kulhanek ⋅ Shengqu Cai ⋅ Haofei Xu ⋅ Marc Pollefeys ⋅ Gordon Wetzstein ⋅ Leonidas Guibas ⋅ Songyou Peng
Exhibit Hall I #262
Find Any Part in 3D Poster Session 2 & Exhibit Hall with Coffee Break
Ziqi Ma ⋅ Yisong Yue ⋅ Georgia Gkioxari
Exhibit Hall I #263
Learning 3D Scene Analogies with Neural Contextual Scene Maps Poster Session 2 & Exhibit Hall with Coffee Break
Junho Kim ⋅ Gwangtak Bae ⋅ Eun Sun Lee ⋅ Young Min Kim
Exhibit Hall I #264
GausSim: Foreseeing Reality by Gaussian Simulator for Elastic Objects Poster Session 2 & Exhibit Hall with Coffee Break
Yidi Shao ⋅ Mu Huang ⋅ Chen Change Loy ⋅ Bo Dai
Exhibit Hall I #265
Global Motion Corresponder for 3D Point-Based Scene Interpolation under Large Motion Poster Session 2 & Exhibit Hall with Coffee Break
Junru Lin ⋅ Chirag Vashist ⋅ Mikaela Uy ⋅ Colton Stearns ⋅ Xuan Luo ⋅ Leonidas Guibas ⋅ Ke Li
Exhibit Hall I #269
AdvDreamer Unveils: Are Vision-Language Models Truly Ready for Real-World 3D Variations? Poster Session 2 & Exhibit Hall with Coffee Break
Shouwei Ruan ⋅ Hanqing Liu ⋅ Yao Huang ⋅ XIaoqi Wang ⋅ Caixin KANG ⋅ Hang Su ⋅ Yinpeng Dong ⋅ Xingxing Wei
Exhibit Hall I #270
SpikeDiff: Zero-shot High-Quality Video Reconstruction from Chromatic Spike Camera and Sub-millisecond Spike Streams Poster Session 2 & Exhibit Hall with Coffee Break
Siqi Yang ⋅ Jinxiu Liang ⋅ Zhaojun Huang ⋅ Yeliduosi Xiaokaiti ⋅ Yakun Chang ⋅ Zhaofei Yu ⋅ Boxin Shi
Exhibit Hall I #271
VA-MoE: Variables-Adaptive Mixture of Experts for Incremental Weather Forecasting Poster Session 2 & Exhibit Hall with Coffee Break
Hao Chen ⋅ Tao Han ⋅ Song Guo ⋅ Jie ZHANG ⋅ Yonghan Dong ⋅ Yunlong Yu ⋅ LEI BAI
Exhibit Hall I #272
AJAHR: Amputated Joint Aware 3D Human Mesh Recovery Poster Session 2 & Exhibit Hall with Coffee Break
hyunjin cho ⋅ Giyun choi ⋅ Jongwon Choi
Exhibit Hall I #273
EquiCaps: Predictor-Free Pose-Aware Pre-Trained Capsule Networks Poster Session 2 & Exhibit Hall with Coffee Break
Athinoulla Konstantinou ⋅ Georgios Leontidis ⋅ Mamatha Thota ⋅ Aiden Durrant
Exhibit Hall I #275
A Structure-aware and Motion-adaptive Framework for 3D Human Pose Estimation with Mamba Poster Session 2 & Exhibit Hall with Coffee Break
Ye Lu ⋅ Jie Wang ⋅ Jianjun Gao ⋅ Rui Gong ⋅ Chen Cai ⋅ Kim-Hui Yap
Exhibit Hall I #276
Unsupervised Joint Learning of Optical Flow and Intensity with Event Cameras Poster Session 2 & Exhibit Hall with Coffee Break
Shuang Guo ⋅ Friedhelm Hamann ⋅ Guillermo Gallego
Exhibit Hall I #278
CAPTURE: Evaluating Spatial Reasoning in Vision Language Models via Occluded Object Counting Poster Session 2 & Exhibit Hall with Coffee Break
Atin Pothiraj ⋅ Jaemin Cho ⋅ Elias Stengel-Eskin ⋅ Mohit Bansal
Exhibit Hall I #280
RoboTron-Drive: All-in-One Large Multimodal Model for Autonomous Driving Poster Session 2 & Exhibit Hall with Coffee Break
Zhijian Huang ⋅ Chengjian Feng ⋅ Baihui Xiao ⋅ Feng yan ⋅ ZEQUN JIE ⋅ Yujie Zhong ⋅ Xiaodan Liang ⋅ Lin Ma
Exhibit Hall I #281
6DOPE-GS: Online 6D Object Pose Estimation using Gaussian Splatting Poster Session 2 & Exhibit Hall with Coffee Break
Yufeng Jin ⋅ Vignesh Prasad ⋅ Snehal Jauhri ⋅ Mathias Franzius ⋅ Georgia Chalvatzaki
Exhibit Hall I #283
AnyCalib: On-Manifold Learning for Model-Agnostic Single-View Camera Calibration Poster Session 2 & Exhibit Hall with Coffee Break
Javier Tirado-Garín ⋅ Javier Civera
Exhibit Hall I #284
Background Invariance Testing According to Semantic Proximity Poster Session 2 & Exhibit Hall with Coffee Break
Zukang Liao ⋅ Min Chen
Exhibit Hall I #285
NuPlanQA: A Large-Scale Dataset and Benchmark for Multi-View Driving Scene Understanding in Multi-Modal Large Language Models Poster Session 2 & Exhibit Hall with Coffee Break
Sung-Yeon Park ⋅ Can Cui ⋅ Yunsheng Ma ⋅ Ahmadreza Moradipari ⋅ Rohit Gupta ⋅ Kyungtae Han ⋅ Ziran Wang
Exhibit Hall I #286
One Look is Enough: Seamless Patchwise Refinement for Zero-Shot Monocular Depth Estimation on High-Resolution Images Poster Session 2 & Exhibit Hall with Coffee Break
Byeongjun Kwon ⋅ Munchurl Kim
Exhibit Hall I #287
Adapting Vehicle Detectors for Aerial Imagery to Unseen Domains with Weak Supervision Poster Session 2 & Exhibit Hall with Coffee Break
Xiao Fang ⋅ Minhyek Jeon ⋅ Zheyang Qin ⋅ Stanislav Panev ⋅ Celso de Melo ⋅ Shuowen Hu ⋅ Shayok Chakraborty ⋅ Fernando De la Torre
Exhibit Hall I #288
RegGS: Unposed Sparse Views Gaussian Splatting with 3DGS Registration Poster Session 2 & Exhibit Hall with Coffee Break
Chong Cheng ⋅ Yu Hu ⋅ Sicheng Yu ⋅ Beizhen ZHAO ⋅ Zijian Wang ⋅ Hao Wang
Exhibit Hall I #289
PersPose: 3D Human Pose Estimation with Perspective Encoding and Perspective Rotation Poster Session 2 & Exhibit Hall with Coffee Break
Xiaoyang Hao ⋅ Han Li
Exhibit Hall I #290
Training-Free Generation of Temporally Consistent Rewards from VLMs Poster Session 2 & Exhibit Hall with Coffee Break
Yinuo Zhao ⋅ Jiale Yuan ⋅ Zhiyuan Xu ⋅ Xiaoshuai Hao ⋅ Xinyi Zhang ⋅ Kun Wu ⋅ Zhengping Che ⋅ Chi Liu ⋅ Jian Tang
Exhibit Hall I #292
MEMFOF: High-Resolution Training for Memory-Efficient Multi-Frame Optical Flow Estimation Poster Session 2 & Exhibit Hall with Coffee Break
Vladislav Bargatin ⋅ Egor Chistov ⋅ Alexander Yakovenko ⋅ Dmitriy Vatolin
Exhibit Hall I #297
Breaking Rectangular Shackles: Cross-View Object Segmentation for Fine-Grained Object Geo-Localization Poster Session 2 & Exhibit Hall with Coffee Break
Qingwang Zhang ⋅ Yingying Zhu
Exhibit Hall I #298
TopicGeo: An Efficient Unified Framework for Geolocation Poster Session 2 & Exhibit Hall with Coffee Break
Xin Wang ⋅ Xinlin Wang ⋅ Shuiping Gou
Exhibit Hall I #302
ETCH: Generalizing Body Fitting to Clothed Humans via Equivariant Tightness Poster Session 2 & Exhibit Hall with Coffee Break
Boqian Li ⋅ Zeyu Cai ⋅ Michael Black ⋅ Haiwen Feng ⋅ Yuliang Xiu
Exhibit Hall I #306
Revisiting Image Fusion for Multi-Illuminant White-Balance Correction Poster Session 2 & Exhibit Hall with Coffee Break
David Serrano ⋅ Aditya Arora ⋅ Luis Herranz ⋅ Kosta Derpanis ⋅ Michael Brown ⋅ Javier Vazquez-Corral
Exhibit Hall I #307
Partially Matching Submap Helps: Uncetainty Modeling and Propagation for Text to Point Cloud Localization Poster Session 2 & Exhibit Hall with Coffee Break
Mingtao Feng ⋅ Longlong Mei ⋅ Zijie Wu ⋅ Jianqiao Luo ⋅ Fenghao Tian ⋅ Jie Feng ⋅ Weisheng Dong ⋅ Yaonan Wang
Exhibit Hall I #309
Medical World Model Poster Session 2 & Exhibit Hall with Coffee Break
Yijun Yang ⋅ Zhao-Yang Wang ⋅ Qiuping Liu ⋅ Shu Wen Sun ⋅ Kang Wang ⋅ Rama Chellappa ⋅ Zongwei Zhou ⋅ Alan Yuille ⋅ Lei Zhu ⋅ Yu-Dong Zhang ⋅ Jieneng Chen
Exhibit Hall I #311
NormalCrafter: Learning Temporally Consistent Normals from Video Diffusion Priors Poster Session 2 & Exhibit Hall with Coffee Break
Yanrui Bin ⋅ Wenbo Hu ⋅ Haoyuan Wang ⋅ Xinya Chen ⋅ Bing WANG
Exhibit Hall I #312
DuCos: Duality Constrained Depth Super-Resolution via Foundation Model Poster Session 2 & Exhibit Hall with Coffee Break
Zhiqiang Yan ⋅ Zhengxue Wang ⋅ Haoye Dong ⋅ Jun Li ⋅ Jian Yang ⋅ Gim Hee Lee
Exhibit Hall I #315
MaskHand: Generative Masked Modeling for Robust Hand Mesh Reconstruction in the Wild Poster Session 2 & Exhibit Hall with Coffee Break
Muhammad Usama Saleem ⋅ Ekkasit Pinyoanuntapong ⋅ Mayur Patel ⋅ Hongfei Xue ⋅ Ahmed Helmy ⋅ Srijan Das ⋅ Pu Wang
Exhibit Hall I #316
Passing the Driving Knowledge Test Poster Session 2 & Exhibit Hall with Coffee Break
Maolin Wei ⋅ Wanzhou Liu ⋅ Eshed Ohn-Bar
Exhibit Hall I #318
Uncertainty-Aware Gradient Stabilization for Small Object Detection Poster Session 2 & Exhibit Hall with Coffee Break
Huixin Sun ⋅ Yanjing Li ⋅ Linlin Yang ⋅ Xianbin Cao ⋅ Baochang Zhang
Exhibit Hall I #319
Can Generative Geospatial Diffusion Models Excel as Discriminative Geospatial Foundation Models? Poster Session 2 & Exhibit Hall with Coffee Break
Yuru Jia ⋅ Valerio Marsocci ⋅ Ziyang Gong ⋅ Xue Yang ⋅ Maarten Vergauwen ⋅ Andrea Nascetti
Exhibit Hall I #321
4D Visual Pre-training for Robot Learning Poster Session 2 & Exhibit Hall with Coffee Break
Chengkai Hou ⋅ Yanjie Ze ⋅ Yankai Fu ⋅ Zeyu Gao ⋅ Songbo Hu ⋅ Yue Yu ⋅ Shanghang Zhang ⋅ Huazhe Xu
Exhibit Hall I #323
Fish2Mesh Transformer: 3D Human Mesh Recovery from Egocentric Vision Poster Session 2 & Exhibit Hall with Coffee Break
Tianma Shen ⋅ Aditya Shrish Puranik ⋅ James Vong ⋅ Vrushabh Deogirikar ⋅ Ryan Fell ⋅ Julianna Dietrich ⋅ Maria Kyrarini ⋅ Christopher Kitts ⋅ David Jeong
Exhibit Hall I #138
CryoFastAR: Fast Cryo-EM Ab initio Reconstruction Made Easy Poster Session 2 & Exhibit Hall with Coffee Break
Jiakai Zhang ⋅ Shouchen Zhou ⋅ Haizhao Dai ⋅ Xinhang Liu ⋅ Peihao Wang ⋅ Zhiwen Fan ⋅ Yuan Pei ⋅ Jingyi Yu
Exhibit Hall I #324
Beyond Pixel Uncertainty: Bounding the OoD Objects in Road Scenes Poster Session 2 & Exhibit Hall with Coffee Break
Huachao Zhu ⋅ Zelong Liu ⋅ Zhichao Sun ⋅ Yuda Zou ⋅ Gui-Song Xia ⋅ Yongchao Xu
Exhibit Hall I #325
HoliTracer: Holistic Vectorization of Geographic Objects from Large-Size Remote Sensing Imagery Poster Session 2 & Exhibit Hall with Coffee Break
Yu Wang ⋅ Bo Dang ⋅ Wanchun Li ⋅ Wei Chen ⋅ Yansheng Li
Exhibit Hall I #326
DialNav: Multi-turn Dialog Navigation with a Remote Guide Poster Session 2 & Exhibit Hall with Coffee Break
Leekyeung Han ⋅ Hyunji Min ⋅ Gyeom Hwangbo ⋅ Jonghyun Choi ⋅ Paul Hongsuck Seo
Exhibit Hall I #329
TaxaDiffusion: Progressively Trained Diffusion Model for Fine-Grained Species Generation Poster Session 2 & Exhibit Hall with Coffee Break
Amin Karimi Monsefi ⋅ Mridul Khurana ⋅ Rajiv Ramnath ⋅ Anuj Karpatne ⋅ Wei-Lun (Harry) Chao ⋅ Cheng Zhang
Exhibit Hall I #335
VLM4D: Towards Spatiotemporal Awareness in Vision Language Models Poster Session 2 & Exhibit Hall with Coffee Break
Shijie Zhou ⋅ Alexander Vilesov ⋅ Xuehai He ⋅ Ziyu Wan ⋅ Shuwang Zhang ⋅ Aditya Nagachandra ⋅ Di Chang ⋅ Dongdong Chen ⋅ Xin Wang ⋅ Achuta Kadambi
Exhibit Hall I #337
Spatial Alignment and Temporal Matching Adapter for Video-Radar Remote Physiological Measurement Poster Session 2 & Exhibit Hall with Coffee Break
Qian Liang ⋅ Ruixu Geng ⋅ Jinbo Chen ⋅ Haoyu Wang ⋅ Yan Chen ⋅ Yang Hu
Exhibit Hall I #339
Bias in Gender Bias Benchmarks: How Spurious Features Distort Evaluation Poster Session 2 & Exhibit Hall with Coffee Break
Yusuke Hirota ⋅ Ryo Hachiuma ⋅ Boyi Li ⋅ Ximing Lu ⋅ Michael Boone ⋅ Boris Ivanovic ⋅ Yejin Choi ⋅ Marco Pavone ⋅ Yu-Chiang Frank Wang ⋅ Noa Garcia ⋅ Yuta Nakashima ⋅ Chao-Han Yang
Exhibit Hall I #340
AGO: Adaptive Grounding for Open World 3D Occupancy Prediction Poster Session 2 & Exhibit Hall with Coffee Break
Peizheng Li ⋅ Shuxiao Ding ⋅ You Zhou ⋅ Qingwen Zhang ⋅ Onat Inak ⋅ Larissa Triess ⋅ Niklas Hanselmann ⋅ Marius Cordts ⋅ Andreas Zell
Exhibit Hall I #341
Environment-Agnostic Pose: Generating Environment-independent Object Representations for 6D Pose Estimation Poster Session 2 & Exhibit Hall with Coffee Break
Shaobo Zhang ⋅ Yuhang Huang ⋅ Wanqing Zhao ⋅ Wei Zhao ⋅ Ziyu Guan ⋅ Jinye Peng
Exhibit Hall I #344
OpenM3D: Open Vocabulary Multi-view Indoor 3D Object Detection without Human Annotations Poster Session 2 & Exhibit Hall with Coffee Break
Peng-Hao Hsu ⋅ Ke Zhang ⋅ Fu-En Wang ⋅ Tao Tu ⋅ Ming-Feng Li ⋅ Yu-Lun Liu ⋅ Albert Y. C. Chen ⋅ Min Sun ⋅ Cheng-Hao Kuo
Exhibit Hall I #345
Online Dense Point Tracking with Streaming Memory Poster Session 2 & Exhibit Hall with Coffee Break
Qiaole Dong ⋅ Yanwei Fu
Exhibit Hall I #347
MaGS: Reconstructing and Simulating Dynamic 3D Objects with Mesh-adsorbed Gaussian Splatting Poster Session 2 & Exhibit Hall with Coffee Break
Shaojie Ma ⋅ Yawei Luo ⋅ Wei Yang ⋅ Yi Yang
Exhibit Hall I #350
GM-MoE: Low-Light Enhancement with Gated-Mechanism Mixture-of-Experts Poster Session 2 & Exhibit Hall with Coffee Break
Minwen Liao ⋅ Hao Dong ⋅ Xinyi Wang ⋅ Kurban Ubul ⋅ Ziyang Yan ⋅ Yihua Shao
Exhibit Hall I #352
CHARM3R: Towards Unseen Camera Height Robust Monocular 3D Detector Poster Session 2 & Exhibit Hall with Coffee Break
Abhinav Kumar ⋅ Yuliang Guo ⋅ Zhihao Zhang ⋅ Xinyu Huang ⋅ Liu Ren ⋅ Xiaoming Liu
Exhibit Hall I #353
Test-Time Retrieval-Augmented Adaptation for Vision-Language Models Poster Session 2 & Exhibit Hall with Coffee Break
Xinqi Fan ⋅ Xueli CHEN ⋅ Luoxiao Yang ⋅ Chuin Hong Yap ⋅ Rizwan Qureshi ⋅ Qi Dou ⋅ Moi Hoon Yap ⋅ Mubarak Shah
Exhibit Hall I #356
RnGCam: High-speed video from rolling & global shutter measurements Poster Session 2 & Exhibit Hall with Coffee Break
Kevin Tandi ⋅ Xiang Dai ⋅ Chinmay Talegaonkar ⋅ Gal Mishne ⋅ Nicholas Antipa
Exhibit Hall I #358
Self-Supervised Monocular 4D Scene Reconstruction for Egocentric Videos Poster Session 2 & Exhibit Hall with Coffee Break
Chengbo Yuan ⋅ Geng Chen ⋅ Li Yi ⋅ Yang Gao
Exhibit Hall I #361
Diorama: Unleashing Zero-shot Single-view 3D Indoor Scene Modeling Poster Session 2 & Exhibit Hall with Coffee Break
Qirui Wu ⋅ Denys Iliash ⋅ Daniel Ritchie ⋅ Manolis Savva ⋅ Angel Chang
Exhibit Hall I #364
Bokehlicious: Photorealistic Bokeh Rendering with Controllable Apertures Poster Session 2 & Exhibit Hall with Coffee Break
Tim Seizinger ⋅ Florin-Alexandru Vasluianu ⋅ Marcos Conde ⋅ Zongwei Wu ⋅ Radu Timofte
Exhibit Hall I #365
Hybrid-TTA: Continual Test-time Adaptation via Dynamic Domain Shift Detection Poster Session 1 & Exhibit Hall
Hyewon Park ⋅ Hyejin Park ⋅ Jueun Ko ⋅ Dongbo Min
Exhibit Hall I #265
Learning on the Go: A Meta-learning Object Navigation Model Poster Session 2 & Exhibit Hall with Coffee Break
Xiaorong Qin ⋅ Xinhang Song ⋅ Sixian Zhang ⋅ Xinyao Yu ⋅ Xinmiao Zhang ⋅ Shuqiang Jiang
Exhibit Hall I #368
Unlocking Constraints: Source-Free Occlusion-Aware Seamless Segmentation Poster Session 2 & Exhibit Hall with Coffee Break
Yihong Cao ⋅ Jiaming Zhang ⋅ Xu Zheng ⋅ Hao Shi ⋅ Kunyu Peng ⋅ Hang Liu ⋅ Kailun Yang ⋅ Hui Zhang
Exhibit Hall I #370
3D Gaussian Map with Open-Set Semantic Grouping for Vision-Language Navigation Poster Session 2 & Exhibit Hall with Coffee Break
Jianzhe Gao ⋅ Rui Liu ⋅ Wenguan Wang
Exhibit Hall I #398
ProGait: A Multi-Purpose Video Dataset and Benchmark for Transfemoral Prosthesis Users Poster Session 2 & Exhibit Hall with Coffee Break
Xiangyu Yin ⋅ Boyuan Yang ⋅ Weichen Liu ⋅ Qiyao Xue ⋅ Abrar Alamri ⋅ Goeran Fiedler ⋅ Wei Gao
Exhibit Hall I #372
MixRI: Mixing Features of Reference Images for Novel Object Pose Estimation Poster Session 2 & Exhibit Hall with Coffee Break
Xinhang Liu ⋅ Jiawei Shi ⋅ Zheng Dang ⋅ Yuchao Dai
Exhibit Hall I #376
ReassembleNet: Learnable Keypoints and Diffusion for 2D Fresco Reconstruction Poster Session 2 & Exhibit Hall with Coffee Break
ADEELA ISLAM ⋅ Stefano Fiorini ⋅ Stuart James ⋅ Pietro Morerio ⋅ ALESSIO DEL BUE
Exhibit Hall I #378
WonderPlay: Dynamic 3D Scene Generation from a Single Image and Actions Poster Session 2 & Exhibit Hall with Coffee Break
Zizhang Li ⋅ Hong-Xing Yu ⋅ Wei Liu ⋅ Yin Yang ⋅ Charles Herrmann ⋅ Gordon Wetzstein ⋅ Jiajun Wu
Exhibit Hall I #383
OMNI-DC: Highly Robust Depth Completion with Multiresolution Depth Integration Poster Session 2 & Exhibit Hall with Coffee Break
Yiming Zuo ⋅ Willow Yang ⋅ Zeyu Ma ⋅ Jia Deng
Exhibit Hall I #401
Beyond the Destination: A Novel Benchmark for Exploration-Aware Embodied Question Answering Poster Session 2 & Exhibit Hall with Coffee Break
Kaixuan Jiang ⋅ Yang Liu ⋅ Weixing Chen ⋅ Jingzhou Luo ⋅ Ziliang Chen ⋅ Ling Pan ⋅ Guanbin Li ⋅ Liang Lin
Exhibit Hall I #384
Not all Views are Created Equal: Analyzing Viewpoint Instabilities in Vision Foundation Models Poster Session 2 & Exhibit Hall with Coffee Break
Mateusz Michalkiewicz ⋅ Xinyue Bai ⋅ Mahsa Baktashmotlagh ⋅ Varun Jampani ⋅ Guha Balakrishnan
Exhibit Hall I #386
CHROME: Clothed Human Reconstruction with Occlusion-Resilience and Multiview-Consistency from a Single Image Poster Session 2 & Exhibit Hall with Coffee Break
Arindam Dutta ⋅ Meng Zheng ⋅ Zhongpai Gao ⋅ Benjamin Planche ⋅ Anwesa Choudhuri ⋅ Terrence Chen ⋅ Amit Roy-Chowdhury ⋅ Ziyan Wu
Exhibit Hall I #387
ReCoT: Reflective Self-Correction Training for Mitigating Confirmation Bias in Large Vision-Language Models Poster Session 2 & Exhibit Hall with Coffee Break
Mengxue Qu ⋅ Yibo Hu ⋅ Kunyang Han ⋅ Yunchao Wei ⋅ Yao Zhao
Exhibit Hall I #389
Easi3R: Estimating Disentangled Motion from DUSt3R Without Training Poster Session 2 & Exhibit Hall with Coffee Break
Xingyu Chen ⋅ Yue Chen ⋅ Yuliang Xiu ⋅ Andreas Geiger ⋅ Anpei Chen
Exhibit Hall I #390
PRE-Mamba: A 4D State Space Model for Ultra-High-Frequent Event Camera Deraining Poster Session 2 & Exhibit Hall with Coffee Break
Ciyu Ruan ⋅ Ruishan Guo ⋅ Zihang GONG ⋅ Jingao Xu ⋅ Wenhan Yang ⋅ Xinlei Chen
Exhibit Hall I #391
GenHaze: Pioneering Controllable One-Step Realistic Haze Generation for Real-World Dehazing Poster Session 2 & Exhibit Hall with Coffee Break
Sixiang Chen ⋅ Tian Ye ⋅ Yunlong Lin ⋅ Yeying Jin ⋅ Yijun Yang ⋅ Haoyu Chen ⋅ Jianyu Lai ⋅ Song Fei ⋅ Zhaohu Xing ⋅ Fugee Tsung ⋅ Lei Zhu
Exhibit Hall I #393
When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruning Poster Session 2 & Exhibit Hall with Coffee Break
Junwei Luo ⋅ Yingying Zhang ⋅ Xue Yang ⋅ Kang Wu ⋅ Qi Zhu ⋅ Lei Liang ⋅ Jingdong Chen ⋅ Yansheng Li
Exhibit Hall I #394
Can3Tok: Canonical 3D Tokenization and Latent Modeling of Scene-Level 3D Gaussians Poster Session 2 & Exhibit Hall with Coffee Break
Quankai Gao ⋅ Iliyan Georgiev ⋅ Tuanfeng Wang ⋅ Krishna Kumar Singh ⋅ Ulrich Neumann ⋅ Jae Shin Yoon
Exhibit Hall I #404
GEOPARD: Geometric Pretraining for Articulation Prediction in 3D Shapes Poster Session 2 & Exhibit Hall with Coffee Break
Pradyumn Goyal ⋅ Dmitrii Petrov ⋅ Sheldon Andrews ⋅ Yizhak Ben-Shabat ⋅ Hsueh-Ti Derek Liu ⋅ Evangelos Kalogerakis
Exhibit Hall I #405
Dream-to-Recon: Monocular 3D Reconstruction with Diffusion-Depth Distillation from Single Images Poster Session 2 & Exhibit Hall with Coffee Break
Philipp Wulff ⋅ Felix Wimbauer ⋅ Dominik Muhle ⋅ Daniel Cremers
Exhibit Hall I #407
LocalDyGS: Multi-view Global Dynamic Scene Modeling via Adaptive Local Implicit Feature Decoupling Poster Session 2 & Exhibit Hall with Coffee Break
Jiahao Wu ⋅ Rui Peng ⋅ Jianbo Jiao ⋅ Jiayu Yang ⋅ Luyang Tang ⋅ Kaiqiang Xiong ⋅ Jie Liang ⋅ Jinbo Yan ⋅ runling liu ⋅ Ronggang Wang
Exhibit Hall I #422
Combinative Matching for Geometric Shape Assembly Poster Session 2 & Exhibit Hall with Coffee Break
Nahyuk Lee ⋅ Juhong Min ⋅ Junhong Lee ⋅ Chunghyun Park ⋅ Minsu Cho
Exhibit Hall I #424
CogNav: Cognitive Process Modeling for Object Goal Navigation with LLMs Poster Session 2 & Exhibit Hall with Coffee Break
Yihan Cao ⋅ Jiazhao Zhang ⋅ Zhinan Yu ⋅ Shuzhen Liu ⋅ Zheng Qin ⋅ Qin Zou ⋅ Bo Du ⋅ Kai Xu
Exhibit Hall I #425
LLaVA-3D: A Simple yet Effective Pathway to Empowering LMMs with 3D Capabilities Poster Session 1 & Exhibit Hall
CHENMING ZHU ⋅ Tai Wang ⋅ Wenwei Zhang ⋅ Jiangmiao Pang ⋅ Xihui Liu
Exhibit Hall I #402
DyGS-SLAM: Real-Time Accurate Localization and Gaussian Reconstruction for Dynamic Scenes Poster Session 2 & Exhibit Hall with Coffee Break
Xinggang Hu ⋅ Chenyangguang Zhang ⋅ Mingyuan Zhao ⋅ Yuanze Gui ⋅ Xiangkui Zhang ⋅ Xiangyang Ji
Exhibit Hall I #426
CAD-Recode: Reverse Engineering CAD Code from Point Clouds Poster Session 2 & Exhibit Hall with Coffee Break
Danila Rukhovich ⋅ Elona Dupont ⋅ Dimitrios Mallis ⋅ Kseniya Cherenkova ⋅ Anis Kacem ⋅ Djamila Aouada
Exhibit Hall I #448
Teaching VLMs to Localize Specific Objects from In-context Examples Poster Session 2 & Exhibit Hall with Coffee Break
Sivan Doveh ⋅ Nimrod Shabtay ⋅ Eli Schwartz ⋅ Leonid Karlinsky ⋅ Raja Giryes ⋅ Hilde Kuehne ⋅ Rogerio Feris ⋅ James Glass ⋅ Assaf Arbelle ⋅ Shimon Ullman ⋅ Muhammad Jehanzeb Mirza
Exhibit Hall I #427
SDFit: 3D Object Pose and Shape by Fitting a Morphable SDF to a Single Image Poster Session 2 & Exhibit Hall with Coffee Break
Dimitrije Antić ⋅ Georgios Paschalidis ⋅ Shashank Tripathi ⋅ Theo Gevers ⋅ Sai Kumar Dwivedi ⋅ Dimitrios Tzionas
Exhibit Hall I #431
Details Matter for Indoor Open-vocabulary 3D Instance Segmentation Poster Session 2 & Exhibit Hall with Coffee Break
Sanghun Jung ⋅ Jingjing Zheng ⋅ Ke Zhang ⋅ Nan Qiao ⋅ Albert Y. C. Chen ⋅ Lu Xia ⋅ Chi Liu ⋅ Yuyin Sun ⋅ Xiao Zeng ⋅ Hsiang-Wei Huang ⋅ Byron Boots ⋅ Min Sun ⋅ Cheng-Hao Kuo
Exhibit Hall I #432
FlashDepth: Real-time Streaming Video Depth Estimation at 2K Resolution Poster Session 2 & Exhibit Hall with Coffee Break
Gene Chou ⋅ Wenqi Xian ⋅ Guandao Yang ⋅ Mohamed Abdelfattah ⋅ Bharath Hariharan ⋅ Noah Snavely ⋅ Ning Yu ⋅ Paul Debevec
Exhibit Hall I #433
Self-supervised Learning of Hybrid Part-aware 3D Representations of 2D Gaussians and Superquadrics Poster Session 2 & Exhibit Hall with Coffee Break
Zhirui Gao ⋅ Renjiao Yi ⋅ Yuhang Huang ⋅ Wei Chen ⋅ Chenyang Zhu ⋅ Kai Xu
Exhibit Hall I #434
Training-Free Personalization via Retrieval and Reasoning on Fingerprints Poster Session 2 & Exhibit Hall with Coffee Break
Deepayan Das ⋅ Davide Talon ⋅ Yiming Wang ⋅ Massimiliano Mancini ⋅ Elisa Ricci
Exhibit Hall I #437
TAPNext: Tracking Any Point (TAP) as Next Token Prediction Poster Session 2 & Exhibit Hall with Coffee Break
Artem Zholus ⋅ Carl Doersch ⋅ Yi Yang ⋅ Skanda Koppula ⋅ Viorica Patraucean ⋅ Xu He ⋅ Ignacio Rocco ⋅ Mehdi S. M. Sajjadi ⋅ Sarath Chandar ⋅ Ross Goroshin
Exhibit Hall I #438
PartField: Learning 3D Feature Fields for Part Segmentation and Beyond Poster Session 2 & Exhibit Hall with Coffee Break
Minghua Liu ⋅ Mikaela Uy ⋅ Donglai Xiang ⋅ Hao Su ⋅ Sanja Fidler ⋅ Nicholas Sharp ⋅ Jun Gao
Exhibit Hall I #439
MoMaps: Semantics-Aware Scene Motion Generation with Motion Maps Poster Session 3 & Exhibit Hall
Jiahui Lei ⋅ Kyle Genova ⋅ George Kopanas ⋅ Noah Snavely ⋅ Leonidas Guibas
Exhibit Hall I #2
Bridging the Sky and Ground: Towards View-Invariant Feature Learning for Aerial-Ground Person Re-Identification Poster Session 2 & Exhibit Hall with Coffee Break
Wajahat Khalid ⋅ Bin Liu ⋅ Xulin Li ⋅ MUHAMMAD WAQAS ⋅ MUHAMMAD SHER AFGAN
Exhibit Hall I #443
PASD: A Pixel-Adaptive Swarm Dynamics Approach for Unsupervised Low-Light Image Enhancement Poster Session 2 & Exhibit Hall with Coffee Break
Shuai Jin ⋅ Yuhua Qian ⋅ Feijiang Li ⋅ Guoqing Liu ⋅ Xinyan Liang
Exhibit Hall I #380
CoA-VLA: Improving Vision-Language-Action Models via Visual-Text Chain-of-Affordance Poster Session 2 & Exhibit Hall with Coffee Break
Jinming Li ⋅ Yichen Zhu ⋅ Zhibin Tang ⋅ Junjie Wen ⋅ Minjie Zhu ⋅ Xiaoyu Liu ⋅ Chengmeng Li ⋅ Ran Cheng ⋅ Yaxin Peng ⋅ Yan Peng ⋅ Feifei Feng
Exhibit Hall I #444
Proactive Scene Decomposition and Reconstruction Poster Session 2 & Exhibit Hall with Coffee Break
Baicheng Li ⋅ Zike Yan ⋅ Dong Wu ⋅ Hongbin Zha
Exhibit Hall I #446
Unified Category-Level Object Detection and Pose Estimation from RGB Images using 3D Prototypes Poster Session 2 & Exhibit Hall with Coffee Break
Tom Fischer ⋅ Xiaojie Zhang ⋅ Eddy Ilg
Exhibit Hall I #447
EvRT-DETR: Latent Space Adaptation of Image Detectors for Event-based Vision Poster Session 2 & Exhibit Hall with Coffee Break
Dmitrii Torbunov ⋅ Yihui Ren ⋅ Animesh Ghose ⋅ Odera Dim ⋅ Yonggang Cui
Exhibit Hall I #449
A Hyperdimensional One Place Signature to Represent Them All: Stackable Descriptors For Visual Place Recognition Poster Session 2 & Exhibit Hall with Coffee Break
Connor Malone ⋅ Somayeh Hussaini ⋅ Tobias Fischer ⋅ Michael Milford
Exhibit Hall I #450
IRASim: A Fine-Grained World Model for Robot Manipulation Poster Session 2 & Exhibit Hall with Coffee Break
Fangqi Zhu ⋅ Hongtao Wu ⋅ Song Guo ⋅ Yuxiao Liu ⋅ Chilam Cheang ⋅ Tao Kong
Exhibit Hall I #451
WalkVLM: Aid Visually Impaired People Walking by Vision Language Model Poster Session 2 & Exhibit Hall with Coffee Break
Zhiqiang Yuan ⋅ Ting Zhang ⋅ Yeshuang Zhu ⋅ Jiapei Zhang ⋅ Ying Deng ⋅ Zexi Jia ⋅ Peixiang Luo ⋅ Xiaoyue Duan ⋅ Jie Zhou ⋅ Jinchao Zhang
Exhibit Hall I #452
Error Recognition in Procedural Videos using Generalized Task Graph Poster Session 3 & Exhibit Hall
Shih-Po Lee ⋅ Ehsan Elhamifar
Exhibit Hall I #1
VIGFace: Virtual Identity Generation for Privacy-Free Face Recognition Dataset Poster Session 3 & Exhibit Hall
Minsoo Kim ⋅ Min-Cheol Sagong ⋅ Gi Pyo Nam ⋅ Junghyun Cho ⋅ Ig-Jae Kim
Exhibit Hall I #4
RoboFactory: Exploring Embodied Agent Collaboration with Compositional Constraints Poster Session 3 & Exhibit Hall
Yiran Qin ⋅ Li Kang ⋅ Xiufeng Song ⋅ Zhenfei Yin ⋅ Xiaohong Liu ⋅ Xihui Liu ⋅ Ruimao Zhang ⋅ LEI BAI
Exhibit Hall I #7
MotionStreamer: Streaming Motion Generation via Diffusion-based Autoregressive Model in Causal Latent Space Poster Session 3 & Exhibit Hall
Lixing Xiao ⋅ Shunlin Lu ⋅ Huaijin Pi ⋅ Ke Fan ⋅ Liang Pan ⋅ Yueer Zhou ⋅ Ziyong Feng ⋅ Xiaowei Zhou ⋅ Sida Peng ⋅ Jingbo Wang
Exhibit Hall I #8
RapVerse: Coherent Vocals and Whole-Body Motion Generation from Text Poster Session 3 & Exhibit Hall
Jiaben Chen ⋅ Xin Yan ⋅ Yihang Chen ⋅ Siyuan Cen ⋅ Zixin Wang ⋅ Qinwei Ma ⋅ Haoyu Zhen ⋅ Kaizhi Qian ⋅ Lie Lu ⋅ Chuang Gan
Exhibit Hall I #9
RoboPearls: Editable Video Simulation for Robot Manipulation Poster Session 3 & Exhibit Hall
Tao Tang ⋅ Likui Zhang ⋅ Youpeng Wen ⋅ Kaidong Zhang ⋅ Jia-Wang Bian ⋅ xia zhou ⋅ Tianyi Yan ⋅ Kun Zhan ⋅ Peng Jia ⋅ Hefeng Wu ⋅ Liang Lin ⋅ Xiaodan Liang
Exhibit Hall I #11
GraspCoT: Integrating Physical Property Reasoning for 6-DoF Grasping under Flexible Language Instructions Poster Session 3 & Exhibit Hall
Xiaomeng Chu ⋅ Jiajun Deng ⋅ Guoliang You ⋅ Wei Liu ⋅ Xingchen Li ⋅ Jianmin Ji ⋅ Yanyong Zhang
Exhibit Hall I #12
Towards Immersive Human-X Interaction: A Real-Time Framework for Physically Plausible Motion Synthesis Poster Session 3 & Exhibit Hall
Kaiyang Ji ⋅ Ye Shi ⋅ Zichen Jin ⋅ Kangyi Chen ⋅ Lan Xu ⋅ Yuexin Ma ⋅ Jingyi Yu ⋅ Jingya Wang
Exhibit Hall I #16
SynFER: Towards Boosting Facial Expression Recognition with Synthetic Data Poster Session 3 & Exhibit Hall
Xilin He ⋅ Cheng Luo ⋅ Xiaole Xian ⋅ Bing Li ⋅ Siyang Song ⋅ Muhammad Haris Khan ⋅ Weicheng Xie ⋅ Linlin Shen ⋅ Zongyuan Ge ⋅ Bernard Ghanem ⋅ Xiangyu Yue
Exhibit Hall I #17
Multi-modal Multi-platform Person Re-Identification: Benchmark and Method Poster Session 3 & Exhibit Hall
Ruiyang Ha ⋅ Songyi Jiang ⋅ Bin Li ⋅ Bikang Pan ⋅ Yihang Zhu ⋅ Junjie Zhang ⋅ Xiatian Zhu ⋅ Shaogang Gong ⋅ Jingya Wang
Exhibit Hall I #23
Mixture of Experts Guided by Gaussian Splatters Matters: A new Approach to Weakly-Supervised Video Anomaly Detection Poster Session 3 & Exhibit Hall
Giacomo D'Amicantonio ⋅ Snehashis Majhi ⋅ Quan Kong ⋅ Lorenzo Garattoni ⋅ Gianpiero Francesca ⋅ Egor Bondarev ⋅ Francois Bremond
Exhibit Hall I #25
What If: Understanding Motion Through Sparse Interactions Poster Session 3 & Exhibit Hall
Stefan A. Baumann ⋅ Nick Stracke ⋅ Timy Phan ⋅ Björn Ommer
Exhibit Hall I #26
PROGRESSOR: A Perceptually Guided Reward Estimator with Self-Supervised Online Refinement Poster Session 3 & Exhibit Hall
Tewodros W. Ayalew ⋅ Xiao Zhang ⋅ Kevin Y Wu ⋅ Tianchong Jiang ⋅ Michael Maire ⋅ Matthew Walter
Exhibit Hall I #27
UniEgoMotion: A Unified Model for Egocentric Motion Reconstruction, Forecasting, and Generation Poster Session 3 & Exhibit Hall
Chaitanya Patel ⋅ Hiroki Nakamura ⋅ Yuta Kyuragi ⋅ Kazuki Kozuka ⋅ Juan Carlos Niebles ⋅ Ehsan Adeli
Exhibit Hall I #29
DAViD: Modeling Dynamic Affordance of 3D Objects Using Pre-trained Video Diffusion Models Poster Session 3 & Exhibit Hall
Hyeonwoo Kim ⋅ Sangwon Baik ⋅ Hanbyul Joo
Exhibit Hall I #30
RoboAnnotatorX: A Comprehensive and Universal Annotation Framework for Accurate Understanding of Long-horizon Robot Demonstration Poster Session 3 & Exhibit Hall
Longxin Kou ⋅ Fei Ni ⋅ Jianye HAO ⋅ Han Peilong ⋅ Jinyi Liu ⋅ Haiqin Cui ⋅ Rui Liu ⋅ YAN ZHENG
Exhibit Hall I #32
FaceShield: Defending Facial Image against Deepfake Threats Poster Session 3 & Exhibit Hall
Jaehwan Jeong ⋅ Sumin In ⋅ Sieun Kim ⋅ Shin yi ⋅ Jongheon Jeong ⋅ Sang Yoon ⋅ Jaewook Chung ⋅ Sangpil Kim
Exhibit Hall I #33
Task-Oriented Human Grasp Synthesis via Context- and Task-Aware Diffusers Poster Session 3 & Exhibit Hall
An Lun Liu ⋅ Yu-Wei Chao ⋅ Yi-Ting Chen
Exhibit Hall I #34
Ouroboros: Single-step Diffusion Models for Cycle-consistent Forward and Inverse Rendering Poster Session 3 & Exhibit Hall
shanlin sun ⋅ Yifan Wang ⋅ Hanwen Zhang ⋅ Yifeng Xiong ⋅ Qin Ren ⋅ Ruogu Fang ⋅ Xiaohui Xie ⋅ Chenyu You
Exhibit Hall I #35
Expressive Talking Human from Single-Image with Imperfect Priors Poster Session 3 & Exhibit Hall
Jun Xiang ⋅ Yudong Guo ⋅ Leipeng Hu ⋅ Boyang Guo ⋅ Yancheng Yuan ⋅ Juyong Zhang
Exhibit Hall I #36
Beyond Label Semantics: Language-Guided Action Anatomy for Few-shot Action Recognition Poster Session 3 & Exhibit Hall
Zefeng Qian ⋅ Xincheng Yao ⋅ Yifei Huang ⋅ Chong-Yang Zhang ⋅ Jiangyong Ying ⋅ Hong Sun
Exhibit Hall I #38
Few-Shot Image Quality Assessment via Adaptation of Vision-Language Models Poster Session 3 & Exhibit Hall
Xudong Li ⋅ Zihao Huang ⋅ Yan Zhang ⋅ Yunhang Shen ⋅ Ke Li ⋅ Xiawu Zheng ⋅ Liujuan Cao ⋅ Rongrong Ji
Exhibit Hall I #40
Unleashing High-Quality Image Generation in Diffusion Sampling Using Second-Order Levenberg-Marquardt-Langevin Poster Session 3 & Exhibit Hall
Fangyikang Wang ⋅ Hubery Yin ⋅ Lei Qian ⋅ Yinan Li ⋅ SHAOBIN ZHUANG ⋅ Huminhao Zhu ⋅ Yilin Zhang ⋅ Yanlong Tang ⋅ Chao Zhang ⋅ Hanbin Zhao ⋅ Hui Qian ⋅ Chen Li
Exhibit Hall I #41
Reverse Convolution and Its Applications to Image Restoration Poster Session 3 & Exhibit Hall
Xuhong Huang ⋅ Shiqi Liu ⋅ Kai Zhang ⋅ Ying Tai ⋅ Jian Yang ⋅ Hui Zeng ⋅ Lei Zhang
Exhibit Hall I #46
MamTiff-CAD: Multi-Scale Latent Diffusion with Mamba+ for Complex Parametric Sequence Poster Session 3 & Exhibit Hall
Liyuan Deng ⋅ Yunpeng Bai ⋅ Yongkang Dai ⋅ Xiaoshui Huang ⋅ Hongping Gan ⋅ Dongshuo Huang ⋅ Hao jiacheng ⋅ Yilei Shi
Exhibit Hall I #47
Local Scale Equivariance with Latent Deep Equilibrium Canonicalizer Poster Session 3 & Exhibit Hall
Md Ashiqur Rahman ⋅ Chiao-An Yang ⋅ Michael N Cheng ⋅ Lim Hao ⋅ Jeremiah Jiang ⋅ Teck-Yian Lim ⋅ Raymond A. Yeh
Exhibit Hall I #48
DynamicID: Zero-Shot Multi-ID Image Personalization with Flexible Facial Editability Poster Session 3 & Exhibit Hall
Xirui Hu ⋅ Jiahao Wang ⋅ Hao chen ⋅ Weizhan Zhang ⋅ Benqi Wang ⋅ yikun Li ⋅ Haishun Nan
Exhibit Hall I #50
EfficientMT: Efficient Temporal Adaptation for Motion Transfer in Text-to-Video Diffusion Models Poster Session 3 & Exhibit Hall
Yufei Cai ⋅ Hu Han ⋅ Yuxiang Wei ⋅ Shiguang Shan ⋅ Xilin Chen
Exhibit Hall I #54
InteractAvatar: Modeling Hand-Face Interaction in Photorealistic Avatars with Deformable Gaussians Poster Session 3 & Exhibit Hall
Kefan Chen ⋅ Sergiu Oprea ⋅ Justin Theiss ⋅ Sreyas Mohan ⋅ Srinath Sridhar ⋅ Aayush Prakash
Exhibit Hall I #37
X-Dancer: Expressive Music to Human Dance Video Generation Poster Session 3 & Exhibit Hall
Zeyuan Chen ⋅ Hongyi Xu ⋅ Guoxian Song ⋅ You Xie ⋅ Chenxu Zhang ⋅ Xin Chen ⋅ Chao Wang ⋅ Di Chang ⋅ Linjie Luo
Exhibit Hall I #55
DeepMesh: Auto-Regressive Artist-mesh Creation with Reinforcement Learning Poster Session 3 & Exhibit Hall
Ruowen Zhao ⋅ James Jun Liang Chen Ye ⋅ Zhengyi Wang ⋅ Guangce Liu ⋅ Yiwen Chen ⋅ Yikai Wang ⋅ Jun Zhu
Exhibit Hall I #56
Im2Haircut: Single-view Strand-based Hair Reconstruction for Human Avatars Poster Session 3 & Exhibit Hall
Vanessa Sklyarova ⋅ Egor Zakharov ⋅ Malte Prinzler ⋅ Giorgio Becherini ⋅ Michael Black ⋅ Justus Thies
Exhibit Hall I #60
AFUNet: Cross-Iterative Alignment-Fusion Synergy for HDR Reconstruction via Deep Unfolding Paradigm Poster Session 3 & Exhibit Hall
Xinyue Li ⋅ Zhangkai Ni ⋅ Wenhan Yang
Exhibit Hall I #61
PINO: Person-Interaction Noise Optimization for Long-Duration and Customizable Motion Generation of Arbitrary-Sized Groups Poster Session 3 & Exhibit Hall
Sakuya Ota ⋅ Qing Yu ⋅ Kent Fujiwara ⋅ Satoshi Ikehata ⋅ Ikuro Sato
Exhibit Hall I #62
TeRA: Rethinking Text-guided Realistic 3D Avatar Generation Poster Session 3 & Exhibit Hall
Yanwen Wang ⋅ Yiyu Zhuang ⋅ Jiawei Zhang ⋅ Li Wang ⋅ Yifei Zeng ⋅ Xun Cao ⋅ Xinxin Zuo ⋅ Hao Zhu
Exhibit Hall I #63
A Unified Framework for Motion Reasoning and Generation in Human Interaction Poster Session 3 & Exhibit Hall
Jeongeun Park ⋅ Sungjoon Choi ⋅ Sangdoo Yun
Exhibit Hall I #64
Open-World Skill Discovery from Unsegmented Demonstration Videos Poster Session 3 & Exhibit Hall
Jingwen Deng ⋅ Zihao Wang ⋅ Shaofei Cai ⋅ Anji Liu ⋅ Yitao Liang
Exhibit Hall I #65
Deep Adaptive Unfolded Network via Spatial Morphology Stripping and Spectral Filtration for Pan-sharpening Poster Session 3 & Exhibit Hall
Hebaixu Wang ⋅ Jiayi Ma
Exhibit Hall I #67
EgoAdapt: Adaptive Multisensory Distillation and Policy Learning for Efficient Egocentric Perception Poster Session 3 & Exhibit Hall
Sanjoy Chowdhury ⋅ Subrata Biswas ⋅ Sayan Nag ⋅ Tushar Nagarajan ⋅ Calvin Murdock ⋅ Ishwarya Ananthabhotla ⋅ Yijun Qian ⋅ Vamsi Ithapu ⋅ Dinesh Manocha ⋅ Ruohan Gao
Exhibit Hall I #68
Reference-based Super-Resolution via Image-based Retrieval-Augmented Generation Diffusion Poster Session 3 & Exhibit Hall
Byeonghun Lee ⋅ Hyunmin Cho ⋅ Honggyu Choi ⋅ Soo Min Kang ⋅ ILJUN AHN ⋅ Kyong Hwan Jin
Exhibit Hall I #70
Vulnerability-Aware Spatio-Temporal Learning for Generalizable Deepfake Video Detection Poster Session 3 & Exhibit Hall
Dat NGUYEN ⋅ Marcella Astrid ⋅ Anis Kacem ⋅ Enjie Ghorbel ⋅ Djamila Aouada
Exhibit Hall I #72
EgoM2P: Egocentric Multimodal Multitask Pretraining Poster Session 3 & Exhibit Hall
Gen Li ⋅ Yutong Chen ⋅ Yiqian Wu ⋅ KAIFENG ZHAO ⋅ Marc Pollefeys ⋅ Siyu Tang
Exhibit Hall I #78
E-NeMF: Event-based Neural Motion Field for Novel Space-time View Synthesis of Dynamic Scenes Poster Session 3 & Exhibit Hall
Yan Liu ⋅ Zehao Chen ⋅ Haojie Yan ⋅ De Ma ⋅ Huajin Tang ⋅ Qian Zheng ⋅ Gang Pan
Exhibit Hall I #80
HUMOTO: A 4D Dataset of Mocap Human Object Interactions Poster Session 3 & Exhibit Hall
Jiaxin Lu ⋅ Chun-Hao Huang ⋅ Uttaran Bhattacharya ⋅ Qixing Huang ⋅ Yi Zhou
Exhibit Hall I #83
CombatVLA: An Efficient Vision-Language-Action Model for Combat Tasks in 3D Action Role-Playing Games Poster Session 3 & Exhibit Hall
Peng Chen ⋅ Pi Bu ⋅ Yingyao Wang ⋅ Xinyi Wang ⋅ Ziming Wang ⋅ Jie Guo ⋅ Yingxiu Zhao ⋅ Qi Zhu ⋅ Jun Song ⋅ Siran Yang ⋅ Jiamang Wang ⋅ Bo Zheng
Exhibit Hall I #86
CharaConsist: Fine-Grained Consistent Character Generation Poster Session 4 & Exhibit Hall with Coffee Break
Mengyu Wang ⋅ Henghui Ding ⋅ Jianing Peng ⋅ Yao Zhao ⋅ Yunpeng Chen ⋅ Yunchao Wei
Exhibit Hall I #111
MonSTeR: a Unified Model for Motion, Scene, Text Retrieval Poster Session 3 & Exhibit Hall
Luca Collorone ⋅ Matteo Gioia ⋅ Massimiliano Pappa ⋅ Paolo Leoni ⋅ Giovanni Ficarra ⋅ Or Litany ⋅ Indro Spinelli ⋅ Fabio Galasso
Exhibit Hall I #88
Nautilus: Locality-aware Autoencoder for Scalable Mesh Generation Poster Session 3 & Exhibit Hall
Yuxuan Wang ⋅ Xuanyu Yi ⋅ Haohan Weng ⋅ Qingshan Xu ⋅ xiaokang wei ⋅ Xianghui Yang ⋅ Chunchao Guo ⋅ Long Chen ⋅ Hanwang Zhang
Exhibit Hall I #90
F-Bench: Rethinking Human Preference Evaluation Metrics for Benchmarking Face Generation, Customization, and Restoration Poster Session 3 & Exhibit Hall
Lu Liu ⋅ Huiyu Duan ⋅ Qiang Hu ⋅ Liu Yang ⋅ Chunlei Cai ⋅ Tianxiao Ye ⋅ Huayu Liu ⋅ Xiaoyun Zhang ⋅ Guangtao Zhai
Exhibit Hall I #92
Decouple and Track: Benchmarking and Improving Video Diffusion Transformers For Motion Transfer Poster Session 3 & Exhibit Hall
Qingyu Shi ⋅ Jianzong Wu ⋅ Jinbin Bai ⋅ Lu Qi ⋅ Jiangning Zhang ⋅ Yunhai Tong ⋅ Xiangtai Li
Exhibit Hall I #93
Latent Swap Joint Diffusion for 2D Long-Form Latent Generation Poster Session 3 & Exhibit Hall
Yusheng Dai ⋅ Chenxi Wang ⋅ Chang Li ⋅ Chen Wang ⋅ Kewei Li ⋅ Jun Du ⋅ Lei Sun ⋅ Jianqing Gao ⋅ Ruoyu Wang ⋅ Jiefeng Ma
Exhibit Hall I #94
Blind Noisy Image Deblurring Using Residual Guidance Strategy Poster Session 3 & Exhibit Hall
Heyan Liu ⋅ Jianing Sun ⋅ Jun Liu ⋅ Xi-Le Zhao ⋅ Tingting WU ⋅ Tieyong Zeng
Exhibit Hall I #95
Drawing Developmental Trajectory from Cortical Surface Reconstruction Poster Session 3 & Exhibit Hall
WENXUAN WU ⋅ ruowen qu ⋅ Zhongliang Liu ⋅ Zhuoyan Dai ⋅ Dongzi Shi ⋅ Sijin Yu ⋅ Tong Xiong ⋅ Shiping Liu ⋅ Xiangmin Xu ⋅ Xiaofen Xing ⋅ Xin Zhang
Exhibit Hall I #96
Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models Poster Session 3 & Exhibit Hall
Yudong Jin ⋅ Sida Peng ⋅ Xuan Wang ⋅ Tao Xie ⋅ Zhen Xu ⋅ Yifan Yang ⋅ Yujun Shen ⋅ Hujun Bao ⋅ Xiaowei Zhou
Exhibit Hall I #98
Less is More: Improving Motion Diffusion Models with Sparse Keyframes Poster Session 3 & Exhibit Hall
Jinseok Bae ⋅ Inwoo Hwang ⋅ Young-Yoon Lee ⋅ Ziyu Guo ⋅ Joseph Liu ⋅ Yizhak Ben-Shabat ⋅ Young Min Kim ⋅ Mubbasir Kapadia
Exhibit Hall I #100
DGTalker: Disentangled Generative Latent Space Learning for Audio-Driven Gaussian Talking Heads Poster Session 3 & Exhibit Hall
Xiaoxi Liang ⋅ Yanbo Fan ⋅ Qiya Yang ⋅ Xuan Wang ⋅ Wei Gao ⋅ Ge Li
Exhibit Hall I #101
VQ-VLA: Improving Vision-Language-Action Models via Scaling Vector-Quantized Action Tokenizers Poster Session 3 & Exhibit Hall
Yating Wang ⋅ Haoyi Zhu ⋅ Mingyu Liu ⋅ Jiange Yang ⋅ Hao-Shu Fang ⋅ Tong He
Exhibit Hall I #102
Augmented and Softened Matching for Unsupervised Visible-Infrared Person Re-Identification Poster Session 3 & Exhibit Hall
Zhiqi Pang ⋅ Chunyu Wang ⋅ Lingling Zhao ⋅ Junjie Wang
Exhibit Hall I #103
Temporal Unlearnable Examples: Preventing Personal Video Data from Unauthorized Exploitation by Object Tracking Poster Session 3 & Exhibit Hall
Qiangqiang Wu ⋅ Yi Yu ⋅ Chenqi Kong ⋅ Ziquan Liu ⋅ Jia Wan ⋅ Haoliang Li ⋅ Alex Kot ⋅ Antoni Chan
Exhibit Hall I #104
VLABench: A Large-Scale Benchmark for Language-Conditioned Robotics Manipulation with Long-Horizon Reasoning Tasks Poster Session 3 & Exhibit Hall
shiduo zhang ⋅ Zhe Xu ⋅ Peiju Liu ⋅ Xiaopeng Yu ⋅ Qinghui Gao ⋅ Yuan Li ⋅ Zhaoye Fei ⋅ Zhangyue Yin ⋅ Zuxuan Wu ⋅ Yu-Gang Jiang ⋅ Xipeng Qiu
Exhibit Hall I #107
TrackVerse: A Large-Scale Object-Centric Video Dataset for Image-Level Representation Learning Poster Session 3 & Exhibit Hall
Yibing Wei ⋅ Samuel Church ⋅ Victor Suciu ⋅ Jinhong Lin ⋅ Cheng-En Wu ⋅ Pedro Morgado
Exhibit Hall I #108
Reangle-A-Video: 4D Video Generation as Video-to-Video Translation Poster Session 3 & Exhibit Hall
Hyeonho Jeong ⋅ Suhyeon Lee ⋅ Jong Ye
Exhibit Hall I #109
Beyond Spatial Frequency: Pixel-wise Temporal Frequency-based Deepfake Video Detection Poster Session 3 & Exhibit Hall
Taehoon Kim ⋅ Jongwook Choi ⋅ Yonghyun Jeong ⋅ Haeun Noh ⋅ Jaejun Yoo ⋅ Seungryul Baek ⋅ Jongwon Choi
Exhibit Hall I #112
Causal-Entity Reflected Egocentric Traffic Accident Video Synthesis Poster Session 3 & Exhibit Hall
Lei-lei Li ⋅ Jianwu Fang ⋅ Junbin Xiao ⋅ Shanmin Pang ⋅ Hongkai Yu ⋅ Chen Lv ⋅ Jianru Xue ⋅ Tat-Seng Chua
Exhibit Hall I #113
Robust Test-Time Adaptation for Single Image Denoising Using Deep Gaussian Prior Poster Session 3 & Exhibit Hall
Qing Ma ⋅ Pengwei Liang ⋅ Xiong Zhou ⋅ Jiayi Ma ⋅ Junjun Jiang ⋅ Zhe Peng
Exhibit Hall I #115
Hierarchical-aware Orthogonal Disentanglement Framework for Fine-grained Skeleton-based Action Recognition Poster Session 3 & Exhibit Hall
Haochen Chang ⋅ Pengfei Ren ⋅ Haoyang Zhang ⋅ Liang Xie ⋅ Hongbo Chen ⋅ Erwei Yin
Exhibit Hall I #117
MBTI: Masked Blending Transformers with Implicit Positional Encoding for Frame-rate Agnostic Motion Estimation Poster Session 3 & Exhibit Hall
Jungwoo Huh ⋅ Yeseung Park ⋅ Seongjean Kim ⋅ Jungsu Kim ⋅ Sanghoon Lee
Exhibit Hall I #146
Balancing Task-invariant Interaction and Task-specific Adaptation for Unified Image Fusion Poster Session 3 & Exhibit Hall
Xingyu Hu ⋅ Junjun Jiang ⋅ Chenyang Wang ⋅ Kui Jiang ⋅ Xianming Liu ⋅ Jiayi Ma
Exhibit Hall I #118
PatchScaler: An Efficient Patch-Independent Diffusion Model for Image Super-Resolution Poster Session 3 & Exhibit Hall
Yong Liu ⋅ Hang Dong ⋅ Jinshan Pan ⋅ Qingji dong ⋅ Kai Chen ⋅ Rongxiang Zhang ⋅ Lean Fu ⋅ Fei Wang
Exhibit Hall I #120
Disentangled Clothed Avatar Generation with Layered Representation Poster Session 3 & Exhibit Hall
Weitian Zhang ⋅ Yichao Yan ⋅ Sijing Wu ⋅ Manwen Liao ⋅ Xiaokang Yang
Exhibit Hall I #124
Augmented Mass-Spring Model for Real-Time Dense Hair Simulation Poster Session 3 & Exhibit Hall
Jorge Herrera ⋅ Yi Zhou ⋅ Xin Sun ⋅ Zhixin Shu ⋅ Chengan He ⋅ Soren Pirk ⋅ Dominik Michels
Exhibit Hall I #125
Punching Bag vs. Punching Person: Motion Transferability in Videos Poster Session 3 & Exhibit Hall
Raiyaan Abdullah ⋅ Jared Claypoole ⋅ Michael Cogswell ⋅ Ajay Divakaran ⋅ Yogesh Rawat
Exhibit Hall I #126
G-DexGrasp: Generalizable Dexterous Grasping Synthesis Via Part-Aware Prior Retrieval and Prior-Assisted Generation Poster Session 3 & Exhibit Hall
Juntao Jian ⋅ Xiuping Liu ⋅ Zixuanchen Zixuanchen ⋅ Manyi Li ⋅ Jian Liu ⋅ Ruizhen Hu
Exhibit Hall I #135
WarpHE4D: Dense 4D Head Map toward Full Head Reconstruction Poster Session 3 & Exhibit Hall
Jongseob Yun ⋅ Yong-Hoon Kwon ⋅ Min-Gyu Park ⋅ Ju-Mi Kang ⋅ Min-Ho Lee ⋅ Inho Chang ⋅ Ju Yoon ⋅ Kuk-Jin Yoon
Exhibit Hall I #138
PrimHOI: Compositional Human-Object Interaction via Reusable Primitives Poster Session 3 & Exhibit Hall
Kai Jia ⋅ Tengyu Liu ⋅ Mingtao Pei ⋅ Yixin Zhu ⋅ Siyuan Huang
Exhibit Hall I #139
Continuous-Time Human Motion Field from Event Cameras Poster Session 3 & Exhibit Hall
Ziyun (Claude) Wang ⋅ Ruijun Zhang ⋅ Zi-Yan Liu ⋅ Yufu Wang ⋅ Kostas Daniilidis
Exhibit Hall I #140
GENMO: A GENeralist Model for Human MOtion Poster Session 3 & Exhibit Hall
Jiefeng Li ⋅ Jinkun Cao ⋅ Haotian Zhang ⋅ Davis Rempe ⋅ Jan Kautz ⋅ Umar Iqbal ⋅ Ye Yuan
Exhibit Hall I #166
Efficient Track Anything Poster Session 3 & Exhibit Hall
Yunyang Xiong ⋅ Chong Zhou ⋅ Xiaoyu Xiang ⋅ Lemeng Wu ⋅ Chenchen Zhu ⋅ Zechun Liu ⋅ Saksham Suri ⋅ Balakrishnan Varadarajan ⋅ Ramya Akula ⋅ Forrest Iandola ⋅ Raghuraman Krishnamoorthi ⋅ Bilge Soran ⋅ Vikas Chandra
Exhibit Hall I #141
HAMoBE: Hierarchical and Adaptive Mixture of Biometric Experts for Video-based Person ReID Poster Session 3 & Exhibit Hall
Yiyang Su ⋅ Yunping Shi ⋅ Feng Liu ⋅ Xiaoming Liu
Exhibit Hall I #142
Multi-Object Sketch Animation by Scene Decomposition and Motion Planning Poster Session 3 & Exhibit Hall
Jingyu Liu ⋅ Zijie Xin ⋅ Yuhan Fu ⋅ Ruixiang Zhao ⋅ Bangxiang Lan ⋅ Xirong Li
Exhibit Hall I #143
ISP2HRNet: Learning to Reconstruct High Resolution Image from Irregularly Sampled Pixels via Hierarchical Gradient Learning Poster Session 3 & Exhibit Hall
Yuanlin Wang ⋅ Ruiqin Xiong ⋅ Rui Zhao ⋅ Jin Wang ⋅ Xiaopeng Fan ⋅ Tiejun Huang
Exhibit Hall I #144
Sequential keypoint density estimator: an overlooked baseline of skeleton-based video anomaly detection Poster Session 3 & Exhibit Hall
Anja Delić ⋅ Matej Grcic ⋅ Siniša Šegvić
Exhibit Hall I #147
GameFactory: Creating New Games with Generative Interactive Videos Poster Session 3 & Exhibit Hall
Jiwen Yu ⋅ Yiran Qin ⋅ Xintao Wang ⋅ Pengfei Wan ⋅ Di ZHANG ⋅ Xihui Liu
Exhibit Hall I #150
FaceCraft4D: Animated 3D Facial Avatar Generation from a Single Image Poster Session 3 & Exhibit Hall
Fei Yin ⋅ Mallikarjun Reddy ⋅ Chun-Han Yao ⋅ Rafal Mantiuk ⋅ Varun Jampani
Exhibit Hall I #152
Event-Driven Storytelling with Multiple Lifelike Humans in a 3D Scene Poster Session 3 & Exhibit Hall
Donggeun Lim ⋅ Jinseok Bae ⋅ Inwoo Hwang ⋅ Seungmin Lee ⋅ Hwanhee Lee ⋅ Young Min Kim
Exhibit Hall I #156
EvolvingGrasp: Evolutionary Grasp Generation via Efficient Preference Alignment Poster Session 3 & Exhibit Hall
Yufei Zhu ⋅ Yiming Zhong ⋅ Zemin Yang ⋅ Peishan Cong ⋅ Jingyi Yu ⋅ Xinge Zhu ⋅ Yuexin Ma
Exhibit Hall I #157
Efficient Autoregressive Shape Generation via Octree-Based Adaptive Tokenization Poster Session 3 & Exhibit Hall
Kangle Deng ⋅ Hsueh-Ti Derek Liu ⋅ Yiheng Zhu ⋅ Xiaoxia Sun ⋅ Chong Shang ⋅ Kiran Bhat ⋅ Deva Ramanan ⋅ Jun-Yan Zhu ⋅ Maneesh Agrawala ⋅ Tinghui Zhou
Exhibit Hall I #159
EAMamba: Efficient All-Around Vision State Space Model for Image Restoration Poster Session 3 & Exhibit Hall
Yu-Cheng Lin ⋅ Yu-Syuan Xu ⋅ Hao-Wei Chen ⋅ Hsien-Kai Kuo ⋅ Chun-Yi Lee
Exhibit Hall I #161
SyncDiff: Synchronized Motion Diffusion for Multi-Body Human-Object Interaction Synthesis Poster Session 3 & Exhibit Hall
Wenkun He ⋅ Yun Liu ⋅ Ruitao Liu ⋅ Li Yi
Exhibit Hall I #163
Fast Image Super-Resolution via Consistency Rectified Flow Poster Session 3 & Exhibit Hall
Jiaqi Xu ⋅ Wenbo Li ⋅ Haoze Sun ⋅ Fan Li ⋅ Zhixin Wang ⋅ Long Peng ⋅ Jingjing Ren ⋅ HAORAN YANG ⋅ Xiaowei Hu ⋅ Renjing Pei ⋅ Pheng-Ann Heng
Exhibit Hall I #165
Event-guided HDR Reconstruction with Diffusion Priors Poster Session 3 & Exhibit Hall
Yixin Yang ⋅ jiawei zhang ⋅ Yang Zhang ⋅ Yunxuan Wei ⋅ Dongqing Zou ⋅ Jimmy Ren ⋅ Boxin Shi
Exhibit Hall I #168
Learning Efficient and Generalizable Human Representation with Human Gaussian Model Poster Session 3 & Exhibit Hall
Yifan Liu ⋅ Shengjun Zhang ⋅ Chensheng Dai ⋅ Yang Chen ⋅ Hao Liu ⋅ Chen Li ⋅ Yueqi Duan
Exhibit Hall I #169
SMGDiff: Soccer Motion Generation using Diffusion Probabilistic Models Poster Session 3 & Exhibit Hall
Hongdi Yang ⋅ Chengyang Li ⋅ Zhenxuan Wu ⋅ Gaozheng Li ⋅ Jingya Wang ⋅ Jingyi Yu ⋅ Zhuo Su ⋅ Lan Xu
Exhibit Hall I #170
AffordDexGrasp: Open-set Language-guided Dexterous Grasp with Generalizable-Instructive Affordance Poster Session 3 & Exhibit Hall
Yilin Wei ⋅ Mu Lin ⋅ Yuhao Lin ⋅ Jian-Jian Jiang ⋅ Xiao-Ming Wu ⋅ Ling-An Zeng ⋅ Wei-Shi Zheng
Exhibit Hall I #171
Robust Adverse Weather Removal via Spectral-based Spatial Grouping Poster Session 3 & Exhibit Hall
Yuhwan Jeong ⋅ Yunseo Yang ⋅ Youngho Yoon ⋅ Kuk-Jin Yoon
Exhibit Hall I #176
Switch-a-View: View Selection Learned from Unlabeled In-the-wild Videos Poster Session 3 & Exhibit Hall
Sagnik Majumder ⋅ Tushar Nagarajan ⋅ Ziad Al-Halah ⋅ Kristen Grauman
Exhibit Hall I #185
DiTaiListener: Controllable High Fidelity Listener Video Generation with Diffusion Poster Session 3 & Exhibit Hall
Maksim Siniukov ⋅ Di Chang ⋅ Minh Tran ⋅ Hongkun Gong ⋅ Ashutosh Chaubey ⋅ Mohammad Soleymani
Exhibit Hall I #187
Hipandas: Hyperspectral Image Joint Denoising and Super-Resolution by Image Fusion with the Panchromatic Image Poster Session 3 & Exhibit Hall
Shuang Xu ⋅ Zixiang Zhao ⋅ Haowen Bai ⋅ Chang Yu ⋅ Jiangjun Peng ⋅ Xiangyong Cao ⋅ Deyu Meng
Exhibit Hall I #188
Ponimator: Unfolding Interactive Pose for Versatile Human-human Interaction Animation Poster Session 3 & Exhibit Hall
Shaowei Liu ⋅ chuan guo ⋅ Bing Zhou ⋅ Jian Wang
Exhibit Hall I #194
Scaling Action Detection: AdaTAD++ with Transformer-Enhanced Temporal-Spatial Adaptation Poster Session 3 & Exhibit Hall
Tanay Agrawal ⋅ Abid Ali ⋅ Antitza Dantcheva ⋅ Francois Bremond
Exhibit Hall I #208
Avat3r: Large Animatable Gaussian Reconstruction Model for High-fidelity 3D Head Avatars Poster Session 3 & Exhibit Hall
Tobias Kirschstein ⋅ Javier Romero ⋅ Artem Sevastopolsky ⋅ Matthias Nießner ⋅ Shunsuke Saito
Exhibit Hall I #196
Skeleton Motion Words for Unsupervised Skeleton-based Temporal Action Segmentation Poster Session 3 & Exhibit Hall
Uzay Gökay ⋅ Federico Spurio ⋅ Dominik Bach ⋅ Juergen Gall
Exhibit Hall I #197
DH-FaceVid-1K: A Large-Scale High-Quality Dataset for Face Video Generation Poster Session 3 & Exhibit Hall
Donglin Di ⋅ He Feng ⋅ Wenzhang SUN ⋅ Yongjia Ma ⋅ Hao Li ⋅ Chen Wei ⋅ Lei Fan ⋅ Tonghua Su ⋅ Xun Yang
Exhibit Hall I #199
Synthetic Video Enhances Physical Fidelity in Video Synthesis Poster Session 3 & Exhibit Hall
Qi Zhao ⋅ Xingyu Ni ⋅ Ziyu Wang ⋅ Feng Cheng ⋅ Ziyan Yang ⋅ Lu Jiang ⋅ Bohan Wang
Exhibit Hall I #200
TimeBooth: Disentangled Facial Invariant Representation for Diverse and Personalized Face Aging Poster Session 3 & Exhibit Hall
Zepeng Su ⋅ zhulin liu ⋅ Zongyan Zhang ⋅ Tong Zhang ⋅ C.L.Philip Chen
Exhibit Hall I #201
Identity Preserving 3D Head Stylization with Multiview Score Distillation Poster Session 3 & Exhibit Hall
Bahri Batuhan Bilecen ⋅ Ahmet Berke Gokmen ⋅ Furkan Güzelant ⋅ Aysegul Dundar
Exhibit Hall I #203
IDF: Iterative Dynamic Filtering Networks for Generalizable Image Denoising Poster Session 3 & Exhibit Hall
Dongjin Kim ⋅ Jaekyun Ko ⋅ Muhammad Kashif Ali ⋅ Tae Hyun Kim
Exhibit Hall I #204
Who is a Better Talker: Subjective and Objective Quality Assessment for AI-Generated Talking Heads Poster Session 3 & Exhibit Hall
Yingjie Zhou ⋅ Jiezhang Cao ⋅ Zicheng Zhang ⋅ Farong Wen ⋅ Jiang Yanwei ⋅ Jun Jia ⋅ Xiaohong Liu ⋅ Xiongkuo Min ⋅ Guangtao Zhai
Exhibit Hall I #206
Towards Efficient General Feature Prediction in Masked Skeleton Modeling Poster Session 3 & Exhibit Hall
Shengkai Sun ⋅ Zefan Zhang ⋅ Jianfeng Dong ⋅ Zhiyong Cheng ⋅ Xiaojun Chang ⋅ Meng Wang
Exhibit Hall I #207
How Would It Sound? Material-Controlled Multimodal Acoustic Profile Generation for Indoor Scenes Poster Session 3 & Exhibit Hall
Mahnoor Saad ⋅ Ziad Al-Halah
Exhibit Hall I #209
VideoSetDiff: Identifying and Reasoning Similarities and Differences in Similar Videos Poster Session 3 & Exhibit Hall
YUE QIU ⋅ Yanjun Sun ⋅ Takuma Yagi ⋅ Shusaku Egami ⋅ Natsuki Miyata ⋅ Ken Fukuda ⋅ Kensho Hara ⋅ Ryusuke Sagawa
Exhibit Hall I #210
Occlusion-robust Stylization for Drawing-based 3D Animation Poster Session 3 & Exhibit Hall
Sunjae Yoon ⋅ Gwanhyeong Koo ⋅ Younghwan Lee ⋅ Ji Woo Hong ⋅ Chang Yoo
Exhibit Hall I #212
Video Individual Counting for Moving Drones Poster Session 3 & Exhibit Hall
Yaowu Fan ⋅ Jia Wan ⋅ Tao Han ⋅ Antoni Chan ⋅ Jinhua Ma
Exhibit Hall I #214
NAPPure: Adversarial Purification for Robust Image Classification under Non-Additive Perturbations Poster Session 1 & Exhibit Hall
Junjie Nan ⋅ Jianing Li ⋅ Wei Chen ⋅ Mingkun Zhang ⋅ Xueqi Cheng
Exhibit Hall I #205
FaceLift: Learning Generalizable Single Image 3D Face Reconstruction from Synthetic Heads Poster Session 3 & Exhibit Hall
Weijie Lyu ⋅ Yi Zhou ⋅ Ming-Hsuan Yang ⋅ Zhixin Shu
Exhibit Hall I #253
What Changed and What Could Have Changed? State-Change Counterfactuals for Procedure-Aware Video Representation Learning Poster Session 3 & Exhibit Hall
Chi-Hsi Kung ⋅ Frangil Ramirez ⋅ Juhyung Ha ⋅ Yi-Hsuan Tsai ⋅ Yi-Ting Chen ⋅ David Crandall
Exhibit Hall I #215
HADES: Human Avatar with Dynamic Explicit Hair Strands Poster Session 3 & Exhibit Hall
Zhanfeng Liao ⋅ Hanzhang Tu ⋅ Cheng Peng ⋅ Hongwen Zhang ⋅ Boyao Zhou ⋅ Yebin Liu
Exhibit Hall I #217
FlowDPS : Flow-Driven Posterior Sampling for Inverse Problems Poster Session 3 & Exhibit Hall
Jeongsol Kim ⋅ Bryan Sangwoo Kim ⋅ Jong Ye
Exhibit Hall I #218
ZFusion: Efficient Deep Compositional Zero-shot Learning for Blind Image Super-Resolution with Generative Diffusion Prior Poster Session 3 & Exhibit Hall
Alireza Esmaeilzehi ⋅ Hossein Zaredar ⋅ Yapeng Tian ⋅ Laleh Seyyed-Kalantari
Exhibit Hall I #219
Stable Virtual Camera: Generative View Synthesis with Diffusion Models Poster Session 3 & Exhibit Hall
Jensen Zhou ⋅ Hang Gao ⋅ Vikram Voleti ⋅ Aaryaman Vasishta ⋅ Chun-Han Yao ⋅ Mark Boss ⋅ Philip Torr ⋅ Christian Rupprecht ⋅ Varun Jampani
Exhibit Hall I #227
VLIPP: Towards Physically Plausible Video Generation with Vision and Language Informed Physical Prior Poster Session 3 & Exhibit Hall
Xindi Yang ⋅ Baolu Li ⋅ Yiming Zhang ⋅ Zhenfei Yin ⋅ LEI BAI ⋅ Liqian Ma ⋅ Zhiyong Wang ⋅ Jianfei Cai ⋅ Tien-Tsin Wong ⋅ Huchuan Lu ⋅ Xu Jia
Exhibit Hall I #221
StreamDiffusion: A Pipeline-level Solution for Real-Time Interactive Generation Poster Session 3 & Exhibit Hall
Akio Kodaira ⋅ Chenfeng Xu ⋅ Toshiki Hazama ⋅ Takanori Yoshimoto ⋅ Kohei Ohno ⋅ Shogo Mitsuhori ⋅ Soichi Sugano ⋅ Hanying Cho ⋅ Zhijian Liu ⋅ Masayoshi Tomizuka ⋅ Kurt Keutzer
Exhibit Hall I #222
DreamRelation: Relation-Centric Video Customization Poster Session 3 & Exhibit Hall
Yujie Wei ⋅ Shiwei Zhang ⋅ Hangjie Yuan ⋅ Biao Gong ⋅ Longxiang Tang ⋅ Xiang Wang ⋅ Haonan Qiu ⋅ Hengjia Li ⋅ Shuai Tan ⋅ Yingya Zhang ⋅ Hongming Shan
Exhibit Hall I #225
ModSkill: Physical Character Skill Modularization Poster Session 3 & Exhibit Hall
Yiming Huang ⋅ Zhiyang Dou ⋅ Lingjie Liu
Exhibit Hall I #226
Rethinking Bimanual Robotic Manipulation: Learning with Decoupled Interaction Framework Poster Session 3 & Exhibit Hall
Jian-Jian Jiang ⋅ Xiao-Ming Wu ⋅ Yi-Xiang He ⋅ Ling-An Zeng ⋅ Yilin Wei ⋅ Dandan Zhang ⋅ Wei-Shi Zheng
Exhibit Hall I #229
Free-Form Motion Control: Controlling the 6D Poses of Camera and Objects in Video Generation Poster Session 3 & Exhibit Hall
Xincheng Shuai ⋅ Henghui Ding ⋅ Zhenyuan Qin ⋅ Hao Luo ⋅ Xingjun Ma ⋅ Dacheng Tao
Exhibit Hall I #231
Learning A Unified Template for Gait Recognition Poster Session 3 & Exhibit Hall
Panjian Huang ⋅ Saihui Hou ⋅ Junzhou Huang ⋅ Yongzhen Huang
Exhibit Hall I #232
Synchronization of Multiple Videos Poster Session 3 & Exhibit Hall
Avihai Naaman ⋅ Ron Shapira Weber ⋅ Oren Freifeld
Exhibit Hall I #237
DeepShield: Fortifying Deepfake Video Detection with Local and Global Forgery Analysis Poster Session 3 & Exhibit Hall
Yinqi Cai ⋅ Jichang Li ⋅ Zhaolun Li ⋅ Weikai Chen ⋅ Rushi Lan ⋅ Xi Xie ⋅ Xiaonan Luo ⋅ Guanbin Li
Exhibit Hall I #238
VertexRegen: Mesh Generation with Continuous Level of Detail Poster Session 3 & Exhibit Hall
Xiang Zhang ⋅ Yawar Siddiqui ⋅ Armen Avetisyan ⋅ Christopher Xie ⋅ Jakob Engel ⋅ Henry Howard-Jenkins
Exhibit Hall I #242
GestureHYDRA: Semantic Co-speech Gesture Synthesis via Hybrid Modality Diffusion Transformer and Cascaded-Synchronized Retrieval-Augmented Generation Poster Session 3 & Exhibit Hall
Quanwei Yang ⋅ Luying Huang ⋅ Kaisiyuan Wang ⋅ Jiazhi Guan ⋅ Shengyi He ⋅ Fengguo Li ⋅ Hang Zhou ⋅ Lingyun Yu ⋅ Yingying Li ⋅ Haocheng Feng ⋅ Hongtao Xie
Exhibit Hall I #246
FoundIR: Unleashing Million-scale Training Data to Advance Foundation Models for Image Restoration Poster Session 3 & Exhibit Hall
Hao Li ⋅ Xiang Chen ⋅ Jiangxin Dong ⋅ Jinhui Tang ⋅ Jinshan Pan
Exhibit Hall I #247
Highlight What You Want: Weakly-Supervised Instance-Level Controllable Infrared-Visible Image Fusion Poster Session 3 & Exhibit Hall
Zeyu Wang ⋅ Jizheng Zhang ⋅ Haiyu Song ⋅ Mingyu Ge ⋅ Jiayu Wang ⋅ Haoran Duan
Exhibit Hall I #248
Adaptive Hyper-Graph Convolution Network for Skeleton-based Human Action Recognition with Virtual Connections Poster Session 3 & Exhibit Hall
Youwei Zhou ⋅ Tianyang Xu ⋅ Cong Wu ⋅ Xiaojun Wu ⋅ Josef Kittler
Exhibit Hall I #249
Precise Action-to-Video Generation Through Visual Action Prompts Poster Session 3 & Exhibit Hall
Yuang Wang ⋅ Chao Wen ⋅ Haoyu Guo ⋅ Sida Peng ⋅ Minghan Qin ⋅ Hujun Bao ⋅ Ruizhen Hu ⋅ Xiaowei Zhou
Exhibit Hall I #255
PRIMAL: Physically Reactive and Interactive Motor Model for Avatar Learning Poster Session 3 & Exhibit Hall
Yan Zhang ⋅ Yao Feng ⋅ Alpár Cseke ⋅ Nitin Saini ⋅ Nathan Bajandas ⋅ Nicolas Heron ⋅ Michael Black
Exhibit Hall I #256
Bridging the Skeleton-Text Modality Gap: Diffusion-Powered Modality Alignment for Zero-shot Skeleton-based Action Recognition Poster Session 3 & Exhibit Hall
Jeonghyeok Do ⋅ Munchurl Kim
Exhibit Hall I #259
Visual Chronicles: Using Multimodal LLMs to Analyze Massive Collections of Images Poster Session 3 & Exhibit Hall
Boyang Deng ⋅ Kyle Genova ⋅ Songyou Peng ⋅ Gordon Wetzstein ⋅ Noah Snavely ⋅ Leonidas Guibas ⋅ Thomas Funkhouser
Exhibit Hall I #260
Latent-Reframe: Enabling Camera Control for Video Diffusion Models without Training Poster Session 3 & Exhibit Hall
Zhenghong Zhou ⋅ Jie An ⋅ Jiebo Luo
Exhibit Hall I #261
GeoAvatar: Adaptive Geometrical Gaussian Splatting for 3D Head Avatar Poster Session 3 & Exhibit Hall
SeungJun Moon ⋅ Hah Min Lew ⋅ Seungeun Lee ⋅ Ji-Su Kang ⋅ Gyeong-Moon Park
Exhibit Hall I #264
Image Intrinsic Scale Assessment: Bridging the Gap Between Quality and Resolution Poster Session 3 & Exhibit Hall
Vlad Hosu ⋅ Lorenzo Agnolucci ⋅ Daisuke Iso ⋅ Dietmar Saupe
Exhibit Hall I #269
Frequency-Guided Posterior Sampling for Diffusion-Based Image Restoration Poster Session 3 & Exhibit Hall
Darshan Thaker ⋅ Abhishek Goyal ⋅ Rene Vidal
Exhibit Hall I #270
GAS: Generative Avatar Synthesis from a Single Image Poster Session 3 & Exhibit Hall
Yixing Lu ⋅ Junting Dong ⋅ YoungJoong Kwon ⋅ Qin Zhao ⋅ Bo Dai ⋅ Fernando De la Torre
Exhibit Hall I #271
Less Static, More Private: Towards Transferable Privacy-Preserving Action Recognition by Generative Decoupled Learning Poster Session 3 & Exhibit Hall
Zhi-Wei Xia ⋅ Kun-Yu Lin ⋅ Yuan-Ming Li ⋅ Wei-Jin Huang ⋅ Xian-Tuo Tan ⋅ Wei-Shi Zheng
Exhibit Hall I #272
Bitrate-Controlled Diffusion for Disentangling Motion and Content in Video Poster Session 3 & Exhibit Hall
Xiao Li ⋅ Qi Chen ⋅ Xiulian Peng ⋅ Kai Yu ⋅ Xie Chen ⋅ Yan Lu
Exhibit Hall I #273
Blind2Sound: Self-Supervised Image Denoising without Residual Noise Poster Session 3 & Exhibit Hall
Jiazheng Liu ⋅ Zejin Wang ⋅ Bohao Chen ⋅ Hua Han
Exhibit Hall I #276
Unified Multimodal Understanding via Byte-Pair Visual Encoding Poster Session 3 & Exhibit Hall
Wanpeng Zhang ⋅ Yicheng Feng ⋅ Hao Luo ⋅ Yijiang Li ⋅ Zihao Yue ⋅ Sipeng Zheng ⋅ Zongqing Lu
Exhibit Hall I #280
IMoRe: Implicit Program-Guided Reasoning for Human Motion Q&A Poster Session 3 & Exhibit Hall
Chen Li ⋅ Chinthani Sugandhika ⋅ Ee Yeo Keat ⋅ Eric Peh ⋅ Hao Zhang ⋅ HONG YANG ⋅ Deepu Rajan ⋅ Basura Fernando
Exhibit Hall I #281
Privacy-centric Deep Motion Retargeting for Anonymization of Skeleton-Based Motion Visualization Poster Session 3 & Exhibit Hall
Thomas Carr ⋅ Depeng Xu ⋅ Shuhan Yuan ⋅ Aidong Lu
Exhibit Hall I #297
AdaDCP: Learning an Adapter with Discrete Cosine Prior for Clear-to-Adverse Domain Generalization Poster Session 3 & Exhibit Hall
Qi Bi ⋅ Yixian Shen ⋅ Jingjun Yi ⋅ Gui-Song Xia
Exhibit Hall I #282
MorphoGen: Efficient Unconditional Generation of Long-Range Projection Neuronal Morphology via a Global-to-Local Framework Poster Session 3 & Exhibit Hall
Tianfang Zhu ⋅ Hongyang Zhou ⋅ Anan LI
Exhibit Hall I #284
GaussianSpeech: Audio-Driven Personalized 3D Gaussian Avatars Poster Session 3 & Exhibit Hall
Shivangi Aneja ⋅ Artem Sevastopolsky ⋅ Tobias Kirschstein ⋅ Justus Thies ⋅ Angela Dai ⋅ Matthias Nießner
Exhibit Hall I #288
A Quality-Guided Mixture of Score-Fusion Experts Framework for Human Recognition Poster Session 3 & Exhibit Hall
Jie Zhu ⋅ Yiyang Su ⋅ Minchul Kim ⋅ Anil Jain ⋅ Xiaoming Liu
Exhibit Hall I #289
Capturing head avatar with hand contacts from a monocular video Poster Session 3 & Exhibit Hall
Haonan He ⋅ Yufeng Zheng ⋅ Jie Song
Exhibit Hall I #291
Tiling artifacts and trade-offs of feature normalization in the segmentation of large biological images Poster Session 3 & Exhibit Hall
Elena Buglakova ⋅ Anwai Archit ⋅ Edoardo D'Imprima ⋅ Julia Mahamid ⋅ Constantin Pape ⋅ Anna Kreshuk
Exhibit Hall I #292
GenM3: Generative Pretrained Multi-path Motion Model for Text Conditional Human Motion Generation Poster Session 3 & Exhibit Hall
Junyu Shi ⋅ Lijiang LIU ⋅ Yong Sun ⋅ Zhiyuan Zhang ⋅ JINNI ZHOU ⋅ Qiang Nie
Exhibit Hall I #294
Saliency-Aware Quantized Imitation Learning for Efficient Robotic Control Poster Session 3 & Exhibit Hall
Seongmin Park ⋅ Hyungmin Kim ⋅ Sangwoo kim ⋅ Wonseok Jeon ⋅ Juyoung Yang ⋅ Byeongwook Jeon ⋅ Yoonseon Oh ⋅ Jungwook Choi
Exhibit Hall I #295
MAVFlow: Preserving Paralinguistic Elements with Conditional Flow Matching for Zero-Shot AV2AV Multilingual Translation Poster Session 3 & Exhibit Hall
Sungwoo Cho ⋅ Jeongsoo Choi ⋅ Sungnyun Kim ⋅ Se-Young Yun
Exhibit Hall I #296
UniPhys: Unified Planner and Controller with Diffusion for Flexible Physics-Based Character Control Poster Session 3 & Exhibit Hall
Yan Wu ⋅ Korrawe Karunratanakul ⋅ Zhengyi Luo ⋅ Siyu Tang
Exhibit Hall I #304
UniRes: Universal Image Restoration for Complex Degradations Poster Session 3 & Exhibit Hall
Mo Zhou ⋅ Keren Ye ⋅ Mauricio Delbracio ⋅ Peyman Milanfar ⋅ Vishal Patel ⋅ Hossein Talebi
Exhibit Hall I #306
SV4D 2.0: Enhancing Spatio-Temporal Consistency in Multi-View Video Diffusion for High-Quality 4D Generation Poster Session 3 & Exhibit Hall
Chun-Han Yao ⋅ Yiming Xie ⋅ Vikram Voleti ⋅ Huaizu Jiang ⋅ Varun Jampani
Exhibit Hall I #307
Light-A-Video: Training-free Video Relighting via Progressive Light Fusion Poster Session 3 & Exhibit Hall
Yujie Zhou ⋅ Jiazi Bu ⋅ Pengyang Ling ⋅ Pan Zhang ⋅ Tong Wu ⋅ Qidong Huang ⋅ Jinsong Li ⋅ Xiaoyi Dong ⋅ Yuhang Zang ⋅ Yuhang Cao ⋅ Anyi Rao ⋅ Jiaqi Wang ⋅ Li Niu
Exhibit Hall I #313
Group-wise Scaling and Orthogonal Decomposition for Domain-Invariant Feature Extraction in Face Anti-Spoofing Poster Session 3 & Exhibit Hall
Seungjin Jung ⋅ Kanghee Lee ⋅ Yonghyun Jeong ⋅ Haeun Noh ⋅ Jungmin Lee ⋅ Jongwon Choi
Exhibit Hall I #318
SVG-Head: Hybrid Surface-Volumetric Gaussians for High-Fidelity Head Reconstruction and Real-Time Editing Poster Session 3 & Exhibit Hall
Heyi Sun ⋅ Cong Wang ⋅ Tian-Xing Xu ⋅ Jingwei Huang ⋅ Di Kang ⋅ Chunchao Guo ⋅ Song-Hai Zhang
Exhibit Hall I #314
Go to Zero: Towards Zero-shot Motion Generation with Million-scale Data Poster Session 3 & Exhibit Hall
Ke Fan ⋅ Shunlin Lu ⋅ Minyue Dai ⋅ Runyi Yu ⋅ Lixing Xiao ⋅ Zhiyang Dou ⋅ Junting Dong ⋅ Lizhuang Ma ⋅ Jingbo Wang
Exhibit Hall I #315
StyleMotif: Multi-Modal Motion Stylization using Style-Content Cross Fusion Poster Session 3 & Exhibit Hall
Ziyu Guo ⋅ Young-Yoon Lee ⋅ Joseph Liu ⋅ Yizhak Ben-Shabat ⋅ Victor Zordan ⋅ Mubbasir Kapadia
Exhibit Hall I #316
I2V3D: Controllable Image-to-video Generation with 3D Guidance Poster Session 3 & Exhibit Hall
Zhiyuan Zhang ⋅ Dongdong Chen ⋅ Jing Liao
Exhibit Hall I #317
FakeRadar: Probing Forgery Outliers to Detect Unknown Deepfake Videos Poster Session 3 & Exhibit Hall
Zhaolun Li ⋅ Jichang Li ⋅ Yinqi Cai ⋅ Junye Chen ⋅ Xiaonan Luo ⋅ Guanbin Li ⋅ Rushi Lan
Exhibit Hall I #319
CameraCtrl II: Dynamic Scene Exploration via Camera-controlled Video Diffusion Models Poster Session 3 & Exhibit Hall
Hao He ⋅ Ceyuan Yang ⋅ Shanchuan Lin ⋅ Yinghao Xu ⋅ Meng Wei ⋅ Liangke Gui ⋅ Qi Zhao ⋅ Gordon Wetzstein ⋅ Lu Jiang ⋅ Hongsheng Li
Exhibit Hall I #322
DynamicFace: High-Quality and Consistent Face Swapping for Image and Video using Composable 3D Facial Priors Poster Session 3 & Exhibit Hall
Runqi Wang ⋅ Yang Chen ⋅ Sijie Xu ⋅ Tianyao He ⋅ Wei Zhu ⋅ Dejia Song ⋅ Nemo Chen ⋅ Xu Tang ⋅ Yao Hu
Exhibit Hall I #324
CARP: Visuomotor Policy Learning via Coarse-to-Fine Autoregressive Prediction Poster Session 3 & Exhibit Hall
Zhefei Gong ⋅ Pengxiang Ding ⋅ Shangke Lyu ⋅ Siteng Huang ⋅ Mingyang Sun ⋅ Wei Zhao ⋅ Zhaoxin Fan ⋅ Donglin Wang
Exhibit Hall I #326
AdaHuman: Animatable Detailed 3D Human Generation with Compositional Multiview Diffusion Poster Session 3 & Exhibit Hall
Yangyi Huang ⋅ Ye Yuan ⋅ Xueting Li ⋅ Jan Kautz ⋅ Umar Iqbal
Exhibit Hall I #333
Trokens: Semantic-Aware Relational Trajectory Tokens for Few-Shot Action Recognition Poster Session 3 & Exhibit Hall
Pulkit Kumar ⋅ Shuaiyi Huang ⋅ Matthew Walmer ⋅ Sai Saketh Rambhatla ⋅ Abhinav Shrivastava
Exhibit Hall I #334
Controllable Weather Synthesis and Removal with Video Diffusion Models Poster Session 3 & Exhibit Hall
Chih-Hao Lin ⋅ Zian Wang ⋅ Ruofan Liang ⋅ Yuxuan Zhang ⋅ Sanja Fidler ⋅ Shenlong Wang ⋅ Zan Gojcic
Exhibit Hall I #337
Sequential Gaussian Avatars with Hierarchical Motion Context Poster Session 3 & Exhibit Hall
Wangze Xu ⋅ Yifan Zhan ⋅ Zhihang Zhong ⋅ Xiao Sun
Exhibit Hall I #338
TokenUnify: Scaling Up Autoregressive Pretraining for Neuron Segmentation Poster Session 3 & Exhibit Hall
Yinda Chen ⋅ Haoyuan Shi ⋅ Xiaoyu Liu ⋅ Te Shi ⋅ Ruobing Zhang ⋅ Dong Liu ⋅ Zhiwei Xiong ⋅ Feng Wu
Exhibit Hall I #339
SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree Poster Session 3 & Exhibit Hall
Shuangrui Ding ⋅ Rui Qian ⋅ Xiaoyi Dong ⋅ Pan Zhang ⋅ Yuhang Zang ⋅ Yuhang Cao ⋅ Yuwei Guo ⋅ Dahua Lin ⋅ Jiaqi Wang
Exhibit Hall I #340
T2Bs: Text-to-Character Blendshapes via Video Generation Poster Session 3 & Exhibit Hall
Jiahao Luo ⋅ Chaoyang Wang ⋅ Michael Vasilkovsky ⋅ Vladislav Shakhrai ⋅ Di Liu ⋅ Peiye Zhuang ⋅ Sergey Tulyakov ⋅ Peter Wonka ⋅ Hsin-Ying Lee ⋅ James Davis ⋅ Jian Wang
Exhibit Hall I #341
Unfolding-Associative Encoder-Decoder Network with Progressive Alignment for Pansharpening Poster Session 3 & Exhibit Hall
Shijie Fang ⋅ Hongping Gan
Exhibit Hall I #343
MOERL: When Mixture-of-Experts Meet Reinforcement Learning for Adverse Weather Image Restoration Poster Session 3 & Exhibit Hall
Tao Wang ⋅ Peiwen Xia ⋅ Bo Li ⋅ Peng-Tao Jiang ⋅ Zhe Kong ⋅ Kaihao Zhang ⋅ Tong Lu ⋅ Wenhan Luo
Exhibit Hall I #345
DimensionX: Create Any 3D and 4D Scenes from a Single Image with Decoupled Video Diffusion Poster Session 3 & Exhibit Hall
Wenqiang Sun ⋅ Shuo Chen ⋅ Fangfu Liu ⋅ Zilong Chen ⋅ Yueqi Duan ⋅ Jun Zhu ⋅ Jun Zhang ⋅ Yikai Wang
Exhibit Hall I #347
LOMM: Latest Object Memory Management for Temporally Consistent Video Instance Segmentation Poster Session 3 & Exhibit Hall
Seunghun Lee ⋅ Jiwan Seo ⋅ Minwoo Choi ⋅ Kiljoon Han ⋅ Jaehoon Jeong ⋅ Zane Durante ⋅ Ehsan Adeli ⋅ Sang Hyun Park ⋅ Sunghoon Im
Exhibit Hall I #349
VoluMe – Authentic 3D Video Calls from Live Gaussian Splat Prediction Poster Session 3 & Exhibit Hall
Martin de La Gorce ⋅ Charlie Hewitt ⋅ Tibor Takács ⋅ Robert Gerdisch ⋅ Zafiirah Hosenie ⋅ Givi Meishvili ⋅ Marek Kowalski ⋅ Thomas J. Cashman ⋅ Antonio Criminisi
Exhibit Hall I #355
EVDM: Event-based Real-world Video Deblurring with Mamba Poster Session 3 & Exhibit Hall
Zhijing Sun ⋅ Senyan Xu ⋅ Kean Liu ⋅ Runze Tian ⋅ Xueyang Fu ⋅ Zheng-Jun Zha
Exhibit Hall I #356
iManip: Skill-Incremental Learning for Robotic Manipulation Poster Session 3 & Exhibit Hall
Zexin Zheng ⋅ Jia-Feng Cai ⋅ Xiao-Ming Wu ⋅ Yilin Wei ⋅ Yu-Ming Tang ⋅ Wei-Shi Zheng ⋅ Ancong Wu
Exhibit Hall I #365
Q-Norm: Robust Representation Learning via Quality-Adaptive Normalization Poster Session 3 & Exhibit Hall
Lanning Zhang ⋅ Ying Zhou ⋅ Fei Gao ⋅ Ziyun Li ⋅ Maoying Qiao ⋅ Jinlan Xu ⋅ Nannan Wang
Exhibit Hall I #366
Proxy-Bridged Game Transformer for Interactive Extreme Motion Prediction Poster Session 3 & Exhibit Hall
Yanwen Fang ⋅ Wenqi Jia ⋅ Xu Cao ⋅ Peng-Tao Jiang ⋅ Guodong Li ⋅ Jintai CHEN
Exhibit Hall I #367
MeshAnything V2: Artist-Created Mesh Generation with Adjacent Mesh Tokenization Poster Session 3 & Exhibit Hall
Yiwen Chen ⋅ Yikai Wang ⋅ Yihao Luo ⋅ Zhengyi Wang ⋅ Zilong Chen ⋅ Jun Zhu ⋅ Chi Zhang ⋅ Guosheng Lin
Exhibit Hall I #368
π-AVAS: Can Physics-Integrated Audio-Visual Modeling Boost Neural Acoustic Synthesis? Poster Session 3 & Exhibit Hall
Susan Liang ⋅ Chao Huang ⋅ Yolo Yunlong Tang ⋅ Zeliang Zhang ⋅ Chenliang Xu
Exhibit Hall I #370
SemGes: Semantics-aware Co-Speech Gesture Generation using Semantic Coherence and Relevance Learning Poster Session 3 & Exhibit Hall
Lanmiao Liu ⋅ Esam Ghaleb ⋅ asli ozyurek ⋅ Zerrin Yumak
Exhibit Hall I #372
Metric Convolutions: A Unifying Theory to Adaptive Image Convolutions Poster Session 3 & Exhibit Hall
Thomas Dagès ⋅ Michael Lindenbaum ⋅ Alfred Bruckstein
Exhibit Hall I #373
RobAVA: A Large-scale Dataset and Baseline Towards Video based Robotic Arm Action Understanding Poster Session 3 & Exhibit Hall
Baoli Sun ⋅ Ning Wang ⋅ Xinzhu Ma ⋅ Anqi Zou ⋅ Lu Yihang ⋅ Chuixuan Fan ⋅ Zhihui Wang ⋅ Kun Lu ⋅ Zhiyong Wang
Exhibit Hall I #374
IDFace: Face Template Protection for Efficient and Secure Identification Poster Session 3 & Exhibit Hall
Sunpill Kim ⋅ Seunghun Paik ⋅ Chanwoo Hwang ⋅ Dongsoo Kim ⋅ Junbum Shin ⋅ Jae Hong Seo
Exhibit Hall I #375
Not All Degradations Are Equal: A Targeted Feature Denoising Framework for Generalizable Image Super-Resolution Poster Session 3 & Exhibit Hall
hongjun wang ⋅ Jiyuan Chen ⋅ Zhengwei Yin ⋅ Xuan Song ⋅ Yinqiang Zheng
Exhibit Hall I #391
I2VControl: Disentangled and Unified Video Motion Synthesis Control Poster Session 3 & Exhibit Hall
Wanquan Feng ⋅ Tianhao Qi ⋅ Jiawei Liu ⋅ Mingzhen Sun ⋅ Pengqi Tu ⋅ Tianxiang Ma ⋅ Fei Dai ⋅ Songtao Zhao ⋅ SiYu Zhou ⋅ Qian HE
Exhibit Hall I #382
MeshLLM: Empowering Large Language Models to Progressively Understand and Generate 3D Mesh Poster Session 3 & Exhibit Hall
Shuangkang Fang ⋅ I-Chao Shen ⋅ Yufeng Wang ⋅ Yi-Hsuan Tsai ⋅ Yi Yang ⋅ Shuchang Zhou ⋅ Wenrui Ding ⋅ Takeo Igarashi ⋅ Ming-Hsuan Yang
Exhibit Hall I #383
On-Device Diffusion Transformer Policy for Efficient Robot Manipulation Poster Session 3 & Exhibit Hall
Yiming Wu ⋅ Huan Wang ⋅ Zhenghao Chen ⋅ Jianxin Pang ⋅ Dong Xu
Exhibit Hall I #384
Generic Event Boundary Detection via Denoising Diffusion Poster Session 3 & Exhibit Hall
Jaejun Hwang ⋅ Dayoung Gong ⋅ Manjin Kim ⋅ Minsu Cho
Exhibit Hall I #385
LiT: Delving into a Simple Linear Diffusion Transformer for Image Generation Poster Session 4 & Exhibit Hall with Coffee Break
Jiahao Wang ⋅ Ning Kang ⋅ Lewei Yao ⋅ Mengzhao Chen ⋅ Chengyue Wu ⋅ Songyang Zhang ⋅ Shuchen Xue ⋅ Yong Liu ⋅ Taiqiang Wu ⋅ Xihui Liu ⋅ Kaipeng Zhang ⋅ Shifeng Zhang ⋅ Wenqi Shao ⋅ Zhenguo Li ⋅ Ping Luo
Exhibit Hall I #112
SHeaP: Self-supervised Head Geometry Predictor Learned via 2D Gaussians Poster Session 3 & Exhibit Hall
Liam Schoneveld ⋅ Zhe Chen ⋅ Davide Davoli ⋅ Jiapeng Tang ⋅ Saimon Terazawa ⋅ Ko Nishino ⋅ Matthias Nießner
Exhibit Hall I #392
TARO: Timestep-Adaptive Representation Alignment with Onset-Aware Conditioning for Synchronized Video-to-Audio Synthesis Poster Session 3 & Exhibit Hall
Tri Ton ⋅ Ji Woo Hong ⋅ Chang Yoo
Exhibit Hall I #398
DexVLG: Dexterous Vision-Language-Grasp Model at Scale Poster Session 3 & Exhibit Hall
Jiawei He ⋅ Danshi Li ⋅ Xinqiang Yu ⋅ Zekun Qi ⋅ Wenyao Zhang ⋅ Jiayi Chen ⋅ Zhaoxiang Zhang ⋅ Zhizheng Zhang ⋅ Li Yi ⋅ He Wang
Exhibit Hall I #400
Towards Explicit Exoskeleton for the Reconstruction of Complicated 3D Human Avatars Poster Session 3 & Exhibit Hall
Yifan Zhan ⋅ Qingtian Zhu ⋅ Muyao Niu ⋅ Mingze Ma ⋅ Jiancheng Zhao ⋅ Zhihang Zhong ⋅ Xiao Sun ⋅ Yu Qiao ⋅ Yinqiang Zheng
Exhibit Hall I #401
Fine-Grained 3D Gaussian Head Avatars Modeling from Static Captures via Joint Reconstruction and Registration Poster Session 3 & Exhibit Hall
Yuan Sun ⋅ Xuan Wang ⋅ Cong Wang ⋅ WeiLi Zhang ⋅ Yanbo Fan ⋅ Yu Guo ⋅ Fei Wang
Exhibit Hall I #404
IM-LUT: Interpolation Mixing Look-Up Tables for Image Super-Resolution Poster Session 3 & Exhibit Hall
Sejin Park ⋅ Sangmin Lee ⋅ Kyong Hwan Jin ⋅ Seung-Won Jung
Exhibit Hall I #406
Attention to Trajectory: Trajectory-Aware Open-Vocabulary Tracking Poster Session 3 & Exhibit Hall
Yunhao Li ⋅ Yifan Jiao ⋅ Dan Meng ⋅ Heng Fan ⋅ Libo Zhang
Exhibit Hall I #413
UniPortrait: A Unified Framework for Identity-Preserving Single- and Multi-Human Image Personalization Poster Session 3 & Exhibit Hall
Junjie He ⋅ Yifeng Geng ⋅ Liefeng Bo
Exhibit Hall I #414
Music-Aligned Holistic 3D Dance Generation via Hierarchical Motion Modeling Poster Session 3 & Exhibit Hall
LI XIAOJIE ⋅ Ronghui Li ⋅ Shukai Fang ⋅ Shuzhao Xie ⋅ Xiaoyang Guo ⋅ Jiaqing Zhou ⋅ Junkun Peng ⋅ Zhi Wang
Exhibit Hall I #416
NoiseController: Towards Consistent Multi-view Video Generation via Noise Decomposition and Collaboration Poster Session 3 & Exhibit Hall
Haotian Dong ⋅ Xin WANG ⋅ Di Lin ⋅ Yipeng Wu ⋅ Qin Chen ⋅ Ruonan Liu ⋅ Kairui Yang ⋅ Ping Li ⋅ Qing Guo
Exhibit Hall I #418
FED-PsyAU: Privacy-Preserving Micro-Expression Recognition via Psychological AU Coordination and Dynamic Facial Motion Modeling Poster Session 3 & Exhibit Hall
Jingting Li ⋅ Yu Qian ⋅ Lin Zhao ⋅ Su-Jing Wang
Exhibit Hall I #419
PUMPS: Skeleton-Agnostic Point-based Universal Motion Pre-Training for Synthesis in Human Motion Tasks Poster Session 3 & Exhibit Hall
Clinton A Mo ⋅ Kun Hu ⋅ Chengjiang Long ⋅ Dong Yuan ⋅ Wan-Chi Siu ⋅ Zhiyong Wang
Exhibit Hall I #423
MistSense: Versatile Online Detection of Procedural and Execution Mistakes Poster Session 3 & Exhibit Hall
Constantin Patsch ⋅ Yuankai Wu ⋅ Marsil Zakour ⋅ Driton Salihu ⋅ Eckehard Steinbach
Exhibit Hall I #426
SEREP: Semantic Facial Expression Representation for Robust In-the-Wild Capture and Retargeting Poster Session 3 & Exhibit Hall
Arthur Josi ⋅ Luiz Gustavo Hafemann ⋅ Abdallah Dib ⋅ Emeline Got ⋅ Rafael M. O. Cruz ⋅ Marc-André Carbonneau
Exhibit Hall I #427
LUT-Fuse: Towards Extremely Fast Infrared and Visible Image Fusion via Distillation to Learnable Look-Up Tables Poster Session 3 & Exhibit Hall
Xunpeng Yi ⋅ yibing zhang ⋅ Xinyu Xiang ⋅ Qinglong Yan ⋅ Han Xu ⋅ Jiayi Ma
Exhibit Hall I #429
Morph: A Motion-free Physics Optimization Framework for Human Motion Generation Poster Session 3 & Exhibit Hall
Zhuo Li ⋅ Mingshuang Luo ⋅ RuiBing Hou ⋅ XIN ZHAO ⋅ Hao Liu ⋅ Hong Chang ⋅ Zimo Liu ⋅ Chen Li
Exhibit Hall I #431
DisCoRD: Discrete Tokens to Continuous Motion via Rectified Flow Decoding Poster Session 3 & Exhibit Hall
Jungbin Cho ⋅ Junwan Kim ⋅ Jisoo Kim ⋅ Minseo Kim ⋅ Mingu Kang ⋅ Sungeun Hong ⋅ Tae-Hyun Oh ⋅ Youngjae Yu
Exhibit Hall I #433
MixANT: Observation-dependent Memory Propagation for Stochastic Dense Action Anticipation Poster Session 3 & Exhibit Hall
Syed Talal Wasim ⋅ Hamid Suleman ⋅ Olga Zatsarynna ⋅ Muzammal Naseer ⋅ Juergen Gall
Exhibit Hall I #434
VoiceCraft-Dub: Automated Video Dubbing with Neural Codec Language Models Poster Session 3 & Exhibit Hall
Kim Sung-Bin ⋅ Jeongsoo Choi ⋅ Puyuan Peng ⋅ Joon Chung Chung ⋅ Tae-Hyun Oh ⋅ David Harwath
Exhibit Hall I #435
DeSPITE: Exploring Contrastive Deep Skeleton-Pointcloud-IMU-Text Embeddings for Advanced Point Cloud Human Activity Understanding Poster Session 3 & Exhibit Hall
Thomas Kreutz ⋅ Max Mühlhäuser ⋅ Alejandro Sanchez Guinea
Exhibit Hall I #436
DISTA-Net: Dynamic Closely-Spaced Infrared Small Target Unmixing Poster Session 3 & Exhibit Hall
Shengdong Han ⋅ Shangdong Yang ⋅ Yuxuan Li ⋅ Xin Zhang ⋅ Xiang Li ⋅ jian Yang ⋅ Ming-Ming Cheng ⋅ Yimian Dai
Exhibit Hall I #438
FLOAT: Generative Motion Latent Flow Matching for Audio-driven Talking Portrait Poster Session 3 & Exhibit Hall
Taekyung Ki ⋅ Dongchan Min ⋅ Gyeongsu Chae
Exhibit Hall I #442
VSRM: A Robust Mamba-Based Framework for Video Super-Resolution Poster Session 3 & Exhibit Hall
Phu Tran Dinh ⋅ Hung Dao ⋅ Daeyoung Kim
Exhibit Hall I #443
2HandedAfforder: Learning Precise Actionable Bimanual Affordances from Human Videos Poster Session 3 & Exhibit Hall
Marvin Heidinger ⋅ Snehal Jauhri ⋅ Vignesh Prasad ⋅ Georgia Chalvatzaki
Exhibit Hall I #446
AnimalClue: Recognizing Animals by their Traces Poster Session 3 & Exhibit Hall
Risa Shinoda ⋅ Nakamasa Inoue ⋅ Iro Laina ⋅ Christian Rupprecht ⋅ Hirokatsu Kataoka
Exhibit Hall I #451
TIP-I2V: A Million-Scale Real Text and Image Prompt Dataset for Image-to-Video Generation Poster Session 4 & Exhibit Hall with Coffee Break
Wenhao Wang ⋅ Yi Yang
Exhibit Hall I #1
SCFlow: Implicitly Learning Style and Content Disentanglement with Flow Models Poster Session 4 & Exhibit Hall with Coffee Break
Pingchuan Ma ⋅ Xiaopei Yang ⋅ Ming Gui ⋅ Yusong Li ⋅ Felix Krause ⋅ Johannes Schusterbauer ⋅ Björn Ommer
Exhibit Hall I #3
OminiControl: Minimal and Universal Control for Diffusion Transformer Poster Session 4 & Exhibit Hall with Coffee Break
Zhenxiong Tan ⋅ Songhua Liu ⋅ Xingyi Yang ⋅ Qiaochu Xue ⋅ Xinchao Wang
Exhibit Hall I #5
Penalizing Boundary Activation for Object Completeness in Diffusion Models Poster Session 4 & Exhibit Hall with Coffee Break
Haoyang Xu ⋅ Tianhao Zhao ⋅ Sibei Yang ⋅ Yutian Lin
Exhibit Hall I #7
RayZer: A Self-supervised Large View Synthesis Model Poster Session 2 & Exhibit Hall with Coffee Break
Hanwen Jiang ⋅ Hao Tan ⋅ Peng Wang ⋅ Haian Jin ⋅ Yue Zhao ⋅ Sai Bi ⋅ Kai Zhang ⋅ Fujun Luan ⋅ Kalyan Sunkavalli ⋅ Qixing Huang ⋅ Georgios Pavlakos
Exhibit Hall I #74
MatchDiffusion: Training-free Generation of Match-Cuts Poster Session 4 & Exhibit Hall with Coffee Break
Alejandro Pardo ⋅ Fabio Pizzati ⋅ Tong Zhang ⋅ Alexander Pondaven ⋅ Philip Torr ⋅ Juan Perez ⋅ Bernard Ghanem
Exhibit Hall I #8
Dual-Expert Consistency Model for Efficient and High-Quality Video Generation Poster Session 4 & Exhibit Hall with Coffee Break
Zhengyao Lyu ⋅ Chenyang Si ⋅ Tianlin Pan ⋅ Zhaoxi Chen ⋅ Kwan-Yee K. Wong ⋅ Yu Qiao ⋅ Ziwei Liu
Exhibit Hall I #9
Straighten Viscous Rectified Flow via Noise Optimization Poster Session 4 & Exhibit Hall with Coffee Break
Jimin Dai ⋅ Jiexi Yan ⋅ Jian Yang ⋅ lei luo
Exhibit Hall I #11
Scalable Dual Fingerprinting for Hierarchical Attribution of Text-to-Image Models Poster Session 4 & Exhibit Hall with Coffee Break
Jianwei Fei ⋅ Yunshu Dai ⋅ Peipeng Yu ⋅ Zhe Kong ⋅ Jiantao Zhou ⋅ Zhihua Xia
Exhibit Hall I #13
QuantCache: Adaptive Importance-Guided Quantization with Hierarchical Latent and Layer Caching for Video Generation Poster Session 4 & Exhibit Hall with Coffee Break
Junyi Wu ⋅ Zhiteng Li ⋅ Zheng Hui ⋅ YULUN ZHANG ⋅ Linghe Kong ⋅ Xiaokang Yang
Exhibit Hall I #14
CRAM: Large Scale Video Continual Learning with Bootstrapped Compression Poster Session 4 & Exhibit Hall with Coffee Break
Shivani Mall ⋅ Joao F. Henriques
Exhibit Hall I #15
Tree-NeRV: Efficient Non-Uniform Sampling for Neural Video Representation via Tree-Structured Feature Grids Poster Session 4 & Exhibit Hall with Coffee Break
Jiancheng Zhao ⋅ Yifan Zhan ⋅ Qingtian Zhu ⋅ Mingze Ma ⋅ Muyao Niu ⋅ Zunian Wan ⋅ Xiang Ji ⋅ Yinqiang Zheng
Exhibit Hall I #18
MaTe: Images Are All You Need for Material Transfer via Diffusion Transformer Poster Session 4 & Exhibit Hall with Coffee Break
Nisha Huang ⋅ Henglin Liu ⋅ Yizhou Lin ⋅ Kaer Huang ⋅ Chubin Chen ⋅ Jie Guo ⋅ Tong-Yee Lee ⋅ Xiu Li
Exhibit Hall I #22
ForCenNet: Foreground-Centric Network for Document Image Rectification Poster Session 4 & Exhibit Hall with Coffee Break
Peng Cai ⋅ liqiang liqiang ⋅ Kaicheng Yang ⋅ guodong guodong ⋅ lijia lijia ⋅ zhounan zhounan ⋅ Xiang An ⋅ Ninghua Yang ⋅ Jiankang Deng
Exhibit Hall I #24
VEGGIE: Instructional Editing and Reasoning Video Concepts with Grounded Generation Poster Session 4 & Exhibit Hall with Coffee Break
Shoubin Yu ⋅ Difan Liu ⋅ Ziqiao Ma ⋅ Yicong Hong ⋅ Yang Zhou ⋅ Hao Tan ⋅ Joyce Chai ⋅ Mohit Bansal
Exhibit Hall I #25
Scale Your Instructions: Enhance the Instruction-Following Fidelity of Unified Image Generation Model by Self-Adaptive Attention Scaling Poster Session 4 & Exhibit Hall with Coffee Break
Chao Zhou ⋅ Tianyi Wei ⋅ Nenghai Yu
Exhibit Hall I #27
CycleVAR: Repurposing Autoregressive Model for Unsupervised One-Step Image Translation Poster Session 4 & Exhibit Hall with Coffee Break
Yi Liu ⋅ Shengqian Li ⋅ Zuzeng Lin ⋅ Feng Wang ⋅ Si Liu
Exhibit Hall I #29
SDMatte: Grafting Diffusion Models for Interactive Matting Poster Session 4 & Exhibit Hall with Coffee Break
Longfei Huang ⋅ Yu Liang ⋅ Hao Zhang ⋅ Jinwei Chen ⋅ Wei Dong ⋅ Lunde Chen ⋅ Wanyu Liu ⋅ Bo Li ⋅ Peng-Tao Jiang
Exhibit Hall I #32
Adaptive Caching for Faster Video Generation with Diffusion Transformers Poster Session 4 & Exhibit Hall with Coffee Break
Kumara Kahatapitiya ⋅ Haozhe Liu ⋅ Sen He ⋅ Ding Liu ⋅ Menglin Jia ⋅ Chenyang Zhang ⋅ Michael Ryoo ⋅ Tian Xie
Exhibit Hall I #33
CoMPaSS: Enhancing Spatial Understanding in Text-to-Image Diffusion Models Poster Session 4 & Exhibit Hall with Coffee Break
Gaoyang Zhang ⋅ Bingtao Fu ⋅ Qingnan Fan ⋅ Qi Zhang ⋅ Runxing Liu ⋅ Hong Gu ⋅ Huaqi Zhang ⋅ Xinguo Liu
Exhibit Hall I #34
Edicho: Consistent Image Editing in the Wild Poster Session 4 & Exhibit Hall with Coffee Break
Qingyan Bai ⋅ Hao Ouyang ⋅ Yinghao Xu ⋅ Qiuyu Wang ⋅ Ceyuan Yang ⋅ Ka Leong Cheng ⋅ Yujun Shen ⋅ Qifeng Chen
Exhibit Hall I #36
LUSD: Localized Update Score Distillation for Text-Guided Image Editing Poster Session 4 & Exhibit Hall with Coffee Break
Worameth Chinchuthakun ⋅ Tossaporn Saengja ⋅ Nontawat Tritrong ⋅ Pitchaporn Rewatbowornwong ⋅ Pramook Khungurn ⋅ Supasorn Suwajanakorn
Exhibit Hall I #38
FlowChef: Steering of Rectified Flow Models for Controlled Generations Poster Session 4 & Exhibit Hall with Coffee Break
Maitreya Patel ⋅ Song Wen ⋅ Dimitris Metaxas ⋅ Yezhou Yang
Exhibit Hall I #39
From Reflection to Perfection: Scaling Inference-Time Optimization for Text-to-Image Diffusion Models via Reflection Tuning Poster Session 4 & Exhibit Hall with Coffee Break
Le Zhuo ⋅ Liangbing Zhao ⋅ Sayak Paul ⋅ Yue Liao ⋅ Renrui Zhang ⋅ Yi Xin ⋅ Peng Gao ⋅ Mohamed Elhoseiny ⋅ Hongsheng Li
Exhibit Hall I #41
Translation of Text Embedding via Delta Vector to Suppress Strongly Entangled Content in Text-to-Image Diffusion Models Poster Session 4 & Exhibit Hall with Coffee Break
Eunseo Koh ⋅ SeungHoo Hong ⋅ Tae-Young Kim ⋅ Jae-Pil Heo ⋅ Simon Woo
Exhibit Hall I #44
Grouped Speculative Decoding for Autoregressive Image Generation Poster Session 4 & Exhibit Hall with Coffee Break
Junhyuk So ⋅ Juncheol Shin ⋅ Hyunho Kook ⋅ Eunhyeok Park
Exhibit Hall I #45
Generating, Fast and Slow: Scalable Parallel Video Generation with Video Interface Networks Poster Session 4 & Exhibit Hall with Coffee Break
Bhishma Dedhia ⋅ David Bourgin ⋅ Krishna Kumar Singh ⋅ Yuheng Li ⋅ Yan Kang ⋅ Zhan Xu ⋅ Niraj Jha ⋅ Yuchen Liu
Exhibit Hall I #46
SynTag: Enhancing the Geometric Robustness of Inversion-based Generative Image Watermarking Poster Session 4 & Exhibit Hall with Coffee Break
Han Fang ⋅ Kejiang Chen ⋅ Zehua Ma ⋅ Jiajun Deng ⋅ Yicong Li ⋅ Weiming Zhang ⋅ Ee-Chien Chang
Exhibit Hall I #49
Draw Your Mind: Personalized Generation via Condition-Level Modeling in Text-to-Image Diffusion Models Poster Session 4 & Exhibit Hall with Coffee Break
Hyungjin Kim ⋅ Seokho Ahn ⋅ Young-Duk Seo
Exhibit Hall I #218
Text Embedding Knows How to Quantize Text-Guided Diffusion Models Poster Session 4 & Exhibit Hall with Coffee Break
Hongjae Lee ⋅ Myungjun Son ⋅ Dongjea Kang ⋅ Seung-Won Jung
Exhibit Hall I #50
PUMA: Empowering Unified MLLM with Multi-granular Visual Generation Poster Session 4 & Exhibit Hall with Coffee Break
Rongyao Fang ⋅ Chengqi Duan ⋅ Kun Wang ⋅ Hao Li ⋅ Linjiang Huang ⋅ Hao Tian ⋅ Xingyu Zeng ⋅ Rui Zhao ⋅ Jifeng Dai ⋅ Hongsheng Li ⋅ Xihui Liu
Exhibit Hall I #52
NeuralSVG: An Implicit Representation for Text-to-Vector Generation Poster Session 4 & Exhibit Hall with Coffee Break
Sagi Polaczek ⋅ Yuval Alaluf ⋅ Elad Richardson ⋅ Yael Vinker ⋅ Daniel Cohen-Or
Exhibit Hall I #53
IQA-Adapter: Exploring Knowledge Transfer from Image Quality Assessment to Diffusion-based Generative Models Poster Session 4 & Exhibit Hall with Coffee Break
Khaled Abud ⋅ Sergey Lavrushkin ⋅ Alexey Kirillov ⋅ Dmitriy Vatolin
Exhibit Hall I #54
Global and Local Entailment Learning for Natural World Imagery Poster Session 4 & Exhibit Hall with Coffee Break
Srikumar Sastry ⋅ Aayush Dhakal ⋅ Eric Xing ⋅ Subash Khanal ⋅ Nathan Jacobs
Exhibit Hall I #84
Dual Recursive Feedback on Generation and Appearance Latents for Pose-Robust Text-to-Image Diffusion Poster Session 4 & Exhibit Hall with Coffee Break
Jiwon Kim ⋅ Pureum Kim ⋅ SeonHwa Kim ⋅ Soobin Park ⋅ Eunju Cha ⋅ Kyong Hwan Jin
Exhibit Hall I #56
Anti-Tamper Protection for Unauthorized Individual Image Generation Poster Session 4 & Exhibit Hall with Coffee Break
Zelin Li ⋅ Ruohan Zong ⋅ Yifan Liu ⋅ Ruichen Yao ⋅ Yaokun Liu ⋅ Yang Zhang ⋅ Dong Wang
Exhibit Hall I #57
Continual Personalization for Diffusion Models Poster Session 4 & Exhibit Hall with Coffee Break
Yu-Chien Liao ⋅ Jr-Jen Chen ⋅ Chi-Pin Huang ⋅ Ci-Siang Lin ⋅ Meng-Lin Wu ⋅ Yu-Chiang Frank Wang
Exhibit Hall I #58
WikiAutoGen: Towards Multi-Modal Wikipedia-Style Article Generation Poster Session 4 & Exhibit Hall with Coffee Break
Zhongyu Yang ⋅ Jun Chen ⋅ Dannong Xu ⋅ Junjie Fei ⋅ Xiaoqian Shen ⋅ Liangbing Zhao ⋅ Chun-Mei Feng ⋅ Mohamed Elhoseiny
Exhibit Hall I #60
Spectral Image Tokenizer Poster Session 4 & Exhibit Hall with Coffee Break
Carlos Esteves ⋅ Mohammed Suhail ⋅ Ameesh Makadia
Exhibit Hall I #219
QuEST: Low-bit Diffusion Model Quantization via Efficient Selective Finetuning Poster Session 4 & Exhibit Hall with Coffee Break
Haoxuan Wang ⋅ Yuzhang Shang ⋅ Zhihang Yuan ⋅ Junyi Wu ⋅ Junchi Yan ⋅ Yan Yan
Exhibit Hall I #61
SPADE: Spatial-Aware Denoising Network for Open-vocabulary Panoptic Scene Graph Generation with Long- and Local-range Context Reasoning Poster Session 4 & Exhibit Hall with Coffee Break
XIN Hu ⋅ Ke Qin ⋅ Guiduo Duan ⋅ Ming Li ⋅ Yuan-Fang Li ⋅ Tao He
Exhibit Hall I #63
DropletVideo: A Dataset and Approach to Explore Integral Spatio-Temporal Consistent Video Generation Poster Session 4 & Exhibit Hall with Coffee Break
Runze Zhang ⋅ Guoguang Du ⋅ Xiaochuan Li ⋅ Qi Jia ⋅ Liang Jin ⋅ Lu Liu ⋅ Jingjing Wang ⋅ Cong Xu ⋅ Zhenhua Guo ⋅ Yaqian Zhao ⋅ Xiaoli Gong ⋅ Rengang Li ⋅ Baoyu Fan
Exhibit Hall I #65
Split-and-Combine: Enhancing Style Augmentation for Single Domain Generalization Poster Session 4 & Exhibit Hall with Coffee Break
Zhen Zhang ⋅ Zhen Zhang ⋅ Qianlong Dang ⋅ Zhize Wu ⋅ LiChuan Gu
Exhibit Hall I #68
RefEdit: A Benchmark and Method for Improving Instruction-based Image Editing Model on Referring Expressions Poster Session 4 & Exhibit Hall with Coffee Break
Bimsara Pathiraja ⋅ Maitreya Patel ⋅ Shivam Singh ⋅ Yezhou Yang ⋅ Chitta Baral
Exhibit Hall I #71
CuRe: Cultural Gaps in the Long Tail of Text-to-Image Systems Poster Session 4 & Exhibit Hall with Coffee Break
Aniket Rege ⋅ Zinnia Nie ⋅ Unmesh Raskar ⋅ Mahesh Ramesh ⋅ Zhuoran Yu ⋅ Aditya Kusupati ⋅ Yong Jae Lee ⋅ Ramya Vinayak
Exhibit Hall I #76
TREAD: Token Routing for Efficient Architecture-agnostic Diffusion Training Poster Session 4 & Exhibit Hall with Coffee Break
Felix Krause ⋅ Timy Phan ⋅ Ming Gui ⋅ Stefan A. Baumann ⋅ Vincent Tao Hu ⋅ Björn Ommer
Exhibit Hall I #78
Bootstrap3D: Improving Multi-view Diffusion Model with Synthetic Data Poster Session 4 & Exhibit Hall with Coffee Break
Zeyi Sun ⋅ Tong Wu ⋅ Pan Zhang ⋅ Yuhang Zang ⋅ Xiaoyi Dong ⋅ Yuanjun Xiong ⋅ Dahua Lin ⋅ Jiaqi Wang
Exhibit Hall I #79
Zero-Shot Depth Aware Image Editing with Diffusion Models Poster Session 4 & Exhibit Hall with Coffee Break
Rishubh Parihar ⋅ Sachidanand VS ⋅ Venkatesh Babu Radhakrishnan
Exhibit Hall I #82
StyleKeeper: Prevent Content Leakage using Negative Visual Query Guidance Poster Session 4 & Exhibit Hall with Coffee Break
Jaeseok Jeong ⋅ Junho Kim ⋅ Youngjung Uh ⋅ Gayoung Lee ⋅ Yunjey Choi
Exhibit Hall I #83
TRKT: Weakly Supervised Dynamic Scene Graph Generation with Temporal-enhanced Relation-aware Knowledge Transferring Poster Session 4 & Exhibit Hall with Coffee Break
Zhu Xu ⋅ Ting Lei ⋅ Zhimin Li ⋅ Guan Wang ⋅ Qingchao Chen ⋅ Yuxin Peng ⋅ Yang Liu
Exhibit Hall I #88
Pose-Star: Anatomy-Aware Editing for Open-World Fashion Images Poster Session 4 & Exhibit Hall with Coffee Break
Yuran Dong ⋅ Mang Ye
Exhibit Hall I #89
Who Controls the Authorization? Invertible Networks for Copyright Protection in Text-to-Image Synthesis Poster Session 4 & Exhibit Hall with Coffee Break
Baoyue Hu ⋅ Yang Wei ⋅ Junhao Xiao ⋅ Wendong Huang ⋅ Xiuli Bi ⋅ Bin Xiao
Exhibit Hall I #90
SegmentDreamer: Towards High-fidelity Text-to-3D Synthesis with Segmented Consistency Trajectory Distillation Poster Session 4 & Exhibit Hall with Coffee Break
Jiahao Zhu ⋅ Zixuan Chen ⋅ Guangcong Wang ⋅ Xiaohua Xie ⋅ Yi Zhou
Exhibit Hall I #93
MUSE: Multi-Subject Unified Synthesis via Explicit Layout Semantic Expansion Poster Session 4 & Exhibit Hall with Coffee Break
Fei Peng ⋅ Junqiang Wu ⋅ Yan Li ⋅ Tingting Gao ⋅ Di ZHANG ⋅ Huiyuan Fu
Exhibit Hall I #95
Magic Insert: Style-Aware Drag-and-Drop Poster Session 4 & Exhibit Hall with Coffee Break
Nataniel Ruiz ⋅ Yuanzhen Li ⋅ Neal Wadhwa ⋅ Yael Pritch ⋅ Michael Rubinstein ⋅ David Jacobs ⋅ Shlomi Fruchter
Exhibit Hall I #103
DIVE: Taming DINO for Subject-Driven Video Editing Poster Session 4 & Exhibit Hall with Coffee Break
Yi Huang ⋅ Wei Xiong ⋅ He Zhang ⋅ Chaoqi Chen ⋅ Jianzhuang Liu ⋅ Mingfu Yan ⋅ Shifeng Chen
Exhibit Hall I #106
FontAnimate: High Quality Few-shot Font Generation via Animating Font Transfer Process Poster Session 4 & Exhibit Hall with Coffee Break
Bin Fu ⋅ Zixuan Wang ⋅ Kainan Yan ⋅ Shitian Zhao ⋅ Qi Qin ⋅ Jie Wen ⋅ Junjun He ⋅ Peng Gao
Exhibit Hall I #107
PromptDresser: Improving the Quality and Controllability of Virtual Try-On via Generative Textual Prompt and Prompt-aware Mask Poster Session 4 & Exhibit Hall with Coffee Break
Jeongho Kim ⋅ Hoiyeong Jin ⋅ Sunghyun Park ⋅ Jaegul Choo
Exhibit Hall I #108
IMG: Calibrating Diffusion Models via Implicit Multimodal Guidance Poster Session 4 & Exhibit Hall with Coffee Break
Jiayi Guo ⋅ Chuanhao Yan ⋅ Xingqian Xu ⋅ Yulin Wang ⋅ Kai Wang ⋅ Gao Huang ⋅ Humphrey Shi
Exhibit Hall I #113
SAGI: Semantically Aligned and Uncertainty Guided AI Image Inpainting Poster Session 4 & Exhibit Hall with Coffee Break
Paschalis Giakoumoglou ⋅ Dimitrios Karageorgiou ⋅ Symeon Papadopoulos ⋅ Panagiotis Petrantonakis
Exhibit Hall I #114
TextMaster: A Unified Framework for Realistic Text Editing via Glyph-Style Dual-Control Poster Session 4 & Exhibit Hall with Coffee Break
Zhenyu Yan ⋅ Jian Wang ⋅ Aoqiang Wang ⋅ Yuhan Li ⋅ Wenxiang Shang ⋅ Zhu Hangcheng
Exhibit Hall I #116
LoRA.rar: Learning to Merge LoRAs via Hypernetworks for Subject-Style Conditioned Image Generation Poster Session 4 & Exhibit Hall with Coffee Break
Donald Shenaj ⋅ Ondrej Bohdal ⋅ Mete Ozay ⋅ Pietro Zanuttigh ⋅ Umberto Michieli
Exhibit Hall I #118
Beyond Perspective: Neural 360-Degree Video Compression Poster Session 4 & Exhibit Hall with Coffee Break
Andy Regensky ⋅ Marc Windsheimer ⋅ Fabian Brand ⋅ Andre Kaup
Exhibit Hall I #119
MCID: Multi-aspect Copyright Infringement Detection for Generated Images Poster Session 4 & Exhibit Hall with Coffee Break
Chuanwei Huang ⋅ Zexi Jia ⋅ Hongyan Fei ⋅ Yeshuang Zhu ⋅ Zhiqiang Yuan ⋅ Ying Deng ⋅ Jiapei Zhang ⋅ Xiaoyue Duan ⋅ Jinchao Zhang ⋅ Jie Zhou
Exhibit Hall I #120
Text2Outfit: Controllable Outfit Generation with Multimodal Language Models Poster Session 4 & Exhibit Hall with Coffee Break
Yuanhao Zhai ⋅ Yen-Liang Lin ⋅ Minxu Peng ⋅ Larry Davis ⋅ Ashwin Chandramouli ⋅ Junsong Yuan ⋅ David Doermann
Exhibit Hall I #121
Outlier-Aware Post-Training Quantization for Image Super-Resolution Poster Session 4 & Exhibit Hall with Coffee Break
Hailing Wang ⋅ Jianglin Lu ⋅ Yitian Zhang ⋅ Yun Fu
Exhibit Hall I #122
DualReal: Adaptive Joint Training for Lossless Identity-Motion Fusion in Video Customization Poster Session 4 & Exhibit Hall with Coffee Break
Wenchuan Wang ⋅ Mengqi Huang ⋅ Yijing Tu ⋅ Zhendong Mao
Exhibit Hall I #161
Modeling Human Gaze Behavior with Diffusion Models for Unified Scanpath Prediction Poster Session 4 & Exhibit Hall with Coffee Break
Giuseppe Cartella ⋅ Vittorio Cuculo ⋅ Alessandro D'Amelio ⋅ Marcella Cornia ⋅ Giuseppe Boccignone ⋅ Rita Cucchiara
Exhibit Hall I #125
What Changed? Detecting and Evaluating Instruction-Guided Image Edits with Multimodal Large Language Models Poster Session 4 & Exhibit Hall with Coffee Break
Lorenzo Baraldi ⋅ Davide Bucciarelli ⋅ Federico Betti ⋅ Marcella Cornia ⋅ Lorenzo Baraldi ⋅ Nicu Sebe ⋅ Rita Cucchiara
Exhibit Hall I #126
MeshPad: Interactive Sketch-Conditioned Artist-Reminiscent Mesh Generation and Editing Poster Session 4 & Exhibit Hall with Coffee Break
Haoxuan Li ⋅ Ziya Erkoç ⋅ Lei Li ⋅ Daniele Sirigatti ⋅ Vladislav Rosov ⋅ Angela Dai ⋅ Matthias Nießner
Exhibit Hall I #127
PLADIS: Pushing the Limits of Attention in Diffusion Models at Inference Time by Leveraging Sparsity Poster Session 4 & Exhibit Hall with Coffee Break
Kwanyoung Kim ⋅ Byeongsu Sim
Exhibit Hall I #128
STIV: Scalable Text and Image Conditioned Video Generation Poster Session 4 & Exhibit Hall with Coffee Break
Zongyu Lin ⋅ Wei Liu ⋅ Chen Chen ⋅ Jiasen Lu ⋅ Wenze Hu ⋅ Tsu-Jui Fu ⋅ Jesse Allardice ⋅ Zhengfeng Lai ⋅ Liangchen Song ⋅ Bowen Zhang ⋅ cha chen ⋅ Yiran Fei ⋅ Lezhi Li ⋅ Yizhou Sun ⋅ Kai-Wei Chang ⋅ Yinfei Yang
Exhibit Hall I #129
D3QE: Learning Discrete Distribution Discrepancy-aware Quantization Error for Autoregressive-Generated Image Detection Poster Session 4 & Exhibit Hall with Coffee Break
Yanran Zhang ⋅ Bingyao Yu ⋅ Yu Zheng ⋅ Wenzhao Zheng ⋅ Yueqi Duan ⋅ Lei Chen ⋅ Jie Zhou ⋅ Jiwen Lu
Exhibit Hall I #133
OmniCache: A Trajectory-Oriented Global Perspective on Training-Free Cache Reuse for Diffusion Transformer Models Poster Session 4 & Exhibit Hall with Coffee Break
Huanpeng Chu ⋅ Wei Wu ⋅ Guanyu Feng ⋅ Yutao Zhang
Exhibit Hall I #134
One-Step Specular Highlight Removal with Adapted Diffusion Models Poster Session 4 & Exhibit Hall with Coffee Break
Mahir Atmis ⋅ LEVENT KARACAN ⋅ Mehmet SARIGÜL
Exhibit Hall I #135
DiGA3D: Coarse-to-Fine Diffusional Propagation of Geometry and Appearance for Versatile 3D Inpainting Poster Session 4 & Exhibit Hall with Coffee Break
Jingyi Pan ⋅ Dan Xu ⋅ Qiong Luo
Exhibit Hall I #138
Holistic Unlearning Benchmark: A Multi-Faceted Evaluation for Text-to-Image Diffusion Model Unlearning Poster Session 4 & Exhibit Hall with Coffee Break
Saemi Moon ⋅ Minjong Lee ⋅ Sangdon Park ⋅ Dongwoo Kim
Exhibit Hall I #139
MV-Adapter: Multi-View Consistent Image Generation Made Easy Poster Session 4 & Exhibit Hall with Coffee Break
Zehuan Huang ⋅ Yuan-Chen Guo ⋅ Haoran Wang ⋅ Ran Yi ⋅ Lizhuang Ma ⋅ Yanpei Cao ⋅ Lu Sheng
Exhibit Hall I #141
On Large Multimodal Models as Open-World Image Classifiers Poster Session 4 & Exhibit Hall with Coffee Break
Alessandro Conti ⋅ Massimiliano Mancini ⋅ Enrico Fini ⋅ Yiming Wang ⋅ Paolo Rota ⋅ Elisa Ricci
Exhibit Hall I #142
VACE: All-in-One Video Creation and Editing Poster Session 4 & Exhibit Hall with Coffee Break
Zeyinzi Jiang ⋅ Zhen Han ⋅ Chaojie Mao ⋅ Jingfeng Zhang ⋅ Yulin Pan ⋅ Yu Liu
Exhibit Hall I #220
DiTFastAttnV2: Head-wise Attention Compression for Multi-Modality Diffusion Transformers Poster Session 4 & Exhibit Hall with Coffee Break
Hanling Zhang ⋅ Rundong Su ⋅ Zhihang Yuan ⋅ Pengtao Chen ⋅ Mingzhu Shen ⋅ Yibo Fan ⋅ Shengen Yan ⋅ Guohao Dai ⋅ Yu Wang
Exhibit Hall I #143
DIMCIM: A Quantitative Evaluation Framework for Default-mode Diversity and Generalization in Text-to-Image Generative Models Poster Session 4 & Exhibit Hall with Coffee Break
Revant Teotia ⋅ Candace Ross ⋅ Karen Ullrich ⋅ Sumit Chopra ⋅ Adriana Romero-Soriano ⋅ Melissa Hall ⋅ Matthew Muckley
Exhibit Hall I #146
From Linearity to Non-Linearity: How Masked Autoencoders Capture Spatial Correlations Poster Session 4 & Exhibit Hall with Coffee Break
Anthony Bisulco ⋅ Rahul Ramesh ⋅ Randall Balestriero ⋅ Pratik Chaudhari
Exhibit Hall I #147
Reusing Computation in Text-to-Image Diffusion for Efficient Generation of Image Sets Poster Session 4 & Exhibit Hall with Coffee Break
Dale Decatur ⋅ Thibault Groueix ⋅ Wang Yifan ⋅ Rana Hanocka ⋅ Vladimir Kim ⋅ Matheus Gadelha
Exhibit Hall I #153
Repurposing 2D Diffusion Models with Gaussian Atlas for 3D Generation Poster Session 4 & Exhibit Hall with Coffee Break
Tiange Xiang ⋅ Kai Li ⋅ Chengjiang Long ⋅ Christian Häne ⋅ Peihong Guo ⋅ Scott Delp ⋅ Ehsan Adeli ⋅ Li Fei-Fei
Exhibit Hall I #154
Cross-Granularity Online Optimization with Masked Compensated Information for Learned Image Compression Poster Session 4 & Exhibit Hall with Coffee Break
Haowei Kuang ⋅ Wenhan Yang ⋅ Zongming Guo ⋅ Jiaying Liu
Exhibit Hall I #156
Generating Multi-Image Synthetic Data for Text-to-Image Customization Poster Session 4 & Exhibit Hall with Coffee Break
Nupur Kumari ⋅ Xi Yin ⋅ Jun-Yan Zhu ⋅ Ishan Misra ⋅ Samaneh Azadi
Exhibit Hall I #157
Deeply Supervised Flow-Based Generative Models Poster Session 4 & Exhibit Hall with Coffee Break
Inkyu Shin ⋅ Chenglin Yang ⋅ Liang-Chieh (Jay) Chen
Exhibit Hall I #158
Stroke2Sketch: Harnessing Stroke Attributes for Training-Free Sketch Generation Poster Session 4 & Exhibit Hall with Coffee Break
Rui Yang ⋅ Huining Li ⋅ Yiyi Long ⋅ Xiaojun Wu ⋅ Shengfeng He
Exhibit Hall I #159
ICE-Bench: A Unified and Comprehensive Benchmark for Image Creating and Editing Poster Session 4 & Exhibit Hall with Coffee Break
Yulin Pan ⋅ Xiangteng He ⋅ Chaojie Mao ⋅ Zhen Han ⋅ Zeyinzi Jiang ⋅ Jingfeng Zhang ⋅ Yu Liu
Exhibit Hall I #163
Edit360: 2D Image Edits to 3D Assets from Any Angle Poster Session 4 & Exhibit Hall with Coffee Break
Junchao Huang ⋅ Xinting Hu ⋅ Shaoshuai Shi ⋅ Zhuotao Tian ⋅ Li Jiang
Exhibit Hall I #166
FlowTok: Flowing Seamlessly Across Text and Image Tokens Poster Session 4 & Exhibit Hall with Coffee Break
Ju He ⋅ Qihang Yu ⋅ Qihao Liu ⋅ Liang-Chieh (Jay) Chen
Exhibit Hall I #167
TeEFusion: Blending Text Embeddings to Distill Classifier-Free Guidance Poster Session 4 & Exhibit Hall with Coffee Break
Minghao Fu ⋅ Guo-Hua Wang ⋅ Xiaohao Chen ⋅ Qing-Guo Chen ⋅ Zhao Xu ⋅ Weihua Luo ⋅ Kaifu Zhang
Exhibit Hall I #169
YOLO-Count: Differentiable Object Counting for Text-to-Image Generation Poster Session 4 & Exhibit Hall with Coffee Break
Guanning Zeng ⋅ Xiang Zhang ⋅ Zirui Wang ⋅ Haiyang Xu ⋅ Zeyuan Chen ⋅ Bingnan Li ⋅ Zhuowen Tu
Exhibit Hall I #180
TITAN-Guide: Taming Inference-Time Alignment for Guided Text-to-Video Diffusion Models Poster Session 4 & Exhibit Hall with Coffee Break
Christian Simon ⋅ Masato Ishii ⋅ Akio Hayakawa ⋅ Zhi Zhong ⋅ Shusuke Takahashi ⋅ Takashi Shibuya ⋅ Yuki Mitsufuji
Exhibit Hall I #170
FiVE-Bench: A Fine-grained Video Editing Benchmark for Evaluating Emerging Diffusion and Rectified Flow Models Poster Session 4 & Exhibit Hall with Coffee Break
Minghan LI ⋅ Chenxi Xie ⋅ Yichen Wu ⋅ Lei Zhang ⋅ Mengyu Wang
Exhibit Hall I #171
CompSlider: Compositional Slider for Disentangled Multiple-Attribute Image Generation Poster Session 4 & Exhibit Hall with Coffee Break
Zixin Zhu ⋅ Kevin Duarte ⋅ Mamshad Nayeem Rizve ⋅ Chengyuan Xu ⋅ Ratheesh Kalarot ⋅ Junsong Yuan
Exhibit Hall I #172
DreamRenderer: Taming Multi-Instance Attribute Control in Large-Scale Text-to-Image Models Poster Session 4 & Exhibit Hall with Coffee Break
Dewei Zhou ⋅ Mingwei Li ⋅ Zongxin Yang ⋅ Yi Yang
Exhibit Hall I #175
DiffSim: Taming Diffusion Models for Evaluating Visual Similarity Poster Session 4 & Exhibit Hall with Coffee Break
Yiren Song ⋅ Xiaokang Liu ⋅ Mike Zheng Shou
Exhibit Hall I #193
Adversarial Distribution Matching for Diffusion Distillation Towards Efficient Image and Video Synthesis Poster Session 4 & Exhibit Hall with Coffee Break
Yanzuo Lu ⋅ Yuxi Ren ⋅ Xin Xia ⋅ Shanchuan Lin ⋅ XING WANG ⋅ Xuefeng Xiao ⋅ Jinhua Ma ⋅ Xiaohua Xie ⋅ Jianhuang Lai
Exhibit Hall I #185
Co-Painter: Fine-Grained Controllable Image Stylization via Implicit Decoupling and Adaptive Injection Poster Session 4 & Exhibit Hall with Coffee Break
Bowen Fu ⋅ Wei Wei ⋅ Jiaqi Tang ⋅ Jiangtao Nie ⋅ Yanyu Ye ⋅ Xiaogang Xu ⋅ Ying-Cong Chen ⋅ Lei Zhang
Exhibit Hall I #186
PLA: Prompt Learning Attack against Text-to-Image Generative Models Poster Session 4 & Exhibit Hall with Coffee Break
XINQI LYU ⋅ Yihao LIU ⋅ Yanjie Li ⋅ Bin Xiao
Exhibit Hall I #188
FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion Poster Session 4 & Exhibit Hall with Coffee Break
Haonan Qiu ⋅ Shiwei Zhang ⋅ Yujie Wei ⋅ Ruihang Chu ⋅ Hangjie Yuan ⋅ Xiang Wang ⋅ Yingya Zhang ⋅ Ziwei Liu
Exhibit Hall I #192
Holistic Tokenizer for Autoregressive Image Generation Poster Session 4 & Exhibit Hall with Coffee Break
Anlin Zheng ⋅ Haochen Wang ⋅ Yucheng Zhao ⋅ Weipeng DENG ⋅ Tiancai Wang ⋅ Xiangyu Zhang ⋅ Xiaojuan Qi
Exhibit Hall I #194
Toward Better Out-painting: Improving the Image Composition with Initialization Policy Model Poster Session 4 & Exhibit Hall with Coffee Break
Xuan Han ⋅ Yihao Zhao ⋅ Yanhao Ge ⋅ Mingyu You
Exhibit Hall I #196
From Image to Video: An Empirical Study of Diffusion Representations Poster Session 4 & Exhibit Hall with Coffee Break
Pedro Vélez ⋅ Luisa Polania Cabrera ⋅ Yi Yang ⋅ Chuhan Zhang ⋅ Rishabh Kabra ⋅ Anurag Arnab ⋅ Mehdi S. M. Sajjadi
Exhibit Hall I #197
Versatile Transition Generation with Image-to-Video Diffusion Poster Session 4 & Exhibit Hall with Coffee Break
Zuhao Yang ⋅ Jiahui Zhang ⋅ Yingchen Yu ⋅ Shijian Lu ⋅ Song Bai
Exhibit Hall I #200
MetaMorph: Multimodal Understanding and Generation via Instruction Tuning Poster Session 4 & Exhibit Hall with Coffee Break
Shengbang Tong ⋅ David Fan ⋅ Jiachen Zhu ⋅ Yunyang Xiong ⋅ Xinlei Chen ⋅ Koustuv Sinha ⋅ Michael Rabbat ⋅ Yann LeCun ⋅ Saining Xie ⋅ Zhuang Liu
Exhibit Hall I #202
SafetyDPO: Scalable Safety Alignment for Text-to-Image Generation Poster Session 4 & Exhibit Hall with Coffee Break
Runtao Liu ⋅ I Chen ⋅ Jindong Gu ⋅ Jipeng Zhang ⋅ Renjie Pi ⋅ Qifeng Chen ⋅ Philip Torr ⋅ Ashkan Khakzar ⋅ Fabio Pizzati
Exhibit Hall I #204
DiffIP: Representation Fingerprints for Robust IP Protection of Diffusion Models Poster Session 4 & Exhibit Hall with Coffee Break
Zhuoling Li ⋅ Haoxuan Qu ⋅ Jason Kuen ⋅ Jiuxiang Gu ⋅ Qiuhong Ke ⋅ Jun Liu ⋅ Hossein Rahmani
Exhibit Hall I #205
FairHuman: Boosting Hand and Face Quality in Human Image Generation with Minimum Potential Delay Fairness in Diffusion Models Poster Session 4 & Exhibit Hall with Coffee Break
Yuxuan Wang ⋅ Tianwei Cao ⋅ Huayu Zhang ⋅ Zhongjiang He ⋅ Kongming Liang ⋅ Zhanyu Ma
Exhibit Hall I #206
Processing and acquisition traces in visual encoders: What does CLIP know about your camera? Poster Session 4 & Exhibit Hall with Coffee Break
Ryan Ramos ⋅ Vladan Stojnić ⋅ Giorgos Kordopatis-Zilos ⋅ Yuta Nakashima ⋅ Giorgos Tolias ⋅ Noa Garcia
Exhibit Hall I #207
AM-Adapter: Appearance Matching Adapter for Exemplar-based Semantic Image Synthesis in-the-Wild Poster Session 4 & Exhibit Hall with Coffee Break
Siyoon Jin ⋅ Jisu Nam ⋅ Jiyoung Kim ⋅ Dahyun Chung ⋅ Yeong-Seok Kim ⋅ Joonhyung Park ⋅ HeonJeong Chu ⋅ Seungryong Kim
Exhibit Hall I #209
Diffusion Epistemic Uncertainty with Asymmetric Learning for Diffusion-Generated Image Detection Poster Session 4 & Exhibit Hall with Coffee Break
Yingsong Huang ⋅ Hui Guo ⋅ Jing Huang ⋅ Bing Bai ⋅ Qi Xiong
Exhibit Hall I #211
HypDAE: Hyperbolic Diffusion Autoencoders for Hierarchical Few-shot Image Generation Poster Session 4 & Exhibit Hall with Coffee Break
Lingxiao Li ⋅ Kaixuan Fan ⋅ Boqing Gong ⋅ Xiangyu Yue
Exhibit Hall I #212
Model Reveals What to Cache: Profiling-Based Feature Reuse for Video Diffusion Models Poster Session 4 & Exhibit Hall with Coffee Break
Xuran Ma ⋅ Yexin Liu ⋅ Yaofu LIU ⋅ Xianfeng Wu ⋅ Mingzhe Zheng ⋅ Zihao Wang ⋅ Ser-Nam Lim ⋅ Harry Yang
Exhibit Hall I #216
Rectifying Magnitude Neglect in Linear Attention Poster Session 5 & Exhibit Hall
Qihang Fan ⋅ Huaibo Huang ⋅ Yuang Ai ⋅ Ran He
Exhibit Hall I #160
RomanTex: Decoupling 3D-aware Rotary Positional Embedded Multi-Attention Network for Texture Synthesis Poster Session 4 & Exhibit Hall with Coffee Break
yifei feng ⋅ Mx Yang ⋅ Shuhui Yang ⋅ Sheng Zhang ⋅ Jiaao Yu ⋅ Zibo Zhao ⋅ Lliu Yuhong ⋅ Jie Jiang ⋅ Chunchao Guo
Exhibit Hall I #221
Calibrating MLLM-as-a-judge via Multimodal Bayesian Prompt Ensembles Poster Session 4 & Exhibit Hall with Coffee Break
Eric Slyman ⋅ Mehrab Tanjim ⋅ Kushal Kafle ⋅ Stefan Lee
Exhibit Hall I #223
V.I.P. : Iterative Online Preference Distillation for Efficient Video Diffusion Models Poster Session 4 & Exhibit Hall with Coffee Break
Jisoo Kim ⋅ Wooseok Seo ⋅ Junwan Kim ⋅ Seungho Park ⋅ Sooyeon Park ⋅ Youngjae Yu
Exhibit Hall I #224
LOTA: Bit-Planes Guided AI-Generated Image Detection Poster Session 4 & Exhibit Hall with Coffee Break
Renxi Cheng ⋅ Hongsong Wang ⋅ Yang Zhang ⋅ Chaolei Han ⋅ Jie Gui
Exhibit Hall I #225
Balanced Image Stylization with Style Matching Score Poster Session 4 & Exhibit Hall with Coffee Break
Yuxin Jiang ⋅ Liming Jiang ⋅ Shuai Yang ⋅ Jia-Wei Liu ⋅ Ivor Tsang ⋅ Mike Zheng Shou
Exhibit Hall I #236
Trade-offs in Image Generation: How Do Different Dimensions Interact? Poster Session 4 & Exhibit Hall with Coffee Break
Sicheng Zhang ⋅ Binzhu Xie ⋅ Zhonghao Yan ⋅ Yuli Zhang ⋅ Donghao Zhou ⋅ Xiaofei Chen ⋅ Shi Qiu ⋅ Jiaqi Liu ⋅ Guoyang Xie ⋅ Zhichao Lu
Exhibit Hall I #228
X-Prompt: Generalizable Auto-Regressive Visual Learning with In-Context Prompting Poster Session 4 & Exhibit Hall with Coffee Break
Zeyi Sun ⋅ Ziyang Chu ⋅ Pan Zhang ⋅ Tong Wu ⋅ Xiaoyi Dong ⋅ Yuhang Zang ⋅ Yuanjun Xiong ⋅ Dahua Lin ⋅ Jiaqi Wang
Exhibit Hall I #229
Long Context Tuning for Video Generation Poster Session 4 & Exhibit Hall with Coffee Break
Yuwei Guo ⋅ Ceyuan Yang ⋅ Ziyan Yang ⋅ Zhibei Ma ⋅ Zhijie Lin ⋅ Zhenheng Yang ⋅ Dahua Lin ⋅ Lu Jiang
Exhibit Hall I #230
DreamFuse: Adaptive Image Fusion with Diffusion Transformer Poster Session 4 & Exhibit Hall with Coffee Break
Junjia Huang ⋅ Pengxiang Yan ⋅ Jiyang Liu ⋅ Jie Wu ⋅ Zhao Wang ⋅ Yitong Wang ⋅ Liang Lin ⋅ Guanbin Li
Exhibit Hall I #231
AnyI2V: Animating Any Conditional Image with Motion Control Poster Session 4 & Exhibit Hall with Coffee Break
Ziye Li ⋅ Xincheng Shuai ⋅ Hao Luo ⋅ Henghui Ding
Exhibit Hall I #232
EEdit : Rethinking the Spatial and Temporal Redundancy for Efficient Image Editing Poster Session 4 & Exhibit Hall with Coffee Break
Zexuan Yan ⋅ Yue Ma ⋅ Chang Zou ⋅ Wenteng Chen ⋅ Qifeng Chen ⋅ Linfeng Zhang
Exhibit Hall I #248
RAGDiffusion: Faithful Cloth Generation via External Knowledge Assimilation Poster Session 4 & Exhibit Hall with Coffee Break
Yuhan Li ⋅ Xianfeng Tan ⋅ Wenxiang Shang ⋅ Yubo Wu ⋅ Jian Wang ⋅ Xuanhong Chen ⋅ Yi Zhang ⋅ Zhu Hangcheng ⋅ Bingbing Ni
Exhibit Hall I #249
Instruction-based Image Editing with Planning, Reasoning, and Generation Poster Session 4 & Exhibit Hall with Coffee Break
Liya Ji ⋅ Chenyang Qi ⋅ Qifeng Chen
Exhibit Hall I #251
HDR Image Generation via Gain Map Decomposed Diffusion Poster Session 4 & Exhibit Hall with Coffee Break
Yuanshen Guan ⋅ Ruikang Xu ⋅ Yinuo Liao ⋅ Mingde Yao ⋅ Lizhi Wang ⋅ Zhiwei Xiong
Exhibit Hall I #254
ESSENTIAL: Episodic and Semantic Memory Integration for Video Class-Incremental Learning Poster Session 4 & Exhibit Hall with Coffee Break
Jongseo Lee ⋅ Kyungho Bae ⋅ Kyle Min ⋅ Gyeong-Moon Park ⋅ Jinwoo Choi
Exhibit Hall I #255
Fair Generation without Unfair Distortions: Debiasing Text-to-Image Generation with Entanglement-Free Attention Poster Session 4 & Exhibit Hall with Coffee Break
Jeonghoon Park ⋅ Juyoung Lee ⋅ Chaeyeon Chung ⋅ Jaeseong Lee ⋅ Jaegul Choo ⋅ Jindong Gu
Exhibit Hall I #257
Training-Free Text-Guided Image Editing with Visual Autoregressive Model Poster Session 4 & Exhibit Hall with Coffee Break
Yufei Wang ⋅ Lanqing Guo ⋅ Zhihao Li ⋅ Jiaxing Huang ⋅ Pichao WANG ⋅ Bihan Wen ⋅ Jian Wang
Exhibit Hall I #258
Accelerating Diffusion Transformer via Gradient-Optimized Cache Poster Session 4 & Exhibit Hall with Coffee Break
Junxiang Qiu ⋅ Lin Liu ⋅ Shuo Wang ⋅ Jinda Lu ⋅ Kezhou Chen ⋅ Yanbin Hao
Exhibit Hall I #261
The Silent Assistant: NoiseQuery as Implicit Guidance for Goal-Driven Image Generation Poster Session 4 & Exhibit Hall with Coffee Break
Ruoyu Wang ⋅ Huayang Huang ⋅ Ye Zhu ⋅ Olga Russakovsky ⋅ Yu Wu
Exhibit Hall I #262
Progressive Growing of Video Tokenizers for Temporally Compact Latent Spaces Poster Session 4 & Exhibit Hall with Coffee Break
Aniruddha Mahapatra ⋅ Long Mai ⋅ David Bourgin ⋅ Yitian Zhang ⋅ Feng Liu
Exhibit Hall I #263
ArtEditor: Learning Customized Instructional Image Editor from Few-Shot Examples Poster Session 4 & Exhibit Hall with Coffee Break
Shijie Huang ⋅ Yiren Song ⋅ Yuxuan Zhang ⋅ Hailong Guo ⋅ Xueyin Wang ⋅ Jiaming Liu
Exhibit Hall I #265
MC-Bench: A Benchmark for Multi-Context Visual Grounding in the Era of MLLMs Poster Session 4 & Exhibit Hall with Coffee Break
Yunqiu Xu ⋅ Linchao Zhu ⋅ Yi Yang
Exhibit Hall I #267
Disrupting Model Merging: A Parameter-Level Defense Without Sacrificing Accuracy Poster Session 4 & Exhibit Hall with Coffee Break
JUNHAO WEI ⋅ YU ZHE ⋅ Jun Sakuma
Exhibit Hall I #269
Harmonizing Visual Representations for Unified Multimodal Understanding and Generation Poster Session 4 & Exhibit Hall with Coffee Break
Size Wu ⋅ Wenwei Zhang ⋅ Lumin Xu ⋅ Sheng Jin ⋅ Zhonghua Wu ⋅ Qingyi Tao ⋅ Wentao Liu ⋅ Wei Li ⋅ Chen Change Loy
Exhibit Hall I #273
A3GS: Arbitrary Artistic Style into Arbitrary 3D Gaussian Splatting Poster Session 4 & Exhibit Hall with Coffee Break
Zhiyuan Fang ⋅ Rengan Xie ⋅ Xuancheng Jin ⋅ Qi Ye ⋅ Wei Chen ⋅ Wenting Zheng ⋅ Rui Wang ⋅ Yuchi Huo
Exhibit Hall I #274
LayerD: Decomposing Raster Graphic Designs into Layers Poster Session 4 & Exhibit Hall with Coffee Break
Tomoyuki Suzuki ⋅ Kang-Jun Liu ⋅ Naoto Inoue ⋅ Kota Yamaguchi
Exhibit Hall I #277
ViLU: Learning Vision-Language Uncertainties for Failure Prediction Poster Session 4 & Exhibit Hall with Coffee Break
Marc Lafon ⋅ Yannis Karmim ⋅ Julio Silva-Rodríguez ⋅ Paul Couairon ⋅ Clément Rambour ⋅ Raphael Fournier-Sniehotta ⋅ Ismail Ayed ⋅ Jose Dolz ⋅ Nicolas THOME
Exhibit Hall I #279
Subjective Camera 1.0: Bridging Human Cognition and Visual Reconstruction through Sequence-Aware Sketch-Guided Diffusion Poster Session 4 & Exhibit Hall with Coffee Break
Haoyang Chen ⋅ Dongfang Sun ⋅ Caoyuan Ma ⋅ Shiqin Wang ⋅ Kewei Zhang ⋅ Zheng Wang ⋅ Zhixiang Wang
Exhibit Hall I #282
GlassWizard: Harvesting Diffusion Priors for Glass Surface Detection Poster Session 4 & Exhibit Hall with Coffee Break
Wenxue Li ⋅ Tian Ye ⋅ Xinyu Xiong ⋅ Jinbin Bai ⋅ feilong tang ⋅ Wenxuan Song ⋅ Zhaohu Xing ⋅ Lie Ju ⋅ Guanbin Li ⋅ Lei Zhu
Exhibit Hall I #283
Zero-Shot Compositional Video Learning with Coding Rate Reduction Poster Session 5 & Exhibit Hall
Heeseok Jung ⋅ Jun-Hyeon Bak ⋅ Yujin Jeong ⋅ Gyugeun Lee ⋅ Jinwoo Ahn ⋅ Eun-Sol Kim
Exhibit Hall I #66
FedMVP: Federated Multimodal Visual Prompt Tuning for Vision-Language Models Poster Session 4 & Exhibit Hall with Coffee Break
Mainak Singha ⋅ Subhankar Roy ⋅ Sarthak Mehrotra ⋅ Ankit Jha ⋅ Moloud Abdar ⋅ Biplab Banerjee ⋅ Elisa Ricci
Exhibit Hall I #285
HyTIP: Hybrid Temporal Information Propagation for Masked Conditional Residual Video Coding Poster Session 4 & Exhibit Hall with Coffee Break
Yi-Hsin Chen ⋅ Yi-Chen Yao ⋅ Kuan-Wei Ho ⋅ Chun-Hung Wu ⋅ Huu-Tai Phung ⋅ Martin Benjak ⋅ Jörn Ostermann ⋅ Wen-Hsiao Peng
Exhibit Hall I #287
DACoN: DINO for Anime Paint Bucket Colorization with Any Number of Reference Images Poster Session 4 & Exhibit Hall with Coffee Break
Kazuma Nagata ⋅ Naoshi Kaneko
Exhibit Hall I #288
Lay-Your-Scene: Natural Scene Layout Generation with Diffusion Transformers Poster Session 4 & Exhibit Hall with Coffee Break
Divyansh Srivastava ⋅ Xiang Zhang ⋅ He Wen ⋅ Chenru Wen ⋅ Zhuowen Tu
Exhibit Hall I #289
Free2Guide: Training-Free Text-to-Video Alignment using Image LVLM Poster Session 4 & Exhibit Hall with Coffee Break
Jaemin Kim ⋅ Bryan Sangwoo Kim ⋅ Jong Ye
Exhibit Hall I #290
InfGen: A Resolution-Agnostic Paradigm for Scalable Image Synthesis Poster Session 4 & Exhibit Hall with Coffee Break
Tao Han ⋅ Wanghan Xu ⋅ Junchao Gong ⋅ Xiaoyu Yue ⋅ Song Guo ⋅ Luping Zhou ⋅ LEI BAI
Exhibit Hall I #292
VideoVAE+: Large Motion Video Autoencoding with Cross-modal Video VAE Poster Session 4 & Exhibit Hall with Coffee Break
Yazhou Xing ⋅ Yang Fei ⋅ Yingqing He ⋅ Jingye Chen ⋅ Pengjun Fang ⋅ Xiaowei Chi ⋅ Qifeng Chen
Exhibit Hall I #293
SpecGuard: Spectral Projection-based Advanced Invisible Watermarking Poster Session 4 & Exhibit Hall with Coffee Break
Inzamamul Alam ⋅ Md Islam ⋅ Simon Woo ⋅ Khan Muhammad
Exhibit Hall I #296
DIA: The Adversarial Exposure of Deterministic Inversion in Diffusion Models Poster Session 4 & Exhibit Hall with Coffee Break
SeungHoo Hong ⋅ GeonHo Son ⋅ Juhun Lee ⋅ Simon Woo
Exhibit Hall I #297
Supercharged One-step Text-to-Image Diffusion Models with Negative Prompts Poster Session 4 & Exhibit Hall with Coffee Break
Viet Nguyen ⋅ Anh Nguyen ⋅ Trung Dao ⋅ Khoi Nguyen ⋅ Cuong Pham ⋅ Toan Tran ⋅ Anh Tran
Exhibit Hall I #298
GFPack++: Attention-Driven Gradient Fields for Optimizing 2D Irregular Packing Poster Session 4 & Exhibit Hall with Coffee Break
Tianyang Xue ⋅ Lin Lu ⋅ Yang Liu ⋅ Mingdong Wu ⋅ Hao Dong ⋅ Yanbin Zhang ⋅ Renmin Han ⋅ Baoquan Chen
Exhibit Hall I #299
Denoising Token Prediction in Masked Autoregressive Models Poster Session 4 & Exhibit Hall with Coffee Break
Ting Yao ⋅ Yehao Li ⋅ Yingwei Pan ⋅ Zhaofan Qiu ⋅ Tao Mei
Exhibit Hall I #300
LACONIC: A 3D Layout Adapter for Controllable Image Creation Poster Session 4 & Exhibit Hall with Coffee Break
Léopold Maillard ⋅ Tom Durand ⋅ Adrien RAMANANA RAHARY ⋅ Maks Ovsjanikov
Exhibit Hall I #302
Preserve Anything: Controllable Image Synthesis with Object Preservation Poster Session 4 & Exhibit Hall with Coffee Break
Prasen Kumar Sharma ⋅ Neeraj Matiyali ⋅ Siddharth Srivastava ⋅ Gaurav Sharma
Exhibit Hall I #303
Contrastive Test-Time Composition of Multiple LoRA Models for Image Generation Poster Session 4 & Exhibit Hall with Coffee Break
Tuna Meral ⋅ Enis Simsar ⋅ Federico Tombari ⋅ Pinar Yanardag
Exhibit Hall I #308
FramePainter: Endowing Interactive Image Editing with Video Diffusion Priors Poster Session 4 & Exhibit Hall with Coffee Break
Yabo Zhang ⋅ xinpeng zhou ⋅ Yihan Zeng ⋅ Hang Xu ⋅ Hui Li ⋅ Wangmeng Zuo
Exhibit Hall I #311
PlanGen: Towards Unified Layout Planning and Image Generation in Auto-Regressive Vision Language Models Poster Session 4 & Exhibit Hall with Coffee Break
Runze He ⋅ bo cheng ⋅ Yuhang Ma ⋅ QingxiangJia QingxiangJia ⋅ Shanyuan Liu ⋅ Ao Ma ⋅ Xiaoyu Wu ⋅ Liebucha Wu ⋅ Dawei Leng ⋅ Yuhui Yin
Exhibit Hall I #313
Turbo2K: Towards Ultra-Efficient and High-Quality 2K Video Synthesis Poster Session 4 & Exhibit Hall with Coffee Break
Jingjing Ren ⋅ Wenbo Li ⋅ Zhongdao Wang ⋅ Haoze Sun ⋅ Bangzhen Liu ⋅ Haoyu Chen ⋅ Jiaqi Xu ⋅ Aoxue Li ⋅ Shifeng Zhang ⋅ Bin Shao ⋅ Yong Guo ⋅ Lei Zhu
Exhibit Hall I #314
Anchor Token Matching: Implicit Structure Locking for Training-free AR Image Editing Poster Session 4 & Exhibit Hall with Coffee Break
Taihang Hu ⋅ Linxuan Li ⋅ Kai Wang ⋅ Yaxing Wang ⋅ jian Yang ⋅ Ming-Ming Cheng
Exhibit Hall I #315
Parametric Shadow Control for Portrait Generation in Text-to-Image Diffusion Models Poster Session 4 & Exhibit Hall with Coffee Break
Haoming Cai ⋅ Tsung-Wei Huang ⋅ Shiv Gehlot ⋅ Brandon Feng ⋅ Sachin Shah ⋅ Guan-Ming Su ⋅ Christopher Metzler
Exhibit Hall I #319
GenDoP: Auto-regressive Camera Trajectory Generation as a Director of Photography Poster Session 4 & Exhibit Hall with Coffee Break
Mengchen Zhang ⋅ Tong Wu ⋅ Jing Tan ⋅ Ziwei Liu ⋅ Gordon Wetzstein ⋅ Dahua Lin
Exhibit Hall I #321
CompleteMe: Reference-based Human Image Completion Poster Session 4 & Exhibit Hall with Coffee Break
Yu-Ju Tsai ⋅ Brian Price ⋅ Qing Liu ⋅ Luis Figueroa ⋅ Daniil Pakhomov ⋅ Zhihong Ding ⋅ Scott Cohen ⋅ Ming-Hsuan Yang
Exhibit Hall I #323
REPA-E: Unlocking VAE for End-to-End Tuning of Latent Diffusion Transformers Poster Session 4 & Exhibit Hall with Coffee Break
Xingjian Leng ⋅ Jaskirat Singh ⋅ Yunzhong Hou ⋅ Zhenchang Xing ⋅ Saining Xie ⋅ Liang Zheng
Exhibit Hall I #324
EEGMirror: Leveraging EEG data in the wild via Montage-Agnostic Self-Supervision for EEG to Video Decoding Poster Session 4 & Exhibit Hall with Coffee Break
Xuan-Hao Liu ⋅ Bao-liang Lu ⋅ Wei-Long Zheng
Exhibit Hall I #325
Accelerating Diffusion Sampling via Exploiting Local Transition Coherence Poster Session 4 & Exhibit Hall with Coffee Break
shangwen zhu ⋅ Han Zhang ⋅ Zhantao Yang ⋅ Qianyu Peng ⋅ Zhao Pu ⋅ Huangji Wang ⋅ Fan Cheng
Exhibit Hall I #326
SA-LUT: Spatial Adaptive 4D Look-Up Table for Photorealistic Style Transfer Poster Session 4 & Exhibit Hall with Coffee Break
Zerui Gong ⋅ Zhonghua Wu ⋅ Qingyi Tao ⋅ Qinyue Li ⋅ Chen Change Loy
Exhibit Hall I #327
UniversalBooth: Model-Agnostic Personalized Text-to-Image Generation Poster Session 4 & Exhibit Hall with Coffee Break
Songhua Liu ⋅ Ruonan Yu ⋅ Xinchao Wang
Exhibit Hall I #329
UniGlyph: Unified Segmentation-Conditioned Diffusion for Precise Visual Text Synthesis Poster Session 4 & Exhibit Hall with Coffee Break
Yuanrui Wang ⋅ Cong Han ⋅ Yafei Li ⋅ Zhipeng Jin ⋅ Xiawei Li ⋅ Sinan Du ⋅ Wen Tao ⋅ Yi Yang ⋅ shuanglong li ⋅ Chun Yuan ⋅ LIU LIN
Exhibit Hall I #331
ADIEE: Automatic Dataset Creation and Scorer for Instruction-Guided Image Editing Evaluation Poster Session 4 & Exhibit Hall with Coffee Break
Sherry Chen ⋅ Yi Wei ⋅ Luowei Zhou ⋅ Suren Kumar
Exhibit Hall I #332
Semantic Discrepancy-aware Detector for Image Forgery Identification Poster Session 4 & Exhibit Hall with Coffee Break
Wang Ziye ⋅ Minghang Yu ⋅ Chunyan Xu ⋅ Zhen Cui
Exhibit Hall I #336
Scalable Ranked Preference Optimization for Text-to-Image Generation Poster Session 4 & Exhibit Hall with Coffee Break
Shyamgopal Karthik ⋅ Huseyin Coskun ⋅ Zeynep Akata ⋅ Sergey Tulyakov ⋅ Jian Ren ⋅ Anil Kag
Exhibit Hall I #337
FairGen: Enhancing Fairness in Text-to-Image Diffusion Models via Self-Discovering Latent Directions Poster Session 4 & Exhibit Hall with Coffee Break
Yilei Jiang ⋅ Wei-Hong Li ⋅ Yiyuan Zhang ⋅ Minghong Cai ⋅ Xiangyu Yue
Exhibit Hall I #338
Benchmarking and Learning Multi-Dimensional Quality Evaluator for Text-to-3D Generation Poster Session 4 & Exhibit Hall with Coffee Break
Yujie Zhang ⋅ Bingyang Cui ⋅ Qi Yang ⋅ Zhu Li ⋅ Yiling Xu
Exhibit Hall I #352
REGEN: Learning Compact Video Embedding with (Re-)Generative Decoder Poster Session 4 & Exhibit Hall with Coffee Break
Yitian Zhang ⋅ Long Mai ⋅ Aniruddha Mahapatra ⋅ David Bourgin ⋅ Yicong Hong ⋅ Jonah Casebeer ⋅ Feng Liu ⋅ Yun Fu
Exhibit Hall I #342
FonTS: Text Rendering With Typography and Style Controls Poster Session 4 & Exhibit Hall with Coffee Break
Wenda SHI ⋅ Yiren Song ⋅ Dengming Zhang ⋅ Jiaming Liu ⋅ XINGXING ZOU
Exhibit Hall I #343
CreatiLayout: Siamese Multimodal Diffusion Transformer for Creative Layout-to-Image Generation Poster Session 4 & Exhibit Hall with Coffee Break
Hui Zhang ⋅ Dexiang Hong ⋅ Yitong Wang ⋅ Jie Shao ⋅ Xinglong Wu ⋅ Zuxuan Wu ⋅ Yu-Gang Jiang
Exhibit Hall I #345
CoMatch: Dynamic Covisibility-Aware Transformer for Bilateral Subpixel-Level Semi-Dense Image Matching Poster Session 4 & Exhibit Hall with Coffee Break
Zizhuo Li ⋅ Yifan Lu ⋅ Linfeng Tang ⋅ Shihua Zhang ⋅ Jiayi Ma
Exhibit Hall I #348
G2SF: Geometry-Guided Score Fusion for Multimodal Industrial Anomaly Detection Poster Session 5 & Exhibit Hall
Chengyu Tao ⋅ Xuanming Cao ⋅ Juan Du
Exhibit Hall I #70
PASTA: Part-Aware Sketch-to-3D Shape Generation with Text-Aligned Prior Poster Session 4 & Exhibit Hall with Coffee Break
Seunggwan Lee ⋅ Hwanhee Jung ⋅ ByoungSoo Koh ⋅ Qixing Huang ⋅ Sang Yoon ⋅ Sangpil Kim
Exhibit Hall I #354
Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation Poster Session 4 & Exhibit Hall with Coffee Break
Yuqing Wang ⋅ Zhijie Lin ⋅ Yao Teng ⋅ Yuanzhi Zhu ⋅ Shuhuai Ren ⋅ Jiashi Feng ⋅ Xihui Liu
Exhibit Hall I #355
Gain-MLP: Improving HDR Gain Map Encoding via a Lightweight MLP Poster Session 4 & Exhibit Hall with Coffee Break
Trevor Canham ⋅ SaiKiran Tedla ⋅ Michael Murdoch ⋅ Michael Brown
Exhibit Hall I #357
From Prompt to Progression: Taming Video Diffusion Models for Seamless Attribute Transition Poster Session 4 & Exhibit Hall with Coffee Break
Ling Lo ⋅ Kelvin Chan ⋅ Wen-Huang Cheng ⋅ Ming-Hsuan Yang
Exhibit Hall I #360
Dense2MoE: Restructuring Diffusion Transformer to MoE for Efficient Text-to-Image Generation Poster Session 4 & Exhibit Hall with Coffee Break
Youwei Zheng ⋅ Yuxi Ren ⋅ Xin Xia ⋅ Xuefeng Xiao ⋅ Xiaohua Xie
Exhibit Hall I #361
Less-to-More Generalization: Unlocking More Controllability by In-Context Generation Poster Session 4 & Exhibit Hall with Coffee Break
shaojin wu ⋅ Mengqi Huang ⋅ wenxu wu ⋅ Yufeng Cheng ⋅ Fei Ding ⋅ Qian HE
Exhibit Hall I #363
Sparse Fine-Tuning of Transformers for Generative Tasks Poster Session 4 & Exhibit Hall with Coffee Break
Wei Chen ⋅ Jingxi Yu ⋅ Zichen Miao ⋅ Qiang Qiu
Exhibit Hall I #365
FlexGen: Flexible Multi-View Generation from Text and Image Inputs Poster Session 4 & Exhibit Hall with Coffee Break
Xinli Xu ⋅ Wenhang Ge ⋅ Jiantao Lin ⋅ Jiawei Feng ⋅ Lie XU ⋅ hanfeng Zhao ⋅ Shunsi Zhang ⋅ Ying-Cong Chen
Exhibit Hall I #366
Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM Poster Session 4 & Exhibit Hall with Coffee Break
Yatai Ji ⋅ Jiacheng Zhang ⋅ Jie Wu ⋅ Shilong Zhang ⋅ Shoufa Chen ⋅ Chongjian GE ⋅ Peize Sun ⋅ Weifeng Chen ⋅ Wenqi Shao ⋅ Xuefeng Xiao ⋅ Weilin Huang ⋅ Ping Luo
Exhibit Hall I #367
Dynamic-VLM: Simple Dynamic Visual Token Compression for VideoLLM Poster Session 5 & Exhibit Hall
Han Wang ⋅ Yuxiang Nie ⋅ Yongjie Ye ⋅ Yanjie Wang ⋅ SHUAI LI ⋅ Haiyang Yu ⋅ Jinghui Lu ⋅ Can Huang
Exhibit Hall I #96
Learning Implicit Features with Flow-Infused Transformations for Realistic Virtual Try-On Poster Session 4 & Exhibit Hall with Coffee Break
Delong Zhang ⋅ Qiwei Huang ⋅ Yang Sun ⋅ Yuanliu Liu ⋅ Wei-Shi Zheng ⋅ Pengfei Xiong ⋅ Wei Zhang
Exhibit Hall I #368
AIGI-Holmes: Towards Explainable and Generalizable AI-Generated Image Detection via Multimodal Large Language Models Poster Session 4 & Exhibit Hall with Coffee Break
Ziyin Zhou ⋅ Yunpeng Luo ⋅ Yuanchen Wu ⋅ Ke Sun ⋅ Jiayi Ji ⋅ Ke Yan ⋅ Shouhong Ding ⋅ Xiaoshuai Sun ⋅ Yunsheng Wu ⋅ Rongrong Ji
Exhibit Hall I #369
GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation Poster Session 4 & Exhibit Hall with Coffee Break
Tianwei Xiong ⋅ Jun Hao Liew ⋅ Zilong Huang ⋅ Jiashi Feng ⋅ Xihui Liu
Exhibit Hall I #371
Collaborative Instance Object Navigation: Leveraging Uncertainty-Awareness to Minimize Human-Agent Dialogues Poster Session 4 & Exhibit Hall with Coffee Break
Francesco Taioli ⋅ Edoardo Zorzi ⋅ Gianni Franchi ⋅ Alberto Castellini ⋅ Alessandro Farinelli ⋅ Marco Cristani ⋅ Yiming Wang
Exhibit Hall I #372
Puppet-Master: Scaling Interactive Video Generation as a Motion Prior for Part-Level Dynamics Poster Session 3 & Exhibit Hall
Ruining Li ⋅ Chuanxia Zheng ⋅ Christian Rupprecht ⋅ Andrea Vedaldi
Exhibit Hall I #321
ReasonVQA: A Multi-hop Reasoning Benchmark with Structural Knowledge for Visual Question Answering Poster Session 4 & Exhibit Hall with Coffee Break
Duong T. Tran ⋅ Trung-Kien Tran ⋅ Manfred Hauswirth ⋅ Danh Le-Phuoc
Exhibit Hall I #373
LOCATEdit: Graph Laplacian Optimized Cross Attention for Localized Text-Guided Image Editing Poster Session 4 & Exhibit Hall with Coffee Break
Achint Soni ⋅ Meet Soni ⋅ Sirisha Rambhatla
Exhibit Hall I #374
Learned Image Compression with Hierarchical Progressive Context Modeling Poster Session 4 & Exhibit Hall with Coffee Break
Yuqi Li ⋅ Haotian Zhang ⋅ Li Li ⋅ Dong Liu
Exhibit Hall I #377
Early Timestep Zero-Shot Candidate Selection for Instruction-Guided Image Editing Poster Session 4 & Exhibit Hall with Coffee Break
Joowon Kim ⋅ Ziseok Lee ⋅ Donghyeon Cho ⋅ Sanghyun Jo ⋅ Yeonsung Jung ⋅ Kyungsu Kim ⋅ Eunho Yang
Exhibit Hall I #378
Teleportraits: Training-Free People Insertion into Any Scene Poster Session 4 & Exhibit Hall with Coffee Break
Jialu Gao ⋅ Joseph K J ⋅ Fernando De la Torre
Exhibit Hall I #380
DCT-Shield: A Robust Frequency Domain Defense against Malicious Image Editing Poster Session 4 & Exhibit Hall with Coffee Break
Aniruddha Bala ⋅ Rohit Chowdhury ⋅ Rohan Jaiswal ⋅ Siddharth Roheda
Exhibit Hall I #381
Context Guided Transformer Entropy Modeling for Video Compression Poster Session 4 & Exhibit Hall with Coffee Break
Junlong Tong ⋅ Wei Zhang ⋅ Yaohui Jin ⋅ Xiaoyu Shen
Exhibit Hall I #382
UIP2P: Unsupervised Instruction-based Image Editing via Edit Reversibility Constraint Poster Session 4 & Exhibit Hall with Coffee Break
Enis Simsar ⋅ Alessio Tonioni ⋅ Yongqin Xian ⋅ Thomas Hofmann ⋅ Federico Tombari
Exhibit Hall I #385
DiT4SR: Taming Diffusion Transformer for Real-World Image Super-Resolution Poster Session 4 & Exhibit Hall with Coffee Break
Zheng-Peng Duan ⋅ jiawei zhang ⋅ Xin Jin ⋅ Ziheng Zhang ⋅ Zheng Xiong ⋅ Dongqing Zou ⋅ Jimmy Ren ⋅ Chun-Le Guo ⋅ Chongyi Li
Exhibit Hall I #390
USP: Unified Self-Supervised Pretraining for Image Generation and Understanding Poster Session 4 & Exhibit Hall with Coffee Break
Xiangxiang Chu ⋅ Renda Li ⋅ Yong Wang
Exhibit Hall I #344
Bi-Level Optimization for Self-Supervised AI-Generated Face Detection Poster Session 4 & Exhibit Hall with Coffee Break
Mian Zou ⋅ Nan Zhong ⋅ Baosheng Yu ⋅ Yibing Zhan ⋅ Kede Ma
Exhibit Hall I #391
VisualCloze: A Universal Image Generation Framework via Visual In-Context Learning Poster Session 4 & Exhibit Hall with Coffee Break
Zhong-Yu Li ⋅ Ruoyi Du ⋅ Juncheng Yan ⋅ Le Zhuo ⋅ Zhen Li ⋅ Peng Gao ⋅ Zhanyu Ma ⋅ Ming-Ming Cheng
Exhibit Hall I #392
Neighboring Autoregressive Modeling for Efficient Visual Generation Poster Session 4 & Exhibit Hall with Coffee Break
Yefei He ⋅ Yuanyu He ⋅ Shaoxuan He ⋅ Feng Chen ⋅ Hong Zhou ⋅ Kaipeng Zhang ⋅ Bohan Zhuang
Exhibit Hall I #395
FastVAR: Linear Visual Autoregressive Modeling via Cached Token Pruning Poster Session 4 & Exhibit Hall with Coffee Break
Hang Guo ⋅ Yawei Li ⋅ Taolin Zhang ⋅ Jiangshan Wang ⋅ Tao Dai ⋅ Shu-Tao Xia ⋅ Luca Benini
Exhibit Hall I #396
Tune-Your-Style: Intensity-tunable 3D Style Transfer with Gaussian Splatting Poster Session 4 & Exhibit Hall with Coffee Break
Yian Zhao ⋅ rushi ye ⋅ Ruochong Zheng ⋅ Zesen Cheng ⋅ Chaoran Feng ⋅ Jiashu Yang ⋅ Pengchong Qiao ⋅ Chang Liu ⋅ Jie Chen
Exhibit Hall I #398
QK-Edit: Revisiting Attention-based Injection in MM-DiT for Image and Video Editing Poster Session 4 & Exhibit Hall with Coffee Break
Tiancheng SHEN ⋅ Jun Hao Liew ⋅ Zilong Huang ⋅ Xiangtai Li ⋅ Zhijie Lin ⋅ Jiyang Liu ⋅ Yitong Wang ⋅ Jiashi Feng ⋅ Ming-Hsuan Yang
Exhibit Hall I #399
Beyond Isolated Words: Diffusion Brush for Handwritten Text-Line Generation Poster Session 4 & Exhibit Hall with Coffee Break
Gang Dai ⋅ Yifan Zhang ⋅ Yutao Qin ⋅ Qiangya Guo ⋅ Shuangping Huang ⋅ Shuicheng YAN
Exhibit Hall I #400
Always Skip Attention Poster Session 5 & Exhibit Hall
Yiping Ji ⋅ Hemanth Saratchandran ⋅ Peyman Moghadam ⋅ Simon Lucey
Exhibit Hall I #313
BadVideo: Stealthy Backdoor Attack against Text-to-Video Generation Poster Session 4 & Exhibit Hall with Coffee Break
Ruotong Wang ⋅ Mingli Zhu ⋅ Jiarong Ou ⋅ Rui Chen ⋅ Xin Tao ⋅ Pengfei Wan ⋅ Baoyuan Wu
Exhibit Hall I #402
Any2AnyTryon: Leveraging Adaptive Position Embeddings for Versatile Virtual Clothing Tasks Poster Session 4 & Exhibit Hall with Coffee Break
Hailong Guo ⋅ Bohan Zeng ⋅ Yiren Song ⋅ Wentao Zhang ⋅ Jiaming Liu ⋅ Chuang Zhang
Exhibit Hall I #403
Blended Point Cloud Diffusion for Localized Text-guided Shape Editing Poster Session 4 & Exhibit Hall with Coffee Break
Etai Sella ⋅ Noam Atia ⋅ Ron Mokady ⋅ Hadar Averbuch-Elor
Exhibit Hall I #406
VSC: Visual Search Compositional Text-to-Image Diffusion Model Poster Session 4 & Exhibit Hall with Coffee Break
Do Dat ⋅ Nam Hyeon-Woo ⋅ Po-Yuan Mao ⋅ Tae-Hyun Oh
Exhibit Hall I #409
Fine-Tuning Visual Autogressive Models for Subject-Driven Generation Poster Session 4 & Exhibit Hall with Coffee Break
Jiwoo Chung ⋅ Sangeek Hyun ⋅ Hyunjun Kim ⋅ Eunseo Koh ⋅ Minkyu Lee ⋅ Jae-Pil Heo
Exhibit Hall I #411
Flow to the Mode: Mode-Seeking Diffusion Autoencoders for State-of-the-Art Image Tokenization Poster Session 4 & Exhibit Hall with Coffee Break
Kyle Sargent ⋅ Kyle Hsu ⋅ Justin Johnson ⋅ Li Fei-Fei ⋅ Jiajun Wu
Exhibit Hall I #439
Pretrained Reversible Generation as Unsupervised Visual Representation Learning Poster Session 4 & Exhibit Hall with Coffee Break
Rongkun Xue ⋅ Jinouwen Zhang ⋅ Yazhe Niu ⋅ Dazhong Shen ⋅ Bingqi Ma ⋅ Yu Liu ⋅ Jing Yang
Exhibit Hall I #415
DLF: Extreme Image Compression with Dual-generative Latent Fusion Poster Session 4 & Exhibit Hall with Coffee Break
Naifu Xue ⋅ Zhaoyang Jia ⋅ Jiahao Li ⋅ Bin Li ⋅ Yuan Zhang ⋅ Yan Lu
Exhibit Hall I #416
Tracing Copied Pixels and Regularizing Patch Affinity in Copy Detection Poster Session 4 & Exhibit Hall with Coffee Break
Yichen Lu ⋅ Siwei Nie ⋅ Minlong Lu ⋅ Xudong Yang ⋅ Xiaobo Zhang ⋅ Peng Zhang
Exhibit Hall I #418
Beyond Brain Decoding: Visual-Semantic Reconstructions to Mental Creation Extension Based on fMRI Poster Session 4 & Exhibit Hall with Coffee Break
Haodong Jing ⋅ Dongyao Jiang ⋅ Yongqiang Ma ⋅ Haibo Hua ⋅ Bo Huang ⋅ Nanning Zheng
Exhibit Hall I #419
Exploiting Domain Properties in Language-Driven Domain Generalization for Semantic Segmentation Poster Session 5 & Exhibit Hall
Seogkyu Jeon ⋅ Kibeom Hong ⋅ Hyeran Byun
Exhibit Hall I #94
PixTalk: Controlling Photorealistic Image Processing and Editing with Language Poster Session 4 & Exhibit Hall with Coffee Break
Marcos Conde ⋅ Zihao Lu ⋅ Radu Timofte
Exhibit Hall I #420
ADCD-Net: Robust Document Image Forgery Localization via Adaptive DCT Feature and Hierarchical Content Disentanglement Poster Session 4 & Exhibit Hall with Coffee Break
KA WONG ⋅ Jicheng Zhou ⋅ Haiwei Wu ⋅ Yain-Whar Si ⋅ Jiantao Zhou
Exhibit Hall I #421
Towards Robust Defense against Customization via Protective Perturbation Resistant to Diffusion-based Purification Poster Session 4 & Exhibit Hall with Coffee Break
Wenkui Yang ⋅ Jie Cao ⋅ Junxian Duan ⋅ Ran He
Exhibit Hall I #422
A Unified Framework for Industrial Cel-Animation Colorization with Temporal-Structural Awareness Poster Session 4 & Exhibit Hall with Coffee Break
Xiaoyi Feng ⋅ Tao Huang ⋅ Peng Wang ⋅ Zizhou Huang ⋅ Haihang Zhang ⋅ Yuntao Zou ⋅ Dagang Li ⋅ Kaifeng Zou
Exhibit Hall I #423
Generative Video Bi-flow Poster Session 4 & Exhibit Hall with Coffee Break
Chen Liu ⋅ Tobias Ritschel
Exhibit Hall I #429
AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation Poster Session 4 & Exhibit Hall with Coffee Break
Moayed Haji-Ali ⋅ Willi Menapace ⋅ Aliaksandr Siarohin ⋅ Ivan Skorokhodov ⋅ Alper Canberk ⋅ Kwot Sin Lee ⋅ Vicente Ordonez ⋅ Sergey Tulyakov
Exhibit Hall I #430
T2I-Copilot: A Training-Free Multi-Agent Text-to-Image System for Enhanced Prompt Interpretation and Interactive Generation Poster Session 4 & Exhibit Hall with Coffee Break
Chieh-Yun Chen ⋅ Min Shi ⋅ Gong Zhang ⋅ Humphrey Shi
Exhibit Hall I #432
LayerLock: Non-collapsing Representation Learning with Progressive Freezing Poster Session 4 & Exhibit Hall with Coffee Break
Goker Erdogan ⋅ Nikhil Parthasarathy ⋅ Catalin Ionescu ⋅ Drew Hudson ⋅ Alexander Lerchner ⋅ Andrew Zisserman ⋅ Mehdi S. M. Sajjadi ⋅ Joao Carreira
Exhibit Hall I #438
Adaptive Routing of Text-to-Image Generation Requests Between Large Cloud Model and Light-Weight Edge Model Poster Session 4 & Exhibit Hall with Coffee Break
Zewei Xin ⋅ Qinya Li ⋅ Chaoyue Niu ⋅ Fan Wu ⋅ Guihai Chen
Exhibit Hall I #440
Exploring Multimodal Diffusion Transformers for Enhanced Prompt-based Image Editing Poster Session 4 & Exhibit Hall with Coffee Break
Joonghyuk Shin ⋅ Alchan Hwang ⋅ Yujin Kim ⋅ Daneul Kim ⋅ Jaesik Park
Exhibit Hall I #441
JPEG Processing Neural Operator for Backward-Compatible Coding Poster Session 4 & Exhibit Hall with Coffee Break
Woo Kyoung Han ⋅ Yongjun Lee ⋅ Byeonghun Lee ⋅ Sang Hyun Park ⋅ Sunghoon Im ⋅ Kyong Hwan Jin
Exhibit Hall I #442
EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer Poster Session 4 & Exhibit Hall with Coffee Break
Yuxuan Zhang ⋅ Yirui Yuan ⋅ Yiren Song ⋅ Haofan Wang ⋅ Jiaming Liu
Exhibit Hall I #443
All Parts Matter: A Unified Mask-Free Virtual Try-On Framework Poster Session 4 & Exhibit Hall with Coffee Break
Chenghu Du ⋅ Shengwu Xiong ⋅ Yi Rong
Exhibit Hall I #444
Function-centric Bayesian Network for Zero-Shot Object Goal Navigation Poster Session 4 & Exhibit Hall with Coffee Break
Sixian Zhang ⋅ Xinyao Yu ⋅ Xinhang Song ⋅ Yiyao Wang ⋅ Shuqiang Jiang
Exhibit Hall I #445
Attention to Neural Plagiarism: Diffusion Models Can Plagiarize Your Copyrighted Images! Poster Session 4 & Exhibit Hall with Coffee Break
zihang zou ⋅ Boqing Gong ⋅ Liqiang Wang
Exhibit Hall I #446
Distilling Parallel Gradients for Fast ODE Solvers of Diffusion Models Poster Session 4 & Exhibit Hall with Coffee Break
Beier Zhu ⋅ Ruoyu Wang ⋅ Tong Zhao ⋅ Hanwang Zhang ⋅ Chi Zhang
Exhibit Hall I #447
LATINO-PRO: LAtent consisTency INverse sOlver with PRompt Optimization Poster Session 4 & Exhibit Hall with Coffee Break
Alessio Spagnoletti ⋅ Jean Prost ⋅ Andres Almansa ⋅ Nicolas Papadakis ⋅ Marcelo Pereyra
Exhibit Hall I #451
EDiT: Efficient Diffusion Transformers with Linear Compressed Attention Poster Session 4 & Exhibit Hall with Coffee Break
Philipp Becker ⋅ Abhinav Mehrotra ⋅ Ruchika Chavhan ⋅ Malcolm Chadwick ⋅ Luca Morreale ⋅ Mehdi Noroozi ⋅ Alberto Gil Couto Pimentel Ramos ⋅ Sourav Bhattacharya
Exhibit Hall I #452
Hate in Plain Sight: On the Risks of Moderating AI-Generated Hateful Illusions Poster Session 4 & Exhibit Hall with Coffee Break
Yiting Qu ⋅ Ziqing Yang ⋅ Yihan Ma ⋅ Michael Backes ⋅ Savvas Zannettou ⋅ Yang Zhang
Exhibit Hall I #453
DC-AE 1.5: Accelerating Diffusion Model Convergence with Structured Latent Space Poster Session 4 & Exhibit Hall with Coffee Break
Junyu Chen ⋅ Dongyun Zou ⋅ Wenkun He ⋅ Junsong Chen ⋅ Enze Xie ⋅ Song Han ⋅ Han Cai
Exhibit Hall I #454
MH-LVC: Multi-Hypothesis Temporal Prediction for Learned Conditional Residual Video Coding Poster Session 4 & Exhibit Hall with Coffee Break
Gao Zong lin ⋅ Huu-Tai Phung ⋅ Yi-Chen Yao ⋅ Kuan-Wei Ho ⋅ Yi-Hsin Chen ⋅ Yu-Hsiang Lin ⋅ Alessandro Gnutti ⋅ Wen-Hsiao Peng
Exhibit Hall I #456
Sculpting Memory: Multi-Concept Forgetting in Diffusion Models via Dynamic Mask and Concept-Aware Optimization Poster Session 4 & Exhibit Hall with Coffee Break
Li ⋅ Yang Xiao ⋅ Jie Ji ⋅ Kaiyuan Deng ⋅ Bo Hui ⋅ Linke Guo ⋅ Xiaolong Ma
Exhibit Hall I #457
On the Provable Importance of Gradients for Autonomous Language-Assisted Image Clustering Poster Session 5 & Exhibit Hall
Bo Peng ⋅ Jie Lu ⋅ Guangquan Zhang ⋅ Zhen Fang
Exhibit Hall I #1
Inter2Former: Dynamic Hybrid Attention for Efficient High-Precision Interactive Segmentation Poster Session 5 & Exhibit Hall
You Huang ⋅ Lichao Chen ⋅ Jiayi Ji ⋅ Liujuan Cao ⋅ Shengchuan Zhang ⋅ Rongrong Ji
Exhibit Hall I #2
OphCLIP: Hierarchical Retrieval-Augmented Learning for Ophthalmic Surgical Video-Language Pretraining Poster Session 5 & Exhibit Hall
Ming Hu ⋅ Kun yuan ⋅ Yaling Shen ⋅ feilong tang ⋅ Xiaohao Xu ⋅ Lin Zhou ⋅ Wei Li ⋅ Ying Chen ⋅ Zhongxing Xu ⋅ Zelin Peng ⋅ Siyuan Yan ⋅ Vinkle Srivastav ⋅ Diping Song ⋅ Tianbin Li ⋅ Danli Shi ⋅ Jin Ye ⋅ Nicolas Padoy ⋅ Nassir Navab ⋅ Junjun He ⋅ Zongyuan Ge
Exhibit Hall I #4
HiERO: Understanding the Hierarchy of Human Behavior Enhances Reasoning on Egocentric Videos Poster Session 5 & Exhibit Hall
Simone Alberto Peirone ⋅ Francesca Pistilli ⋅ Giuseppe Averta
Exhibit Hall I #6
CaptionSmiths: Flexibly Controlling Language Pattern in Image Captioning Poster Session 5 & Exhibit Hall
Kuniaki Saito ⋅ Donghyun Kim ⋅ Kwanyong Park ⋅ Atsushi Hashimoto ⋅ Yoshitaka Ushiku
Exhibit Hall I #7
An Efficient Hybrid Vision Transformer for TinyML Applications Poster Session 5 & Exhibit Hall
Fanhong Zeng ⋅ Huanan LI ⋅ Juntao Guan ⋅ Rui Fan ⋅ Tong Wu ⋅ Xilong Wang ⋅ Lai Rui
Exhibit Hall I #11
Graph Domain Adaptation with Dual-branch Encoder and Two-level Alignment for Whole Slide Image-based Survival Prediction Poster Session 5 & Exhibit Hall
Yuntao Shou ⋅ Xiangyong Cao ⋅ PeiqiangYan PeiqiangYan ⋅ Qiaohui Qiaohui ⋅ Qian Zhao ⋅ Deyu Meng
Exhibit Hall I #12
CNS-Bench: Benchmarking Image Classifier Robustness Under Continuous Nuisance Shifts Poster Session 5 & Exhibit Hall
Olaf Dünkel ⋅ Artur Jesslen ⋅ Jiahao Xie ⋅ Christian Theobalt ⋅ Christian Rupprecht ⋅ Adam Kortylewski
Exhibit Hall I #17
Visual Test-time Scaling for GUI Agent Grounding Poster Session 5 & Exhibit Hall
Tiange Luo ⋅ Lajanugen Logeswaran ⋅ Justin Johnson ⋅ Honglak Lee
Exhibit Hall I #18
Multi-Schema Proximity Network for Composed Image Retrieval Poster Session 5 & Exhibit Hall
Jiangming Shi ⋅ Xiangbo Yin ⋅ yeyunchen yeyunchen ⋅ Yachao Zhang ⋅ zhizhong zhang ⋅ Yuan Xie ⋅ Yanyun Qu
Exhibit Hall I #19
ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations Poster Session 5 & Exhibit Hall
Tianming Liang ⋅ Kun-Yu Lin ⋅ Chaolei Tan ⋅ Jianguo Zhang ⋅ Wei-Shi Zheng ⋅ Jian-Fang Hu
Exhibit Hall I #20
GECKO: Gigapixel Vision-Concept Contrastive Pretraining in Histopathology Poster Session 5 & Exhibit Hall
Saarthak Kapse ⋅ Pushpak Pati ⋅ Srikar Yellapragada ⋅ Srijan Das ⋅ Rajarsi Gupta ⋅ Joel Saltz ⋅ Dimitris Samaras ⋅ Prateek Prasanna
Exhibit Hall I #21
Lumina-Image 2.0: A Unified and Efficient Image Generative Framework Poster Session 5 & Exhibit Hall
Qi Qin ⋅ Le Zhuo ⋅ Yi Xin ⋅ Ruoyi Du ⋅ Zhen Li ⋅ Bin Fu ⋅ Yiting Lu ⋅ Xinyue Li ⋅ Dongyang Liu ⋅ Xiangyang Zhu ⋅ Will Beddow ⋅ Erwann Millon ⋅ Victor Perez ⋅ Wenhai Wang ⋅ Yu Qiao ⋅ Bo Zhang ⋅ Xiaohong Liu ⋅ Hongsheng Li ⋅ Chang Xu ⋅ Peng Gao
Exhibit Hall I #22
DiSCO-3D : Discovering and Segmenting Sub-Concepts from Open-vocabulary Queries in NeRF Poster Session 5 & Exhibit Hall
Doriand Petit ⋅ Steve Bourgeois ⋅ Vincent Gay-Bellile ⋅ Florian Chabot ⋅ Loïc Barthe
Exhibit Hall I #23
ESCNet:Edge-Semantic Collaborative Network for Camouflaged Object Detection Poster Session 5 & Exhibit Hall
Sheng Ye ⋅ Xin Chen ⋅ Yan Zhang ⋅ Xianming Lin ⋅ Liujuan Cao
Exhibit Hall I #24
Test-time Adaptation for Foundation Medical Segmentation Model Without Parametric Updates Poster Session 5 & Exhibit Hall
Kecheng Chen ⋅ Xinyu Luo ⋅ Tiexin Qin ⋅ Jie Liu ⋅ Hui Liu ⋅ Victor Ho Fun Lee ⋅ Hong Yan ⋅ Haoliang Li
Exhibit Hall I #26
ResQ: A Novel Framework to Implement Residual Neural Networks on Analog Rydberg Atom Quantum Computers Poster Session 5 & Exhibit Hall
Nicholas DiBrita ⋅ Jason Han ⋅ Tirthak Patel
Exhibit Hall I #27
M-Net: MRI Brain Tumor Sequential Segmentation Network via Mesh-Cast Poster Session 5 & Exhibit Hall
Jiacheng Lu ⋅ Hui Ding ⋅ Shiyu Zhang ⋅ Guoping Huo
Exhibit Hall I #30
Moment Quantization for Video Temporal Grounding Poster Session 5 & Exhibit Hall
Xiaolong Sun ⋅ Le Wang ⋅ Sanping Zhou ⋅ Liushuai Shi ⋅ Kun Xia ⋅ Mengnan Liu ⋅ Yabing Wang ⋅ Gang Hua
Exhibit Hall I #32
SVTRv2: CTC Beats Encoder-Decoder Models in Scene Text Recognition Poster Session 5 & Exhibit Hall
Yongkun Du ⋅ Zhineng Chen ⋅ Hongtao Xie ⋅ Caiyan Jia ⋅ Yu-Gang Jiang
Exhibit Hall I #33
ROVI: A VLM-LLM Re-Captioned Dataset for Open-Vocabulary Instance-Grounded Text-to-Image Generation Poster Session 5 & Exhibit Hall
Cihang Peng ⋅ Qiming HOU ⋅ Zhong Ren ⋅ Kun Zhou
Exhibit Hall I #38
S⁴M: Boosting Semi-Supervised Instance Segmentation with SAM Poster Session 5 & Exhibit Hall
Heeji Yoon ⋅ Heeseong Shin ⋅ Eunbeen Hong ⋅ Hyunwook Choi ⋅ Hansang Cho ⋅ Daun Jeong ⋅ Seungryong Kim
Exhibit Hall I #40
Structure-aware Semantic Discrepancy and Consistency for 3D Medical Image Self-supervised Learning Poster Session 5 & Exhibit Hall
Tan Pan ⋅ Zhaorui Tan ⋅ Kaiyu Guo ⋅ Dongli Xu ⋅ Weidi Xu ⋅ Chen Jiang ⋅ Xin Guo ⋅ Yuan Qi ⋅ Yuan Cheng
Exhibit Hall I #43
ARGUS: Hallucination and Omission Evaluation in Video-LLMs Poster Session 5 & Exhibit Hall
Ruchit Rawal ⋅ Reza Shirkavand ⋅ Heng Huang ⋅ Gowthami Somepalli ⋅ Tom Goldstein
Exhibit Hall I #45
Feature Purification Matters: Suppressing Outlier Propagation for Training-Free Open-Vocabulary Semantic Segmentation Poster Session 5 & Exhibit Hall
Shuo Jin ⋅ Siyue Yu ⋅ Bingfeng Zhang ⋅ Mingjie Sun ⋅ Yi Dong ⋅ Jimin XIAO
Exhibit Hall I #46
DiffPS: Leveraging Prior Knowledge of Diffusion Model for Person Search Poster Session 5 & Exhibit Hall
Giyeol Kim ⋅ Sooyoung Yang ⋅ Jihyong Oh ⋅ Myungjoo Kang ⋅ Chanho Eom
Exhibit Hall I #47
Mind the Gap: Aligning Vision Foundation Models to Image Feature Matching Poster Session 5 & Exhibit Hall
Yuhan Liu ⋅ Jingwen Fu ⋅ Yang Wu ⋅ Kangyi Wu ⋅ Pengna Li ⋅ Jiayi Wu ⋅ Sanping Zhou ⋅ Jingmin Xin
Exhibit Hall I #48
COIN: Confidence Score-Guided Distillation for Annotation-Free Cell Segmentation Poster Session 5 & Exhibit Hall
Sanghyun Jo ⋅ Seo Lee ⋅ Seungwoo Lee ⋅ Seohyung Hong ⋅ Hyungseok Seo ⋅ Kyungsu Kim
Exhibit Hall I #49
OVG-HQ: Online Video Grounding with Hybrid-modal Queries Poster Session 5 & Exhibit Hall
Runhao Zeng ⋅ Jiaqi Mao ⋅ Minghao Lai ⋅ Vu Phan ⋅ Yanjie Dong ⋅ Wei Wang ⋅ Qi Chen ⋅ Xiping Hu
Exhibit Hall I #120
Learn2Synth: Learning Optimal Data Synthesis Using Hypergradients for Brain Image Segmentation Poster Session 5 & Exhibit Hall
Xiaoling Hu ⋅ Xiangrui Zeng ⋅ Oula Puonti ⋅ Juan Iglesias ⋅ Bruce Fischl ⋅ Yaël Balbastre
Exhibit Hall I #53
Representation Shift: Unifying Token Compression with FlashAttention Poster Session 5 & Exhibit Hall
Joonmyung Choi ⋅ Sanghyeok Lee ⋅ Byungoh Ko ⋅ Eunseo Kim ⋅ Jihyung Kil ⋅ Hyunwoo Kim
Exhibit Hall I #61
ZipVL: Accelerating Vision-Language Models through Dynamic Token Sparsity Poster Session 5 & Exhibit Hall
Yefei He ⋅ Feng Chen ⋅ Jing Liu ⋅ Wenqi Shao ⋅ Hong Zhou ⋅ Kaipeng Zhang ⋅ Bohan Zhuang
Exhibit Hall I #63
ProSAM: Enhancing the Robustness of SAM-based Visual Reference Segmentation with Probabilistic Prompts Poster Session 5 & Exhibit Hall
Xiaoqi Wang ⋅ Clint Sebastian ⋅ Wenbin He ⋅ Liu Ren
Exhibit Hall I #64
LaCoOT: Layer Collapse through Optimal Transport Poster Session 5 & Exhibit Hall
Victor Quétu ⋅ Zhu LIAO ⋅ Nour Hezbri ⋅ Fabio Pizzati ⋅ Enzo Tartaglione
Exhibit Hall I #65
Fuzzy Contrastive Decoding to Alleviate Object Hallucination in Large Vision-Language Models Poster Session 5 & Exhibit Hall
Jieun Kim ⋅ Jinmyeong Kim ⋅ Yoonji Kim ⋅ Sung-Bae Cho
Exhibit Hall I #72
Semantic versus Identity: A Divide-and-Conquer Approach towards Adjustable Medical Image De-Identification Poster Session 5 & Exhibit Hall
Yuan Tian ⋅ Shuo Wang ⋅ Rongzhao Zhang ⋅ Zijian Chen ⋅ Yankai Jiang ⋅ Chunyi Li ⋅ Xiangyang Zhu ⋅ Fang Yan ⋅ Qiang Hu ⋅ Xiaosong Wang ⋅ Guangtao Zhai
Exhibit Hall I #78
Towards Video Thinking Test: A Holistic Benchmark for Advanced Video Reasoning and Understanding Poster Session 5 & Exhibit Hall
Yuanhan Zhang ⋅ Yunice Chew ⋅ Yuhao Dong ⋅ Aria Leo ⋅ Bo Hu ⋅ Ziwei Liu
Exhibit Hall I #79
Cross-View Isolated Sign Language Recognition via View Synthesis and Feature Disentanglement Poster Session 5 & Exhibit Hall
Xin Shen ⋅ Xinyu Wang ⋅ Lei Shen ⋅ Kaihao Zhang ⋅ Xin Yu
Exhibit Hall I #81
Geo4D: Leveraging Video Generators for Geometric 4D Scene Reconstruction Poster Session 5 & Exhibit Hall
Zeren Jiang ⋅ Chuanxia Zheng ⋅ Iro Laina ⋅ Diane Larlus ⋅ Andrea Vedaldi
Exhibit Hall I #82
Fix-CLIP: Dual-Branch Hierarchical Contrastive Learning via Synthetic Captions for Better Understanding of Long Text Poster Session 5 & Exhibit Hall
Bingchao Wang ⋅ Zhiwei Ning ⋅ Jianyu Ding ⋅ Xuanang Gao ⋅ Yin Li ⋅ Dongsheng Jiang ⋅ JIE YANG ⋅ Wei Liu
Exhibit Hall I #85
Superpowering Open-Vocabulary Object Detectors for X-ray Vision Poster Session 5 & Exhibit Hall
Pablo Garcia-Fernandez ⋅ Lorenzo Vaquero ⋅ Mingxuan Liu ⋅ Feng Xue ⋅ Daniel Cores ⋅ Nicu Sebe ⋅ Manuel Mucientes ⋅ Elisa Ricci
Exhibit Hall I #92
RhythmGuassian: Repurposing Generalizable Gaussian Model For Remote Physiological Measurement Poster Session 5 & Exhibit Hall
Hao LU ⋅ Yuting Zhang ⋅ Jiaqi Tang ⋅ Bowen Fu ⋅ Wenhang Ge ⋅ Wei Wei ⋅ Kaishun Wu ⋅ Ying-Cong Chen
Exhibit Hall I #93
Beyond Text-Visual Attention: Exploiting Visual Cues for Effective Token Pruning in VLMs Poster Session 5 & Exhibit Hall
Qizhe Zhang ⋅ Aosong Cheng ⋅ Ming Lu ⋅ Renrui Zhang ⋅ Zhiyong Zhuo ⋅ Jiajun Cao ⋅ Shaobo Guo ⋅ Qi She ⋅ Shanghang Zhang
Exhibit Hall I #100
MaTVLM: Hybrid Mamba-Transformer for Efficient Vision-Language Modeling Poster Session 5 & Exhibit Hall
Yingyue Li ⋅ Bencheng Liao ⋅ Wenyu Liu ⋅ Xinggang Wang
Exhibit Hall I #102
UniConvNet: Expanding Effective Receptive Field while Maintaining Asymptotically Gaussian Distribution for ConvNets of Any Scale Poster Session 5 & Exhibit Hall
Yuhao Wang ⋅ Wei Xi
Exhibit Hall I #106
On the Recovery of Cameras from Fundamental Matrices Poster Session 5 & Exhibit Hall
Rakshith Madhavan ⋅ Federica Arrigoni
Exhibit Hall I #107
Wavelet Policy: Lifting Scheme for Policy Learning in Long-Horizon Tasks Poster Session 3 & Exhibit Hall
Hao Huang ⋅ Shuaihang Yuan ⋅ Geeta Chandra Raju Bethala ⋅ Congcong Wen ⋅ Anthony Tzes ⋅ Yi Fang
Exhibit Hall I #220
MultiverSeg: Scalable Interactive Segmentation of Biomedical Imaging Datasets with In-Context Guidance Poster Session 5 & Exhibit Hall
Hallee Wong ⋅ Jose Javier Gonzalez Ortiz ⋅ John Guttag ⋅ Adrian Dalca
Exhibit Hall I #110
The Devil is in the Spurious Correlations: Boosting Moment Retrieval with Dynamic Learning Poster Session 5 & Exhibit Hall
Xinyang Zhou ⋅ Fanyue Wei ⋅ Lixin Duan ⋅ Angela Yao ⋅ Wen Li
Exhibit Hall I #111
CABLD: Contrast-Agnostic Brain Landmark Detection with Consistency-Based Regularization Poster Session 5 & Exhibit Hall
Soorena Salari ⋅ Arash Harirpoush ⋅ Hassan Rivaz ⋅ Yiming Xiao
Exhibit Hall I #112
EVEv2: Improved Baselines for Encoder-Free Vision-Language Models Poster Session 5 & Exhibit Hall
Haiwen Diao ⋅ Xiaotong Li ⋅ Yufeng Cui ⋅ Yueze Wang ⋅ Haoge Deng ⋅ Ting Pan ⋅ Wenxuan Wang ⋅ Huchuan Lu ⋅ Xinlong Wang
Exhibit Hall I #114
Describe, Adapt and Combine: Empowering CLIP Encoders for Open-set 3D Object Retrieval Poster Session 5 & Exhibit Hall
Zhichuan Wang ⋅ Yang Zhou ⋅ Zhe Liu ⋅ Rui Yu ⋅ Song Bai ⋅ Yulong Wang ⋅ Xinwei He ⋅ Xiang Bai
Exhibit Hall I #115
Robustifying Zero-Shot Vision Language Models by Subspaces Alignment Poster Session 5 & Exhibit Hall
Junhao Dong ⋅ Piotr Koniusz ⋅ Liaoyuan Feng ⋅ Yifei Zhang ⋅ Hao Zhu ⋅ Weiming Liu ⋅ Xinghua Qu ⋅ YEW-SOON ONG
Exhibit Hall I #116
V2PE: Improving Multimodal Long-Context Capability of Vision-Language Models with Variable Visual Position Encoding Poster Session 5 & Exhibit Hall
Junqi Ge ⋅ Ziyi Chen ⋅ Jintao Lin ⋅ Jinguo Zhu ⋅ Xihui Liu ⋅ Jifeng Dai ⋅ Xizhou Zhu
Exhibit Hall I #119
Enhancing Zero-shot Object Counting via Text-guided Local Ranking and Number-evoked Global Attention Poster Session 5 & Exhibit Hall
Shiwei Zhang ⋅ Qi Zhou ⋅ Wei Ke
Exhibit Hall I #121
SegAnyPET: Universal Promptable Segmentation from Positron Emission Tomography Images Poster Session 5 & Exhibit Hall
Yichi Zhang ⋅ Le Xue ⋅ Wenbo zhang ⋅ Lanlan Li ⋅ Yuchen Liu ⋅ Chen Jiang ⋅ Yuan Cheng ⋅ Yuan Qi
Exhibit Hall I #122
Multi-View Slot Attention Using Paraphrased Texts for Face Anti-Spoofing Poster Session 5 & Exhibit Hall
Jeongmin Yu ⋅ Susang Kim ⋅ Kisu Lee ⋅ Taekyoung Kwon ⋅ Won-Yong Shin ⋅ Ha Young Kim
Exhibit Hall I #123
4D-Bench: Benchmarking Multi-modal Large Language Models for 4D Object Understanding Poster Session 5 & Exhibit Hall
Wenxuan Zhu ⋅ Bing Li ⋅ Cheng Zheng ⋅ Jinjie Mai ⋅ Jun Chen ⋅ Letian Jiang ⋅ Abdullah Hamdi ⋅ Sara Rojas Martinez ⋅ Chia-Wen Lin ⋅ Mohamed Elhoseiny ⋅ Bernard Ghanem
Exhibit Hall I #124
Uncertainty-Driven Expert Control: Enhancing the Reliability of Medical Vision-Language Models Poster Session 5 & Exhibit Hall
Xiao Liang ⋅ Di Wang ⋅ Zhicheng Jiao ⋅ Ronghan Li ⋅ Pengfei Yang ⋅ Quan Wang ⋅ Tat-Seng Chua
Exhibit Hall I #125
OuroMamba: A Data-Free Quantization Framework for Vision Mamba Poster Session 5 & Exhibit Hall
Akshat Ramachandran ⋅ Mingyu Lee ⋅ Huan Xu ⋅ Souvik Kundu ⋅ Tushar Krishna
Exhibit Hall I #128
Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers Poster Session 5 & Exhibit Hall
Weiming Ren ⋅ Wentao Ma ⋅ Huan Yang ⋅ Cong Wei ⋅ Ge Zhang ⋅ Wenhu Chen
Exhibit Hall I #130
SAMora: Enhancing SAM through Hierarchical Self-Supervised Pre-Training for Medical Images Poster Session 5 & Exhibit Hall
Shuhang Chen ⋅ Hangjie Yuan ⋅ Pengwei Liu ⋅ Hanxue Gu ⋅ Tao Feng ⋅ Dong Ni
Exhibit Hall I #131
FE-CLIP: Frequency Enhanced CLIP Model for Zero-Shot Anomaly Detection and Segmentation Poster Session 5 & Exhibit Hall
Tao Gong ⋅ Qi Chu ⋅ Bin Liu ⋅ Zhou Wei ⋅ Nenghai Yu
Exhibit Hall I #132
Referring Expression Comprehension for Small Objects Poster Session 5 & Exhibit Hall
Kanoko Goto ⋅ Takumi Hirose ⋅ Mahiro Ukai ⋅ Shuhei Kurita ⋅ Nakamasa Inoue
Exhibit Hall I #133
AID: Adapting Image2Video Diffusion Models for Instruction-guided Video Prediction Poster Session 5 & Exhibit Hall
Zhen Xing ⋅ Qi Dai ⋅ Zejia Weng ⋅ Zuxuan Wu ⋅ Yu-Gang Jiang
Exhibit Hall I #134
CutS3D: Cutting Semantics in 3D for 2D Unsupervised Instance Segmentation Poster Session 5 & Exhibit Hall
Leon Sick ⋅ Dominik Engel ⋅ Sebastian Hartwig ⋅ Pedro Hermosilla ⋅ Timo Ropinski
Exhibit Hall I #136
Text-guided Visual Prompt DINO for Generic Segmentation Poster Session 5 & Exhibit Hall
Yuchen Guan ⋅ Chong Sun ⋅ Canmiao Fu ⋅ Zhipeng Huang ⋅ Chun Yuan ⋅ Chen Li
Exhibit Hall I #138
ETVA: Evaluation of Text-to-Video Alignment via Fine-grained Question Generation and Answering Poster Session 5 & Exhibit Hall
Kaisi Guan ⋅ Zhengfeng Lai ⋅ Yuchong Sun ⋅ Peng Zhang ⋅ Wei Liu ⋅ Xiaojiang Liu ⋅ Meng Cao ⋅ Ruihua Song
Exhibit Hall I #139
GEMeX: A Large-Scale, Groundable, and Explainable Medical VQA Benchmark for Chest X-ray Diagnosis Poster Session 5 & Exhibit Hall
Bo Liu ⋅ Ke Zou ⋅ Li-Ming Zhan ⋅ ZEXIN LU ⋅ Xiaoyu DONG ⋅ Chengqiang Xie ⋅ Yidi Chen ⋅ Jiannong Cao ⋅ Xiao-Ming Wu ⋅ Huazhu Fu
Exhibit Hall I #140
Bias-Resilient Weakly Supervised Semantic Segmentation Using Normalizing Flows Poster Session 5 & Exhibit Hall
Xianglin Qiu ⋅ Xiaoyang Wang ⋅ Zhen Zhang ⋅ Jimin XIAO
Exhibit Hall I #141
MedSegFactory: Text-Guided Generation of Medical Image-Mask Pairs Poster Session 5 & Exhibit Hall
Jiawei Mao ⋅ Yuhan Wang ⋅ Yucheng Tang ⋅ Daguang Xu ⋅ Kang Wang ⋅ Yang Yang ⋅ Zongwei Zhou ⋅ Yuyin Zhou
Exhibit Hall I #162
Cracking Instance Jigsaw Puzzles: A Superior Alternative to Multiple Instance Learning for Whole Slide Image Analysis Poster Session 5 & Exhibit Hall
Xiwen Chen ⋅ Peijie Qiu ⋅ Wenhui Zhu ⋅ Hao Wang ⋅ Huayu Li ⋅ XUANZHAO DONG ⋅ Xiaotong Sun ⋅ Xiaobing Yu ⋅ Yalin Wang ⋅ Abolfazl Razi ⋅ Aristedis Sotiras
Exhibit Hall I #144
STDDNet: Harnessing Mamba for Video Polyp Segmentation via Spatial-aligned Temporal Modeling and Discriminative Dynamic Representation Learning Poster Session 5 & Exhibit Hall
Guilian Chen ⋅ Huisi Wu ⋅ Jing Qin
Exhibit Hall I #145
FLOSS: Free Lunch in Open-vocabulary Semantic Segmentation Poster Session 5 & Exhibit Hall
Yasser Benigmim ⋅ Mohammad Fahes ⋅ Tuan-Hung Vu ⋅ Andrei Bursuc ⋅ Raoul de Charette
Exhibit Hall I #157
Sparse-Dense Side-Tuner for efficient Video Temporal Grounding Poster Session 5 & Exhibit Hall
David Pujol-Perich ⋅ Sergio Escalera ⋅ Albert Clapés
Exhibit Hall I #161
Towards a Universal 3D Medical Multi-modality Generalization via Learning Personalized Invariant Representation Poster Session 5 & Exhibit Hall
Zhaorui Tan ⋅ Xi Yang ⋅ Tan Pan ⋅ TIANYI LIU ⋅ Chen Jiang ⋅ Xin Guo ⋅ Qiufeng Wang ⋅ Anh Nguyen ⋅ Yuan Qi ⋅ Kaizhu Huang ⋅ Yuan Cheng
Exhibit Hall I #196
DecAD: Decoupling Anomalies in Latent Space for Multi-Class Unsupervised Anomaly Detection Poster Session 5 & Exhibit Hall
Xiaolei Wang ⋅ Xiaoyang Wang ⋅ Huihui Bai ⋅ ENG Gee LIM ⋅ Jimin XIAO
Exhibit Hall I #166
Few-Shot Pattern Detection via Template Matching and Regression Poster Session 5 & Exhibit Hall
Eunchan Jo ⋅ Dahyun Kang ⋅ Sanghyun Kim ⋅ Yunseon Choi ⋅ Minsu Cho
Exhibit Hall I #167
Hierarchical Event Memory for Accurate and Low-latency Online Video Temporal Grounding Poster Session 5 & Exhibit Hall
Minghang Zheng ⋅ Yuxin Peng ⋅ Benyuan Sun ⋅ Yi Yang ⋅ Yang Liu
Exhibit Hall I #168
Learning Yourself: Class-Incremental Semantic Segmentation with Language-Inspired Bootstrapped Disentanglement Poster Session 5 & Exhibit Hall
Ruitao Wu ⋅ Yifan Zhao ⋅ Jia Li
Exhibit Hall I #171
Aligning Information Capacity Between Vision and Language via Dense-to-Sparse Feature Distillation for Image-Text matching Poster Session 5 & Exhibit Hall
Yang Liu ⋅ Wentao Feng ⋅ Zhuoyao Liu ⋅ Shudong Huang ⋅ Jiancheng Lv
Exhibit Hall I #176
RA-BUSSeg: Relation-aware Semi-supervised Breast Ultrasound Image Segmentation via Adjacent Propagation and Cross-layer Alignment Poster Session 5 & Exhibit Hall
Wanting ZHANG ⋅ Zhenhui Ding ⋅ Guilian Chen ⋅ Huisi Wu ⋅ Jing Qin
Exhibit Hall I #177
ExCap3D: Expressive 3D Scene Understanding via Object Captioning with Varying Detail Poster Session 5 & Exhibit Hall
Chandan Yeshwanth ⋅ David Rozenberszki ⋅ Angela Dai
Exhibit Hall I #178
DisCo: Towards Distinct and Coherent Visual Encapsulation in Video MLLMs Poster Session 5 & Exhibit Hall
JIAHE ZHAO ⋅ rongkun Zheng ⋅ Yi Wang ⋅ Helin WANG ⋅ Hengshuang Zhao
Exhibit Hall I #179
CC-OCR: A Comprehensive and Challenging OCR Benchmark for Evaluating Large Multimodal Models in Literacy Poster Session 5 & Exhibit Hall
Zhibo Yang ⋅ Jun Tang ⋅ Zhaohai Li ⋅ Pengfei Wang ⋅ Jianqiang Wan ⋅ Humen Zhong ⋅ Xuejing Liu ⋅ Mingkun Yang ⋅ Peng Wang ⋅ Shuai Bai ⋅ Lianwen Jin ⋅ Junyang Lin
Exhibit Hall I #182
Exploring Probabilistic Modeling Beyond Domain Generalization for Semantic Segmentation Poster Session 5 & Exhibit Hall
I-Hsiang Chen ⋅ Hua-En Chang ⋅ Wei-Ting Chen ⋅ Jenq-Newng Hwang ⋅ Sy-Yen Kuo
Exhibit Hall I #183
Prototypes are Balanced Units for Efficient and Effective Partially Relevant Video Retrieval Poster Session 5 & Exhibit Hall
WonJun Moon ⋅ Cheol-Ho Cho ⋅ Woojin Jun ⋅ Minho Shim ⋅ Taeoh Kim ⋅ Inwoong Lee ⋅ Dongyoon Wee ⋅ Jae-Pil Heo
Exhibit Hall I #186
VideoAds for Fast-Paced Video Understanding Poster Session 5 & Exhibit Hall
Zheyuan Zhang ⋅ Wanying Dou ⋅ Linkai Peng ⋅ Hongyi Pan ⋅ Ulas Bagci ⋅ Boqing Gong
Exhibit Hall I #188
Auto-Controlled Image Perception in MLLMs via Visual Perception Tokens Poster Session 5 & Exhibit Hall
Runpeng Yu ⋅ Xinyin Ma ⋅ Xinchao Wang
Exhibit Hall I #189
Refer to Any Segmentation Mask Group With Vision-Language Prompts Poster Session 5 & Exhibit Hall
Shengcao Cao ⋅ Zijun Wei ⋅ Jason Kuen ⋅ Kangning Liu ⋅ Lingzhi Zhang ⋅ Jiuxiang Gu ⋅ HyunJoon Jung ⋅ Liangyan Gui ⋅ Yu-Xiong Wang
Exhibit Hall I #192
Triad: Empowering LMM-based Anomaly Detection with Expert-guided Region-of-Interest Tokenizer and Manufacturing Process Poster Session 5 & Exhibit Hall
Yuanze Li ⋅ Shihao Yuan ⋅ Haolin Wang ⋅ Qizhang Li ⋅ Ming Liu ⋅ Chen Xu ⋅ Guangming Shi ⋅ Wangmeng Zuo
Exhibit Hall I #198
Bridging the Gap between Brain and Machine in Interpreting Visual Semantics: Towards Self-adaptive Brain-to-Text Decoding Poster Session 5 & Exhibit Hall
Jiaxuan Chen ⋅ Yu Qi ⋅ Yueming Wang ⋅ Gang Pan
Exhibit Hall I #200
DisTime: Distribution-based Time Representation for Video Large Language Models Poster Session 5 & Exhibit Hall
yingsen zeng ⋅ Zepeng Huang ⋅ Yujie Zhong ⋅ Chengjian Feng ⋅ Jie Hu ⋅ Lin Ma ⋅ Yang Liu
Exhibit Hall I #202
WeaveSeg: Iterative Contrast-weaving and Spectral Feature-refining for Nuclei Instance Segmentation Poster Session 5 & Exhibit Hall
Jiajia Li ⋅ Huisi Wu ⋅ Jing Qin
Exhibit Hall I #204
How Can Objects Help Video-Language Understanding? Poster Session 5 & Exhibit Hall
Zitian Tang ⋅ Shijie Wang ⋅ Junho Cho ⋅ Jaewook Yoo ⋅ Chen Sun
Exhibit Hall I #205
Everything is a Video: Unifying Modalities through Next-Frame Prediction Poster Session 5 & Exhibit Hall
G Thomas Hudson ⋅ Dean Slack ⋅ Thomas Winterbottom ⋅ Jamie Stirling ⋅ Chenghao Xiao ⋅ Junjie Shentu ⋅ Noura Al Moubayed
Exhibit Hall I #206
Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabulary Segmentation Poster Session 5 & Exhibit Hall
Luca Barsellotti ⋅ Lorenzo Bianchi ⋅ Nicola Messina ⋅ Fabio Carrara ⋅ Marcella Cornia ⋅ Lorenzo Baraldi ⋅ Fabrizio Falchi ⋅ Rita Cucchiara
Exhibit Hall I #208
CARIM: Caption-Based Autonomous Driving Scene Retrieval via Inclusive Text Matching Poster Session 5 & Exhibit Hall
Minjoo Ki ⋅ Dae Jung Kim ⋅ Kisung Kim ⋅ Seon Joo Kim ⋅ Jinhan Lee
Exhibit Hall I #209
Beyond Training: Dynamic Token Merging for Zero-Shot Video Understanding Poster Session 5 & Exhibit Hall
Yiming Zhang ⋅ Zhuokai Zhao ⋅ Zhaorun Chen ⋅ Zenghui Ding ⋅ Xianjun Yang ⋅ Yining Sun
Exhibit Hall I #210
Modeling Saliency Dataset Bias Poster Session 5 & Exhibit Hall
Matthias Kümmerer ⋅ Harneet Singh Khanuja ⋅ Matthias Bethge
Exhibit Hall I #213
Beyond Single Images: Retrieval Self-Augmented Unsupervised Camouflaged Object Detection Poster Session 5 & Exhibit Hall
Ji Du ⋅ Xin WANG ⋅ Fangwei Hao ⋅ Mingyang Yu ⋅ Chunyuan Chen ⋅ Jiesheng Wu ⋅ Bin Wang ⋅ Jing Xu ⋅ Ping Li
Exhibit Hall I #218
Advancing Visual Large Language Model for Multi-granular Versatile Perception Poster Session 5 & Exhibit Hall
Wentao Xiang ⋅ Haoxian Tan ⋅ Cong Wei ⋅ Yujie Zhong ⋅ Dengjie Li ⋅ Yujiu Yang
Exhibit Hall I #220
Controllable Latent Space Augmentation for Digital Pathology Poster Session 5 & Exhibit Hall
Sofiène Boutaj ⋅ Marin Scalbert ⋅ Pierre Marza ⋅ Florent Couzinie-Devy ⋅ Maria Vakalopoulou ⋅ Stergios Christodoulidis
Exhibit Hall I #221
PS3: A Multimodal Transformer Integrating Pathology Reports with Histology Images and Biological Pathways for Cancer Survival Prediction Poster Session 5 & Exhibit Hall
Manahil Raza ⋅ Ayesha Azam ⋅ Talha Qaiser ⋅ Nasir Rajpoot
Exhibit Hall I #222
Balanced Sharpness-Aware Minimization for Imbalanced Regression Poster Session 2 & Exhibit Hall with Coffee Break
Yahao Liu ⋅ Qin Wang ⋅ Lixin Duan ⋅ Wen Li
Exhibit Hall I #114
MIEB: Massive Image Embedding Benchmark Poster Session 5 & Exhibit Hall
Chenghao Xiao ⋅ Isaac Chung ⋅ Imene Kerboua ⋅ Jamie Stirling ⋅ Xin Zhang ⋅ Márton Kardos ⋅ Roman Solomatin ⋅ Noura Al Moubayed ⋅ Kenneth Enevoldsen ⋅ Niklas Muennighoff
Exhibit Hall I #223
Interpretable point cloud classification using multiple instance learning Poster Session 5 & Exhibit Hall
Matt De Vries ⋅ Reed Naidoo ⋅ Olga Fourkioti ⋅ Lucas Dent ⋅ Nathan Curry ⋅ Chris Dunsby ⋅ Chris Bakal
Exhibit Hall I #225
Fine-grained Abnormality Prompt Learning for Zero-shot Anomaly Detection Poster Session 5 & Exhibit Hall
Jiawen Zhu ⋅ YEW-SOON ONG ⋅ Chunhua Shen ⋅ Guansong Pang
Exhibit Hall I #228
Bidirectional Likelihood Estimation with Multi-Modal Large Language Models for Text-Video Retrieval Poster Session 5 & Exhibit Hall
Dohwan Ko ⋅ Ji Soo Lee ⋅ Minhyuk Choi ⋅ Zihang Meng ⋅ Hyunwoo Kim
Exhibit Hall I #232
Alleviating Textual Reliance in Medical Language-guided Segmentation via Prototype-driven Semantic Approximation Poster Session 5 & Exhibit Hall
Shuchang Ye ⋅ Usman Naseem ⋅ Mingyuan Meng ⋅ jinman kim
Exhibit Hall I #237
Controllable-LPMoE: Adapting to Challenging Object Segmentation via Dynamic Local Priors from Mixture-of-Experts Poster Session 5 & Exhibit Hall
Yanguang Sun ⋅ Jiawei Lian ⋅ jian Yang ⋅ lei luo
Exhibit Hall I #238
Progressive Test Time Energy Adaptation for Medical Image Segmentation Poster Session 5 & Exhibit Hall
Xiaoran Zhang ⋅ Byung-Woo Hong ⋅ Hyoungseob Park ⋅ Daniel Pak ⋅ Anne-Marie Rickmann ⋅ Lawrence Staib ⋅ James Duncan ⋅ Alex Wong
Exhibit Hall I #239
Revisiting Efficient Semantic Segmentation: Learning Offsets for Better Spatial and Class Feature Alignment Poster Session 5 & Exhibit Hall
Shi-Chen Zhang ⋅ Yunheng Li ⋅ Yu-Huan Wu ⋅ Qibin Hou ⋅ Ming-Ming Cheng
Exhibit Hall I #241
SignRep: Enhancing Self-Supervised Sign Representations Poster Session 5 & Exhibit Hall
Ryan Wong ⋅ Necati Cihan Camgoz ⋅ Richard Bowden
Exhibit Hall I #282
GUIOdyssey: A Comprehensive Dataset for Cross-App GUI Navigation on Mobile Devices Poster Session 5 & Exhibit Hall
Quanfeng Lu ⋅ Wenqi Shao ⋅ Zitao Liu ⋅ Lingxiao Du ⋅ Fanqing Meng ⋅ Boxuan Li ⋅ Botong Chen ⋅ Siyuan Huang ⋅ Kaipeng Zhang ⋅ Ping Luo
Exhibit Hall I #245
Learning Beyond Still Frames: Scaling Vision-Language Models with Video Poster Session 5 & Exhibit Hall
Yiyuan Zhang ⋅ Handong Li ⋅ Jing Liu ⋅ Xiangyu Yue
Exhibit Hall I #247
Is CLIP ideal? No. Can we fix it? Yes! Poster Session 5 & Exhibit Hall
Raphaela Kang ⋅ Yue Song ⋅ Georgia Gkioxari ⋅ Pietro Perona
Exhibit Hall I #248
HQ-CLIP: Leveraging Large Vision-Language Models to Create High-Quality Image-Text Datasets and CLIP Models Poster Session 5 & Exhibit Hall
ZHIXIANG WEI ⋅ Guangting Wang ⋅ Xiaoxiao Ma ⋅ Ke Mei ⋅ Fengyun Rao ⋅ Huaian Chen ⋅ Yi Jin
Exhibit Hall I #249
Dynamic Dictionary Learning for Remote Sensing Image Segmentation Poster Session 5 & Exhibit Hall
Xuechao Zou ⋅ Yue Li ⋅ Shun Zhang ⋅ Kai Li ⋅ Shiying Wang ⋅ Pin Tao ⋅ Junliang Xing ⋅ congyan lang
Exhibit Hall I #250
Temporal-aware Query Routing for Real-time Video Instance Segmentation Poster Session 5 & Exhibit Hall
Zesen Cheng ⋅ Kehan Li ⋅ Yian Zhao ⋅ Hang Zhang ⋅ Chang Liu ⋅ Jie Chen
Exhibit Hall I #251
Free-MoRef: Instantly Multiplexing Context Perception Capabilities of Video-MLLMs within Single Inference Poster Session 5 & Exhibit Hall
KUO WANG ⋅ Quanlong Zheng ⋅ Junlin Xie ⋅ Yanhao Zhang ⋅ Jinguo Luo ⋅ Haonan Lu ⋅ Liang Lin ⋅ Fan Zhou ⋅ Guanbin Li
Exhibit Hall I #254
Towards Fine-grained Interactive Segmentation in Images and Videos Poster Session 5 & Exhibit Hall
Yuan Yao ⋅ Qiushi Yang ⋅ Miaomiao Cui ⋅ Liefeng Bo
Exhibit Hall I #255
Learnable Retrieval Enhanced Visual-Text Alignment and Fusion for Radiology Report Generation Poster Session 5 & Exhibit Hall
Qin Zhou ⋅ Guoyan Liang ⋅ Xindi Li ⋅ Jingyuan CHEN ⋅ Zhe Wang ⋅ Chang Yao ⋅ Sai Wu
Exhibit Hall I #257
Generalizable Object Re-Identification via Visual In-Context Prompting Poster Session 5 & Exhibit Hall
Zhizhong Huang ⋅ Xiaoming Liu
Exhibit Hall I #258
TAB: Transformer Attention Bottlenecks enable User Intervention and Debugging in Vision-Language Models Poster Session 5 & Exhibit Hall
Pooyan Rahmanzadehgervi ⋅ Hung Nguyen ⋅ Rosanne Liu ⋅ Long Mai ⋅ Anh Nguyen
Exhibit Hall I #259
Anomaly Detection of Integrated Circuits Package Substrates Using the Large Vision Model SAIC: Dataset Construction, Methodology, and Application Poster Session 5 & Exhibit Hall
Ruiyun Yu ⋅ Bingyang Guo ⋅ Haoyuan Li
Exhibit Hall I #260
Streaming VideoLLMs for Real-Time Procedural Video Understanding Poster Session 5 & Exhibit Hall
Dibyadip Chatterjee ⋅ Edoardo Remelli ⋅ Yale Song ⋅ Bugra Tekin ⋅ Abhay Mittal ⋅ Bharat Bhatnagar ⋅ Necati Cihan Camgoz ⋅ Shreyas Hampali ⋅ Eric Sauser ⋅ Shugao Ma ⋅ Angela Yao ⋅ Fadime Sener
Exhibit Hall I #262
Prompt-driven Transferable Adversarial Attack on Person Re-Identification with Attribute-aware Textual Inversion Poster Session 5 & Exhibit Hall
Yuan Bian ⋅ Min Liu ⋅ Yunqi Yi ⋅ Xueping Wang ⋅ Shuai Jiang ⋅ Yaonan Wang
Exhibit Hall I #263
FrameFusion: Combining Similarity and Importance for Video Token Reduction on Large Vision Language Models Poster Session 5 & Exhibit Hall
Tianyu Fu ⋅ Tengxuan Liu ⋅ Qinghao Han ⋅ Guohao Dai ⋅ Shengen Yan ⋅ Huazhong Yang ⋅ Xuefei Ning ⋅ Yu Wang
Exhibit Hall I #268
Aligning Effective Tokens with Video Anomaly in Large Language Models Poster Session 5 & Exhibit Hall
YINGXIAN Chen ⋅ Jiahui Liu ⋅ Ruidi Fan ⋅ Yanwei Li ⋅ Chirui CHANG ⋅ Shizhen Zhao ⋅ Wilton.W.T. Fok ⋅ Xiaojuan Qi ⋅ Yik WU
Exhibit Hall I #272
No More Sibling Rivalry: Debiasing Human-Object Interaction Detection Poster Session 5 & Exhibit Hall
Bin Yang ⋅ Yulin Zhang ⋅ Hong-Yu Zhou ⋅ Sibei Yang
Exhibit Hall I #273
Borrowing Eyes for the Blind Spot: Overcoming Data Scarcity in Malicious Video Detection via Cross-Domain Retrieval Augmentation Poster Session 5 & Exhibit Hall
Rongpei Hong ⋅ Jian Lang ⋅ Ting Zhong ⋅ Fan Zhou
Exhibit Hall I #275
DASH: Detection and Assessment of Systematic Hallucinations of VLMs Poster Session 5 & Exhibit Hall
Maximilian Augustin ⋅ Yannic Neuhaus ⋅ Matthias Hein
Exhibit Hall I #277
Sim-DETR: Unlock DETR for Temporal Sentence Grounding Poster Session 5 & Exhibit Hall
Jiajin Tang ⋅ Zhengxuan Wei ⋅ Yuchen Zhu ⋅ Cheng Shi ⋅ Guanbin Li ⋅ Liang Lin ⋅ Sibei Yang
Exhibit Hall I #278
ViCTr: Vital Consistency Transfer for Pathology Aware Image Synthesis Poster Session 5 & Exhibit Hall
Onkar Susladkar ⋅ Gayatri Deshmukh ⋅ Yalcin Tur ⋅ Gorkem Durak ⋅ Ulas Bagci
Exhibit Hall I #279
DIH-CLIP: Unleashing the Diversity of Multi-Head Self-Attention for Training-Free Open-Vocabulary Semantic Segmentation Poster Session 5 & Exhibit Hall
Songsong Duan ⋅ Xi Yang ⋅ Nannan Wang
Exhibit Hall I #281
Plug-in Feedback Self-adaptive Attention in CLIP for Training-free Open-Vocabulary Segmentation Poster Session 5 & Exhibit Hall
Zhixiang Chi ⋅ Yanan Wu ⋅ Li Gu ⋅ Huan Liu ⋅ Ziqiang Wang ⋅ Yang Zhang ⋅ Yang Wang ⋅ Konstantinos Plataniotis
Exhibit Hall I #283
Feather the Throttle: Revisiting Visual Token Pruning for Vision-Language Model Acceleration Poster Session 5 & Exhibit Hall
Mark Endo ⋅ Xiaohan Wang ⋅ Serena Yeung-Levy
Exhibit Hall I #284
Intermediate Connectors and Geometric Priors for Language-Guided Affordance Segmentation on Unseen Object Categories Poster Session 5 & Exhibit Hall
Yicong Li ⋅ Yiyang Chen ⋅ Zhenyuan Ma ⋅ Junbin Xiao ⋅ Xiang Wang ⋅ Angela Yao
Exhibit Hall I #285
LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models Poster Session 5 & Exhibit Hall
Yuzhang Shang ⋅ Mu Cai ⋅ Bingxin Xu ⋅ Yong Jae Lee ⋅ Yan Yan
Exhibit Hall I #287
AURELIA: Test-time Reasoning Distillation in Audio-Visual LLMs Poster Session 5 & Exhibit Hall
Sanjoy Chowdhury ⋅ Hanan Gani ⋅ Nishit Anand ⋅ Sayan Nag ⋅ Ruohan Gao ⋅ Mohamed Elhoseiny ⋅ Salman Khan ⋅ Dinesh Manocha
Exhibit Hall I #291
HERMES: temporal-coHERent long-forM understanding with Episodes and Semantics Poster Session 5 & Exhibit Hall
Gueter Josmy Faure ⋅ Jia-Fong Yeh ⋅ Min-Hung Chen ⋅ Hung-Ting Su ⋅ Shang-Hong Lai ⋅ Winston Hsu
Exhibit Hall I #292
Cycle Consistency as Reward: Learning Image-Text Alignment without Human Preferences Poster Session 5 & Exhibit Hall
Hyojin Bahng ⋅ Caroline Chan ⋅ Fredo Durand ⋅ Phillip Isola
Exhibit Hall I #294
Griffon v2: Advancing Multimodal Perception with High-Resolution Scaling and Visual-Language Co-Referring Poster Session 5 & Exhibit Hall
Yufei Zhan ⋅ Shurong Zheng ⋅ Yousong Zhu ⋅ Hongyin Zhao ⋅ Fan Yang ⋅ Ming Tang ⋅ Jinqiao Wang
Exhibit Hall I #295
LVBench: An Extreme Long Video Understanding Benchmark Poster Session 5 & Exhibit Hall
Weihan Wang ⋅ zehai he ⋅ Wenyi Hong ⋅ Yean Cheng ⋅ Xiaohan Zhang ⋅ Ji Qi ⋅ Ming Ding ⋅ Xiaotao Gu ⋅ Shiyu Huang ⋅ Bin Xu ⋅ Yuxiao Dong ⋅ Jie Tang
Exhibit Hall I #296
Debiasing Trace Guidance: Top-down Trace Distillation and Bottom-up Velocity Alignment for Unsupervised Anomaly Detection
Xingjian Wang ⋅ Li Chai ⋅ Jiming Chen
#299
Beyond [cls]: Exploring the True Potential of Masked Image Modeling Representations Poster Session 5 & Exhibit Hall
Marcin Przewięźlikowski ⋅ Randall Balestriero ⋅ Wojciech Jasiński ⋅ Marek Śmieja ⋅ Bartosz Zieliński
Exhibit Hall I #343
MultiADS: Defect-aware Supervision for Multi-type Anomaly Detection and Segmentation in Zero-Shot Learning Poster Session 5 & Exhibit Hall
Ylli Sadikaj ⋅ Hongkuan Zhou ⋅ Lavdim Halilaj ⋅ Stefan Schmid ⋅ Steffen Staab ⋅ Claudia Plant
Exhibit Hall I #298
ODDR: Outlier Detection & Dimension Reduction Based Defense Against Adversarial Patches Poster Session 5 & Exhibit Hall
Nandish Chattopadhyay ⋅ Amira Guesmi ⋅ Muhammad Abdullah Hanif ⋅ Bassem ouni ⋅ Muhammad Shafique
Exhibit Hall I #300
Similarity Memory Prior is All You Need for Medical Image Segmentation Poster Session 5 & Exhibit Hall
Hao Tang ⋅ Zhiqing Guo ⋅ Liejun Wang ⋅ Chao Liu
Exhibit Hall I #301
CalliReader: Contextualizing Chinese Calligraphy via an Embedding-Aligned Vision-Language Model Poster Session 5 & Exhibit Hall
Yuxuan Luo ⋅ Jiaqi Tang ⋅ Chenyi Huang ⋅ Feiyang Hao ⋅ Zhouhui Lian
Exhibit Hall I #305
Bringing RNNs Back to Efficient Open-Ended Video Understanding Poster Session 5 & Exhibit Hall
Weili Xu ⋅ Enxin Song ⋅ Wenhao Chai ⋅ Xuexiang Wen ⋅ Tian Ye ⋅ Gaoang Wang
Exhibit Hall I #344
Boosting Vision Semantic Density with Anatomy Normality Modeling for Medical Vision-language Pre-training Poster Session 5 & Exhibit Hall
Weiwei Cao ⋅ Jianpeng Zhang ⋅ Zhongyi Shui ⋅ Sinuo Wang ⋅ Zeli Chen ⋅ Xi Li ⋅ Le Lu ⋅ Xianghua Ye ⋅ Qi Zhang ⋅ Tingbo Liang ⋅ Ling Zhang
Exhibit Hall I #306
Accelerate 3D Object Detection Models via Zero-Shot Attention Key Pruning Poster Session 5 & Exhibit Hall
Lizhen Xu ⋅ Xiuxiu Bai ⋅ Xiaojun Jia ⋅ Jianwu Fang ⋅ Shanmin Pang
Exhibit Hall I #310
SparseMM: Head Sparsity Emerges from Visual Concept Responses in MLLMs Poster Session 5 & Exhibit Hall
Jiahui Wang ⋅ Zuyan Liu ⋅ Yongming Rao ⋅ Jiwen Lu
Exhibit Hall I #319
ReferEverything: Towards Segmenting Everything We Can Speak of in Videos Poster Session 5 & Exhibit Hall
Anurag Bagchi ⋅ Zhipeng Bao ⋅ Yu-Xiong Wang ⋅ Pavel Tokmakov ⋅ Martial Hebert
Exhibit Hall I #323
Continual Multiple Instance Learning with Enhanced Localization for Histopathological Whole Slide Image Analysis Poster Session 5 & Exhibit Hall
Byung Hyun Lee ⋅ Wongi Jeong ⋅ Woojae Han ⋅ KYOUNGBUN LEE ⋅ Se Young Chun
Exhibit Hall I #324
From Trial to Triumph: Advancing Long Video Understanding via Visual Context Sample Scaling and Self-reward Alignment Poster Session 5 & Exhibit Hall
Yucheng Suo ⋅ Fan Ma ⋅ Linchao Zhu ⋅ Tianyi Wang ⋅ Fengyun Rao ⋅ Yi Yang
Exhibit Hall I #325
Cross-Architecture Distillation Made Simple with Redundancy Suppression Poster Session 5 & Exhibit Hall
Weijia Zhang ⋅ Yuehao Liu ⋅ Wu Ran ⋅ Chao Ma
Exhibit Hall I #326
DC-TTA: Divide-and-Conquer Framework for Test-Time Adaptation of Interactive Segmentation Poster Session 5 & Exhibit Hall
Jihun Kim ⋅ Hoyong Kwon ⋅ Hyeokjun Kweon ⋅ Wooseong Jeong ⋅ Kuk-Jin Yoon
Exhibit Hall I #328
FIND: Few-Shot Anomaly Inspection with Normal-Only Multi-Modal Data Poster Session 5 & Exhibit Hall
YITING LI ⋅ Fayao Liu ⋅ Jingyi Liao ⋅ Sichao Tian ⋅ Chuan-Sheng Foo ⋅ Xulei Yang
Exhibit Hall I #329
VISO: Accelerating In-orbit Object Detection with Language-Guided Mask Learning and Sparse Inference Poster Session 5 & Exhibit Hall
Meiqi Wang ⋅ Han Qiu
Exhibit Hall I #330
Unsupervised Histopathological Image Semantic Segmentation with Overlapping Patches Consistency Constraint Poster Session 5 & Exhibit Hall
Wentian Cai ⋅ Weizhao Weng ⋅ Zihao Huang ⋅ Yandan Chen ⋅ Siquan Huang ⋅ Ping Gao ⋅ Victor Leung ⋅ Ying Gao
Exhibit Hall I #333
How Do Optical Flow and Textual Prompts Collaborate to Assist in Audio-Visual Semantic Segmentation? Poster Session 5 & Exhibit Hall
Yujian Lee ⋅ Peng Gao ⋅ Yongqi Xu ⋅ Wentao Fan
Exhibit Hall I #334
UINavBench: A Framework for Comprehensive Evaluation of Interactive Digital Agents Poster Session 5 & Exhibit Hall
Harsh Agrawal ⋅ Eldon Schoop ⋅ Xinlei Pan ⋅ Ari Seff ⋅ Anuj Mahajan ⋅ Di Feng ⋅ Ruijia Cheng ⋅ Andres Romero Mier y Teran ⋅ Esteban Gomez ⋅ Abhishek Sundararajan ⋅ Forrest Huang ⋅ Amanda Swearngin ⋅ Mohana Moorthy ⋅ Jeffrey Nichols ⋅ Alexander Toshev
Exhibit Hall I #335
VIPerson: Flexibly Generating Virtual Identity for Person Re-Identification Poster Session 5 & Exhibit Hall
Xiao-Wen Zhang ⋅ Delong Zhang ⋅ Yi-Xing Peng ⋅ Zhi Ouyang ⋅ Jingke Meng ⋅ Wei-Shi Zheng
Exhibit Hall I #337
Towards Robustness of Person Search against Corruptions Poster Session 5 & Exhibit Hall
Woojung Son ⋅ Yoonki Cho ⋅ Guoyuan An ⋅ Chanmi Lee ⋅ Sung-eui Yoon
Exhibit Hall I #340
Flow-MIL: Constructing Highly-expressive Latent Feature Space For Whole Slide Image Classification Using Normalizing Flow Poster Session 5 & Exhibit Hall
Yingfan MA ⋅ Bohan An ⋅ Ao Shen ⋅ Mingzhi Yuan ⋅ Minghong Duan ⋅ Manning Wang
Exhibit Hall I #354
HarmonySeg: Tubular Structure Segmentation with Deep-Shallow Feature Fusion and Growth-Suppression Balanced Loss Poster Session 5 & Exhibit Hall
Ke Zhang ⋅ Yi Huang ⋅ Wei Liu ⋅ Yuanyuan Wang ⋅ Vishal Patel ⋅ Le Lu ⋅ Xu Han ⋅ Dakai Jin ⋅ Ke Yan
Exhibit Hall I #355
CompCap: Improving Multimodal Large Language Models with Composite Captions Poster Session 5 & Exhibit Hall
Xiaohui Chen ⋅ Satya Narayan Shukla ⋅ Mahmoud Azab ⋅ Aashu Singh ⋅ Qifan Wang ⋅ David Yang ⋅ ShengYun Peng ⋅ Hanchao Yu ⋅ Shen Yan ⋅ Xuewen Zhang ⋅ Baosheng He
Exhibit Hall I #356
Stable Diffusion Models are Secretly Good at Visual In-Context Learning Poster Session 5 & Exhibit Hall
Trevine Oorloff ⋅ Vishwanath Sindagi ⋅ Wele Gedara Chaminda Bandara ⋅ Ali Shafahi ⋅ Amin Ghiasi ⋅ Charan Prakash ⋅ Reza Ardekani
Exhibit Hall I #358
Prior2Former - Evidential Modeling of Mask Transformers for Assumption-Free Open-World Panoptic Segmentation Poster Session 5 & Exhibit Hall
Sebastian Schmidt ⋅ Julius Koerner ⋅ Dominik Fuchsgruber ⋅ Stefano Gasperini ⋅ Federico Tombari ⋅ Stephan Günnemann
Exhibit Hall I #362
Seeing the Unseen: A Semantic Alignment and Context-Aware Prompt Framework for Open-Vocabulary Camouflaged Object Segmentation Poster Session 5 & Exhibit Hall
Peng Ren ⋅ Tian Bai ⋅ Jing Sun ⋅ Fuming Sun
Exhibit Hall I #363
ViLLa: Video Reasoning Segmentation with Large Language Model Poster Session 5 & Exhibit Hall
rongkun Zheng ⋅ Lu Qi ⋅ Xi Chen ⋅ Yi Wang ⋅ Kun Wang ⋅ Hengshuang Zhao
Exhibit Hall I #364
DynImg: Key Frames with Visual Prompts are Good Representation for Multi-Modal Video Understanding Poster Session 5 & Exhibit Hall
Xiaoyi Bao ⋅ Chen-Wei Xie ⋅ Hao Tang ⋅ Tingyu Weng ⋅ Xiaofeng Wang ⋅ Yun Zheng ⋅ Xingang Wang
Exhibit Hall I #365
Object-level Correlation for Few-Shot Segmentation Poster Session 5 & Exhibit Hall
chunlin wen ⋅ Yu Zhang ⋅ Jie Fan ⋅ Hongyuan Zhu ⋅ Xiu-Shen Wei ⋅ Yijun Wang ⋅ Zhiqiang Kou ⋅ Shuzhou Sun
Exhibit Hall I #366
Vision-Language Neural Graph Featurization for Extracting Retinal Lesions Poster Session 5 & Exhibit Hall
Taimur Hassan ⋅ Anabia Sohail ⋅ Muzammal Naseer ⋅ Naoufel Werghi
Exhibit Hall I #367
SSVQ: Unleashing the Potential of Vector Quantization with Sign-Splitting Poster Session 5 & Exhibit Hall
Shuaiting Li ⋅ Juncan Deng ⋅ Chengxuan Wang ⋅ Kedong Xu ⋅ Rongtao Deng ⋅ Hong Gu ⋅ Haibin Shen ⋅ Kejie Huang
Exhibit Hall I #368
RadGPT: Constructing 3D Image-Text Tumor Datasets Poster Session 5 & Exhibit Hall
Pedro Bassi ⋅ Mehmet Yavuz ⋅ Ibrahim Ethem Hamamci ⋅ Sezgin Er ⋅ Xiaoxi Chen ⋅ Wenxuan Li ⋅ Bjoern Menze ⋅ Sergio Decherchi ⋅ Andrea Cavalli ⋅ Kang Wang ⋅ Yang Yang ⋅ Alan Yuille ⋅ Zongwei Zhou
Exhibit Hall I #369
LawDIS: Language-Window-based Controllable Dichotomous Image Segmentation Poster Session 5 & Exhibit Hall
Xinyu Yan ⋅ Meijun Sun ⋅ Ge-Peng Ji ⋅ Fahad Khan ⋅ Salman Khan ⋅ Deng-Ping Fan
Exhibit Hall I #385
VideoMiner: Iteratively Grounding Key Frames of Hour-Long Videos via Tree-based Group Relative Policy Optimization Poster Session 5 & Exhibit Hall
Xinye Cao ⋅ Hongcan Guo ⋅ Jiawen Qian ⋅ Guoshun Nan ⋅ Chao Wang ⋅ Yuqi Pan ⋅ Tianhao Hou ⋅ Xiaojuan Wang ⋅ Yutong Gao
Exhibit Hall I #374
Flow4Agent: Long-form Video Understanding via Motion Prior from Optical Flow Poster Session 5 & Exhibit Hall
Ruyang Liu ⋅ Shangkun Sun ⋅ Haoran Tang ⋅ Wei Gao ⋅ Ge Li
Exhibit Hall I #380
An OpenMind for 3D Medical Vision Self-supervised Learning Poster Session 5 & Exhibit Hall
Tassilo Wald ⋅ Constantin Ulrich ⋅ Jonathan Suprijadi ⋅ Sebastian Ziegler ⋅ Michal Nohel ⋅ Robin Peretzke ⋅ Gregor Koehler ⋅ Klaus Maier-Hein
Exhibit Hall I #382
OmniSAM: Omnidirectional Segment Anything Model for UDA in Panoramic Semantic Segmentation Poster Session 5 & Exhibit Hall
Ding Zhong ⋅ Xu Zheng ⋅ Chenfei Liao ⋅ Yuanhuiyi Lyu ⋅ Jialei Chen ⋅ Shengyang Wu ⋅ Linfeng Zhang ⋅ Xuming Hu
Exhibit Hall I #384
ModalTune: Fine-Tuning Slide-Level Foundation Models with Multi-Modal Information for Multi-task Learning in Digital Pathology Poster Session 5 & Exhibit Hall
Vishwesh Ramanathan ⋅ Tony Xu ⋅ Pushpak Pati ⋅ Faruk Ahmed ⋅ Maged Goubran ⋅ Anne Martel
Exhibit Hall I #386
VFlowOpt: A Token Pruning Framework for LMMs with Visual Information Flow-Guided Optimization Poster Session 5 & Exhibit Hall
Sihan Yang ⋅ Runsen Xu ⋅ Chenhang Cui ⋅ Tai Wang ⋅ Dahua Lin ⋅ Jiangmiao Pang
Exhibit Hall I #387
Open-Vocabulary HOI Detection with Interaction-aware Prompt and Concept Calibration Poster Session 5 & Exhibit Hall
Ting Lei ⋅ Shaofeng Yin ⋅ Qingchao Chen ⋅ Yuxin Peng ⋅ Yang Liu
Exhibit Hall I #389
MINERVA: Evaluating Complex Video Reasoning Poster Session 5 & Exhibit Hall
Arsha Nagrani ⋅ Sachit Menon ⋅ Ahmet Iscen ⋅ Shyamal Buch ⋅ Nilpa Jha ⋅ Ramin Mehran ⋅ Anja Hauth ⋅ Mikhail Sirotenko ⋅ Yukun Zhu ⋅ Carl Vondrick ⋅ Cordelia Schmid ⋅ Tobias Weyand
Exhibit Hall I #391
Multi-Granular Spatio-Temporal Token Merging for Training-Free Acceleration of Video LLMs Poster Session 5 & Exhibit Hall
Jeongseok Hyun ⋅ Sukjun Hwang ⋅ Su Ho Han ⋅ Taeoh Kim ⋅ Inwoong Lee ⋅ Dongyoon Wee ⋅ Joon-Young Lee ⋅ Seon Joo Kim ⋅ Minho Shim
Exhibit Hall I #393
Scaling Tumor Segmentation: Best Lessons from Real and Synthetic Data Poster Session 5 & Exhibit Hall
Qi Chen ⋅ Xinze Zhou ⋅ Chen Liu ⋅ Hao Chen ⋅ Wenxuan Li ⋅ Zekun Jiang ⋅ Ziyan Huang ⋅ Yuxuan Zhao ⋅ Dexin Yu ⋅ Junjun He ⋅ Yefeng Zheng ⋅ Ling Shao ⋅ Alan Yuille ⋅ Zongwei Zhou
Exhibit Hall I #394
TAViS: Text-bridged Audio-Visual Segmentation with Foundation Models Poster Session 5 & Exhibit Hall
Ziyang Luo ⋅ Nian Liu ⋅ Xuguang Yang ⋅ Salman Khan ⋅ Rao Anwer ⋅ Hisham Cholakkal ⋅ Fahad Khan ⋅ Junwei Han
Exhibit Hall I #395
Teaching AI the Anatomy Behind the Scan: Addressing Anatomical Flaws in Medical Image Segmentation with Learnable Prior Poster Session 5 & Exhibit Hall
Young Seok Jeon ⋅ Hongfei Yang ⋅ Huazhu Fu ⋅ Young Seok Jeon
Exhibit Hall I #396
LIRA: Inferring Segmentation in Large Multi-modal Models with Local Interleaved Region Assistance Poster Session 5 & Exhibit Hall
Zhang Li ⋅ Biao Yang ⋅ Qiang Liu ⋅ Shuo Zhang ⋅ Zhiyin Ma ⋅ Liang Yin ⋅ Deng Linger ⋅ Yabo Sun ⋅ Yuliang Liu ⋅ Xiang Bai
Exhibit Hall I #399
SimMLM: A Simple Framework for Multi-modal Learning with Missing Modality Poster Session 5 & Exhibit Hall
Sijie Li ⋅ Chen Chen ⋅ Jungong Han
Exhibit Hall I #400
NAVER: A Neuro-Symbolic Compositional Automaton for Visual Grounding with Explicit Logic Reasoning Poster Session 5 & Exhibit Hall
Zhixi Cai ⋅ Fucai Ke ⋅ Simindokht Jahangard ⋅ Maria Garcia de la Banda ⋅ Gholamreza Haffari ⋅ Peter Stuckey ⋅ Hamid Rezatofighi
Exhibit Hall I #401
MDP3: A Training-free Approach for List-wise Frame Selection in Video-LLMs Poster Session 5 & Exhibit Hall
Hui Sun ⋅ Shiyin Lu ⋅ Huanyu Wang ⋅ Qing-Guo Chen ⋅ Zhao Xu ⋅ Weihua Luo ⋅ Kaifu Zhang ⋅ Ming Li
Exhibit Hall I #402
Emulating Self-attention with Convolution for Efficient Image Super-Resolution Poster Session 5 & Exhibit Hall
Dongheon Lee ⋅ Seokju Yun ⋅ Youngmin Ro
Exhibit Hall I #437
Token-Efficient VLM: High-Resolution Image Understanding via Dynamic Region Proposal Poster Session 5 & Exhibit Hall
Yitong Jiang ⋅ Jinwei Gu ⋅ Tianfan Xue ⋅ Ka Chun Cheung ⋅ Pavlo Molchanov ⋅ Hongxu Yin ⋅ Sifei Liu
Exhibit Hall I #407
Vision-Language Models Can't See the Obvious Poster Session 5 & Exhibit Hall
YASSER ABDELAZIZ DAHOU DJILALI ⋅ Ngoc Huynh ⋅ Phúc Lê Khắc ⋅ Wamiq Para ⋅ Ankit Singh ⋅ Sanath Narayan
Exhibit Hall I #408
Region-aware Anchoring Mechanism for Efficient Referring Visual Grounding Poster Session 5 & Exhibit Hall
Shuyi Ouyang ⋅ Ziwei Niu ⋅ Hongyi Wang ⋅ Yen-wei Chen ⋅ Lanfen Lin
Exhibit Hall I #411
VTimeCoT: Thinking by Drawing for Video Temporal Grounding and Reasoning Poster Session 5 & Exhibit Hall
Jinglei Zhang ⋅ Yuanfan Guo ⋅ Rolandos Alexandros Potamias ⋅ Jiankang Deng ⋅ Hang Xu ⋅ Chao Ma
Exhibit Hall I #412
Kaputt: A Large-Scale Dataset for Visual Defect Detection Poster Session 5 & Exhibit Hall
Sebastian Höfer ⋅ Dorian Henning ⋅ Artemij Amiranashvili ⋅ Douglas Morrison ⋅ Mariliza Tzes ⋅ Ingmar Posner ⋅ Marc Matvienko ⋅ Alessandro Rennola ⋅ Anton Milan
Exhibit Hall I #414
ChatReID: Open-ended Interactive Person Retrieval via Hierarchical Progressive Tuning for Vision Language Models Poster Session 5 & Exhibit Hall
Ke Niu ⋅ Haiyang Yu ⋅ Mengyang Zhao ⋅ Teng Fu ⋅ Siyang Yi ⋅ Wei Lu ⋅ Bin Li ⋅ Xuelin Qian ⋅ Xiangyang Xue
Exhibit Hall I #416
Auto-Vocabulary Semantic Segmentation Poster Session 5 & Exhibit Hall
Osman Ülger ⋅ Maksymilian Kulicki ⋅ Yuki Asano ⋅ Martin R. Oswald
Exhibit Hall I #418
Enrich and Detect: Video Temporal Grounding with Multimodal LLMs Poster Session 5 & Exhibit Hall
Shraman Pramanick ⋅ Effrosyni Mavroudi ⋅ Yale Song ⋅ Rama Chellappa ⋅ Lorenzo Torresani ⋅ Triantafyllos Afouras
Exhibit Hall I #421
Player-Centric Multimodal Prompt Generation for Large Language Model Based Identity-Aware Basketball Video Captioning Poster Session 5 & Exhibit Hall
Zeyu Xi ⋅ Haoying Sun ⋅ Yaofei Wu ⋅ Junchi Yan ⋅ Haoran Zhang ⋅ Lifang Wu ⋅ Liang Wang ⋅ Chang Wen Chen
Exhibit Hall I #424
Synchronizing Task Behavior: Aligning Multiple Tasks during Test-Time Training Poster Session 5 & Exhibit Hall
Wooseong Jeong ⋅ Jegyeong Cho ⋅ Youngho Yoon ⋅ Kuk-Jin Yoon
Exhibit Hall I #425
Conditional Latent Diffusion Models for Zero-Shot Instance Segmentation Poster Session 5 & Exhibit Hall
Maximilian Ulmer ⋅ Wout Boerdijk ⋅ Rudolph Triebel ⋅ Maximilian Durner
Exhibit Hall I #427
GenHancer: Imperfect Generative Models are Secretly Strong Vision-Centric Enhancers Poster Session 5 & Exhibit Hall
Shijie Ma ⋅ Yuying Ge ⋅ Teng Wang ⋅ Yuxin Guo ⋅ Yixiao Ge ⋅ Ying Shan
Exhibit Hall I #431
Breaking Grid Constraints: Dynamic Graph Reconstruction Network for Multi-organ Segmentation Poster Session 5 & Exhibit Hall
Junhao Xiao ⋅ Yang Wei ⋅ Jingyu Wang ⋅ Yongchao Wang ⋅ Xiuli Bi ⋅ Bin Xiao
Exhibit Hall I #432
MaskSAM: Auto-prompt SAM with Mask Classification for Volumetric Medical Image Segmentation Poster Session 5 & Exhibit Hall
Bin Xie ⋅ Hao Tang ⋅ Bin Duan ⋅ Dawen Cai ⋅ Yan Yan ⋅ Gady Agam
Exhibit Hall I #433
Large-scale Pre-training for Grounded Video Caption Generation Poster Session 5 & Exhibit Hall
Evangelos Kazakos ⋅ Cordelia Schmid ⋅ Josef Sivic
Exhibit Hall I #434
MEH: A Multi-Style Dataset and Toolkit for Advancing Egyptian Hieroglyph Recognition Poster Session 5 & Exhibit Hall
Maksim Golyadkin ⋅ Rubanova Alexandrovna ⋅ Aleksandr Utkov ⋅ Dmitry Nikolotov ⋅ Ilya Makarov
Exhibit Hall I #439
Hybrid-Tower: Fine-grained Pseudo-query Interaction and Generation for Text-to-Video Retrieval Poster Session 5 & Exhibit Hall
Bangxiang Lan ⋅ Ruobing Xie ⋅ Ruixiang Zhao ⋅ Xingwu Sun ⋅ Zhanhui Kang ⋅ Gang Yang ⋅ Xirong Li
Exhibit Hall I #440
Unbiased Missing-modality Multimodal Learning Poster Session 5 & Exhibit Hall
Ruiting Dai ⋅ Chenxi Li ⋅ Yandong Yan ⋅ Lisi Mo ⋅ Ke Qin ⋅ Tao He
Exhibit Hall I #441
ViM-VQ: Efficient Post-Training Vector Quantization for Visual Mamba Poster Session 5 & Exhibit Hall
Juncan Deng ⋅ Shuaiting Li ⋅ Zeyu Wang ⋅ Kedong Xu ⋅ Hong Gu ⋅ Kejie Huang
Exhibit Hall I #442
Axis-level Symmetry Detection with Group-Equivariant Representation Poster Session 6 & Exhibit Hall with Coffee Break
Wongyun Yu ⋅ Ahyun Seo ⋅ Minsu Cho
Exhibit Hall I #7
B-VLLM: A Vision Large Language Model with Balanced Spatio-Temporal Tokens Poster Session 5 & Exhibit Hall
Zhuqiang Lu ⋅ Zhenfei Yin ⋅ Mengwei He ⋅ Zhihui Wang ⋅ Zicheng Liu ⋅ Zhiyong Wang ⋅ Kun Hu
Exhibit Hall I #445
DiffTell: A High-Quality Dataset for Describing Image Manipulation Changes Poster Session 5 & Exhibit Hall
Zonglin Di ⋅ Jing Shi ⋅ Yifei Fan ⋅ Hao Tan ⋅ Alexander Black ⋅ John Collomosse ⋅ Yang Liu
Exhibit Hall I #448
YOLOE: Real-Time Seeing Anything Poster Session 5 & Exhibit Hall
Ao Wang ⋅ Lihao Liu ⋅ Hui Chen ⋅ Zijia Lin ⋅ Jungong Han ⋅ Guiguang Ding
Exhibit Hall I #449
Mixture-of-Scores: Robust Image-Text Data Valuation via Three Lines of Code Poster Session 5 & Exhibit Hall
WU Sitong ⋅ Haoru Tan ⋅ Yukang Chen ⋅ Shaofeng Zhang ⋅ Jingyao Li ⋅ Bei Yu ⋅ Xiaojuan Qi ⋅ Jiaya Jia
Exhibit Hall I #450
Benchmarking Burst Super-Resolution for Polarization Images: Noise Dataset and Analysis Poster Session 6 & Exhibit Hall with Coffee Break
Inseung Hwang ⋅ Kiseok Choi ⋅ Hyunho Ha ⋅ Min H. Kim
Exhibit Hall I #17
X2-Gaussian: 4D Radiative Gaussian Splatting for Continuous-time Tomographic Reconstruction Poster Session 6 & Exhibit Hall with Coffee Break
Weihao Yu ⋅ Yuanhao Cai ⋅ Ruyi Zha ⋅ Zhiwen Fan ⋅ Chenxin Li ⋅ Yixuan Yuan
Exhibit Hall I #1
HyperGCT: A Dynamic Hyper-GNN-Learned Geometric Constraint for 3D Registration Poster Session 6 & Exhibit Hall with Coffee Break
Xiyu Zhang ⋅ Jiayi Ma ⋅ Jianwei Guo ⋅ Wei Hu ⋅ Zhaoshuai Qi ⋅ Fei HUI ⋅ Jiaqi Yang ⋅ Yanning Zhang
Exhibit Hall I #3
AD-GS: Object-Aware B-Spline Gaussian Splatting for Self-Supervised Autonomous Driving Poster Session 6 & Exhibit Hall with Coffee Break
Jiawei Xu ⋅ Kai Deng ⋅ Zexin Fan ⋅ Shenlong Wang ⋅ Jin Xie ⋅ jian Yang
Exhibit Hall I #5
EvaGaussians: Event Stream Assisted Gaussian Splatting from Blurry Images Poster Session 6 & Exhibit Hall with Coffee Break
Wangbo Yu ⋅ Chaoran Feng ⋅ Jianing Li ⋅ Jiye Tang ⋅ Jiashu Yang ⋅ Zhenyu Tang ⋅ Meng Cao ⋅ Xu Jia ⋅ Yuchao Yang ⋅ Li Yuan ⋅ Yonghong Tian
Exhibit Hall I #6
All in One: Visual-Description-Guided Unified Point Cloud Segmentation Poster Session 6 & Exhibit Hall with Coffee Break
Zongyan Han ⋅ Mohamed El Amine Boudjoghra ⋅ Jiahua Dong ⋅ Jinhong Wang ⋅ Rao Anwer
Exhibit Hall I #11
Bolt3D: Generating 3D Scenes in Seconds Poster Session 6 & Exhibit Hall with Coffee Break
Stanislaw Szymanowicz ⋅ Jason Y. Zhang ⋅ Pratul Srinivasan ⋅ Ruiqi Gao ⋅ Arthur Brussee ⋅ Aleksander Holynski ⋅ Ricardo Martin Brualla ⋅ Jonathan Barron ⋅ Philipp Henzler
Exhibit Hall I #12
Semantic Causality-Aware Vision-Based 3D Occupancy Prediction Poster Session 6 & Exhibit Hall with Coffee Break
Dubing Chen ⋅ Huan Zheng ⋅ Yucheng Zhou ⋅ Xianfei Li ⋅ Wenlong Liao ⋅ Tao He ⋅ Pai Peng ⋅ Jianbing Shen
Exhibit Hall I #15
U-ViLAR: Uncertainty-Aware Visual Localization for Autonomous Driving via Differentiable Association and Registration Poster Session 6 & Exhibit Hall with Coffee Break
Xiaofan Li ⋅ Zhihao Xu ⋅ Chenming Wu ⋅ Zhao Yang ⋅ Yumeng Zhang ⋅ Jiang-Jiang Liu ⋅ Haibao Yu ⋅ Xiaoqing Ye ⋅ YuAn Wang ⋅ Shirui Li ⋅ Xun Sun ⋅ Ji Wan ⋅ Jun Wang
Exhibit Hall I #16
Large Scene Generation with Cube-Absorb Discrete Diffusion Poster Session 6 & Exhibit Hall with Coffee Break
Qianjiang Hu ⋅ Wei Hu
Exhibit Hall I #44
Group Inertial Poser: Multi-Person Pose and Global Translation from Sparse Inertial Sensors and Ultra-Wideband Ranging Poster Session 6 & Exhibit Hall with Coffee Break
Ying Xue ⋅ Jiaxi Jiang ⋅ Rayan Armani ⋅ Dominik Hollidt ⋅ Yi-Chi Liao ⋅ Christian Holz
Exhibit Hall I #18
RESCUE: Crowd Evacuation Simulation via Controlling SDM-United Characters Poster Session 6 & Exhibit Hall with Coffee Break
Xiaolin Liu ⋅ Tianyi zhou ⋅ Hongbo Kang ⋅ Jian Ma ⋅ Ziwen Wang ⋅ Jing Huang ⋅ Wenguo Weng ⋅ Yu-Kun Lai ⋅ Kun Li
Exhibit Hall I #22
SG-LDM: Semantic-Guided LiDAR Generation via Latent-Aligned Diffusion Poster Session 6 & Exhibit Hall with Coffee Break
Zhengkang Xiang ⋅ Zizhao Li ⋅ Amir Khodabandeh ⋅ Kourosh Khoshelham
Exhibit Hall I #23
LookOut: Real-World Humanoid Egocentric Navigation Poster Session 6 & Exhibit Hall with Coffee Break
Boxiao Pan ⋅ Adam Harley ⋅ Francis Engelmann ⋅ Karen Liu ⋅ Leonidas Guibas
Exhibit Hall I #24
Occupancy Learning with Spatiotemporal Memory Poster Session 6 & Exhibit Hall with Coffee Break
Ziyang Leng ⋅ Jiawei Yang ⋅ Wenlong Yi ⋅ Bolei Zhou
Exhibit Hall I #179
PointGAC: Geometric-Aware Codebook for Masked Point Modeling Poster Session 6 & Exhibit Hall with Coffee Break
Abiao Li ⋅ Chenlei Lv ⋅ Guofeng Mei ⋅ Yifan Zuo ⋅ Jian Zhang ⋅ Yuming Fang
Exhibit Hall I #25
Statistical Confidence Rescoring for Robust 3D Scene Graph Generation from Multi-View Images Poster Session 6 & Exhibit Hall with Coffee Break
Qi Xun Yeo ⋅ Yanyan Li ⋅ Gim Hee Lee
Exhibit Hall I #26
PRM: Photometric Stereo based Large Reconstruction Model Poster Session 6 & Exhibit Hall with Coffee Break
Wenhang Ge ⋅ Jiantao Lin ⋅ Guibao SHEN ⋅ Jiawei Feng ⋅ Tao Hu ⋅ Xinli Xu ⋅ Ying-Cong Chen
Exhibit Hall I #27
4D Gaussian Splatting SLAM Poster Session 6 & Exhibit Hall with Coffee Break
Yanyan Li ⋅ Youxu Fang ⋅ Zunjie Zhu ⋅ Kunyi Li ⋅ Yong Ding ⋅ Federico Tombari
Exhibit Hall I #28
Generalizable Non-Line-of-Sight Imaging with Learnable Physical Priors Poster Session 6 & Exhibit Hall with Coffee Break
Shida Sun ⋅ Yue Li ⋅ Yueyi Zhang ⋅ Zhiwei Xiong
Exhibit Hall I #30
Hi3DGen: High-fidelity 3D Geometry Generation from Images via Normal Bridging Poster Session 6 & Exhibit Hall with Coffee Break
Chongjie Ye ⋅ Yushuang Wu ⋅ Ziteng Lu ⋅ Jiahao Chang ⋅ Xiaoyang Guo ⋅ Jiaqing Zhou ⋅ Hao Zhao ⋅ Xiaoguang Han
Exhibit Hall I #31
SuperMat: Physically Consistent PBR Material Estimation at Interactive Rates Poster Session 6 & Exhibit Hall with Coffee Break
Yijia Hong ⋅ Yuan-Chen Guo ⋅ Ran Yi ⋅ Yulong Chen ⋅ Yanpei Cao ⋅ Lizhuang Ma
Exhibit Hall I #34
RI3D: Few-Shot Gaussian Splatting With Repair and Inpainting Diffusion Priors Poster Session 6 & Exhibit Hall with Coffee Break
Avinash Paliwal ⋅ xilong zhou ⋅ Wei Ye ⋅ Jinhui Xiong ⋅ Rakesh Ranjan ⋅ Nima Kalantari
Exhibit Hall I #35
Dual-S3D: Hierarchical Dual-Path Selective SSM-CNN for High-Fidelity Implicit Reconstruction Poster Session 6 & Exhibit Hall with Coffee Break
Luoxi Zhang ⋅ Pragyan Shrestha ⋅ Yu Zhou ⋅ Chun Xie ⋅ Itaru Kitahara
Exhibit Hall I #36
AlignDiff: Learning Physically-Grounded Camera Alignment via Diffusion Poster Session 6 & Exhibit Hall with Coffee Break
Liuyue Xie ⋅ Jiancong Guo ⋅ Ozan Cakmakci ⋅ Andre Araujo ⋅ Laszlo A. A. Jeni ⋅ zhiheng jia
Exhibit Hall I #210
FastPoint: Accelerating 3D Point Cloud Model Inference via Sample Point Distance Prediction Poster Session 6 & Exhibit Hall with Coffee Break
Donghyun Lee ⋅ Dawoon Jeong ⋅ Jae W. Lee ⋅ Hongil Yoon
Exhibit Hall I #37
RobuSTereo: Robust Zero-Shot Stereo Matching under Adverse Weather Poster Session 6 & Exhibit Hall with Coffee Break
Yuran Wang ⋅ Yingping Liang ⋅ Yutao Hu ⋅ Ying Fu
Exhibit Hall I #39
Gaussian Splatting with Discretized SDF for Relightable Assets Poster Session 6 & Exhibit Hall with Coffee Break
Zuo-Liang Zhu ⋅ jian Yang ⋅ Beibei Wang
Exhibit Hall I #41
MMGeo: Multimodal Compositional Geo-Localization for UAVs Poster Session 6 & Exhibit Hall with Coffee Break
Yuxiang Ji ⋅ Boyong He ⋅ Zhuoyue Tan ⋅ Liaoni Wu
Exhibit Hall I #42
AdaptiveAE: An Adaptive Exposure Strategy for HDR Capturing in Dynamic Scenes Poster Session 6 & Exhibit Hall with Coffee Break
Tianyi Xu ⋅ Fan Zhang ⋅ Boxin Shi ⋅ Tianfan Xue ⋅ Yujin Wang
Exhibit Hall I #43
SynAD: Enhancing Real-World End-to-End Autonomous Driving Models through Synthetic Data Integration Poster Session 6 & Exhibit Hall with Coffee Break
Jongsuk Kim ⋅ Jae Young Lee ⋅ Gyojin Han ⋅ Dong-Jae Lee ⋅ Minki Jeong ⋅ Junmo Kim
Exhibit Hall I #45
Benchmarking Egocentric Visual-Inertial SLAM at City Scale Poster Session 6 & Exhibit Hall with Coffee Break
Anusha Krishnan ⋅ Shaohui Liu ⋅ Paul-Edouard Sarlin ⋅ Oscar Gentilhomme ⋅ David Caruso ⋅ Maurizio Monge ⋅ Richard Newcombe ⋅ Jakob Engel ⋅ Marc Pollefeys
Exhibit Hall I #46
Neural Shell Texture Splatting: More Details and Fewer Primitives Poster Session 6 & Exhibit Hall with Coffee Break
Xin Zhang ⋅ Anpei Chen ⋅ Jincheng Xiong ⋅ Pinxuan Dai ⋅ Yujun Shen ⋅ Weiwei Xu
Exhibit Hall I #48
Gaussian-based World Model: Gaussian Priors for Voxel-Based Occupancy Prediction and Future Motion Prediction Poster Session 6 & Exhibit Hall with Coffee Break
Tuo Feng ⋅ Wenguan Wang ⋅ Yi Yang
Exhibit Hall I #49
Momentum-GS: Momentum Gaussian Self-Distillation for High-Quality Large Scene Reconstruction Poster Session 6 & Exhibit Hall with Coffee Break
JIXUAN FAN ⋅ Wanhua Li ⋅ Yifei Han ⋅ Tianru Dai ⋅ Yansong Tang
Exhibit Hall I #50
JointDiT: Enhancing RGB-Depth Joint Modeling with Diffusion Transformers Poster Session 6 & Exhibit Hall with Coffee Break
Kwon Byung-Ki ⋅ Qi Dai ⋅ Lee Hyoseok ⋅ Chong Luo ⋅ Tae-Hyun Oh
Exhibit Hall I #51
A Real-world Display Inverse Rendering Dataset Poster Session 6 & Exhibit Hall with Coffee Break
Seokjun Choi ⋅ Hoon-Gyu Chung ⋅ Yujin Jeon ⋅ Giljoo Nam ⋅ Seung-Hwan Baek
Exhibit Hall I #52
RCTDistill: Cross-Modal Knowledge Distillation Framework for Radar-Camera 3D Object Detection with Temporal Fusion Poster Session 6 & Exhibit Hall with Coffee Break
Geonho Bang ⋅ Minjae Seong ⋅ Jisong Kim ⋅ Geunju Baek ⋅ Daye Oh ⋅ Junhyung Kim ⋅ Junho Koh ⋅ Jun Won Choi
Exhibit Hall I #56
Federated Domain Generalization with Domain-specific Soft Prompts Generation Poster Session 1 & Exhibit Hall
Jianhan Wu ⋅ Xiaoyang Qu ⋅ Zhangcheng Huang ⋅ Jianzong Wang
Exhibit Hall I #215
GCRayDiffusion: Pose-Free Surface Reconstruction via Geometric Consistent Ray Diffusion Poster Session 6 & Exhibit Hall with Coffee Break
Li-Heng Chen ⋅ Zi-Xin Zou ⋅ Chang Liu ⋅ Tianjiao Jing ⋅ Yanpei Cao ⋅ Shi-Sheng Huang ⋅ Hongbo Fu ⋅ Hua Huang
Exhibit Hall I #58
GSRecon: Efficient Generalizable Gaussian Splatting for Surface Reconstruction from Sparse Views Poster Session 6 & Exhibit Hall with Coffee Break
Hang Yang ⋅ Le Hui ⋅ Jianjun Qian ⋅ Jin Xie ⋅ Jian Yang
Exhibit Hall I #59
REPARO: Compositional 3D Assets Generation with Differentiable 3D Layout Alignment Poster Session 6 & Exhibit Hall with Coffee Break
Haonan Han ⋅ Rui Yang ⋅ Huan Liao ⋅ Haonan Han ⋅ Zunnan Xu ⋅ Xiaoming Yu ⋅ Junwei Zha ⋅ Xiu Li ⋅ Wanhua Li
Exhibit Hall I #61
Towards Safer and Understandable Driver Intention Prediction Poster Session 6 & Exhibit Hall with Coffee Break
Mukilan Karuppasamy ⋅ Shankar Gangisetty ⋅ Shyam Nandan Rai ⋅ Carlo Masone ⋅ C.V. Jawahar
Exhibit Hall I #62
V2XPnP: Vehicle-to-Everything Spatio-Temporal Fusion for Multi-Agent Perception and Prediction Poster Session 6 & Exhibit Hall with Coffee Break
Zewei Zhou ⋅ Hao Xiang ⋅ Zhaoliang Zheng ⋅ Zhihao Zhao ⋅ Mingyue Lei ⋅ Yun Zhang ⋅ Tianhui Cai ⋅ Xinyi Liu ⋅ Johnson Liu ⋅ Maheswari Bajji ⋅ Xin Xia ⋅ Zhiyu Huang ⋅ Bolei Zhou ⋅ Jiaqi Ma
Exhibit Hall I #64
InstaDrive: Instance-Aware Driving World Models for Realistic and Consistent Video Generation Poster Session 6 & Exhibit Hall with Coffee Break
Zhuoran Yang ⋅ Xi Guo ⋅ Chenjing Ding ⋅ Chiyu Wang ⋅ Wei Wu ⋅ Yanyong Zhang
Exhibit Hall I #65
NormalLoc: Visual Localization on Textureless 3D Models using Surface Normals Poster Session 6 & Exhibit Hall with Coffee Break
Jiro Abe ⋅ Gaku Nakano ⋅ Kazumine Ogura
Exhibit Hall I #66
EmbodiedSplat: Personalized Real-to-Sim-to-Real Navigation with Gaussian Splats from a Mobile Device Poster Session 6 & Exhibit Hall with Coffee Break
Gunjan Chhablani ⋅ Xiaomeng Ye ⋅ Muhammad Zubair Irshad ⋅ Zsolt Kira
Exhibit Hall I #67
FreeSplatter: Pose-free Gaussian Splatting for Sparse-view 3D Reconstruction Poster Session 6 & Exhibit Hall with Coffee Break
Jiale Xu ⋅ Shenghua Gao ⋅ Ying Shan
Exhibit Hall I #68
NGD: Neural Gradient Based Deformation for Monocular Garment Reconstruction Poster Session 6 & Exhibit Hall with Coffee Break
Soham Dasgupta ⋅ Shanthika Naik ⋅ Preet Savalia ⋅ Sujay Kumar Ingle ⋅ Avinash Sharma
Exhibit Hall I #72
Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations Poster Session 6 & Exhibit Hall with Coffee Break
Xiang Xu ⋅ Lingdong Kong ⋅ Song Wang ⋅ Chuanwei Zhou ⋅ Qingshan Liu
Exhibit Hall I #76
Lifting the Structural Morphing for Wide-Angle Images Rectification: Unified Content and Boundary Modeling Poster Session 6 & Exhibit Hall with Coffee Break
Wenting Luan ⋅ Siqi Lu ⋅ Yongbin Zheng ⋅ Wanying XU ⋅ Lang Nie ⋅ Zongtan Zhou ⋅ Kang Liao
Exhibit Hall I #78
Global Regulation and Excitation via Attention Tuning for Stereo Matching Poster Session 6 & Exhibit Hall with Coffee Break
Jiahao LI ⋅ Xinhong Chen ⋅ Zhengmin JIANG ⋅ Qian Zhou ⋅ Yung-Hui Li ⋅ Jianping Wang
Exhibit Hall I #79
UniOcc: A Unified Benchmark for Occupancy Forecasting and Prediction in Autonomous Driving Poster Session 6 & Exhibit Hall with Coffee Break
Yuping Wang ⋅ Xiangyu Huang ⋅ Xiaokang Sun ⋅ Mingxuan Yan ⋅ Shuo Xing ⋅ Zhengzhong Tu ⋅ Jiachen Li
Exhibit Hall I #81
Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency Poster Session 6 & Exhibit Hall with Coffee Break
Tianqi Liu ⋅ Zihao Huang ⋅ Zhaoxi Chen ⋅ Guangcong Wang ⋅ Shoukang Hu ⋅ Liao Shen ⋅ Huiqiang Sun ⋅ Zhiguo Cao ⋅ Wei Li ⋅ Ziwei Liu
Exhibit Hall I #82
HiP-AD: Hierarchical and Multi-Granularity Planning with Deformable Attention for Autonomous Driving in a Single Decoder Poster Session 6 & Exhibit Hall with Coffee Break
Yingqi Tang ⋅ Zhuoran Xu ⋅ Zhaotie Meng ⋅ Erkang Cheng
Exhibit Hall I #85
RayletDF: Raylet Distance Fields for Generalizable 3D Surface Reconstruction from Point Clouds or Gaussians Poster Session 6 & Exhibit Hall with Coffee Break
Shenxing Wei ⋅ Jinxi Li ⋅ Yafei YANG ⋅ Siyuan Zhou ⋅ Bo Yang
Exhibit Hall I #86
Semantic-guided Camera Ray Regression for Visual Localization Poster Session 6 & Exhibit Hall with Coffee Break
Yesheng Zhang ⋅ Xu Zhao
Exhibit Hall I #88
SketchSplat: 3D Edge Reconstruction via Differentiable Multi-view Sketch Splatting Poster Session 6 & Exhibit Hall with Coffee Break
Haiyang Ying ⋅ Matthias Zwicker
Exhibit Hall I #89
Polarimetric Neural Field via Unified Complex-Valued Wave Representation Poster Session 6 & Exhibit Hall with Coffee Break
Chu Zhou ⋅ Yixin Yang ⋅ Junda Liao ⋅ Heng Guo ⋅ Boxin Shi ⋅ Imari Sato
Exhibit Hall I #90
High-Precision 3D Measurement of Complex Textured Surfaces Using Multiple Filtering Approach Poster Session 6 & Exhibit Hall with Coffee Break
Yuchong Chen ⋅ Jian Yu ⋅ Shaoyan Gai ⋅ Zeyu Cai ⋅ Feipeng Da
Exhibit Hall I #91
AutoScape: Geometry-Consistent Long-Horizon Scene Generation Poster Session 6 & Exhibit Hall with Coffee Break
Jiacheng Chen ⋅ Ziyu Jiang ⋅ Mingfu Liang ⋅ Bingbing Zhuang ⋅ Jong-Chyi Su ⋅ Sparsh Garg ⋅ Ying Wu ⋅ Manmohan Chandraker
Exhibit Hall I #94
From Gallery to Wrist: Realistic 3D Bracelet Insertion in Videos Poster Session 6 & Exhibit Hall with Coffee Break
Chenjian Gao ⋅ Lihe Ding ⋅ Rui Han ⋅ Zhanpeng Huang ⋅ Zibin Wang ⋅ Tianfan Xue
Exhibit Hall I #95
Street Gaussians without 3D Object Tracker Poster Session 6 & Exhibit Hall with Coffee Break
Ruida Zhang ⋅ Chengxi Li ⋅ Chenyangguang Zhang ⋅ Xingyu Liu ⋅ Haili Yuan ⋅ Yanyan Li ⋅ Xiangyang Ji ⋅ Gim Hee Lee
Exhibit Hall I #96
HiNeuS: High-fidelity Neural Surface Mitigating Low-texture and Reflective Ambiguity Poster Session 6 & Exhibit Hall with Coffee Break
Yida Wang ⋅ Xueyang Zhang ⋅ Kun Zhan ⋅ Peng Jia ⋅ XianPeng Lang
Exhibit Hall I #98
RGE-GS: Reward-Guided Expansive Driving Scene Reconstruction via Diffusion Priors Poster Session 6 & Exhibit Hall with Coffee Break
Sicong Du ⋅ Jiarun Liu ⋅ Qifeng Chen ⋅ Hao-Xiang Chen ⋅ Tai-Jiang Mu ⋅ Sheng Yang
Exhibit Hall I #99
Scene Coordinate Reconstruction Priors Poster Session 6 & Exhibit Hall with Coffee Break
Wenjing Bian ⋅ Axel Barroso-Laguna ⋅ Tommaso Cavallari ⋅ Victor Prisacariu ⋅ Eric Brachmann
Exhibit Hall I #100
Resonance: Learning to Predict Social-Aware Pedestrian Trajectories as Co-Vibrations Poster Session 6 & Exhibit Hall with Coffee Break
Conghao Wong ⋅ Ziqian Zou ⋅ Beihao Xia
Exhibit Hall I #102
I2-World: Intra-Inter Tokenization for Efficient Dynamic 4D Scene Forecasting Poster Session 6 & Exhibit Hall with Coffee Break
Zhimin Liao ⋅ Ping Wei ⋅ Ruijie Zhang ⋅ Shuaijia Chen ⋅ Haoxuan Wang ⋅ Ziyang Ren
Exhibit Hall I #104
InsideOut: Integrated RGB-Radiative Gaussian Splatting for Comprehensive 3D Object Representation Poster Session 6 & Exhibit Hall with Coffee Break
Jungmin Lee ⋅ Seonghyuk Hong ⋅ Juyong Lee ⋅ Jaeyoon Lee ⋅ Jongwon Choi
Exhibit Hall I #105
RIOcc: Efficient Cross-Modal Fusion Transformer with Collaborative Feature Refinement for 3D Semantic Occupancy Prediction Poster Session 6 & Exhibit Hall with Coffee Break
Baojie Fan ⋅ Xiaotian Li ⋅ Yuhan Zhou ⋅ Yuyu Jiang ⋅ Jiandong Tian ⋅ Huijie Fan
Exhibit Hall I #108
TeethGenerator: A two-stage framework for paired pre- and post-orthodontic 3D dental data generation Poster Session 6 & Exhibit Hall with Coffee Break
Changsong Lei ⋅ Yaqian Liang ⋅ Shaofeng Wang ⋅ Jiajia Dai ⋅ Yong-Jin Liu
Exhibit Hall I #110
Removing Out-of-Focus Reflective Flares via Color Alignment Poster Session 2 & Exhibit Hall with Coffee Break
Fengbo Lan ⋅ Chang Wen Chen
Exhibit Hall I #445
Towards Accurate and Efficient 3D Object Detection for Autonomous Driving: A Mixture of Experts Computing System on Edge Poster Session 6 & Exhibit Hall with Coffee Break
Linshen Liu ⋅ Boyan Su ⋅ Junyue Jiang ⋅ Guanlin Wu ⋅ Cong Guo ⋅ Ceyu Xu ⋅ Hao Frank Yang
Exhibit Hall I #113
Degradation-Modeled Multipath Diffusion for Tunable Metalens Photography Poster Session 6 & Exhibit Hall with Coffee Break
Jianing Zhang ⋅ Jiayi Zhu ⋅ Feiyu Ji ⋅ Xiaokang Yang ⋅ Xiaoyun Yuan
Exhibit Hall I #114
GS-Occ3D: Scaling Vision-only Occupancy Reconstruction with Gaussian Splatting Poster Session 6 & Exhibit Hall with Coffee Break
Baijun Ye ⋅ Minghui Qin ⋅ Saining Zhang ⋅ Moonjun Gong ⋅ Shaoting Zhu ⋅ Hao Zhao ⋅ Hang Zhao
Exhibit Hall I #115
MetaScope: Optics-Driven Neural Network for Ultra-Micro Metalens Endoscopy Poster Session 6 & Exhibit Hall with Coffee Break
Wuyang Li ⋅ Wentao Pan ⋅ Xiaoyuan Liu ⋅ Zhendong Luo ⋅ Chenxin Li ⋅ Hengyu Liu ⋅ Din Tsai ⋅ Mu Chen ⋅ Yixuan Yuan
Exhibit Hall I #116
CoLMDriver: LLM-based Negotiation Benefits Cooperative Autonomous Driving Poster Session 6 & Exhibit Hall with Coffee Break
Changxing Liu ⋅ Genjia Liu ⋅ Zijun Wang ⋅ Jinchang Yang ⋅ Siheng Chen
Exhibit Hall I #117
Free-running vs Synchronous: Single-Photon Lidar for High-flux 3D Imaging Poster Session 6 & Exhibit Hall with Coffee Break
Ruangrawee Kitichotkul ⋅ Shashwath Bharadwaj ⋅ Joshua Rapp ⋅ Yanting Ma ⋅ Alexander Mehta ⋅ Vivek Goyal
Exhibit Hall I #119
Mitigating Geometric Degradation in Fast DownSampling via FastAdapter for Point Cloud Segmentation Poster Session 6 & Exhibit Hall with Coffee Break
Shuofeng Sun ⋅ Haibin Yan
Exhibit Hall I #120
Noise2Score3D: Tweedie's Approach for Unsupervised Point Cloud Denoising Poster Session 6 & Exhibit Hall with Coffee Break
Xiangbin Wei ⋅ Yuanfeng Wang ⋅ Ao XU ⋅ Lingyu Zhu ⋅ Dongyong Sun ⋅ Keren Li ⋅ Yang Li ⋅ Qi Qin
Exhibit Hall I #121
ClaraVid: A Holistic Scene Reconstruction Benchmark From Aerial Perspective With Delentropy-Based Complexity Profiling Poster Session 6 & Exhibit Hall with Coffee Break
Radu Beche ⋅ Sergiu Nedevschi
Exhibit Hall I #123
Discontinuity-aware Normal Integration for Generic Central Camera Models Poster Session 6 & Exhibit Hall with Coffee Break
Francesco Milano ⋅ Manuel Lopez-Antequera ⋅ Naina Dhingra ⋅ Roland Siegwart ⋅ Robert Thiel
Exhibit Hall I #124
SEHDR: Single-Exposure HDR Novel View Synthesis via 3D Gaussian Bracketing Poster Session 6 & Exhibit Hall with Coffee Break
Yiyu Li ⋅ Haoyuan Wang ⋅ Ke Xu ⋅ Gerhard Hancke ⋅ Rynson W.H. Lau
Exhibit Hall I #126
SL2A-INR: Single-Layer Learnable Activation for Implicit Neural Representation Poster Session 6 & Exhibit Hall with Coffee Break
Reza Rezaeian ⋅ Moein Heidari ⋅ Reza Azad ⋅ Dorit Merhof ⋅ Hamid Soltanian-Zadeh ⋅ Ilker Hacihaliloglu
Exhibit Hall I #128
TARS: Traffic-Aware Radar Scene Flow Estimation Poster Session 6 & Exhibit Hall with Coffee Break
Jialong Wu ⋅ Marco Braun ⋅ Dominic Spata ⋅ Matthias Rottmann
Exhibit Hall I #129
DoppDrive: Doppler-Driven Temporal Aggregation for Improved Radar Object Detection Poster Session 6 & Exhibit Hall with Coffee Break
Yuval Haitman ⋅ Oded Bialer
Exhibit Hall I #130
Leaps and Bounds: An Improved Point Cloud Winding Number Formulation for Fast Normal Estimation and Surface Reconstruction Poster Session 6 & Exhibit Hall with Coffee Break
Chamin Hewa Koneputugodage ⋅ Dylan Campbell ⋅ Stephen Gould
Exhibit Hall I #133
GVDepth: Zero-Shot Monocular Depth Estimation for Ground Vehicles based on Probabilistic Cue Fusion Poster Session 6 & Exhibit Hall with Coffee Break
Karlo Koledic ⋅ Luka Petrovic ⋅ Ivan Marković ⋅ Ivan Petrovic
Exhibit Hall I #134
Harnessing Text-to-Image Diffusion Models for Point Cloud Self-Supervised Learning Poster Session 6 & Exhibit Hall with Coffee Break
Yiyang Chen ⋅ Shanshan Zhao ⋅ Lunhao Duan ⋅ Changxing Ding ⋅ Dacheng Tao
Exhibit Hall I #138
OD-RASE: Ontology-Driven Risk Assessment and Safety Enhancement for Autonomous Driving Poster Session 6 & Exhibit Hall with Coffee Break
Kota Shimomura ⋅ Masaki Nambata ⋅ Atsuya Ishikawa ⋅ Ryota Mimura ⋅ Takayuki Kawabuchi ⋅ Takayoshi Yamashita ⋅ Koki Inoue
Exhibit Hall I #139
MDP-Omni: Parameter-free Multimodal Depth Prior-based Sampling for Omnidirectional Stereo Matching Poster Session 6 & Exhibit Hall with Coffee Break
Eunjin Son ⋅ HyungGi Jo ⋅ Wookyong Kwon ⋅ Sang Jun Lee
Exhibit Hall I #140
DistillDrive: End-to-End Multi-Mode Autonomous Driving Distillation by Isomorphic Hetero-Source Planning Model Poster Session 6 & Exhibit Hall with Coffee Break
Rui Yu ⋅ Xianghang Zhang ⋅ Runkai Zhao ⋅ Huaicheng Yan ⋅ Meng Wang
Exhibit Hall I #141
EDM: Efficient Deep Feature Matching Poster Session 6 & Exhibit Hall with Coffee Break
Xi Li ⋅ Tong Rao ⋅ Cihui Pan
Exhibit Hall I #142
GS-ID: Illumination Decomposition on Gaussian Splatting via Adaptive Light Aggregation and Diffusion-Guided Material Priors Poster Session 6 & Exhibit Hall with Coffee Break
Kang DU ⋅ Zhihao Liang ⋅ Yulin Shen ⋅ Zeyu Wang
Exhibit Hall I #146
NeRF Is a Valuable Assistant for 3D Gaussian Splatting Poster Session 6 & Exhibit Hall with Coffee Break
Shuangkang Fang ⋅ I-Chao Shen ⋅ Takeo Igarashi ⋅ Yufeng Wang ⋅ ZeSheng Wang ⋅ Yi Yang ⋅ Wenrui Ding ⋅ Shuchang Zhou
Exhibit Hall I #147
UniGS: Modeling Unitary 3D Gaussians for Novel View Synthesis from Sparse-view Images Poster Session 6 & Exhibit Hall with Coffee Break
Jiamin WU ⋅ Kenkun Liu ⋅ Xiaoke Jiang ⋅ Yuan Yao ⋅ Lei Zhang
Exhibit Hall I #148
TOTP: Transferable Online Pedestrian Trajectory Prediction with Temporal-Adaptive Mamba Latent Diffusion Poster Session 6 & Exhibit Hall with Coffee Break
Ziyang Ren ⋅ Ping Wei ⋅ Shangqi Deng ⋅ Haowen Tang ⋅ Jiapeng Li ⋅ Huan Li
Exhibit Hall I #150
UnMix-NeRF: Spectral Unmixing Meets Neural Radiance Fields Poster Session 6 & Exhibit Hall with Coffee Break
Fabian Perez ⋅ Sara Rojas Martinez ⋅ Carlos Hinojosa ⋅ Hoover Rueda-Chacón ⋅ Bernard Ghanem
Exhibit Hall I #152
MaterialMVP: Illumination-Invariant Material Generation via Multi-view PBR Diffusion Poster Session 6 & Exhibit Hall with Coffee Break
Zebin He ⋅ Mx Yang ⋅ Shuhui Yang ⋅ Yixuan Tang ⋅ Tao Wang ⋅ Kaihao Zhang ⋅ Guanying Chen ⋅ Lliu Yuhong ⋅ Jie Jiang ⋅ Chunchao Guo ⋅ Wenhan Luo
Exhibit Hall I #153
7DGS: Unified Spatial-Temporal-Angular Gaussian Splatting Poster Session 6 & Exhibit Hall with Coffee Break
Zhongpai Gao ⋅ Benjamin Planche ⋅ Meng Zheng ⋅ Anwesa Choudhuri ⋅ Terrence Chen ⋅ Ziyan Wu
Exhibit Hall I #155
StochasticSplats: Stochastic Rasterization for Sorting-Free 3D Gaussian Splatting Poster Session 6 & Exhibit Hall with Coffee Break
Shakiba Kheradmand ⋅ Delio Vicini ⋅ George Kopanas ⋅ Dmitry Lagun ⋅ Kwang Moo Yi ⋅ Mark Matthews ⋅ Andrea Tagliasacchi
Exhibit Hall I #156
TurboReg: TurboClique for Robust and Efficient Point Cloud Registration Poster Session 6 & Exhibit Hall with Coffee Break
Shaocheng Yan ⋅ Pengcheng Shi ⋅ Zhenjun Zhao ⋅ Kaixin Wang ⋅ Kuang Cao ⋅ Ji Wu ⋅ Jiayuan Li
Exhibit Hall I #160
Efficient Spiking Point Mamba for Point Cloud Analysis Poster Session 6 & Exhibit Hall with Coffee Break
Peixi Wu ⋅ Bosong Chai ⋅ Menghua Zheng ⋅ Wei Li ⋅ Zhangchi Hu ⋅ Jie Chen ⋅ Zheyu Zhang ⋅ Hebei Li ⋅ Xiaoyan Sun
Exhibit Hall I #162
SpatialSplat: Efficient Semantic 3D from Sparse Unposed Images Poster Session 6 & Exhibit Hall with Coffee Break
Yu Sheng ⋅ Jiajun Deng ⋅ Xinran Zhang ⋅ Yu Zhang ⋅ Bei Hua ⋅ Yanyong Zhang ⋅ Jianmin Ji
Exhibit Hall I #163
CoMoGaussian: Continuous Motion-Aware Gaussian Splatting from Motion-Blurred Images Poster Session 6 & Exhibit Hall with Coffee Break
Jungho Lee ⋅ DongHyeong Kim ⋅ Dogyoon Lee ⋅ Suhwan Cho ⋅ Minhyeok Lee ⋅ Wonjoon Lee ⋅ Taeoh Kim ⋅ Dongyoon Wee ⋅ Sangyoun Lee
Exhibit Hall I #164
Generalized and Efficient 2D Gaussian Splatting for Arbitrary-scale Super-Resolution Poster Session 6 & Exhibit Hall with Coffee Break
Du Chen ⋅ Liyi Chen ⋅ Zhengqiang ZHANG ⋅ Lei Zhang
Exhibit Hall I #166
Visual Surface Wave Elastography: Revealing Subsurface Physical Properties via Visible Surface Waves Poster Session 6 & Exhibit Hall with Coffee Break
Alexander Ogren ⋅ Berthy Feng ⋅ Jihoon Ahn ⋅ Katherine Bouman ⋅ Chiara Daraio
Exhibit Hall I #167
GaRe: Relightable 3D Gaussian Splatting for Outdoor Scenes from Unconstrained Photo Collections Poster Session 6 & Exhibit Hall with Coffee Break
Haiyang Bai ⋅ Jiaqi Zhu ⋅ Songru Jiang ⋅ Wei Huang ⋅ Tao Lu ⋅ Yuanqi Li ⋅ Jie Guo ⋅ Runze Fu ⋅ Yanwen Guo ⋅ Lijun Chen
Exhibit Hall I #168
PolarAnything: Diffusion-based Polarimetric Image Synthesis Poster Session 6 & Exhibit Hall with Coffee Break
Kailong Zhang ⋅ Youwei Lyu ⋅ Heng Guo ⋅ Si Li ⋅ Zhanyu Ma ⋅ Boxin Shi
Exhibit Hall I #169
LightCity: An Urban Dataset for Outdoor Inverse Rendering and Reconstruction under Multi-illumination Conditions Poster Session 6 & Exhibit Hall with Coffee Break
Jingjing Wang ⋅ Qirui Hu ⋅ Chong Bao ⋅ Yuke Zhu ⋅ Hujun Bao ⋅ Zhaopeng Cui ⋅ Guofeng Zhang
Exhibit Hall I #170
ETA: Efficiency through Thinking Ahead, A Dual Approach to Self-Driving with Large Models Poster Session 6 & Exhibit Hall with Coffee Break
Shadi Hamdan ⋅ Chonghao Sima ⋅ Zetong Yang ⋅ Hongyang Li ⋅ Fatma Guney
Exhibit Hall I #175
MergeOcc: Bridge the Domain Gap between Different LiDARs for Robust Occupancy Prediction Poster Session 6 & Exhibit Hall with Coffee Break
Zikun Xu ⋅ Shaobing Xu
Exhibit Hall I #176
Feature Extraction and Representation of Pre-training Point Cloud Based on Diffusion Models Poster Session 6 & Exhibit Hall with Coffee Break
Chang Qiu ⋅ Feipeng Da ⋅ Zilei Zhang
Exhibit Hall I #178
Towards Open-World Generation of Stereo Images and Unsupervised Matching Poster Session 6 & Exhibit Hall with Coffee Break
Feng Qiao ⋅ Zhexiao Xiong ⋅ Eric Xing ⋅ Nathan Jacobs
Exhibit Hall I #180
LoD-Loc v2: Aerial Visual Localization over Low Level-of-Detail City Models using Explicit Silhouette Alignment Poster Session 6 & Exhibit Hall with Coffee Break
Juelin Zhu ⋅ Shuaibang Peng ⋅ Long Wang ⋅ Hanlin Tan ⋅ Yu Liu ⋅ Maojun Zhang ⋅ Shen Yan
Exhibit Hall I #183
LANGTRAJ: Diffusion Model and Dataset for Language-Conditioned Trajectory Simulation Poster Session 6 & Exhibit Hall with Coffee Break
WEI-JER Chang ⋅ Masayoshi Tomizuka ⋅ Wei Zhan ⋅ Manmohan Chandraker ⋅ Francesco Pittaluga
Exhibit Hall I #184
Temporal Overlapping Prediction: A Self-supervised Pre-training Method for LiDAR Moving Object Segmentation Poster Session 6 & Exhibit Hall with Coffee Break
Ziliang Miao ⋅ Runjian Chen ⋅ Yixi Cai ⋅ Buwei He ⋅ Wenquan Zhao ⋅ Wenqi Shao ⋅ Bo Zhang ⋅ Fu Zhang
Exhibit Hall I #187
VideoRFSplat: Direct Scene-Level Text-to-3D Gaussian Splatting Generation with Flexible Pose and Multi-View Joint Modeling Poster Session 6 & Exhibit Hall with Coffee Break
Hyojun Go ⋅ Byeongjun Park ⋅ Hyelin Nam ⋅ Byung-Hoon Kim ⋅ Hyungjin Chung ⋅ Changick Kim
Exhibit Hall I #192
S²M²: Scalable Stereo Matching Model for Reliable Depth Estimation Poster Session 6 & Exhibit Hall with Coffee Break
JUNHONG MIN ⋅ YOUNGPIL JEON ⋅ Jimin Kim ⋅ Minyong Choi
Exhibit Hall I #194
ACE-G: Improving Generalization of Scene Coordinate Regression Through Query Pre-Training Poster Session 6 & Exhibit Hall with Coffee Break
Leonard Bruns ⋅ Axel Barroso-Laguna ⋅ Tommaso Cavallari ⋅ Áron Monszpart ⋅ Sowmya Munukutla ⋅ Victor Prisacariu ⋅ Eric Brachmann
Exhibit Hall I #196
VistaDream: Sampling multiview consistent images for single-view scene reconstruction Poster Session 6 & Exhibit Hall with Coffee Break
Haiping Wang ⋅ Yuan Liu ⋅ Ziwei Liu ⋅ Wenping Wang ⋅ Zhen Dong ⋅ Bisheng Yang
Exhibit Hall I #198
Towards Visual Localization Interoperability: Cross-Feature for Collaborative Visual Localization and Mapping Poster Session 6 & Exhibit Hall with Coffee Break
Alberto Jaenal ⋅ Paula Carbó Cubero ⋅ Jose Araujo ⋅ André Mateus
Exhibit Hall I #199
MiDSummer: Multi-Guidance Diffusion for Controllable Zero-Shot Immersive Gaussian Splatting Scene Generation Poster Session 6 & Exhibit Hall with Coffee Break
Anjun Hu ⋅ Richard Tomsett ⋅ Valentin Gourmet ⋅ Massimo Camplani ⋅ Jas Kandola ⋅ Hanting Xie
Exhibit Hall I #200
Spatio-Spectral Pattern Illumination for Direct and Indirect Separation from a Single Hyperspectral Image Poster Session 6 & Exhibit Hall with Coffee Break
Shin Ishihara ⋅ Imari Sato
Exhibit Hall I #203
Adversarial Exploitation of Data Diversity Improves Visual Localization Poster Session 6 & Exhibit Hall with Coffee Break
Sihang Li ⋅ Siqi Tan ⋅ Bowen Chang ⋅ Jing Zhang ⋅ Chen Feng ⋅ Yiming Li
Exhibit Hall I #205
GeoFormer: Geometry Point Encoder for 3D Object Detection with Graph-based Transformer Poster Session 6 & Exhibit Hall with Coffee Break
Xin Jin ⋅ Haisheng Su ⋅ Cong Ma ⋅ Kai Liu ⋅ Wei Wu ⋅ Fei HUI ⋅ Junchi Yan
Exhibit Hall I #208
Tile-wise vs. Image-wise: Random-Tile Loss and Training Paradigm for Gaussian Splatting Poster Session 6 & Exhibit Hall with Coffee Break
Xiaoyu Zhang ⋅ Weihong Pan ⋅ Xiaojun Xiang ⋅ Hongjia Zhai ⋅ Liyang Zhou ⋅ Hanqing Jiang ⋅ Guofeng Zhang
Exhibit Hall I #212
DriveArena: A Closed-loop Generative Simulation Platform for Autonomous Driving Poster Session 6 & Exhibit Hall with Coffee Break
Xuemeng Yang ⋅ Licheng Wen ⋅ Tiantian Wei ⋅ Yukai Ma ⋅ Jianbiao Mei ⋅ Xin Li ⋅ Wenjie Lei ⋅ Daocheng Fu ⋅ Pinlong Cai ⋅ Min Dou ⋅ Liang He ⋅ Yong Liu ⋅ Botian Shi ⋅ Yu Qiao
Exhibit Hall I #213
Explaining Human Preferences via Metrics for Structured 3D Reconstruction Poster Session 6 & Exhibit Hall with Coffee Break
Jack Langerman ⋅ Denis Rozumny ⋅ Yuzhong Huang ⋅ Dmytro Mishkin
Exhibit Hall I #214
CoopTrack: Exploring End-to-End Learning for Efficient Cooperative Sequential Perception Poster Session 6 & Exhibit Hall with Coffee Break
Jiaru Zhong ⋅ Jiahao Wang ⋅ Jiahui Xu ⋅ Xiaofan Li ⋅ Zaiqing Nie ⋅ Haibao Yu
Exhibit Hall I #215
UniVerse: Unleashing the Scene Prior of Video Diffusion Models for Robust Radiance Field Reconstruction Poster Session 6 & Exhibit Hall with Coffee Break
Jin Cao ⋅ Hongrui Wu ⋅ Ziyong Feng ⋅ Hujun Bao ⋅ Xiaowei Zhou ⋅ Sida Peng
Exhibit Hall I #224
RoCo-Sim: Enhancing Roadside Collaborative Perception through Foreground Simulation Poster Session 6 & Exhibit Hall with Coffee Break
Yuwen Du ⋅ Anning Hu ⋅ Zichen Chao ⋅ Yifan Lu ⋅ Junhao Ge ⋅ Genjia Liu ⋅ Wei-Tao Wu ⋅ Lanjun Wang ⋅ Siheng Chen
Exhibit Hall I #217
Inverse 3D Microscopy Rendering for Cell Shape Inference with Active Mesh Poster Session 6 & Exhibit Hall with Coffee Break
Sacha Ichbiah ⋅ Anshuman Sinha ⋅ Fabrice Delbary ⋅ Hervé Turlier
Exhibit Hall I #218
GaussRender: Learning 3D Occupancy with Gaussian Rendering Poster Session 6 & Exhibit Hall with Coffee Break
Loick Chambon ⋅ Eloi Zablocki ⋅ Alexandre Boulch ⋅ Mickael Chen ⋅ Matthieu Cord
Exhibit Hall I #222
SA-Occ: Satellite-Assisted 3D Occupancy Prediction in Real World Poster Session 6 & Exhibit Hall with Coffee Break
Chen Chen ⋅ Zhirui Wang ⋅ Taowei Sheng ⋅ Yi Jiang ⋅ Yundu Li ⋅ Peirui Cheng ⋅ Luning Zhang ⋅ Kaiqiang Chen ⋅ Yanfeng Hu ⋅ Xue Yang ⋅ Xian Sun
Exhibit Hall I #223
UPP: Unified Point-Level Prompting for Robust Point Cloud Analysis Poster Session 6 & Exhibit Hall with Coffee Break
Zixiang Ai ⋅ Zhenyu Cui ⋅ Yuxin Peng ⋅ Jiahuan Zhou
Exhibit Hall I #255
ExploreGS: Explorable 3D Scene Reconstruction with Virtual Camera Samplings and Diffusion Priors Poster Session 6 & Exhibit Hall with Coffee Break
Minsu Kim ⋅ Subin Jeon ⋅ In Cho ⋅ Mijin Yoo ⋅ Seon Joo Kim
Exhibit Hall I #225
LaneDiffusion: Improving Centerline Graph Learning via Prior Injected BEV Feature Generation Poster Session 6 & Exhibit Hall with Coffee Break
Zijie Wang ⋅ Weiming Zhang ⋅ Wei Zhang ⋅ Xiao Tan ⋅ hongxing liu ⋅ Yaowei Wang ⋅ Guanbin Li
Exhibit Hall I #226
Bridging 3D Anomaly Localization and Repair via High-Quality Continuous Geometric Representation Poster Session 6 & Exhibit Hall with Coffee Break
Bozhong Zheng ⋅ Jinye Gan ⋅ Xiaohao Xu ⋅ Xintao Chen ⋅ Wenqiao Li ⋅ Xiaonan Huang ⋅ Na Ni ⋅ Yingna Wu
Exhibit Hall I #227
CuMPerLay: Learning Cubical Multiparameter Persistence Vectorizations Poster Session 6 & Exhibit Hall with Coffee Break
Caner Korkmaz ⋅ Brighton Nuwagira ⋅ Baris Coskunuzer ⋅ Tolga Birdal
Exhibit Hall I #229
SGAD: Semantic and Geometric-aware Descriptor for Local Feature Matching Poster Session 6 & Exhibit Hall with Coffee Break
Xiangzeng Liu ⋅ CHI WANG ⋅ Guanglu Shi ⋅ Xiaodong Zhang ⋅ Qiguang Miao ⋅ Miao Fan
Exhibit Hall I #230
End-to-End Driving with Online Trajectory Evaluation via BEV World Model Poster Session 6 & Exhibit Hall with Coffee Break
Yingyan Li ⋅ Yuqi Wang ⋅ Yang Liu ⋅ Jiawei He ⋅ Lue Fan ⋅ Zhaoxiang Zhang
Exhibit Hall I #234
Planar Affine Rectification from Local Change of Scale and Orientation Poster Session 6 & Exhibit Hall with Coffee Break
Yuval Nissan ⋅ Marc Pollefeys ⋅ Daniel Barath
Exhibit Hall I #235
ERNet: Efficient Non-Rigid Registration Network for Point Sequences Poster Session 6 & Exhibit Hall with Coffee Break
Guangzhao He ⋅ Yuxi Xiao ⋅ Zhen Xu ⋅ Xiaowei Zhou ⋅ Sida Peng
Exhibit Hall I #236
SeqGrowGraph: Learning Lane Topology as a Chain of Graph Expansions Poster Session 6 & Exhibit Hall with Coffee Break
Mengwei Xie ⋅ Shuang Zeng ⋅ Xinyuan Chang ⋅ Xinran Liu ⋅ Zheng Pan ⋅ Mu Xu ⋅ Xing Wei
Exhibit Hall I #237
Doppler-Aware LiDAR-RADAR Fusion for Weather-Robust 3D Detection Poster Session 6 & Exhibit Hall with Coffee Break
Yujeong Chae ⋅ Heejun Park ⋅ Hyeonseong Kim ⋅ Kuk-Jin Yoon
Exhibit Hall I #240
Egocentric Action-aware Inertial Localization in Point Clouds with Vision-Language Guidance Poster Session 6 & Exhibit Hall with Coffee Break
Mingfang Zhang ⋅ Ryo Yonetani ⋅ Yifei Huang ⋅ Liangyang Ouyang ⋅ Ruicong Liu ⋅ Yoichi Sato
Exhibit Hall I #241
Epona: Autoregressive Diffusion World Model for Autonomous Driving Poster Session 6 & Exhibit Hall with Coffee Break
Kaiwen Zhang ⋅ Zhenyu Tang ⋅ Xiaotao Hu ⋅ Xingang Pan ⋅ Xiaoyang Guo ⋅ Yuan Liu ⋅ Jingwei Huang ⋅ Li Yuan ⋅ Qian Zhang ⋅ XIAOXIAO LONG ⋅ Xun Cao ⋅ Wei Yin
Exhibit Hall I #242
Leveraging Local Patch Alignment to Seam-cutting for Large Parallax Image Stitching Poster Session 6 & Exhibit Hall with Coffee Break
Tianli Liao ⋅ Chenyang Zhao ⋅ Lei Li ⋅ Heling Cao
Exhibit Hall I #246
InfiniCube: Unbounded and Controllable Dynamic 3D Driving Scene Generation with World-Guided Video Models Poster Session 6 & Exhibit Hall with Coffee Break
Yifan Lu ⋅ Xuanchi Ren ⋅ Jiawei Yang ⋅ Tianchang Shen ⋅ Jay Zhangjie Wu ⋅ Jun Gao ⋅ Yue Wang ⋅ Siheng Chen ⋅ Mike Chen ⋅ Sanja Fidler ⋅ Jiahui Huang
Exhibit Hall I #247
SynCity: Training-Free Generation of 3D Cities Poster Session 6 & Exhibit Hall with Coffee Break
Paul Engstler ⋅ Aleksandar Shtedritski ⋅ Iro Laina ⋅ Christian Rupprecht ⋅ Andrea Vedaldi
Exhibit Hall I #276
PriorMotion: Generative Class-Agnostic Motion Prediction with Raster-Vector Motion Field Priors Poster Session 6 & Exhibit Hall with Coffee Break
Kangan Qian ⋅ Jinyu Miao ⋅ Xinyu Jiao ⋅ Ziang Luo ⋅ Zheng Fu ⋅ Yining Shi ⋅ Yunlong Wang ⋅ Kun Jiang ⋅ Diange Yang
Exhibit Hall I #248
MGSR: 2D/3D Mutual-boosted Gaussian Splatting for High-fidelity Surface Reconstruction under Various Light Conditions Poster Session 6 & Exhibit Hall with Coffee Break
Qingyuan Zhou ⋅ Yuehu Gong ⋅ Weidong Yang ⋅ Jiaze Li ⋅ Yeqi Luo ⋅ Baixin Xu ⋅ Shuhao Li ⋅ Ben Fei ⋅ Ying He
Exhibit Hall I #249
ArgMatch: Adaptive Refinement Gathering for Efficient Dense Matching Poster Session 6 & Exhibit Hall with Coffee Break
Yuxin Deng ⋅ Kaining Zhang ⋅ Linfeng Tang ⋅ Jiaqi Yang ⋅ Jiayi Ma
Exhibit Hall I #256
RoboTron-Sim: Improving Real-World Driving via Simulated Hard-Case Poster Session 6 & Exhibit Hall with Coffee Break
Baihui Xiao ⋅ Chengjian Feng ⋅ Zhijian Huang ⋅ Feng yan ⋅ Yujie Zhong ⋅ Lin Ma
Exhibit Hall I #257
SAFT: Shape and Appearance of Fabrics from Template via Differentiable Physical Simulations from Monocular Video Poster Session 6 & Exhibit Hall with Coffee Break
David Stotko ⋅ Reinhard Klein
Exhibit Hall I #283
Thermal Polarimetric Multi-view Stereo Poster Session 6 & Exhibit Hall with Coffee Break
Takahiro Kushida ⋅ Kenichiro Tanaka
Exhibit Hall I #258
StealthAttack: Robust 3D Gaussian Splatting Poisoning via Density-Guided Illusions Poster Session 6 & Exhibit Hall with Coffee Break
Bo-Hsu Ke ⋅ You-Zhe Xie ⋅ Yu-Lun Liu ⋅ Wei-Chen Chiu
Exhibit Hall I #259
LongSplat: Robust Unposed 3D Gaussian Splatting for Casual Long Videos Poster Session 6 & Exhibit Hall with Coffee Break
Chin-Yang Lin ⋅ Cheng Sun ⋅ Fu-En Yang ⋅ Min-Hung Chen ⋅ Yen-Yu Lin ⋅ Yu-Lun Liu
Exhibit Hall I #260
WonderTurbo: Generating Interactive 3D World in 0.72 Seconds Poster Session 6 & Exhibit Hall with Coffee Break
Chaojun Ni ⋅ Xiaofeng Wang ⋅ Zheng Zhu ⋅ Weijie Wang ⋅ Haoyun Li ⋅ Guosheng Zhao ⋅ Jie Li ⋅ Wenkang Qin ⋅ Guan Huang ⋅ Wenjun Mei
Exhibit Hall I #261
SFUOD: Source-Free Unknown Object Detection Poster Session 1 & Exhibit Hall
Keon-Hee Park ⋅ Seun-An Choe ⋅ Gyeong-Moon Park
Exhibit Hall I #325
MOSAIC: Generating Consistent, Privacy-Preserving Scenes from Multiple Depth Views in Multi-Room Environments Poster Session 6 & Exhibit Hall with Coffee Break
Zhixuan Liu ⋅ Haokun Zhu ⋅ Rui Chen ⋅ Jonathan Francis ⋅ Soonmin Hwang ⋅ Ji Zhang ⋅ Jean Oh
Exhibit Hall I #264
Coordinate-based Speed of Sound Recovery for Aberration-Corrected Photoacoustic Computed Tomography Poster Session 6 & Exhibit Hall with Coffee Break
Tianao Li ⋅ Manxiu Cui ⋅ Cheng Ma ⋅ Emma Alexander
Exhibit Hall I #265
GenFlow3D: Generative Scene Flow Estimation and Prediction on Point Cloud Sequences Poster Session 6 & Exhibit Hall with Coffee Break
Hanlin Li ⋅ Wenming Weng ⋅ Yueyi Zhang ⋅ Zhiwei Xiong
Exhibit Hall I #267
Generative Gaussian Splatting: Generating 3D Scenes with Video Diffusion Priors Poster Session 6 & Exhibit Hall with Coffee Break
Katja Schwarz ⋅ Norman Müller ⋅ Peter Kontschieder
Exhibit Hall I #269
Curve-Aware Gaussian Splatting for 3D Parametric Curve Reconstruction Poster Session 6 & Exhibit Hall with Coffee Break
Zhirui Gao ⋅ Renjiao Yi ⋅ YaQiao Dai ⋅ Xuening Zhu ⋅ Wei Chen ⋅ Kai Xu ⋅ Chenyang Zhu
Exhibit Hall I #271
RadarSplat: Radar Gaussian Splatting for High-Fidelity Data Synthesis and 3D Reconstruction of Autonomous Driving Scenes Poster Session 6 & Exhibit Hall with Coffee Break
Pou-Chun Kung ⋅ Skanda Harisha ⋅ Ram Vasudevan ⋅ Aline Eid ⋅ Katherine A. Skinner
Exhibit Hall I #277
Tree Skeletonization from 3D Point Clouds by Denoising Diffusion Poster Session 6 & Exhibit Hall with Coffee Break
Elias Marks ⋅ Lucas Nunes ⋅ Federico Magistri ⋅ Matteo Sodano ⋅ Rodrigo Marcuzzi ⋅ Lars Zimmermann ⋅ Jens Behley ⋅ Cyrill Stachniss
Exhibit Hall I #278
Splat-LOAM: Gaussian Splatting LiDAR Odometry and Mapping Poster Session 6 & Exhibit Hall with Coffee Break
Emanuele Giacomini ⋅ Luca Di Giammarino ⋅ Lorenzo De Rebotti ⋅ Giorgio Grisetti ⋅ Martin R. Oswald
Exhibit Hall I #280
Purge-Gate: Efficient Backpropagation-Free Test-Time Adaptation for Point Clouds via Token purging Poster Session 6 & Exhibit Hall with Coffee Break
Moslem Yazdanpanah ⋅ Ali Bahri ⋅ Mehrdad Noori ⋅ Sahar Dastani ⋅ Gustavo Vargas Hakim ⋅ David OSOWIECHI ⋅ Ismail Ayed ⋅ Christian Desrosiers
Exhibit Hall I #281
AAA-Gaussians: Anti-Aliased and Artifact-Free 3D Gaussian Rendering Poster Session 6 & Exhibit Hall with Coffee Break
Michael Steiner ⋅ Thomas Köhler ⋅ Lukas Radl ⋅ Felix Windisch ⋅ Dieter Schmalstieg ⋅ Markus Steinberger
Exhibit Hall I #282
BridgeDepth: Bridging Monocular and Stereo Reasoning with Latent Alignment Poster Session 6 & Exhibit Hall with Coffee Break
Tongfan Guan ⋅ Jiaxin Guo ⋅ Chen Wang ⋅ Yun-Hui Liu
Exhibit Hall I #285
Neural Inverse Rendering for High-Accuracy 3D Measurement of Moving Objects with Fewer Phase-Shifting Patterns Poster Session 6 & Exhibit Hall with Coffee Break
Yuki Urakawa ⋅ Yoshihiro Watanabe
Exhibit Hall I #286
FlowR: Flowing from Sparse to Dense 3D Reconstructions Poster Session 6 & Exhibit Hall with Coffee Break
Tobias Fischer ⋅ Samuel Rota Bulò ⋅ Yung-Hsu Yang ⋅ Nikhil Keetha ⋅ Lorenzo Porzi ⋅ Norman Müller ⋅ Katja Schwarz ⋅ Jonathon Luiten ⋅ Marc Pollefeys ⋅ Peter Kontschieder
Exhibit Hall I #289
WorldScore: Unified Evaluation Benchmark for World Generation Poster Session 6 & Exhibit Hall with Coffee Break
Haoyi Duan ⋅ Hong-Xing Yu ⋅ Sirui Chen ⋅ Li Fei-Fei ⋅ Jiajun Wu
Exhibit Hall I #290
LightSwitch: Multi-view Relighting with Material-guided Diffusion Poster Session 6 & Exhibit Hall with Coffee Break
Yehonathan Litman ⋅ Fernando De la Torre ⋅ Shubham Tulsiani
Exhibit Hall I #293
Decoupled Diffusion Sparks Adaptive Scene Generation Poster Session 6 & Exhibit Hall with Coffee Break
Yunsong Zhou ⋅ Naisheng Ye ⋅ William Ljungbergh ⋅ Tianyu Li ⋅ Jiazhi Yang ⋅ Zetong Yang ⋅ Hongzi Zhu ⋅ Christoffer Petersson ⋅ Hongyang Li
Exhibit Hall I #294
Recover Biological Structure from Sparse-View Diffraction Images with Neural Volumetric Prior Poster Session 6 & Exhibit Hall with Coffee Break
Renzhi He ⋅ Haowen Zhou ⋅ Yubei Chen ⋅ Yi Xue
Exhibit Hall I #295
HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation Poster Session 6 & Exhibit Hall with Coffee Break
Xin Zhou ⋅ DINGKANG LIANG ⋅ Sifan Tu ⋅ Xiwu Chen ⋅ Yikang Ding ⋅ Dingyuan Zhang ⋅ Feiyang Tan ⋅ Hengshuang Zhao ⋅ Xiang Bai
Exhibit Hall I #299
Generative Active Learning for Long-tail Trajectory Prediction via Controllable Diffusion Model Poster Session 6 & Exhibit Hall with Coffee Break
Daehee Park ⋅ Monu Surana ⋅ Pranav Desai ⋅ Ashish Mehta ⋅ Reuben John ⋅ Kuk-Jin Yoon
Exhibit Hall I #301
QuickSplat: Fast 3D Surface Reconstruction via Learned Gaussian Initialization Poster Session 6 & Exhibit Hall with Coffee Break
Yueh-Cheng Liu ⋅ Lukas Höllein ⋅ Matthias Nießner ⋅ Angela Dai
Exhibit Hall I #302
SP2T: Sparse Proxy Attention for Dual-stream Point Transformer Poster Session 6 & Exhibit Hall with Coffee Break
Jiaxu Wan ⋅ Hong Zhang ⋅ Ziqi He ⋅ Yangyan Deng ⋅ Qishu Wang ⋅ Ding Yuan ⋅ Yifan Yang
Exhibit Hall I #305
Instant GaussianImage: A Generalizable and Self-Adaptive Image Representation via 2D Gaussian Splatting Poster Session 6 & Exhibit Hall with Coffee Break
Zhaojie Zeng ⋅ Yuesong Wang ⋅ Chao Yang ⋅ Tao Guan ⋅ Lili Ju
Exhibit Hall I #306
CF3: Compact and Fast 3D Feature Fields Poster Session 6 & Exhibit Hall with Coffee Break
Hyunjoon Lee ⋅ Joonkyu Min ⋅ Jaesik Park
Exhibit Hall I #307
When Anchors Meet Cold Diffusion: A Multi-Stage Approach to Lane Detection Poster Session 6 & Exhibit Hall with Coffee Break
Bo-Lun Huang ⋅ Tzu-Hsiang Ni ⋅ Feng-Kai Huang ⋅ Hong-Han Shuai ⋅ Wen-Huang Cheng
Exhibit Hall I #308
2D Gaussian Splatting-based Sparse-view Transparent Object Depth Reconstruction via Physics Simulation for Scene Update Poster Session 6 & Exhibit Hall with Coffee Break
Jeongyun Kim ⋅ Seunghoon Jeong ⋅ Giseop Kim ⋅ Myung-Hwan Jeon ⋅ Eunji Jun ⋅ Ayoung Kim
Exhibit Hall I #309
Faster and Better 3D Splatting via Group Training Poster Session 6 & Exhibit Hall with Coffee Break
Chengbo Wang ⋅ Guozheng Ma ⋅ Yizhen Lao ⋅ Yifei Xue
Exhibit Hall I #313
Sat2City: 3D City Generation from A Single Satellite Image with Cascaded Latent Diffusion Poster Session 6 & Exhibit Hall with Coffee Break
Tongyan Hua ⋅ Lutao Jiang ⋅ Ying-Cong Chen ⋅ Wufan Zhao
Exhibit Hall I #314
NeuFrameQ: Neural Frame Fields for Scalable and Generalizable Anisotropic Quadrangulation Poster Session 6 & Exhibit Hall with Coffee Break
Ying-Tian Liu ⋅ Jiajun Li ⋅ Yu-Tao Liu ⋅ Xin Yu ⋅ Yuan-Chen Guo ⋅ Yanpei Cao ⋅ Ding Liang ⋅ Ariel Shamir ⋅ Song-Hai Zhang
Exhibit Hall I #316
RTMap: Real-Time Recursive Mapping with Change Detection and Localization Poster Session 6 & Exhibit Hall with Coffee Break
Yuheng Du ⋅ Sheng Yang ⋅ Lingxuan Wang ⋅ Zhenghua.Hou Zhenghua.Hou ⋅ Chengying Cai ⋅ Zhitao Tan ⋅ Mingxia Chen ⋅ Shi-Sheng Huang ⋅ Qiang Li
Exhibit Hall I #318
Controllable 3D Outdoor Scene Generation via Scene Graphs Poster Session 6 & Exhibit Hall with Coffee Break
Yuheng Liu ⋅ Xinke Li ⋅ Yuning Zhang ⋅ Lu Qi ⋅ Xin Li ⋅ Wenping Wang ⋅ Chongshou Li ⋅ Xueting Li ⋅ Ming-Hsuan Yang
Exhibit Hall I #321
PolGS: Polarimetric Gaussian Splatting for Fast Reflective Surface Reconstruction Poster Session 6 & Exhibit Hall with Coffee Break
Yufei Han ⋅ Bowen Tie ⋅ Heng Guo ⋅ Youwei Lyu ⋅ Si Li ⋅ Boxin Shi ⋅ Yunpeng Jia ⋅ Zhanyu Ma
Exhibit Hall I #323
Driving View Synthesis on Free-form Trajectories with Generative Prior Poster Session 6 & Exhibit Hall with Coffee Break
Zeyu Yang ⋅ Zijie Pan ⋅ Yuankun Yang ⋅ Xiatian Zhu ⋅ Li Zhang
Exhibit Hall I #324
Wasserstein Style Distribution Analysis and Transform for Stylized Image Generation Poster Session 4 & Exhibit Hall with Coffee Break
Xi Yu ⋅ Xiang Gu ⋅ Zhihao Shi ⋅ Jian Sun
Exhibit Hall I #250
Constraint-Aware Feature Learning for Parametric Point Cloud Poster Session 6 & Exhibit Hall with Coffee Break
Xi Cheng ⋅ Ruiqi Lei ⋅ Di Huang ⋅ Zhichao Liao ⋅ Fengyuan Piao ⋅ Yan Chen ⋅ Pingfa Feng ⋅ Long ZENG
Exhibit Hall I #327
NeuraLeaf: Neural Parametric Leaf Models with Shape and Deformation Disentanglement Poster Session 6 & Exhibit Hall with Coffee Break
Yang Yang ⋅ Dongni Mao ⋅ Hiroaki Santo ⋅ Yasuyuki Matsushita ⋅ Fumio Okura
Exhibit Hall I #332
ZeroStereo: Zero-shot Stereo Matching from Single Images Poster Session 6 & Exhibit Hall with Coffee Break
Xianqi Wang ⋅ Hao Yang ⋅ Gangwei Xu ⋅ Junda Cheng ⋅ Min Lin ⋅ Yong Deng ⋅ Jinliang Zang ⋅ Yurui Chen ⋅ Xin Yang
Exhibit Hall I #333
CVFusion: Cross-View Fusion of 4D Radar and Camera for 3D Object Detection Poster Session 6 & Exhibit Hall with Coffee Break
Hanzhi Zhong ⋅ Zhiyu Xiang ⋅ Ruoyu Xu ⋅ Jingyun Fu ⋅ Peng Xu ⋅ Shaohong Wang ⋅ Zhihao Zhihao ⋅ Tianyu Pu ⋅ Eryun Liu
Exhibit Hall I #334
Stochastic Gradient Estimation for Higher-Order Differentiable Rendering Poster Session 6 & Exhibit Hall with Coffee Break
Zican Wang ⋅ Michael Fischer ⋅ Tobias Ritschel
Exhibit Hall I #335
CATSplat: Context-Aware Transformer with Spatial Guidance for Generalizable 3D Gaussian Splatting from A Single-View Image Poster Session 6 & Exhibit Hall with Coffee Break
Wonseok Roh ⋅ Hwanhee Jung ⋅ JongWook Kim ⋅ Seunggwan Lee ⋅ Innfarn Yoo ⋅ Andreas Lugmayr ⋅ Seunggeun Chi ⋅ Karthik Ramani ⋅ Sangpil Kim
Exhibit Hall I #338
Quadratic Gaussian Splatting: High Quality Surface Reconstruction with Second-order Geometric Primitives Poster Session 6 & Exhibit Hall with Coffee Break
ziyu zhang ⋅ Binbin Huang ⋅ Hanqing Jiang ⋅ Liyang Zhou ⋅ Xiaojun Xiang ⋅ Shuhan Shen
Exhibit Hall I #341
Uncertainty-Aware Diffusion-Guided Refinement of 3D Scenes Poster Session 6 & Exhibit Hall with Coffee Break
Sarosij Bose ⋅ Arindam Dutta ⋅ Sayak Nag ⋅ Junge Zhang ⋅ Jiachen Li ⋅ Konstantinos Karydis ⋅ Amit Roy-Chowdhury
Exhibit Hall I #342
Foresight in Motion: Reinforcing Trajectory Prediction with Reward Heuristics Poster Session 6 & Exhibit Hall with Coffee Break
Muleilan Pei ⋅ Shaoshuai Shi ⋅ Xuesong Chen ⋅ Xu Liu ⋅ Shaojie Shen
Exhibit Hall I #345
MAESTRO: Task-Relevant Optimization via Adaptive Feature Enhancement and Suppression for Multi-task 3D Perception Poster Session 6 & Exhibit Hall with Coffee Break
ChangWon Kang ⋅ Jisong Kim ⋅ Hongjae Shin ⋅ Junseo Park ⋅ Jun Won Choi
Exhibit Hall I #346
ToF-Splatting: Dense SLAM using Sparse Time-of-Flight Depth and Multi-Frame Integration Poster Session 6 & Exhibit Hall with Coffee Break
Andrea Conti ⋅ Matteo Poggi ⋅ Valerio Cambareri ⋅ Martin R. Oswald ⋅ Stefano Mattoccia
Exhibit Hall I #349
Joint Semantic and Rendering Enhancements in 3D Gaussian Modeling with Anisotropic Local Encoding Poster Session 6 & Exhibit Hall with Coffee Break
Jingming He ⋅ Chongyi Li ⋅ Shiqi Wang ⋅ Sam Kwong
Exhibit Hall I #350
Unsupervised Imaging Inverse Problems with Diffusion Distribution Matching Poster Session 6 & Exhibit Hall with Coffee Break
Giacomo Meanti ⋅ Thomas Ryckeboer ⋅ Michael Arbel ⋅ Julien Mairal
Exhibit Hall I #351
R-LiViT: A LiDAR-Visual-Thermal Dataset Enabling Vulnerable Road User Focused Roadside Perception Poster Session 6 & Exhibit Hall with Coffee Break
Jonas Mirlach ⋅ Lei Wan ⋅ Andreas Wiedholz ⋅ Hannan Keen ⋅ Andreas Eich
Exhibit Hall I #352
V2XScenes: A Multiple Challenging Traffic Conditions Dataset for Large-Range Vehicle-Infrastructure Collaborative Perception Poster Session 6 & Exhibit Hall with Coffee Break
Bowen Wang ⋅ Yafei Wang ⋅ Wei Gong ⋅ Siheng Chen ⋅ Genjia Liu ⋅ Minhao Xiong ⋅ Chin Long Ng
Exhibit Hall I #353
Robust 3D Object Detection using Probabilistic Point Clouds from Single-Photon LiDARs Poster Session 6 & Exhibit Hall with Coffee Break
Bhavya Goyal ⋅ Felipe Gutierrez-Barragan ⋅ Wei Lin ⋅ Andreas Velten ⋅ Yin Li ⋅ Mohit Gupta
Exhibit Hall I #358
SViM3D: Stable Video Material Diffusion for Single Image 3D Generation Poster Session 6 & Exhibit Hall with Coffee Break
Andreas Engelhardt ⋅ Mark Boss ⋅ Vikram Voleti ⋅ Chun-Han Yao ⋅ Hendrik Lensch ⋅ Varun Jampani
Exhibit Hall I #359
HouseCrafter: Lifting Floorplans to 3D Scenes with 2D Diffusion Models Poster Session 6 & Exhibit Hall with Coffee Break
YIWEN CHEN ⋅ Hieu (Hayden) Nguyen ⋅ Vikram Voleti ⋅ Varun Jampani ⋅ Huaizu Jiang
Exhibit Hall I #360
G2D: Boosting Multimodal Learning with Gradient-Guided Distillation Poster Session 1 & Exhibit Hall
Mohammed Rakib ⋅ Arunkumar Bagavathi
Exhibit Hall I #378
Leveraging BEV Paradigm for Ground-to-Aerial Image Synthesis Poster Session 6 & Exhibit Hall with Coffee Break
Junyan Ye ⋅ Jun He ⋅ Weijia Li ⋅ Zhutao Lv ⋅ Yi Lin ⋅ Jinhua Yu ⋅ Haote Yang ⋅ Conghui He
Exhibit Hall I #361
EMD: Explicit Motion Modeling for High-Quality Street Gaussian Splatting Poster Session 6 & Exhibit Hall with Coffee Break
Xiaobao Wei ⋅ Qingpo Wuwu ⋅ Zhongyu Zhao ⋅ Zhuangzhe Wu ⋅ Nan Huang ⋅ Ming Lu ⋅ ningning ma ⋅ Shanghang Zhang
Exhibit Hall I #362
Perspective-aware 3D Gaussian Inpainting with Multi-view Consistency Poster Session 6 & Exhibit Hall with Coffee Break
Yuxin CHENG ⋅ Binxiao Huang ⋅ Taiqiang Wu ⋅ Wenyong Zhou ⋅ Chenchen Ding ⋅ Zhengwu Liu ⋅ Graziano Chesi ⋅ Ngai Wong
Exhibit Hall I #366
SparseRecon: Neural Implicit Surface Reconstruction from Sparse Views with Feature and Depth Consistencies Poster Session 6 & Exhibit Hall with Coffee Break
Liang Han ⋅ Xu Zhang ⋅ Haichuan Song ⋅ Kanle Shi ⋅ Liang Han ⋅ Zhizhong Han
Exhibit Hall I #367
SAM4D: Segment Anything in Camera and LiDAR Streams Poster Session 6 & Exhibit Hall with Coffee Break
Jianyun Xu ⋅ Song Wang ⋅ Ziqian Ni ⋅ Chunyong Hu ⋅ Sheng Yang ⋅ Jianke Zhu ⋅ Qiang Li
Exhibit Hall I #369
Representing 3D Shapes With 64 Latent Vectors for 3D Diffusion Models Poster Session 6 & Exhibit Hall with Coffee Break
In Cho ⋅ Youngbeom Yoo ⋅ Subin Jeon ⋅ Seon Joo Kim
Exhibit Hall I #371
LINR-PCGC: Lossless Implicit Neural Representations for Point Cloud Geometry Compression Poster Session 6 & Exhibit Hall with Coffee Break
Wenjie Huang ⋅ Qi Yang ⋅ Shuting Xia ⋅ He Huang ⋅ Yiling Xu ⋅ Zhu Li
Exhibit Hall I #373
Interaction-Merged Motion Planning: Effectively Leveraging Diverse Motion Datasets for Robust Planning Poster Session 6 & Exhibit Hall with Coffee Break
Giwon Lee ⋅ Wooseong Jeong ⋅ Daehee Park ⋅ Jaewoo Jeong ⋅ Kuk-Jin Yoon
Exhibit Hall I #376
Communication-Efficient Multi-Vehicle Collaborative Semantic Segmentation via Sparse 3D Gaussian Sharing Poster Session 6 & Exhibit Hall with Coffee Break
Tianyu Hong ⋅ Xiaobo Zhou ⋅ Wenkai Hu ⋅ Qi Xie ⋅ Zhihui Ke ⋅ Tie Qiu
Exhibit Hall I #377
DATA: Domain-And-Time Alignment for High-Quality Feature Fusion in Collaborative Perception Poster Session 6 & Exhibit Hall with Coffee Break
Chengchang Tian ⋅ Jianwei Ma ⋅ Yan Huang ⋅ Zhanye Chen ⋅ Honghao Wei ⋅ Hui Zhang ⋅ Wei Hong
Exhibit Hall I #379
Hi-Gaussian: Hierarchical Gaussians under Normalized Spherical Projection for Single-View 3D Reconstruction Poster Session 6 & Exhibit Hall with Coffee Break
Binjian Xie ⋅ Pengju Zhang ⋅ Hao Wei ⋅ Yihong Wu
Exhibit Hall I #381
Towards More Diverse and Challenging Pre-training for Point Cloud Learning: Self-Supervised Cross Reconstruction with Decoupled Views Poster Session 6 & Exhibit Hall with Coffee Break
Xiangdong Zhang ⋅ Shaofeng Zhang ⋅ Junchi Yan
Exhibit Hall I #384
A Lesson in Splats: Teacher-Guided Diffusion for 3D Gaussian Splats Generation with 2D Supervision Poster Session 6 & Exhibit Hall with Coffee Break
Chensheng Peng ⋅ Ido Sobol ⋅ Masayoshi Tomizuka ⋅ Kurt Keutzer ⋅ Chenfeng Xu ⋅ Or Litany
Exhibit Hall I #385
MMAT-1M: A Large Reasoning Dataset for Multimodal Agent Tuning Poster Session 1 & Exhibit Hall
Tianhong Gao ⋅ Yannian Fu ⋅ Weiqun Wu ⋅ Haixiao Yue ⋅ Shanshan Liu ⋅ Gang Zhang
Exhibit Hall I #131
Extrapolated Urban View Synthesis Benchmark Poster Session 6 & Exhibit Hall with Coffee Break
Xiangyu Han ⋅ Zhen Jia ⋅ Boyi Li ⋅ Yan Wang ⋅ Boris Ivanovic ⋅ Yurong You ⋅ Lingjie Liu ⋅ Yue Wang ⋅ Marco Pavone ⋅ Chen Feng ⋅ Yiming Li
Exhibit Hall I #386
Heatmap Regression without Soft-Argmax for Facial Landmark Detection Poster Session 6 & Exhibit Hall with Coffee Break
Chiao-An Yang ⋅ Raymond A. Yeh
Exhibit Hall I #387
Demeter: A Parametric Model of Crop Plant Morphology from the Real World Poster Session 6 & Exhibit Hall with Coffee Break
Tianhang Cheng ⋅ Albert Zhai ⋅ Evan Chen ⋅ Rui Zhou ⋅ Yawen Deng ⋅ Zitong Li ⋅ Kejie Zhao ⋅ Janice Shiu ⋅ Qianyu Zhao ⋅ Yide Xu ⋅ Xinlei Wang ⋅ Yuan Shen ⋅ Sheng Wang ⋅ Lisa Ainsworth ⋅ Kaiyu Guan ⋅ Shenlong Wang
Exhibit Hall I #388
Mixed Signals: A Diverse Point Cloud Dataset for Heterogeneous LiDAR V2X Collaboration Poster Session 6 & Exhibit Hall with Coffee Break
Katie Luo ⋅ Minh-Quan Dao ⋅ Zhenzhen Liu ⋅ Mark Campbell ⋅ Wei-Lun (Harry) Chao ⋅ Kilian Weinberger ⋅ Ezio Malis ⋅ Vincent FREMONT ⋅ Bharath Hariharan ⋅ Mao Shan ⋅ Stewart Worrall ⋅ Julie Stephany Berrio Perez
Exhibit Hall I #390
Exploiting Vision Language Model for Training-Free 3D Point Cloud OOD Detection via Graph Score Propagation Poster Session 6 & Exhibit Hall with Coffee Break
Tiankai Chen ⋅ Yushu Li ⋅ Adam Goodge ⋅ Fei Teng ⋅ Xulei Yang ⋅ Tianrui Li ⋅ Xun Xu
Exhibit Hall I #393
FROSS: Faster-Than-Real-Time Online 3D Semantic Scene Graph Generation from RGB-D Images Poster Session 6 & Exhibit Hall with Coffee Break
Hao-Yu Hou ⋅ Chun-Yi Lee ⋅ Motoharu Sonogashira ⋅ Yasutomo Kawanishi
Exhibit Hall I #395
HUG: Hierarchical Urban Gaussian Splatting with Block-Based Reconstruction for Large-Scale Aerial Scenes Poster Session 6 & Exhibit Hall with Coffee Break
Mai Su ⋅ Zhongtao Wang ⋅ Huishan Au ⋅ Yilong Li ⋅ Xizhe Cao ⋅ Chengwei Pan ⋅ Yisong Chen ⋅ Guoping Wang
Exhibit Hall I #397
Unraveling the Effects of Synthetic Data on End-to-End Autonomous Driving Poster Session 6 & Exhibit Hall with Coffee Break
Junhao Ge ⋅ Zuhong Liu ⋅ Longteng Fan ⋅ Yifan Jiang ⋅ Jiaqi Su ⋅ Yiming Li ⋅ Zhejun Zhang ⋅ Siheng Chen
Exhibit Hall I #399
BANet: Bilateral Aggregation Network for Mobile Stereo Matching Poster Session 6 & Exhibit Hall with Coffee Break
Gangwei Xu ⋅ Jiaxin Liu ⋅ Xianqi Wang ⋅ Junda Cheng ⋅ Yong Deng ⋅ Jinliang Zang ⋅ Yurui Chen ⋅ Xin Yang
Exhibit Hall I #400
Puzzle Similarity: A Perceptually-guided Cross-Reference Metric for Artifact Detection in 3D Scene Reconstructions Poster Session 6 & Exhibit Hall with Coffee Break
Nicolai Hermann ⋅ Jorge Condor ⋅ Piotr Didyk
Exhibit Hall I #401
Authentic 4D Driving Simulation with a Video Generation Model Poster Session 6 & Exhibit Hall with Coffee Break
Lening Wang ⋅ Wenzhao Zheng ⋅ Dalong Du ⋅ Yunpeng Zhang ⋅ Yilong Ren ⋅ Han Jiang ⋅ Zhiyong Cui ⋅ Haiyang Yu ⋅ Jie Zhou ⋅ Shanghang Zhang
Exhibit Hall I #402
DONUT: A Decoder-Only Model for Trajectory Prediction Poster Session 6 & Exhibit Hall with Coffee Break
Markus Knoche ⋅ Daan de Geus ⋅ Bastian Leibe
Exhibit Hall I #403
Lidar Waveforms are Worth 40x128x33 Words Poster Session 6 & Exhibit Hall with Coffee Break
Dominik Scheuble ⋅ Hanno Holzhüter ⋅ Steven Peters ⋅ Mario Bijelic ⋅ Felix Heide
Exhibit Hall I #404
Spherical Epipolar Rectification for Deep Two-View Absolute Depth Estimation Poster Session 6 & Exhibit Hall with Coffee Break
Pierre-André Brousseau ⋅ Sébastien Roy
Exhibit Hall I #405
PanoSplatt3R: Leveraging Perspective Pretraining for Generalized Unposed Wide-Baseline Panorama Reconstruction Poster Session 6 & Exhibit Hall with Coffee Break
Jiahui Ren ⋅ Mochu Xiang ⋅ Jiajun Zhu ⋅ Yuchao Dai
Exhibit Hall I #408
GaussianOcc: Fully Self-supervised and Efficient 3D Occupancy Estimation with Gaussian Splatting Poster Session 6 & Exhibit Hall with Coffee Break
Wanshui Gan ⋅ Fang Liu ⋅ Hongbin Xu ⋅ Ningkai Mo ⋅ Naoto Yokoya
Exhibit Hall I #410
GeoSplatting: Towards Geometry Guided Gaussian Splatting for Physically-based Inverse Rendering Poster Session 6 & Exhibit Hall with Coffee Break
Kai Ye ⋅ Chong Gao ⋅ Guanbin Li ⋅ Wenzheng Chen ⋅ Baoquan Chen
Exhibit Hall I #411
Wide2Long: Learning Lens Compression and Perspective Adjustment for Wide-Angle to Telephoto Translation Poster Session 6 & Exhibit Hall with Coffee Break
Soumyadipta Banerjee ⋅ Jiaul Paik ⋅ Debashis Sen
Exhibit Hall I #412
EVER: Exact Volumetric Ellipsoid Rendering for Real-time View Synthesis Poster Session 2 & Exhibit Hall with Coffee Break
Alexander Mai ⋅ Peter Hedman ⋅ George Kopanas ⋅ Dor Verbin ⋅ David Futschik ⋅ Qiangeng Xu ⋅ Falko Kuester ⋅ Jonathan Barron ⋅ Yinda Zhang
Exhibit Hall I #381
LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion Poster Session 6 & Exhibit Hall with Coffee Break
Fangfu Liu ⋅ Hao Li ⋅ Jiawei Chi ⋅ Hanyang Wang ⋅ Minghui Yang ⋅ Fudong Wang ⋅ Yueqi Duan
Exhibit Hall I #413
Leveraging 2D Priors and SDF Guidance for Urban Scene Rendering Poster Session 6 & Exhibit Hall with Coffee Break
Siddharth Tourani ⋅ Jayaram Reddy ⋅ Akash Kumbar ⋅ Satyajit Tourani ⋅ Nishant Goyal ⋅ Madhava Krishna ⋅ Dinesh Reddy Narapureddy ⋅ Muhammad Haris Khan
Exhibit Hall I #417
LBM: Latent Bridge Matching for Fast Image-to-Image Translation Poster Session 6 & Exhibit Hall with Coffee Break
Clément Chadebec ⋅ Onur Tasar ⋅ Sanjeev Sreetharan ⋅ Benjamin Aubin
Exhibit Hall I #420
SparseLaneSTP: Leveraging Spatio-Temporal Priors with Sparse Transformers for 3D Lane Detection Poster Session 6 & Exhibit Hall with Coffee Break
Maximilian Pittner ⋅ Joel Janai ⋅ Mario Faigle ⋅ Alexandru Condurache
Exhibit Hall I #421
Relative Illumination Fields: Learning Medium and Light Independent Underwater Scenes Poster Session 6 & Exhibit Hall with Coffee Break
Mengkun She ⋅ Felix Seegräber ⋅ David Nakath ⋅ Patricia Schöntag ⋅ Kevin Köser
Exhibit Hall I #422
Super Resolved Imaging with Adaptive Optics Poster Session 6 & Exhibit Hall with Coffee Break
Robin Swanson ⋅ Esther Y. H. Lin ⋅ Masen Lamb ⋅ Suresh Sivanandam ⋅ Kiriakos N. Kutulakos
Exhibit Hall I #425
HVPUNet: Hybrid-Voxel Point-cloud Upsampling Network Poster Session 6 & Exhibit Hall with Coffee Break
Juhyung Ha ⋅ Vibhas Vats ⋅ Alimoor Reza ⋅ Soon-heung Jung ⋅ David Crandall
Exhibit Hall I #426
Stealthy Backdoor Attack in Federated Learning via Adaptive Layer-wise Gradient Alignment Poster Session 6 & Exhibit Hall with Coffee Break
Qingqian Yang ⋅ Peishen Yan ⋅ Xiaoyu Wu ⋅ Jiaru Zhang ⋅ Tao Song ⋅ Yang Hua ⋅ Hao Wang ⋅ Liangliang Wang ⋅ Haibing Guan
Exhibit Hall I #427
VMem: Consistent Interactive Video Scene Generation with Surfel-Indexed View Memory Poster Session 6 & Exhibit Hall with Coffee Break
Runjia Li ⋅ Philip Torr ⋅ Andrea Vedaldi ⋅ Tomas Jakab
Exhibit Hall I #93
Back on Track: Bundle Adjustment for Dynamic Scene Reconstruction Poster Session 2 & Exhibit Hall with Coffee Break
Weirong Chen ⋅ Ganlin Zhang ⋅ Felix Wimbauer ⋅ Rui Wang ⋅ Nikita Araslanov ⋅ Andrea Vedaldi ⋅ Daniel Cremers
Exhibit Hall I #75
Self-Ensembling Gaussian Splatting for Few-Shot Novel View Synthesis Poster Session 2 & Exhibit Hall with Coffee Break
Chen Zhao ⋅ Xuan Wang ⋅ Tong Zhang ⋅ Saqib Javed ⋅ Mathieu Salzmann
Exhibit Hall I #147
Importance-Based Token Merging for Efficient Image and Video Generation Poster Session 2 & Exhibit Hall with Coffee Break
Haoyu Wu ⋅ Jingyi Xu ⋅ Hieu Le ⋅ Dimitris Samaras
Exhibit Hall I #303
Knowledge Distillation for Learned Image Compression Poster Session 2 & Exhibit Hall with Coffee Break
Yunuo Chen ⋅ Zezheng Lyu ⋅ Bing He ⋅ Ning Cao ⋅ Gang chen ⋅ Guo Lu ⋅ Wenjun Zhang
Exhibit Hall I #304
Variance-Based Pruning for Accelerating and Compressing Trained Networks Poster Session 2 & Exhibit Hall with Coffee Break
Uranik Berisha ⋅ Jens Mehnert ⋅ Alexandru Condurache
Exhibit Hall I #382
LoftUp: Learning a Coordinate-Based Feature Upsampler for Vision Foundation Models Poster Session 3 & Exhibit Hall
Haiwen Huang ⋅ Anpei Chen ⋅ Volodymyr Havrylov ⋅ Andreas Geiger ⋅ Dan Zhang
Exhibit Hall I #75
MaskControl: Spatio-Temporal Control for Masked Motion Synthesis Poster Session 3 & Exhibit Hall
Ekkasit Pinyoanuntapong ⋅ Muhammad Usama Saleem ⋅ Korrawe Karunratanakul ⋅ Pu Wang ⋅ Hongfei Xue ⋅ Chen Chen ⋅ chuan guo ⋅ Junli Cao ⋅ Jian Ren ⋅ Sergey Tulyakov
Exhibit Hall I #148
RS-vHeat: Heat Conduction Guided Efficient Remote Sensing Foundation Model Poster Session 3 & Exhibit Hall
Huiyang Hu ⋅ Peijin Wang ⋅ Hanbo Bi ⋅ Boyuan Tong ⋅ Zhaozhi Wang ⋅ Wenhui Diao ⋅ Hao Chang ⋅ Yingchao Feng ⋅ Ziqi Zhang ⋅ Yaowei Wang ⋅ Qixiang Ye ⋅ Kun Fu ⋅ Xian Sun
Exhibit Hall I #149
HairCUP: Hair Compositional Universal Prior for 3D Gaussian Avatars Poster Session 3 & Exhibit Hall
Byungjun Kim ⋅ Shunsuke Saito ⋅ Giljoo Nam ⋅ Tomas Simon ⋅ Jason Saragih ⋅ Hanbyul Joo ⋅ Junxuan Li
Exhibit Hall I #223
Understanding Co-speech Gestures in-the-wild Poster Session 3 & Exhibit Hall
Sindhu Hegde ⋅ K R Prajwal ⋅ Taein Kwon ⋅ Andrew Zisserman
Exhibit Hall I #302
DPoser-X: Diffusion Model as Robust 3D Whole-body Human Pose Prior Poster Session 3 & Exhibit Hall
Junzhe Lu ⋅ Jing Lin ⋅ Hongkun Dou ⋅ Ailing Zeng ⋅ Yue Deng ⋅ Xian Liu ⋅ Zhongang Cai ⋅ Lei Yang ⋅ YULUN ZHANG ⋅ Haoqian Wang ⋅ Ziwei Liu
Exhibit Hall I #377
Towards a Unified Copernicus Foundation Model for Earth Vision Poster Session 3 & Exhibit Hall
Yi Wang ⋅ Zhitong Xiong ⋅ Chenying Liu ⋅ Adam Stewart ⋅ Thomas Dujardin ⋅ Nikolaos Ioannis Bountos ⋅ Angelos Zavras ⋅ Franziska Gerken ⋅ Ioannis Papoutsis ⋅ Laura Leal-Taixé ⋅ Xiao Xiang Zhu
Exhibit Hall I #449
Teeth Reconstruction and Performance Capture Using a Phone Camera Poster Session 3 & Exhibit Hall
Weixi Zheng ⋅ Jingwang Ling ⋅ Zhibo Wang ⋅ Quan Wang ⋅ Feng Xu
Exhibit Hall I #450
ReCamMaster: Camera-Controlled Generative Rendering from A Single Video Poster Session 4 & Exhibit Hall with Coffee Break
Jianhong Bai ⋅ Menghan Xia ⋅ Xiao Fu ⋅ Xintao Wang ⋅ Lianrui Mu ⋅ Jinwen Cao ⋅ Zuozhu Liu ⋅ Haoji Hu ⋅ Xiang Bai ⋅ Pengfei Wan ⋅ Di ZHANG
Exhibit Hall I #74
Spatially-Varying Autofocus Poster Session 6 & Exhibit Hall with Coffee Break
Yingsi Qin ⋅ Aswin Sankaranarayanan ⋅ Matthew O'Toole
Exhibit Hall I #74
SparseFlex: High-Resolution and Arbitrary-Topology 3D Shape Modeling Poster Session 4 & Exhibit Hall with Coffee Break
Xianglong He ⋅ Zi-Xin Zou ⋅ Chia Hao Chen ⋅ Yuan-Chen Guo ⋅ Ding Liang ⋅ Chun Yuan ⋅ Wanli Ouyang ⋅ Yanpei Cao ⋅ Yangguang Li
Exhibit Hall I #75
RePoseD: Efficient Relative Pose Estimation With Known Depth Information Poster Session 4 & Exhibit Hall with Coffee Break
Yaqing Ding ⋅ Viktor Kocur ⋅ VACLAV VAVRA ⋅ Zuzana Berger Haladova ⋅ jian Yang ⋅ Torsten Sattler ⋅ Zuzana Kukelova
Exhibit Hall I #151
Diving into the Fusion of Monocular Priors for Generalized Stereo Matching Poster Session 4 & Exhibit Hall with Coffee Break
Chengtang Yao ⋅ Lidong Yu ⋅ Zhidan Liu ⋅ Jiaxi Zeng ⋅ Yuwei Wu ⋅ Yunde Jia
Exhibit Hall I #152
Forecasting Continuous Non-Conservative Dynamical Systems in SO(3) Poster Session 4 & Exhibit Hall with Coffee Break
Lennart Bastian ⋅ Mohammad Rashed ⋅ Nassir Navab ⋅ Tolga Birdal
Exhibit Hall I #226
Dynamic Typography: Bringing Text to Life via Video Diffusion Prior Poster Session 4 & Exhibit Hall with Coffee Break
Zichen Liu ⋅ Yihao Meng ⋅ Hao Ouyang ⋅ Yue Yu ⋅ Bolin Zhao ⋅ Daniel Cohen-Or ⋅ Huamin Qu
Exhibit Hall I #227
Certifiably Optimal Anisotropic Rotation Averaging Poster Session 4 & Exhibit Hall with Coffee Break
Carl Olsson ⋅ Yaroslava Lochman ⋅ Johan Malmport ⋅ Christopher Zach
Exhibit Hall I #305
MIORe & VAR-MIORe: Benchmarks to Push the Boundaries of Restoration Poster Session 5 & Exhibit Hall
George Ciubotariu ⋅ Zhuyun Zhou ⋅ Zongwei Wu ⋅ Radu Timofte
Exhibit Hall I #155
MikuDance: Animating Character Art with Mixed Motion Dynamics Poster Session 5 & Exhibit Hall
Jiaxu Zhang ⋅ Xianfang Zeng ⋅ Xin Chen ⋅ Wei Zuo ⋅ Gang YU ⋅ Zhigang Tu
Exhibit Hall I #156
ROAR: Reducing Inversion Error in Generative Image Watermarking Poster Session 5 & Exhibit Hall
Hanyi Wang ⋅ Han Fang ⋅ Shi-Lin Wang ⋅ Ee-Chien Chang
Exhibit Hall I #230
Diffusion Transformer meets Multi-level Wavelet Spectrum for Single Image Super-Resolution Poster Session 5 & Exhibit Hall
Peng Du ⋅ Hui Li ⋅ Han Xu ⋅ Paul Jeon ⋅ Dongwook Lee ⋅ Daehyun Ji ⋅ Ran Yang ⋅ Feng Zhu
Exhibit Hall I #303
Automated Model Evaluation for Object Detection via Prediction Consistency and Reliability Poster Session 5 & Exhibit Hall
Seungju Yoo ⋅ Hyuk Kwon ⋅ Joong-Won Hwang ⋅ Kibok Lee
Exhibit Hall I #304
LOTS of Fashion! Multi-Conditioning for Image Generation via Sketch-Text Pairing Poster Session 5 & Exhibit Hall
Federico Girella ⋅ Davide Talon ⋅ Ziyue Liu ⋅ Zanxi Ruan ⋅ Yiming Wang ⋅ Marco Cristani
Exhibit Hall I #376
FlowEdit: Inversion-Free Text-Based Editing Using Pre-Trained Flow Models Poster Session 5 & Exhibit Hall
Vladimir Kulikov ⋅ Matan Kleiner ⋅ Inbar Huberman-Spiegelglas ⋅ Tomer Michaeli
Exhibit Hall I #452
LayerTracer: Cognitive-Aligned Layered SVG Synthesis via Diffusion Transformer Poster Session 5 & Exhibit Hall
Yiren Song ⋅ Danze Chen ⋅ Mike Zheng Shou
Exhibit Hall I #453
SuperDec: 3D Scene Decomposition with Superquadrics Primitives Poster Session 6 & Exhibit Hall with Coffee Break
Elisabetta Fedele ⋅ Boyang Sun ⋅ Francis Engelmann ⋅ Marc Pollefeys ⋅ Leonidas Guibas
Exhibit Hall I #144
E-SAM: Training-Free Segment Every Entity Model Poster Session 6 & Exhibit Hall with Coffee Break
WEIMING ZHANG ⋅ Dingwen Xiao ⋅ Lei Chen ⋅ Lin Wang
Exhibit Hall I #219
Online Reasoning Video Segmentation with Just-in-Time Digital Twins Poster Session 6 & Exhibit Hall with Coffee Break
Yiqing Shen ⋅ Bohan Liu ⋅ Chenjia Li ⋅ Lalithkumar Seenivasan ⋅ Mathias Unberath
Exhibit Hall I #220
Towards Foundational Models for Single-Chip Radar Poster Session 6 & Exhibit Hall with Coffee Break
Tianshu Huang ⋅ Akarsh Prabhakara ⋅ Chuhan Chen ⋅ Jay Karhade ⋅ Deva Ramanan ⋅ Matthew O'Toole ⋅ Anthony Rowe
Exhibit Hall I #287
Make Your Training Flexible: Towards Deployment-Efficient Video Models Poster Session 5 & Exhibit Hall
Chenting Wang ⋅ Kunchang Li ⋅ Tianxiang Jiang ⋅ Xiangyu Zeng ⋅ Yi Wang ⋅ Limin Wang
Exhibit Hall I #383
M2SFormer: Multi-Spectral and Multi-Scale Attention with Edge-Aware Difficulty Guidance for Image Forgery Localization Poster Session 4 & Exhibit Hall with Coffee Break
Ju-Hyeon Nam ⋅ Dong-Hyun Moon ⋅ Sang-Chul Lee
Exhibit Hall I #99
Articulate3D: Holistic Understanding of 3D Scenes as Universal Scene Description Poster Session 2 & Exhibit Hall with Coffee Break
Anna-Maria Halacheva ⋅ Yang Miao ⋅ Jan-Nico Zaech ⋅ Xi Wang ⋅ Luc Gool ⋅ Danda Pani Paudel
Exhibit Hall I #57
What You Have is What You Track: Adaptive and Robust Multimodal Tracking Poster Session 1 & Exhibit Hall
Yuedong Tan ⋅ Jiawei Shao ⋅ Eduard Zamfir ⋅ Ruanjun Li ⋅ Zhaochong An ⋅ Chao Ma ⋅ Danda Pani Paudel ⋅ Luc Gool ⋅ Radu Timofte ⋅ Zongwei Wu
Exhibit Hall I #321
Low-Light Image Enhancement using Event-Based Illumination Estimation Poster Session 2 & Exhibit Hall with Coffee Break
Lei Sun ⋅ Yuhan Bao ⋅ Jiajun Zhai ⋅ Jingyun Liang ⋅ YULUN ZHANG ⋅ Kaiwei Wang ⋅ Danda Pani Paudel ⋅ Luc Gool
Exhibit Hall I #156
Multi-Modal Few-Shot Temporal Action Segmentation Poster Session 3 & Exhibit Hall
Zijia Lu ⋅ Ehsan Elhamifar
Exhibit Hall I #387
WildSAT: Learning Satellite Image Representations from Wildlife Observations Poster Session 2 & Exhibit Hall with Coffee Break
Rangel Daroya ⋅ Elijah Cole ⋅ Oisin Mac Aodha ⋅ Grant Horn ⋅ Subhransu Maji
Exhibit Hall I #105
Forgetting Through Transforming: Enabling Federated Unlearning via Class-Aware Representation Transformation Poster Session 1 & Exhibit Hall
Qi Guo ⋅ Zhen Tian ⋅ Minghao Yao ⋅ Saiyu Qi ⋅ Yong Qi ⋅ Bingyi Liu
Exhibit Hall I #130
SU-RGS: Relightable 3D Gaussian Splatting from Sparse Views under Unconstrained Illuminations Poster Session 6 & Exhibit Hall with Coffee Break
Qi Zhang ⋅ Chi Huang ⋅ Qian Zhang ⋅ Nan Li ⋅ Wei Feng
Exhibit Hall I #206
SpectralAR: Spectral Autoregressive Visual Generation Poster Session 4 & Exhibit Hall with Coffee Break
Yuanhui Huang ⋅ Weiliang Chen ⋅ Wenzhao Zheng ⋅ Yueqi Duan ⋅ Jie Zhou ⋅ Jiwen Lu
Exhibit Hall I #91
Sibai: A Few-Shot Meta-Classifier for Poisoning Detection in Federated Learning Poster Session 1 & Exhibit Hall
Melanie Götz ⋅ Torsten Krauß ⋅ Alexandra Dmitrienko
Exhibit Hall I #352
Gradient Extrapolation for Debiased Representation Learning Poster Session 1 & Exhibit Hall
Ihab Asaad ⋅ Maha Shadaydeh ⋅ Joachim Denzler
Exhibit Hall I #355
Supercharging Floorplan Localization with Semantic Rays Poster Session 6 & Exhibit Hall with Coffee Break
Yuval Grader ⋅ Hadar Averbuch-Elor
Exhibit Hall I #232
Learning Streaming Video Representation via Multitask Training Poster Session 3 & Exhibit Hall
Yibin Yan ⋅ Jilan Xu ⋅ Shangzhe Di ⋅ Yikun Liu ⋅ Yudi Shi ⋅ Qirui Chen ⋅ Zeqian Li ⋅ Yifei Huang ⋅ Weidi Xie
Exhibit Hall I #224
InstantEdit: Text-Guided Few-Step Image Editing with Piecewise Rectified Flow Poster Session 4 & Exhibit Hall with Coffee Break
Yiming Gong ⋅ Zhen Zhu ⋅ Minjia Zhang
Exhibit Hall I #184
World4Drive: End-to-End Autonomous Driving via Intention-aware Physical Latent World Model Poster Session 6 & Exhibit Hall with Coffee Break
Yupeng Zheng ⋅ Pengxuan Yang ⋅ Zebin Xing ⋅ Qichao Zhang ⋅ Yuhang Zheng ⋅ Yinfeng Gao ⋅ Pengfei Li ⋅ Teng Zhang ⋅ Zhongpu Xia ⋅ Peng Jia ⋅ XianPeng Lang ⋅ Dongbin Zhao
Exhibit Hall I #378
CoHD: A Counting-Aware Hierarchical Decoding Framework for Generalized Referring Expression Segmentation Poster Session 5 & Exhibit Hall
Zhuoyan Luo ⋅ Yinghao Wu ⋅ Tianheng Cheng ⋅ Yong Liu ⋅ Yicheng Xiao ⋅ Hongfa Wang ⋅ Xiao-Ping Zhang ⋅ Yujiu Yang
Exhibit Hall I #271
Scaling Transformer-Based Novel View Synthesis with Models Token Disentanglement and Synthetic Data Poster Session 6 & Exhibit Hall with Coffee Break
Nithin Gopalakrishnan Nair ⋅ Srinivas Kaza ⋅ Xuan Luo ⋅ Jungyeon Park ⋅ Stephen Lombardi ⋅ Vishal Patel
Exhibit Hall I #372
Learning to See in the Extremely Dark Poster Session 2 & Exhibit Hall with Coffee Break
Hai Jiang ⋅ Binhao Guan ⋅ Zhen Liu ⋅ Xiaohong Liu ⋅ Jian Yu ⋅ Zheng Liu ⋅ Songchen Han ⋅ Shuaicheng Liu
Exhibit Hall I #250
Customizing Domain Adapters for Domain Generalization Poster Session 1 & Exhibit Hall
Yuyang Ji ⋅ Zeyi Huang ⋅ Haohan Wang ⋅ Yong Jae Lee
Exhibit Hall I #80
BATCLIP: Bimodal Online Test-Time Adaptation for CLIP Poster Session 1 & Exhibit Hall
Sarthak Kumar Maharana ⋅ Baoming Zhang ⋅ Leonid Karlinsky ⋅ Rogerio Feris ⋅ Yunhui Guo
Exhibit Hall I #139
BillBoard Splatting (BBSplat): Learnable Textured Primitives for Novel View Synthesis Poster Session 6 & Exhibit Hall with Coffee Break
David Svitov ⋅ Pietro Morerio ⋅ Lourdes Agapito ⋅ ALESSIO DEL BUE
Exhibit Hall I #29
Vivid4D: Improving 4D Reconstruction from Monocular Video by Video Inpainting Poster Session 3 & Exhibit Hall
Jiaxin Huang ⋅ Sheng Miao ⋅ Bangbang Yang ⋅ Yuewen Ma ⋅ Yiyi Liao
Exhibit Hall I #244
MemoryTalker: Personalized Speech-Driven 3D Facial Animation via Audio-Guided Stylization Poster Session 3 & Exhibit Hall
Hyung Kyu Kim ⋅ Sangmin Lee ⋅ HAK GU KIM
Exhibit Hall I #116
DriveX: Omni Scene Modeling for Learning Generalizable World Knowledge in Autonomous Driving Poster Session 6 & Exhibit Hall with Coffee Break
Chen Shi ⋅ Shaoshuai Shi ⋅ Kehua Sheng ⋅ Bo Zhang ⋅ Li Jiang
Exhibit Hall I #375
MamV2XCalib: V2X-based Target-less Infrastructure Camera Calibration with State Space Model Poster Session 6 & Exhibit Hall with Coffee Break
Yaoye Zhu ⋅ Zhe Wang ⋅ Yan Wang
Exhibit Hall I #191
SiM3D: Single-instance Multiview Multimodal and Multisetup 3D Anomaly Detection Benchmark Poster Session 5 & Exhibit Hall
Alex Costanzino ⋅ Pierluigi Zama Ramirez ⋅ Luigi Lella ⋅ Matteo Ragaglia ⋅ Alessandro Oliva ⋅ Giuseppe Lisanti ⋅ Luigi Stefano
Exhibit Hall I #108
Image as an IMU: Estimating Camera Motion from a Single Motion-Blurred Image Poster Session 1 & Exhibit Hall
Jerred Chen ⋅ Ronald Clark
Exhibit Hall I #228
SUB: Benchmarking CBM Generalization via Synthetic Attribute Substitutions Poster Session 5 & Exhibit Hall
Jessica Bader ⋅ Leander Girrbach ⋅ Stephan Alaniz ⋅ Zeynep Akata
Exhibit Hall I #320
PARTE: Part-Guided Texturing for 3D Human Reconstruction from a Single Image Poster Session 2 & Exhibit Hall with Coffee Break
Hyeongjin Nam ⋅ Donghwan Kim ⋅ Gyeongsik Moon ⋅ Kyoung Mu Lee
Exhibit Hall I #332
Cross-Subject Mind Decoding from Inaccurate Representations Poster Session 4 & Exhibit Hall with Coffee Break
Yangyang Xu ⋅ Bangzhen Liu ⋅ Wenqi Shao ⋅ Yong Du ⋅ Shengfeng He ⋅ Tingting Zhu
Exhibit Hall I #17
Boosting MLLM Reasoning with Text-Debiased Hint-GRPO Poster Session 1 & Exhibit Hall
Qihan Huang ⋅ Weilong Dai ⋅ Jinlong Liu ⋅ Wanggui He ⋅ Hao Jiang ⋅ Mingli Song ⋅ Jingyuan CHEN ⋅ Chang Yao ⋅ Jie Song
Exhibit Hall I #455
Adaptive Dual Uncertainty Optimization: Boosting Monocular 3D Object Detection under Test-Time Shifts Poster Session 2 & Exhibit Hall with Coffee Break
Zixuan Hu ⋅ Dongxiao Li ⋅ Xinzhu Ma ⋅ SHIXIANG TANG ⋅ Xiaotong Li ⋅ Wenhan Yang ⋅ LINGYU DUAN
Exhibit Hall I #211
SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation Poster Session 4 & Exhibit Hall with Coffee Break
Junsong Chen ⋅ Shuchen Xue ⋅ Yuyang Zhao ⋅ Jincheng YU ⋅ Sayak Paul ⋅ Junyu Chen ⋅ Han Cai ⋅ Enze Xie ⋅ Song Han
Exhibit Hall I #123
AirCache: Activating Inter-modal Relevancy KV Cache Compression for Efficient Large Vision-Language Model Inference Poster Session 5 & Exhibit Hall
Kai Huang ⋅ hao zou ⋅ Bochen Wang ⋅ Xi Ye ⋅ Zhen Xie ⋅ Hao Wang
Exhibit Hall I #390
LLM-enhanced Action-aware Multi-modal Prompt Tuning for Image-Text Matching Poster Session 5 & Exhibit Hall
Meng Tian ⋅ Shuo Yang ⋅ Xinxiao Wu
Exhibit Hall I #90
UKBOB: One Billion MRI Labeled Masks for Generalizable 3D Medical Image Segmentation Poster Session 5 & Exhibit Hall
Emmanuelle Bourigault ⋅ Amir Jamaludin ⋅ Abdullah Hamdi
Exhibit Hall I #169
FlowStyler: Artistic Video Stylization via Transformation Fields Transports Poster Session 3 & Exhibit Hall
YuNing Gong ⋅ Jiaming Chen ⋅ Xiaohua Ren ⋅ Yuanjun Liao ⋅ Yanci Zhang
Exhibit Hall I #21
ShadowHack: Hacking Shadows via Luminance-Color Divide and Conquer Poster Session 3 & Exhibit Hall
Jin Hu ⋅ Mingjia Li ⋅ Xiaojie Guo
Exhibit Hall I #131
Harnessing Massive Satellite Imagery with Efficient Masked Image Modeling Poster Session 2 & Exhibit Hall with Coffee Break
Fengxiang Wang ⋅ Hongzhen Wang ⋅ Di Wang ⋅ Zonghao Guo ⋅ Zhenyu Zhong ⋅ Long Lan ⋅ Wenjing Yang ⋅ Jing Zhang
Exhibit Hall I #180
Beyond Losses Reweighting: Empowering Multi-Task Learning via the Generalization Perspective Poster Session 1 & Exhibit Hall
Hoang Phan ⋅ Tung Lam Tran ⋅ Quyen Tran ⋅ Ngoc Tran ⋅ Tuan Truong ⋅ Qi Lei ⋅ Nhat Ho ⋅ Dinh Phung ⋅ Trung Le
Exhibit Hall I #222
StableCodec: Taming One-Step Diffusion for Extreme Image Compression Poster Session 4 & Exhibit Hall with Coffee Break
Tianyu Zhang ⋅ Xin Luo ⋅ Li Li ⋅ Dong Liu
Exhibit Hall I #239
FastJSMA: Accelerating Jacobian-based Saliency Map Attacks through Gradient Decoupling Poster Session 1 & Exhibit Hall
Zhenghao Gao ⋅ Shengjie Xu ⋅ Zijing Li ⋅ Meixi Chen ⋅ Chaojian Yu ⋅ Yuanjie Shao ⋅ Changxin Gao
Exhibit Hall I #133
Toward Fair and Accurate Cross-Domain Medical Image Segmentation: A VLM-Driven Active Domain Adaptation Paradigm Poster Session 5 & Exhibit Hall
Hongqiu Wang ⋅ Wu Chen ⋅ Xiangde Luo ⋅ Zhaohu Xing ⋅ Lihao Liu ⋅ Jing Qin ⋅ Shaozhi Wu ⋅ Lei Zhu
Exhibit Hall I #403
Decouple to Reconstruct: High Quality UHD Restoration via Active Feature Disentanglement and Reversible Fusion Poster Session 3 & Exhibit Hall
Yidi Liu ⋅ Dong Li ⋅ Yuxin Ma ⋅ Jie Huang ⋅ Wenlong Zhang ⋅ Xueyang Fu ⋅ Zheng-Jun Zha
Exhibit Hall I #153
Federated Continuous Category Discovery and Learning Poster Session 1 & Exhibit Hall
Lixu Wang ⋅ Chenxi Liu ⋅ Junfeng Guo ⋅ Qingqing Ye ⋅ Heng Huang ⋅ Haibo Hu ⋅ Wei Dong
Exhibit Hall I #221
Are They the Same? Exploring Visual Correspondence Shortcomings of Multimodal LLMs Poster Session 4 & Exhibit Hall with Coffee Break
Yikang Zhou ⋅ Tao Zhang ⋅ Shilin Xu ⋅ Shihao Chen ⋅ Qianyu Zhou ⋅ Yunhai Tong ⋅ Shunping Ji ⋅ Jiangning Zhang ⋅ Lu Qi ⋅ Xiangtai Li
Exhibit Hall I #266
Consensus-Driven Active Model Selection Poster Session 1 & Exhibit Hall
Justin Kay ⋅ Grant Horn ⋅ Subhransu Maji ⋅ Daniel Sheldon ⋅ Sara Beery
Exhibit Hall I #431
BlueNeg: A 35mm Negative Film Dataset for Restoring Channel-Heterogeneous Deterioration Poster Session 3 & Exhibit Hall
Hanyuan Liu ⋅ Chengze Li ⋅ Minshan Xie ⋅ Wang Zhenni ⋅ Jiawen Liang ⋅ Chi LEUNG ⋅ Tien-Tsin Wong
Exhibit Hall I #293
Rethinking Key-frame-based Micro-expression Recognition: A Robust and Accurate Framework Against Key-frame Errors Poster Session 3 & Exhibit Hall
Zheyuan Zhang ⋅ Weihao Tang ⋅ Hong Chen
Exhibit Hall I #213
Make Me Happier: Evoking Emotions Through Image Diffusion Models Poster Session 4 & Exhibit Hall with Coffee Break
Qing Lin ⋅ Jingfeng Zhang ⋅ YEW-SOON ONG ⋅ Mengmi Zhang
Exhibit Hall I #140
Pretend Benign: A Stealthy Adversarial Attack by Exploiting Vulnerabilities in Cooperative Perception Poster Session 5 & Exhibit Hall
Hongwei Lin ⋅ Dongyu Pan ⋅ Qiming Xia ⋅ Hai Wu ⋅ Cheng Wang ⋅ Siqi Shen ⋅ Chenglu Wen
Exhibit Hall I #14
What we need is explicit controllability: Training 3D gaze estimator using only facial images Poster Session 3 & Exhibit Hall
Tingwei Li ⋅ Jun Bao ⋅ Zhenzhong Kuang ⋅ Buyu Liu
Exhibit Hall I #132
SemiVisBooster: Boosting Semi-Supervised Learning for Fine-Grained Classification through Pseudo-Label Semantic Guidance Poster Session 1 & Exhibit Hall
Wenjin Zhang ⋅ Xinyu Li ⋅ Chenyang Gao ⋅ Ivan Marsic
Exhibit Hall I #104
OpenAnimals: Revisiting Person Re-Identification for Animals Towards Better Generalization Poster Session 3 & Exhibit Hall
Saihui Hou ⋅ Panjian Huang ⋅ Zengbin Wang ⋅ Yuan Liu ⋅ Zeyu Li ⋅ Man Zhang ⋅ Yongzhen Huang
Exhibit Hall I #411
Enhancing Prompt Generation with Adaptive Refinement for Camouflaged Object Detection Poster Session 5 & Exhibit Hall
Xuehan Chen ⋅ Guangyu Ren ⋅ Tianhong Dai ⋅ Tania Stathaki ⋅ Hengyan Liu
Exhibit Hall I #83
Hypergraph Clustering Network with Partial Attribute Imputation Poster Session 1 & Exhibit Hall
Qianqian Wang ⋅ Bowen Zhao ⋅ Zhengming Ding ⋅ Wei Feng ⋅ Quanxue Gao
Exhibit Hall I #248
Easy3D: A Simple Yet Effective Method for 3D Interactive Segmentation Poster Session 6 & Exhibit Hall with Coffee Break
Andrea Simonelli ⋅ Norman Müller ⋅ Peter Kontschieder
Exhibit Hall I #357
SAMPLE: Semantic Alignment through Temporal-Adaptive Multimodal Prompt Learning for Event-Based Open-Vocabulary Action Recognition Poster Session 3 & Exhibit Hall
Jing Wang ⋅ Rui Zhao ⋅ Ruiqin Xiong ⋅ Xingtao Wang ⋅ Xiaopeng Fan ⋅ Tiejun Huang
Exhibit Hall I #415
Object-centric Video Question Answering with Visual Grounding and Referring Poster Session 5 & Exhibit Hall
Haochen Wang ⋅ Qirui Chen ⋅ Cilin Yan ⋅ Jiayin Cai ⋅ Xiaolong Jiang ⋅ Yao Hu ⋅ Weidi Xie ⋅ Stratis Gavves
Exhibit Hall I #233
DSO: Aligning 3D Generators with Simulation Feedback for Physical Soundness Poster Session 2 & Exhibit Hall with Coffee Break
Ruining Li ⋅ Chuanxia Zheng ⋅ Christian Rupprecht ⋅ Andrea Vedaldi
Exhibit Hall I #165
EgoAgent: A Joint Predictive Agent Model in Egocentric Worlds Poster Session 2 & Exhibit Hall with Coffee Break
Lu Chen ⋅ Yizhou Wang ⋅ SHIXIANG TANG ⋅ Qianhong Ma ⋅ Tong He ⋅ Wanli Ouyang ⋅ Xiaowei Zhou ⋅ Hujun Bao ⋅ Sida Peng
Exhibit Hall I #183
VMBench: A Benchmark for Perception-Aligned Video Motion Generation Poster Session 3 & Exhibit Hall
Xinran Ling ⋅ Chen Zhu ⋅ Meiqi Wu ⋅ Hangyu Li ⋅ Xiaokun Feng ⋅ Cundian Yang ⋅ Aiming Hao ⋅ Jiashu Zhu ⋅ Jiahong Wu ⋅ Xiangxiang Chu
Exhibit Hall I #290
UAVScenes: A Multi-Modal Dataset for UAVs Poster Session 6 & Exhibit Hall with Coffee Break
Sijie Wang ⋅ Siqi Li ⋅ Yawei Zhang ⋅ Shangshu Yu ⋅ Shenghai Yuan ⋅ Rui She ⋅ Quanjiang Guo ⋅ JinXuan Zheng ⋅ Ong Howe ⋅ Leonrich Chandra ⋅ Shrivarshann Srijeyan ⋅ Aditya Sivadas ⋅ Toshan Aggarwal ⋅ Heyuan Liu ⋅ Hongming Zhang ⋅ CHEN CHUJIE ⋅ JIANG JUNYU ⋅ Lihua Xie ⋅ Wee Peng Tay
Exhibit Hall I #407
LIRA: Reasoning Reconstruction via Multimodal Large Language Models Poster Session 1 & Exhibit Hall
Zhen Zhou ⋅ Tong Wang ⋅ Yunkai Ma ⋅ Xiao Tan ⋅ Fengshui Jing
Exhibit Hall I #159
Move to Understand a 3D Scene: Bridging Visual Grounding and Exploration for Efficient and Versatile Embodied Navigation Poster Session 2 & Exhibit Hall with Coffee Break
ZIYU ZHU ⋅ Xilin Wang ⋅ Yixuan Li ⋅ Zhuofan Zhang ⋅ Xiaojian Ma ⋅ Yixin Chen ⋅ Baoxiong Jia ⋅ Wei Liang ⋅ Qian Yu ⋅ Zhidong Deng ⋅ Siyuan Huang ⋅ Qing Li
Exhibit Hall I #291
NavMorph: A Self-Evolving World Model for Vision-and-Language Navigation in Continuous Environments Poster Session 2 & Exhibit Hall with Coffee Break
Xuan Yao ⋅ Junyu Gao ⋅ Changsheng Xu
Exhibit Hall I #48
TF-TI2I: Training-Free Text-and-Image-to-Image Generation via Multi-Modal Implicit-Context Learning In Text-to-Image Models Poster Session 4 & Exhibit Hall with Coffee Break
Teng-Fang Hsiao ⋅ Bo-Kai Ruan ⋅ Yi-Lun Wu ⋅ Tzu-Ling Lin ⋅ Hong-Han Shuai
Exhibit Hall I #335
Exploiting Frequency Dynamics for Enhanced Multimodal Event-based Action Recognition Poster Session 2 & Exhibit Hall with Coffee Break
Meiqi Cao ⋅ Xiangbo Shu ⋅ Xin Jiang ⋅ Rui Yan ⋅ Yazhou Yao ⋅ Jinhui Tang
Exhibit Hall I #89
Compression of 3D Gaussian Splatting with Optimized Feature Planes and Standard Video Codecs Poster Session 6 & Exhibit Hall with Coffee Break
Soonbin Lee ⋅ Fangwen Shu ⋅ Yago Sanchez de la Fuente ⋅ Thomas Schierl ⋅ Cornelius Hellge
Exhibit Hall I #73
GeoProg3D: Compositional Visual Reasoning for City-Scale 3D Language Fields Poster Session 2 & Exhibit Hall with Coffee Break
Shunsuke Yasuki ⋅ Taiki Miyanishi ⋅ Nakamasa Inoue ⋅ Shuhei Kurita ⋅ Koya Sakamoto ⋅ Daichi Azuma ⋅ Masato Taki ⋅ Yutaka Matsuo
Exhibit Hall I #442
GazeGaussian: High-Fidelity Gaze Redirection with 3D Gaussian Splatting Poster Session 3 & Exhibit Hall
Xiaobao Wei ⋅ Peng Chen ⋅ Guangyu Li ⋅ Ming Lu ⋅ Hui Chen ⋅ Feng Tian
Exhibit Hall I #311
Boosting Adversarial Transferability via Negative Hessian Trace Regularization Poster Session 1 & Exhibit Hall
Yunfei Long ⋅ Zilin Tian ⋅ Liguo Zhang ⋅ Huosheng Xu
Exhibit Hall I #217
FB-Diff: Fourier Basis-guided Diffusion for Temporal Interpolation of 4D Medical Imaging Poster Session 6 & Exhibit Hall with Coffee Break
Xin You ⋅ Runze Yang ⋅ Chuyan Zhang ⋅ Zhongliang Jiang ⋅ JIE YANG ⋅ Nassir Navab
Exhibit Hall I #317
How Far are AI-generated Videos from Simulating the 3D Visual World: A Learned 3D Evaluation Approach Poster Session 3 & Exhibit Hall
Chirui CHANG ⋅ Jiahui Liu ⋅ Zhengzhe Liu ⋅ Xiaoyang Lyu ⋅ Yi-Hua Huang ⋅ Xin Tao ⋅ Pengfei Wan ⋅ Di ZHANG ⋅ Xiaojuan Qi
Exhibit Hall I #28
SIC: Similarity-Based Interpretable Image Classification with Neural Networks Poster Session 5 & Exhibit Hall
Tom Nuno Wolf ⋅ Emre Kavak ⋅ Fabian Bongratz ⋅ Christian Wachinger
Exhibit Hall I #419
3DRealCar: An In-the-wild RGB-D Car Dataset with 360-degree Views Poster Session 6 & Exhibit Hall with Coffee Break
Xiaobiao Du ⋅ Yida Wang ⋅ Haiyang Sun ⋅ Zhuojie Wu ⋅ Hongwei Sheng ⋅ Shuyun Wang ⋅ Jiaying Ying ⋅ Ming Lu ⋅ Tianqing Zhu ⋅ Kun Zhan ⋅ Xin Yu
Exhibit Hall I #171
Describe, Don’t Dictate: Semantic Image Editing with Natural Language Intent Poster Session 4 & Exhibit Hall with Coffee Break
En Ci ⋅ Shanyan Guan ⋅ Yanhao Ge ⋅ Yilin Zhang ⋅ Wei Li ⋅ Zhenyu Zhang ⋅ Jian Yang ⋅ Ying Tai
Exhibit Hall I #412
Event-based Tiny Object Detection: A Benchmark Dataset and Baselines Poster Session 2 & Exhibit Hall with Coffee Break
Nuo Chen ⋅ Chao Xiao ⋅ Yimian Dai ⋅ Shiman He ⋅ Miao Li ⋅ Wei An
Exhibit Hall I #205
Depth AnyEvent: A Cross-Modal Distillation Paradigm for Event-Based Monocular Depth Estimation Poster Session 4 & Exhibit Hall with Coffee Break
Luca Bartolomei ⋅ Enrico Mannocci ⋅ Fabio Tosi ⋅ Matteo Poggi ⋅ Stefano Mattoccia
Exhibit Hall I #458
EmotiCrafter: Text-to-Emotional-Image Generation based on Valence-Arousal Model Poster Session 4 & Exhibit Hall with Coffee Break
Shengqi Dang ⋅ Yi He ⋅ Long Ling ⋅ Ziqing Qian ⋅ Nanxuan Zhao ⋅ Nan Cao
Exhibit Hall I #31
LD-RPS: Zero-Shot Unified Image Restoration via Latent Diffusion Recurrent Posterior Sampling Poster Session 3 & Exhibit Hall
Li Huaqiu ⋅ Yong Wang ⋅ Tongwen Huang ⋅ Hailang Huang ⋅ Haoqian Wang ⋅ Xiangxiang Chu
Exhibit Hall I #346
Towards Stabilized and Efficient Diffusion Transformers through Long-Skip-Connections with Spectral Constraints Poster Session 4 & Exhibit Hall with Coffee Break
Guanjie Chen ⋅ Xinyu Zhao ⋅ Yucheng Zhou ⋅ Xiaoye Qu ⋅ Tianlong Chen ⋅ Yu Cheng
Exhibit Hall I #270
Not All Frame Features Are Equal: Video-to-4D Generation via Decoupling Dynamic-Static Features Poster Session 2 & Exhibit Hall with Coffee Break
Liying Yang ⋅ Chen Liu ⋅ Zhenwei Zhu ⋅ Ajian Liu ⋅ Hui Ma ⋅ Jian Nong ⋅ Yanyan Liang
Exhibit Hall I #233
Fuse Before Transfer: Knowledge Fusion for Heterogeneous Distillation Poster Session 1 & Exhibit Hall
Guopeng Li ⋅ Qiang Wang ⋅ Ke Yan ⋅ Shouhong Ding ⋅ Yuan Gao ⋅ Gui-Song Xia
Exhibit Hall I #320
CoSMIC: Continual Self-supervised Learning for Multi-Domain Medical Imaging via Conditional Mutual Information Maximization Poster Session 5 & Exhibit Hall
Yihang Liu ⋅ Ying Wen ⋅ Longzhen Yang ⋅ Lianghua He ⋅ Heng Tao Shen
Exhibit Hall I #307
Unsupervised Identification of Protein Compositions and Conformations via Implicit Content-Transformation Disentanglement Poster Session 2 & Exhibit Hall with Coffee Break
Mostofa Rafid Uddin ⋅ Jana Armouti ⋅ Min Xu
Exhibit Hall I #232
SplArt: Articulation Estimation and Part-Level Reconstruction with 3D Gaussian Splatting Poster Session 2 & Exhibit Hall with Coffee Break
Shengjie Lin ⋅ Jiading Fang ⋅ Muhammad Zubair Irshad ⋅ Vitor Campagnolo Guizilini ⋅ Rares Ambrus ⋅ Greg Shakhnarovich ⋅ Matthew Walter
Exhibit Hall I #359
Splat-based 3D Scene Reconstruction with Extreme Motion-blur Poster Session 6 & Exhibit Hall with Coffee Break
Hyeonjoong Jang ⋅ Dongyoung Choi ⋅ Donggun Kim ⋅ Woohyun Kang ⋅ Min H. Kim
Exhibit Hall I #165
Diffusion Curriculum: Synthetic-to-Real Data Curriculum via Image-Guided Diffusion Poster Session 1 & Exhibit Hall
Yijun Liang ⋅ Shweta Bhardwaj ⋅ Tianyi Zhou
Exhibit Hall I #151
Training-free and Adaptive Sparse Attention for Efficient Long Video Generation Poster Session 4 & Exhibit Hall with Coffee Break
yifei xia ⋅ Suhan Ling ⋅ Fangcheng Fu ⋅ Yujie Wang ⋅ Huixia Li ⋅ Xuefeng Xiao ⋅ Bin CUI
Exhibit Hall I #104
ForestFormer3D: A Unified Framework for End-to-End Segmentation of Forest LiDAR 3D Point Clouds Poster Session 6 & Exhibit Hall with Coffee Break
Binbin Xiang ⋅ Maciej Wielgosz ⋅ Stefano Puliti ⋅ Kamil Král ⋅ Martin Krůček ⋅ Azim Missarov ⋅ Rasmus Astrup
Exhibit Hall I #356
OV3D-CG: Open-vocabulary 3D Instance Segmentation with Contextual Guidance Poster Session 2 & Exhibit Hall with Coffee Break
Mingquan Zhou ⋅ Chen He ⋅ Ruiping Wang ⋅ Xilin Chen
Exhibit Hall I #27
AdsQA: Towards Advertisement Video Understanding Poster Session 5 & Exhibit Hall
Xinwei Long ⋅ Kai Tian ⋅ Peng Xu ⋅ Guoli Jia ⋅ Jingxuan Li ⋅ Sa Yang ⋅ Yihua Shao ⋅ Kaiyan Zhang ⋅ Che Jiang ⋅ Hao Xu ⋅ Yang Liu ⋅ Jiaheng Ma ⋅ Bowen Zhou
Exhibit Hall I #339
Memory-Efficient Generative Models via Product Quantization Poster Session 4 & Exhibit Hall with Coffee Break
Jie Shao ⋅ Hanxiao Zhang ⋅ Hao Yu ⋅ Jianxin Wu
Exhibit Hall I #190
ForgeLens: Data-Efficient Forgery Focus for Generalizable Forgery Image Detection Poster Session 4 & Exhibit Hall with Coffee Break
Yingjian Chen ⋅ Lei Zhang ⋅ Yakun Niu
Exhibit Hall I #131
Rethinking Discrete Tokens: Treating Them as Conditions for Continuous Autoregressive Image Synthesis Poster Session 4 & Exhibit Hall with Coffee Break
Peng Zheng ⋅ Junke Wang ⋅ Yi Chang ⋅ Yizhou Yu ⋅ Rui Ma ⋅ Zuxuan Wu
Exhibit Hall I #240
Multimodal Prompt Alignment for Facial Expression Recognition Poster Session 3 & Exhibit Hall
Fuyan Ma ⋅ Yiran He ⋅ Bin Sun ⋅ Shutao Li
Exhibit Hall I #243
CogCM: Cognition-Inspired Contextual Modeling for Audio-Visual Speech Enhancement Poster Session 5 & Exhibit Hall
Feixiang Wang ⋅ Shuang Yang ⋅ Shiguang Shan ⋅ Xilin Chen
Exhibit Hall I #149
Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition Poster Session 1 & Exhibit Hall
Zhisheng Zhong ⋅ Chengyao Wang ⋅ Yuqi Liu ⋅ Senqiao Yang ⋅ Longxiang Tang ⋅ Yuechen Zhang ⋅ Jingyao Li ⋅ Tianyuan Qu ⋅ Yanwei Li ⋅ Yukang Chen ⋅ Shaozuo Yu ⋅ WU Sitong ⋅ Eric Lo ⋅ Shu Liu ⋅ Jiaya Jia
Exhibit Hall I #343
AIM: Adaptive Inference of Multi-Modal LLMs via Token Merging and Pruning Poster Session 5 & Exhibit Hall
Yiwu Zhong ⋅ Zhuoming Liu ⋅ Yin Li ⋅ Liwei Wang
Exhibit Hall I #36
EDFFDNet: Towards Accurate and Efficient Unsupervised Multi-Grid Image Registration Poster Session 2 & Exhibit Hall with Coffee Break
Haokai Zhu ⋅ Bo Qu ⋅ Si-Yuan Cao ⋅ Runmin Zhang ⋅ Shujie Chen ⋅ Bailin Yang ⋅ Hui-liang Shen
Exhibit Hall I #8
Enhancing Mamba Decoder with Bidirectional Interaction in Multi-Task Dense Prediction Poster Session 4 & Exhibit Hall with Coffee Break
Mang Cao ⋅ Sanping Zhou ⋅ Yizhe Li ⋅ Ye Deng ⋅ Wenli Huang ⋅ Le Wang
Exhibit Hall I #375
Leveraging Debiased Cross-modal Attention Maps and Code-based Reasoning for Zero-shot Referring Expression Comprehension Poster Session 5 & Exhibit Hall
Juntao Chen ⋅ Wen Shen ⋅ Zhihua Wei ⋅ Lijun Sun ⋅ Hongyun Zhang
Exhibit Hall I #57
UST-SSM: Unified Spatio-Temporal State Space Models for Point Cloud Video Modeling Poster Session 2 & Exhibit Hall with Coffee Break
Peiming Li ⋅ Ziyi Wang ⋅ Yulin Yuan ⋅ Hong Liu ⋅ Xiangming Meng ⋅ Junsong Yuan ⋅ Mengyuan Liu
Exhibit Hall I #162
Improving Multimodal Learning via Imbalanced Learning Poster Session 1 & Exhibit Hall
Shicai Wei ⋅ Chunbo Luo ⋅ Yang Luo
Exhibit Hall I #204
SITE: towards Spatial Intelligence Thorough Evaluation Poster Session 2 & Exhibit Hall with Coffee Break
Wenqi Wang ⋅ Reuben Tan ⋅ Pengyue Zhu ⋅ Jianwei Yang ⋅ Zhengyuan Yang ⋅ Lijuan Wang ⋅ Andrey Kolobov ⋅ Jianfeng Gao ⋅ Boqing Gong
Exhibit Hall I #379
SHIFT: Smoothing Hallucinations by Information Flow Tuning for Multimodal Large Language Models Poster Session 1 & Exhibit Hall
Sudong Wang ⋅ Yunjian Zhang ⋅ Yao Zhu ⋅ Enci Liu ⋅ Jianing Li ⋅ Yanwei Liu ⋅ Xiangyang Ji
Exhibit Hall I #338
Stable Score Distillation Poster Session 4 & Exhibit Hall with Coffee Break
Haiming Zhu ⋅ Yangyang Xu ⋅ Chenshu Xu ⋅ Tingrui Shen ⋅ Wenxi Liu ⋅ Yong Du ⋅ Jun Yu ⋅ Shengfeng He
Exhibit Hall I #164
Synergistic Prompting for Robust Visual Recognition with Missing Modalities Poster Session 1 & Exhibit Hall
Zhihui Zhang ⋅ Luanyuan Dai ⋅ Qika Lin ⋅ Yunfeng Diao ⋅ Guangyin Jin ⋅ Yufei Guo ⋅ Jing Zhang ⋅ Xiaoshuai Hao
Exhibit Hall I #170
Hierarchical Visual Prompt Learning for Continual Video Instance Segmentation Poster Session 3 & Exhibit Hall
Jiahua Dong ⋅ Hui Yin ⋅ Wenqi Liang ⋅ Hanbin Zhao ⋅ Henghui Ding ⋅ Nicu Sebe ⋅ Salman Khan ⋅ Fahad Khan
Exhibit Hall I #172
Automated Red Teaming for Text-to-Image Models through Feedback-Guided Prompt Iteration with Vision-Language Models Poster Session 4 & Exhibit Hall with Coffee Break
Wei Xu ⋅ Kangjie Chen ⋅ Jiawei Qiu ⋅ Yuyang zhang ⋅ Run Wang ⋅ Jin Mao ⋅ Tianwei Zhang ⋅ Lina Wang
Exhibit Hall I #353
RAGD: Regional-Aware Diffusion Model for Text-to-Image Generation Poster Session 4 & Exhibit Hall with Coffee Break
Chen Zhennan ⋅ Yajie Li ⋅ Haofan Wang ⋅ Zhibo Chen ⋅ Zhengkai Jiang ⋅ Jun Li ⋅ Qian Wang ⋅ Jian Yang ⋅ Ying Tai
Exhibit Hall I #426
Enhancing Spatial Reasoning in Multimodal Large Language Models through Reasoning-based Segmentation Poster Session 2 & Exhibit Hall with Coffee Break
Zhenhua Ning ⋅ Zhuotao Tian ⋅ Shaoshuai Shi ⋅ Daojing He ⋅ Guangming Lu ⋅ Wenjie Pei ⋅ Li Jiang
Exhibit Hall I #266
Knowledge Distillation with Refined Logits Poster Session 1 & Exhibit Hall
Wujie Sun ⋅ Defang Chen ⋅ Siwei Lyu ⋅ Genlang Chen ⋅ Chun Chen ⋅ Can Wang
Exhibit Hall I #96
Dark-ISP: Enhancing RAW Image Processing for Low-Light Object Detection Poster Session 2 & Exhibit Hall with Coffee Break
Jiasheng Guo ⋅ Xin Gao ⋅ Yuxiang Yan ⋅ Guanghao Li ⋅ Jian Pu
Exhibit Hall I #428
BézierGS: Dynamic Urban Scene Reconstruction with Bézier Curve Gaussian Splatting Poster Session 6 & Exhibit Hall with Coffee Break
Zipei Ma ⋅ Junzhe Jiang ⋅ Yurui Chen ⋅ Li Zhang
Exhibit Hall I #77
Domain Generalizable Portrait Style Transfer Poster Session 4 & Exhibit Hall with Coffee Break
Xinbo Wang ⋅ Wenju Xu ⋅ Qing Zhang ⋅ Wei-Shi Zheng
Exhibit Hall I #87
PerLDiff: Controllable Street View Synthesis Using Perspective-Layout Diffusion Model Poster Session 6 & Exhibit Hall with Coffee Break
Jinhua Zhang ⋅ Hualian Sheng ⋅ Sijia Cai ⋅ Bing Deng ⋅ Qiao Liang ⋅ Wen Li ⋅ Ying Fu ⋅ Jieping Ye ⋅ Shuhang Gu
Exhibit Hall I #154
Diffusion Image Prior Poster Session 6 & Exhibit Hall with Coffee Break
Hamadi Chihaoui ⋅ Paolo Favaro
Exhibit Hall I #288
Text2VDM: Text to Vector Displacement Maps for Expressive and Interactive 3D Sculpting Poster Session 4 & Exhibit Hall with Coffee Break
Hengyu Meng ⋅ Duotun Wang ⋅ Zhijing Shao ⋅ Ligang Liu ⋅ Zeyu Wang
Exhibit Hall I #191
HERO: Human Reaction Generation from Videos Poster Session 3 & Exhibit Hall
Chengjun Yu ⋅ Wei Zhai ⋅ Yuhang Yang ⋅ Yang Cao ⋅ Zheng-Jun Zha
Exhibit Hall I #24
Towards Comprehensive Lecture Slides Understanding: Large-scale Dataset and Effective Method Poster Session 1 & Exhibit Hall
Enming Zhang ⋅ Yuzhe Li ⋅ Yuliang Liu ⋅ Yingying Zhu ⋅ Xiang Bai
Exhibit Hall I #418
A Unified Interpretation of Training-Time Out-of-Distribution Detection Poster Session 1 & Exhibit Hall
Xu Cheng ⋅ Xin Jiang ⋅ Zechao Li
Exhibit Hall I #194
VQ-SGen: A Vector Quantized Stroke Representation for Creative Sketch Generation Poster Session 4 & Exhibit Hall with Coffee Break
Jiawei Wang ⋅ Zhiming Cui ⋅ Changjian Li
Exhibit Hall I #424
G2PDiffusion: Cross-species Genotype-to-Phenotype Prediction via Evolutionary Diffusion Poster Session 5 & Exhibit Hall
Mengdi Liu ⋅ Zhangyang Gao ⋅ Hong Chang ⋅ Stan Li ⋅ Shiguang Shan ⋅ Xilin Chen
Exhibit Hall I #86
Mamba-3VL: Taming State Space Model for 3D Vision Language Learning Poster Session 2 & Exhibit Hall with Coffee Break
Yuan Wang ⋅ Yuxin Chen ⋅ Zhongang Qi ⋅ Lijun Liu ⋅ Jile Jiao ⋅ Xuetao Feng ⋅ Yujia Liang ⋅ Ying Shan ⋅ Zhipeng Zhang
Exhibit Hall I #117
Embodied Representation Alignment with Mirror Neurons Poster Session 3 & Exhibit Hall
Wentao Zhu ⋅ Zhining Zhang ⋅ Yuwei Ren ⋅ Yin Huang ⋅ Hao Xu ⋅ Yizhou Wang
Exhibit Hall I #183
Referring to Any Person Poster Session 5 & Exhibit Hall
Qing Jiang ⋅ Lin Wu ⋅ Zhaoyang Zeng ⋅ Tianhe Ren ⋅ Yuda Xiong ⋅ Yihao Chen ⋅ Liu Qin ⋅ Lei Zhang
Exhibit Hall I #175
Selective Contrastive Learning for Weakly Supervised Affordance Grounding Poster Session 2 & Exhibit Hall with Coffee Break
WonJun Moon ⋅ Hyun Seok Seong ⋅ Jae-Pil Heo
Exhibit Hall I #18
CoST: Efficient Collaborative Perception From Unified Spatiotemporal Perspective Poster Session 1 & Exhibit Hall
Zongheng Tang ⋅ Yi Liu ⋅ Yifan Sun ⋅ Yulu Gao ⋅ Jinyu Chen ⋅ Runsheng Xu ⋅ Si Liu
Exhibit Hall I #97
M2EIT: Multi-Domain Mixture of Experts for Robust Neural Inertial Tracking Poster Session 6 & Exhibit Hall with Coffee Break
Yan Li ⋅ Yang Xu ⋅ Changhao Chen ⋅ Zhongchen Shi ⋅ Wei Chen ⋅ Liang Xie ⋅ Hongbo Chen ⋅ Erwei Yin
Exhibit Hall I #336
MobileViCLIP: An Efficient Video-Text Model for Mobile Devices Poster Session 5 & Exhibit Hall
Min Yang ⋅ Zihan Jia ⋅ Zhilin Dai ⋅ Sheng Guo ⋅ Limin Wang
Exhibit Hall I #97
Task-Specific Zero-shot Quantization-Aware Training for Object Detection Poster Session 5 & Exhibit Hall
Changhao Li ⋅ Xinrui Chen ⋅ Ji Wang ⋅ Kang Zhao ⋅ Jianfei Chen
Exhibit Hall I #288
Bridging Domain Generalization to Multimodal Domain Generalization via Unified Representations Poster Session 5 & Exhibit Hall
Hai Huang ⋅ Yan Xia ⋅ Sashuai Zhou ⋅ Hanting Wang ⋅ Shulei Wang ⋅ Zhou Zhao
Exhibit Hall I #253
Addressing Representation Collapse in Vector Quantized Models with One Linear Layer Poster Session 5 & Exhibit Hall
Yongxin Zhu ⋅ Bocheng Li ⋅ Yifei Xin ⋅ Zhihua Xia ⋅ Linli Xu
Exhibit Hall I #297
DictAS: A Framework for Class-Generalizable Few-Shot Anomaly Segmentation via Dictionary Lookup Poster Session 5 & Exhibit Hall
Zhen Qu ⋅ Xian Tao ⋅ Xinyi Gong ⋅ ShiChen Qu ⋅ Xiaopei Zhang ⋅ Xingang Wang ⋅ Fei Shen ⋅ Zhengtao Zhang ⋅ Mukesh Prasad ⋅ Guiguang Ding
Exhibit Hall I #67
EVOLVE: Event-Guided Deformable Feature Transfer and Dual-Memory Refinement for Low-Light Video Object Segmentation Poster Session 3 & Exhibit Hall
Jong Hyeon Baek ⋅ Jiwon oh ⋅ Yeong Jun Koh
Exhibit Hall I #119
MATE: Motion-Augmented Temporal Consistency for Event-based Point Tracking Poster Session 2 & Exhibit Hall with Coffee Break
Han Han ⋅ Wei Zhai ⋅ Yang Cao ⋅ Bin Li ⋅ Zheng-Jun Zha
Exhibit Hall I #313
Asynchronous Event Error-Minimizing Noise for Safeguarding Event Dataset Poster Session 3 & Exhibit Hall
Ruofei WANG ⋅ Peiqi Duan ⋅ Boxin Shi ⋅ Renjie Wan
Exhibit Hall I #13
AG2aussian: Anchor-Graph Structured Gaussian Splatting for Instance-Level 3D Scene Understanding and Editing Poster Session 6 & Exhibit Hall with Coffee Break
Zhaonan Wang ⋅ Manyi Li ⋅ Changhe Tu
Exhibit Hall I #201
Vector Contrastive Learning For Pixel-Wise Pretraining In Medical Vision Poster Session 5 & Exhibit Hall
Yuting He ⋅ Shuo Li
Exhibit Hall I #3
InterGSEdit: Interactive 3D Gaussian Splatting Editing with 3D Geometry-Consistent Attention Prior Poster Session 6 & Exhibit Hall with Coffee Break
Minghao Wen ⋅ Shengjie Wu ⋅ Kangkan Wang ⋅ Dong Liang
Exhibit Hall I #135
CaO2: Rectifying Inconsistencies in Diffusion-Based Dataset Distillation Poster Session 1 & Exhibit Hall
Haoxuan Wang ⋅ Zhenghao Zhao ⋅ Junyi Wu ⋅ Yuzhang Shang ⋅ Gaowen Liu ⋅ Yan Yan
Exhibit Hall I #443
Differential-informed Sample Selection Accelerates Multimodal Contrastive Learning Poster Session 1 & Exhibit Hall
Zihua Zhao ⋅ Feng Hong ⋅ Mengxi Chen ⋅ Pengyi Chen ⋅ Benyuan Liu ⋅ Jiangchao Yao ⋅ Ya Zhang ⋅ Yanfeng Wang
Exhibit Hall I #270
InstaScene: Towards Complete 3D Instance Decomposition and Reconstruction from Cluttered Scenes Poster Session 2 & Exhibit Hall with Coffee Break
Zesong Yang ⋅ Bangbang Yang ⋅ Wenqi Dong ⋅ Chenxuan Cao ⋅ Liyuan Cui ⋅ Yuewen Ma ⋅ Zhaopeng Cui ⋅ Hujun Bao
Exhibit Hall I #259
Efficient Fine-Tuning of Large Models via Nested Low-Rank Adaptation Poster Session 5 & Exhibit Hall
Lujun Li ⋅ Cheng Lin ⋅ Dezhi Li ⋅ You-Liang Huang ⋅ Wei Li ⋅ Tianyu Wu ⋅ Jie Zou ⋅ Wei Xue ⋅ Sirui Han ⋅ Yike Guo
Exhibit Hall I #231
Dual-level Prototype Learning for Composite Degraded Image Restoration Poster Session 3 & Exhibit Hall
Zhongze Wang ⋅ Haitao Zhao ⋅ Lujian Yao ⋅ Jingchao Peng ⋅ Kaijie Zhao
Exhibit Hall I #378
Dynamic Reconstruction of Hand-Object Interaction with Distributed Force-aware Contact Representation Poster Session 2 & Exhibit Hall with Coffee Break
Zhenjun Yu ⋅ Wenqiang Xu ⋅ Pengfei Xie ⋅ Yutong Li ⋅ Brian Anthony ⋅ Zhuorui Zhang ⋅ Cewu Lu
Exhibit Hall I #336
Efficient Input-level Backdoor Defense on Text-to-Image Synthesis via Neuron Activation Variation Poster Session 4 & Exhibit Hall with Coffee Break
Shengfang ZHAI ⋅ Jiajun Li ⋅ Yue Liu ⋅ Huanran Chen ⋅ Zhihua Tian ⋅ Wenjie Qu ⋅ Qingni Shen ⋅ Ruoxi Jia ⋅ Yinpeng Dong ⋅ Jiaheng Zhang
Exhibit Hall I #28
Decoupled Multi-Predictor Optimization for Inference-Efficient Model Tuning Poster Session 1 & Exhibit Hall
Liwei Luo ⋅ Shuaitengyuan Li ⋅ Dongwei Ren ⋅ Qilong Wang ⋅ Pengfei Zhu ⋅ Qinghua Hu
Exhibit Hall I #337
Detection, Pose Estimation and Segmentation for Multiple Bodies: Closing the Virtuous Circle Poster Session 2 & Exhibit Hall with Coffee Break
Miroslav Purkrabek ⋅ Jiri Matas
Exhibit Hall I #374
ACAM-KD: Adaptive and Cooperative Attention Masking for Knowledge Distillation Poster Session 1 & Exhibit Hall
Qizhen Lan ⋅ Qing Tian
Exhibit Hall I #368
GReg: Geometry-Aware Region Refinement for Sign Language Video Generation Poster Session 4 & Exhibit Hall with Coffee Break
Tongkai Shi ⋅ Lianyu Hu ⋅ Fanhua Shang ⋅ Liqing Gao ⋅ Wei Feng
Exhibit Hall I #150
Unsupervised Part Discovery via Descriptor-Based Masked Image Restoration with Optimized Constraints Poster Session 2 & Exhibit Hall with Coffee Break
Jiahao Xia ⋅ Yike Wu ⋅ Wenjian Huang ⋅ Jianguo Zhang ⋅ Jian Zhang
Exhibit Hall I #343
NETracer: A Topology-Aware Iterative Tracing Approach for Tubular Structure Extraction Poster Session 5 & Exhibit Hall
Chao Liu ⋅ Yangbo Jiang ⋅ Nenggan Zheng
Exhibit Hall I #76
Benchmarking Multimodal CoT Reward Model Stepwise by Visual Program Poster Session 1 & Exhibit Hall
Minghe Gao ⋅ Xuqi Liu ⋅ Zhongqi Yue ⋅ Yang Wu ⋅ Shuang Chen ⋅ Juncheng Li ⋅ Siliang Tang ⋅ Fei Wu ⋅ Tat-Seng Chua ⋅ Yueting Zhuang
Exhibit Hall I #155
MotionCtrl: A Real-time Controllable Vision-Language-Motion Model Poster Session 3 & Exhibit Hall
Bin Cao ⋅ Sipeng Zheng ⋅ Ye Wang ⋅ Lujie Xia ⋅ Qianshan Wei ⋅ Qin Jin ⋅ Jing Liu ⋅ Zongqing Lu
Exhibit Hall I #211
Visual Relation Diffusion for Human-Object Interaction Detection Poster Session 5 & Exhibit Hall
Ping Cao ⋅ Yepeng Tang ⋅ Chunjie Zhang ⋅ Xiaolong Zheng ⋅ Chao Liang ⋅ Yunchao Wei ⋅ Yao Zhao
Exhibit Hall I #353
Pinco: Position-induced Consistent Adapter for Diffusion Transformer in Foreground-conditioned Inpainting Poster Session 4 & Exhibit Hall with Coffee Break
Guangben Lu ⋅ Yuzhen N/A ⋅ Zhimin Sun ⋅ Ran Yi ⋅ Yifan Qi ⋅ Yizhe Tang ⋅ Tianyi Wang ⋅ Lizhuang Ma ⋅ FangYuan Zou
Exhibit Hall I #35
VLR-Driver: Large Vision-Language-Reasoning Models for Embodied Autonomous Driving Poster Session 6 & Exhibit Hall with Coffee Break
Fanjie Kong ⋅ Yitong Li ⋅ Weihuang Chen ⋅ Chen Min ⋅ Yizhe Li ⋅ Zhiqiang Gao ⋅ Haoyang Li ⋅ Zhongyu Guo ⋅ Hongbin Sun
Exhibit Hall I #216
Vid-Group: Temporal Video Grounding Pretraining from Unlabeled Videos in the Wild Poster Session 5 & Exhibit Hall
Peijun Bao ⋅ Chenqi Kong ⋅ SIYUAN YANG ⋅ Zihao Shao ⋅ Xinghao Jiang ⋅ Boon Ng ⋅ Meng Er ⋅ Alex Kot
Exhibit Hall I #69
AcZeroTS: Active Learning for Zero-shot Tissue Segmentation in Pathology Images Poster Session 5 & Exhibit Hall
Jiao Tang ⋅ Junjie Zhou ⋅ Bo Qian ⋅ Peng Wan ⋅ Yingli Zuo ⋅ WEI SHAO ⋅ Daoqiang Zhang
Exhibit Hall I #349
OneGT: One-Shot Geometry-Texture Neural Rendering for Head Avatars Poster Session 3 & Exhibit Hall
Jinshu Chen ⋅ Bingchuan Li ⋅ Fan Zhang ⋅ Songtao Zhao ⋅ Qian HE
Exhibit Hall I #121
METEOR: Multi-Encoder Collaborative Token Pruning for Efficient Vision Language Models Poster Session 5 & Exhibit Hall
Yuchen Liu ⋅ Yaoming Wang ⋅ Bowen Shi ⋅ XIAOPENG ZHANG ⋅ Wenrui Dai ⋅ Chenglin Li ⋅ Hongkai Xiong ⋅ Qi Tian
Exhibit Hall I #159
Unsupervised Visible-Infrared Person Re-identification under Unpaired Settings Poster Session 3 & Exhibit Hall
Haoyu Yao ⋅ Bin Yang ⋅ Wenke Huang ⋅ Mang Ye ⋅ Bo Du
Exhibit Hall I #180
Adaptive Prompt Learning via Gaussian Outlier Synthesis for Out-of-distribution Detection Poster Session 1 & Exhibit Hall
Yongkang Zhang ⋅ Dongyu She ⋅ Zhong Zhou
Exhibit Hall I #299
Bring Your Rear Cameras for Egocentric 3D Human Pose Estimation Poster Session 2 & Exhibit Hall with Coffee Break
HIroyasu Akada ⋅ Jian Wang ⋅ Vladislav Golyanik ⋅ Christian Theobalt
Exhibit Hall I #420
AMDANet: Attention-Driven Multi-Perspective Discrepancy Alignment for RGB-Infrared Image Fusion and Segmentation Poster Session 3 & Exhibit Hall
Haifeng Zhong ⋅ Fan Tang ⋅ Zhuo Chen ⋅ Hyung Jin Chang ⋅ Yixing Gao
Exhibit Hall I #59
Lay2Story: Extending Diffusion Transformers for Layout-Togglable Story Generation Poster Session 4 & Exhibit Hall with Coffee Break
Ao Ma ⋅ Jiasong Feng ⋅ Ke Cao ⋅ Jing Wang ⋅ Yun Wang ⋅ Quanwei Zhang ⋅ Zhanjie Zhang
Exhibit Hall I #115
OCK: Unsupervised Dynamic Video Prediction with Object-Centric Kinematics Poster Session 3 & Exhibit Hall
YeonJi Song ⋅ Jaein Kim ⋅ Suhyung Choi ⋅ Jin-Hwa Kim ⋅ Byoung-Tak Zhang
Exhibit Hall I #127
Unraveling the Smoothness Properties of Diffusion Models: A Gaussian Mixture Perspective Poster Session 3 & Exhibit Hall
Yingyu Liang ⋅ Zhizhou Sha ⋅ Zhenmei Shi ⋅ Zhao Song ⋅ Mingda Wan ⋅ Yufa Zhou
Exhibit Hall I #134
S$^3$E: Self-Supervised State Estimation for Radar-Inertial System Poster Session 6 & Exhibit Hall with Coffee Break
Shengpeng Wang ⋅ Yulong Xie ⋅ Qing Liao ⋅ Wei Wang
Exhibit Hall I #190
Prompt Guidance and Human Proximal Perception for HOT Prediction with Regional Joint Loss Poster Session 5 & Exhibit Hall
Yuxiao Wang ⋅ Yu Lei ⋅ Zhenao WEI ⋅ WeiYing Xue ⋅ Xinyu Jiang ⋅ Nan Zhuang ⋅ Qi Liu
Exhibit Hall I #361
Scalable Image Tokenization with Index Backpropagation Quantization Poster Session 4 & Exhibit Hall with Coffee Break
Fengyuan Shi ⋅ Zhuoyan Luo ⋅ Yixiao Ge ⋅ Yujiu Yang ⋅ Ying Shan ⋅ Limin Wang
Exhibit Hall I #109
BVINet: Unlocking Blind Video Inpainting with Zero Annotations Poster Session 3 & Exhibit Hall
zhiliang wu ⋅ Kerui Chen ⋅ Kun Li ⋅ Hehe Fan ⋅ Yi Yang
Exhibit Hall I #379
Coupling the Generator with Teacher for Effective Data-Free Knowledge Distillation Poster Session 1 & Exhibit Hall
Xu Chen ⋅ Yang Li ⋅ Yahong Han ⋅ Guangquan Xu ⋅ Jialie Shen
Exhibit Hall I #195
Video Color Grading via Look-Up Table Generation Poster Session 4 & Exhibit Hall with Coffee Break
Seunghyun Shin ⋅ Dongmin Shin ⋅ Jisu Shin ⋅ Hae-Gon Jeon ⋅ Joon-Young Lee
Exhibit Hall I #408
Structure-Guided Diffusion Models for High-Fidelity Portrait Shadow Removal Poster Session 3 & Exhibit Hall
wanchang Yu ⋅ Qing Zhang ⋅ Rongjia Zheng ⋅ Wei-Shi Zheng
Exhibit Hall I #158
FreeDNA: Endowing Domain Adaptation of Diffusion-Based Dense Prediction with Training-Free Domain Noise Alignment Poster Session 1 & Exhibit Hall
Hang Xu ⋅ Jie Huang ⋅ Linjiang Huang ⋅ Dong Li ⋅ Yidi Liu ⋅ Feng Zhao
Exhibit Hall I #304
ProbMED: A Probabilistic Framework for Medical Multimodal Binding Poster Session 5 & Exhibit Hall
Yuan Gao ⋅ Sangwook Kim ⋅ Jianzhong You ⋅ Chris Mcintosh
Exhibit Hall I #34
You Are Your Own Best Teacher: Achieving Centralized-level Performance in Federated Learning under Heterogeneous and Long-tailed Data Poster Session 1 & Exhibit Hall
Shanshan Yan ⋅ Zexi Li ⋅ Chao Wu ⋅ Meng Pang ⋅ Yang Lu ⋅ Yan Yan ⋅ Hanzi Wang
Exhibit Hall I #253
A Tiny Change, A Giant Leap: Long-Tailed Class-Incremental Learning via Geometric Prototype Alignment Poster Session 1 & Exhibit Hall
xinyi lai ⋅ Luojun Lin ⋅ Weijie Chen ⋅ yuanlong yu
Exhibit Hall I #127
CountSE: Soft Exemplar Open-set Object Counting Poster Session 5 & Exhibit Hall
Shuai Liu ⋅ Peng Zhang ⋅ Shiwei Zhang ⋅ Wei Ke
Exhibit Hall I #163
Sparfels: Fast Reconstruction from Sparse Unposed Imagery Poster Session 6 & Exhibit Hall with Coffee Break
Shubhendu Jena ⋅ Amine Ouasfi ⋅ Mae Younes ⋅ Adnane Boukhayma
Exhibit Hall I #266
GaussianFlowOcc: Sparse and Weakly Supervised Occupancy Estimation using Gaussian Splatting and Temporal Flow Poster Session 6 & Exhibit Hall with Coffee Break
Simon Boeder ⋅ Fabian Gigengack ⋅ Benjamin Risse
Exhibit Hall I #21
Learning 4D Embodied World Models Poster Session 2 & Exhibit Hall with Coffee Break
Haoyu Zhen ⋅ Qiao Sun ⋅ Hongxin Zhang ⋅ Junyan Li ⋅ Siyuan Zhou ⋅ Yilun Du ⋅ Chuang Gan
Exhibit Hall I #30
MuGS: Multi-Baseline Generalizable Gaussian Splatting Reconstruction Poster Session 6 & Exhibit Hall with Coffee Break
Yaopeng Lou ⋅ Liao Shen ⋅ Tianqi Liu ⋅ Jiaqi Li ⋅ Zihao Huang ⋅ Huiqiang Sun ⋅ Zhiguo Cao
Exhibit Hall I #83
Region-Level Data Attribution for Text-to-Image Generative Models Poster Session 4 & Exhibit Hall with Coffee Break
Trong Bang Nguyen ⋅ Phi Le Nguyen ⋅ Simon Lucey ⋅ Minh Hoai
Exhibit Hall I #376
Trans-Adapter: A Plug-and-Play Framework for Transparent Image Inpainting Poster Session 4 & Exhibit Hall with Coffee Break
Yuekun Dai ⋅ Haitian Li ⋅ Shangchen Zhou ⋅ Chen Change Loy
Exhibit Hall I #12
Identity-aware Language Gaussian Splatting for Open-vocabulary 3D Semantic Segmentation Poster Session 5 & Exhibit Hall
SungMin Jang ⋅ Wonjun Kim
Exhibit Hall I #62
MolParser: End-to-end Visual Recognition of Molecule Structures in the Wild Poster Session 5 & Exhibit Hall
Xi Fang ⋅ Jiankun Wang ⋅ Xiaochen Cai ⋅ Shang Chien ⋅ Shuwen Yang ⋅ Haoyi Tao ⋅ Nan wang ⋅ Lin Yao ⋅ Linfeng Zhang ⋅ Guolin Ke
Exhibit Hall I #443
Fine-structure Preserved Real-world Image Super-resolution via Transfer VAE Training Poster Session 3 & Exhibit Hall
Qiaosi Yi ⋅ Shuai Li ⋅ Rongyuan Wu ⋅ Lingchen Sun ⋅ Yuhui WU ⋅ Lei Zhang
Exhibit Hall I #228
Ask and Remember: A Questions-Only Replay Strategy for Continual Visual Question Answering Poster Session 4 & Exhibit Hall with Coffee Break
Imad Eddine MAROUF ⋅ Enzo Tartaglione ⋅ Stéphane Lathuilière ⋅ Joost van de Weijer
Exhibit Hall I #307
Benefit From Seen: Enhancing Open-Vocabulary Object Detection by Bridging Visual and Textual Co-Occurrence Knowledge Poster Session 5 & Exhibit Hall
Yanqi Li ⋅ Jianwei Niu ⋅ Tao Ren
Exhibit Hall I #216
Towards Omnimodal Expressions and Reasoning in Referring Audio-Visual Segmentation Poster Session 5 & Exhibit Hall
Kaining Ying ⋅ Henghui Ding ⋅ Guangquan Jie ⋅ Yu-Gang Jiang
Exhibit Hall I #261
ImageGen-CoT: Enhancing Text-to-Image In-context Learning with Chain-of-Thought Reasoning Poster Session 4 & Exhibit Hall with Coffee Break
Jiaqi Liao ⋅ Zhengyuan Yang ⋅ Linjie Li ⋅ Dianqi Li ⋅ Kevin Lin ⋅ Yu Cheng ⋅ Lijuan Wang
Exhibit Hall I #222
CityGS-X: A Scalable Architecture for Efficient and Geometrically Accurate Large-Scale Scene Reconstruction Poster Session 6 & Exhibit Hall with Coffee Break
Yuanyuan Gao ⋅ Hao Li ⋅ Jiaqi Chen ⋅ Zhihang Zhong ⋅ Zhengyu Zou ⋅ Dingwen Zhang ⋅ Xiao Sun ⋅ Junwei Han
Exhibit Hall I #239
AIRA: Activation-Informed Low-Rank Adaptation for Large Models Poster Session 1 & Exhibit Hall
Lujun Li ⋅ Dezhi Li ⋅ Cheng Lin ⋅ Wei Li ⋅ Wei Xue ⋅ Sirui Han ⋅ Yike Guo
Exhibit Hall I #156
Robust Unfolding Network for HDR Imaging with Modulo Cameras Poster Session 6 & Exhibit Hall with Coffee Break
Zhile Chen ⋅ Hui Ji
Exhibit Hall I #47
Embodied Navigation with Auxiliary Task of Action Description Prediction Poster Session 2 & Exhibit Hall with Coffee Break
Haru Kondoh ⋅ Asako Kanezaki
Exhibit Hall I #188
IAP: Invisible Adversarial Patch Attack through Perceptibility-Aware Localization and Perturbation Optimization Poster Session 3 & Exhibit Hall
Subrat Kishore Dutta ⋅ Xiao Zhang
Exhibit Hall I #448
SweetTok: Semantic-Aware Spatial-Temporal Tokenizer for Compact Video Discretization Poster Session 5 & Exhibit Hall
Zhentao Tan ⋅ Ben Xue ⋅ Jian Jia ⋅ Junhao Wang ⋅ Wencai Ye ⋅ Shaoyun Shi ⋅ Sun Mingjie ⋅ Wenjin Wu ⋅ Quan Chen ⋅ Peng Jiang
Exhibit Hall I #352
Beyond Simple Edits: Composed Video Retrieval with Dense Modifications Poster Session 5 & Exhibit Hall
Omkar Thawakar ⋅ Dmitry Demidov ⋅ Ritesh Thawkar ⋅ Rao Anwer ⋅ Mubarak Shah ⋅ Fahad Khan ⋅ Salman Khan
Exhibit Hall I #59
GRAB: A Challenging GRaph Analysis Benchmark for Large Multimodal Models Poster Session 1 & Exhibit Hall
Jonathan Roberts ⋅ Kai Han ⋅ Samuel Albanie
Exhibit Hall I #146
Att-Adapter: A Robust and Precise Domain-Specific Multi-Attributes T2I Diffusion Adapter via Conditional Variational Autoencoder Poster Session 4 & Exhibit Hall with Coffee Break
Wonwoong Cho ⋅ Yan-Ying Chen ⋅ Matthew Klenk ⋅ David I. Inouye ⋅ Yanxia Zhang
Exhibit Hall I #69
REDUCIO! Generating 1K Video within 16 Seconds using Extremely Compressed Motion Latents Poster Session 4 & Exhibit Hall with Coffee Break
Rui Tian ⋅ Qi Dai ⋅ Jianmin Bao ⋅ Kai Qiu ⋅ Yifan Yang ⋅ Chong Luo ⋅ Zuxuan Wu ⋅ Yu-Gang Jiang
Exhibit Hall I #417
DAP-MAE: Domain-Adaptive Point Cloud Masked Autoencoder for Effective Cross-Domain Learning Poster Session 1 & Exhibit Hall
Ziqi Gao ⋅ Qiufu Li ⋅ Linlin Shen
Exhibit Hall I #324
AllGCD: Leveraging All Unlabeled Data for Generalized Category Discovery Poster Session 1 & Exhibit Hall
Xinzi Cao ⋅ Ke Chen ⋅ Feidiao Yang ⋅ Xiawu Zheng ⋅ Yutong Lu ⋅ Yonghong Tian
Exhibit Hall I #306
Towards Long-Horizon Vision-Language-Action System: Reasoning, Acting and Memory Poster Session 2 & Exhibit Hall with Coffee Break
Daixun Li ⋅ Yusi Zhang ⋅ Mingxiang Cao ⋅ donglai Liu ⋅ Weiying Xie ⋅ Tianlin Hui ⋅ Lunkai Lin ⋅ Zhiqiang Xie ⋅ Yunsong Li
Exhibit Hall I #171
UniFuse: A Unified All-in-One Framework for Multi-Modal Medical Image Fusion Under Diverse Degradations and Misalignments Poster Session 3 & Exhibit Hall
Dayong Su ⋅ Yafei Zhang ⋅ Huafeng Li ⋅ Jinxing Li ⋅ Yu Liu
Exhibit Hall I #399
3DGS-LM: Faster Gaussian-Splatting Optimization with Levenberg-Marquardt Poster Session 6 & Exhibit Hall with Coffee Break
Lukas Höllein ⋅ Aljaz Bozic ⋅ Michael Zollhöfer ⋅ Matthias Nießner
Exhibit Hall I #195
OmniVTON: Training-Free Universal Virtual Try-On Poster Session 4 & Exhibit Hall with Coffee Break
Zhaotong Yang ⋅ Yuhui Li ⋅ Shengfeng He ⋅ Xinzhe Li ⋅ Yangyang Xu ⋅ Junyu Dong ⋅ Yong Du
Exhibit Hall I #174
FreeCus: Free Lunch Subject-driven Customization in Diffusion Transformers Poster Session 4 & Exhibit Hall with Coffee Break
Yanbing Zhang ⋅ Zhe Wang ⋅ Qin Zhou ⋅ Mengping Yang
Exhibit Hall I #59
GLEAM: Learning Generalizable Exploration Policy for Active Mapping in Complex 3D Indoor Scene Poster Session 2 & Exhibit Hall with Coffee Break
Xiao Chen ⋅ Tai Wang ⋅ Quanyi Li ⋅ Tao Huang ⋅ Jiangmiao Pang ⋅ Tianfan Xue
Exhibit Hall I #50
CopyrightShield: Enhancing Diffusion Model Security Against Copyright Infringement Attacks Poster Session 4 & Exhibit Hall with Coffee Break
Zhixiang Guo ⋅ Siyuan Liang ⋅ Aishan Liu ⋅ Dacheng Tao
Exhibit Hall I #434
CA2C: A Prior-Knowledge-Free Approach for Robust Label Noise Learning via Asymmetric Co-learning and Co-training Poster Session 1 & Exhibit Hall
Mengmeng Sheng ⋅ Zeren Sun ⋅ Tianfei Zhou ⋅ Xiangbo Shu ⋅ Jinshan Pan ⋅ Yazhou Yao
Exhibit Hall I #77
TCFG: Truncated Classifier-Free Guidance for Efficient and Scalable Text-to-Image Acceleration Poster Session 4 & Exhibit Hall with Coffee Break
Xiaomeng Fu ⋅ Jia Li
Exhibit Hall I #351
Point Cloud Self-supervised Learning via 3D to Multi-view Masked Learner Poster Session 6 & Exhibit Hall with Coffee Break
Zhimin Chen ⋅ Xuewei Chen ⋅ Xiao Guo ⋅ Yingwei Li ⋅ Longlong Jing ⋅ Liang Yang ⋅ Bing Li
Exhibit Hall I #279
FICGen: Frequency-Inspired Contextual Disentanglement for Layout-driven Degraded Image Generation Poster Session 4 & Exhibit Hall with Coffee Break
Wenzhuang Wang ⋅ Yifan Zhao ⋅ Mingcan Ma ⋅ Ming Liu ⋅ Zhonglin Jiang ⋅ Yong Chen ⋅ Jia Li
Exhibit Hall I #404
MSA2: Multi-task Framework with Structure-aware and Style-adaptive Character Representation for Open-set Chinese Text Recognition Poster Session 5 & Exhibit Hall
Yangfu Li ⋅ Hongjian Zhan ⋅ Qi Liu ⋅ Li Sun ⋅ Yu-Jie Xiong ⋅ Yue Lu
Exhibit Hall I #311
Local Dense Logit Relations for Enhanced Knowledge Distillation Poster Session 1 & Exhibit Hall
Liuchi Xu ⋅ Kang Liu ⋅ Jinshuai Liu ⋅ Lu Wang ⋅ Lisheng XU ⋅ Jun Cheng
Exhibit Hall I #426
FW-Merging: Scaling Model Merging with Frank-Wolfe Optimization Poster Session 1 & Exhibit Hall
Hao Chen ⋅ Shell Xu Hu ⋅ Wayne Luk ⋅ Timothy Hospedales ⋅ Hongxiang Fan
Exhibit Hall I #315
HIS-GPT: Towards 3D Human-In-Scene Multimodal Understanding Poster Session 1 & Exhibit Hall
JIAHE ZHAO ⋅ RuiBing Hou ⋅ zejie tian ⋅ Hong Chang ⋅ Shiguang Shan
Exhibit Hall I #405
VisRL: Intention-Driven Visual Perception via Reinforced Reasoning Poster Session 1 & Exhibit Hall
Zhangquan Chen ⋅ Xufang Luo ⋅ Dongsheng Li
Exhibit Hall I #234
Soft Local Completeness: Rethinking Completeness in XAI Poster Session 5 & Exhibit Hall
Ziv Weiss Haddad ⋅ Oren Barkan ⋅ Yehonatan Elisha ⋅ Noam Koenigstein
Exhibit Hall I #377
Controllable Feature Whitening for Hyperparameter-Free Bias Mitigation Poster Session 1 & Exhibit Hall
Yooshin Cho ⋅ Hanbyel Cho ⋅ Janghyeon Lee ⋅ HyeongGwon Hong ⋅ Jaesung Ahn ⋅ Junmo Kim
Exhibit Hall I #427
UMDATrack: Unified Multi-Domain Adaptive Tracking Under Adverse Weather Conditions Poster Session 2 & Exhibit Hall with Coffee Break
Siyuan Yao ⋅ Rui Zhu ⋅ Ziqi Wang ⋅ Wenqi Ren ⋅ Yanyang Yan ⋅ Xiaochun Cao
Exhibit Hall I #135
KV-Edit: Training-Free Image Editing for Precise Background Preservation Poster Session 4 & Exhibit Hall with Coffee Break
Tianrui Zhu ⋅ Shiyi Zhang ⋅ Jiawei Shao ⋅ Yansong Tang
Exhibit Hall I #165
FusionPhys: A Flexible Framework for Fusing Complementary Sensing Modalities in Remote Physiological Measurement Poster Session 2 & Exhibit Hall with Coffee Break
Chenhang Ying ⋅ Huiyu Yang ⋅ Jieyi Ge ⋅ Zhaodong Sun ⋅ Xu Cheng ⋅ Kui Ren ⋅ Xiaobai Li
Exhibit Hall I #408
You Think, You ACT: The New Task of Arbitrary Text to Motion Generation Poster Session 3 & Exhibit Hall
Runqi Wang ⋅ Caoyuan Ma ⋅ Guopeng Li ⋅ Hanrui Xu ⋅ Yuke Li ⋅ Zheng Wang
Exhibit Hall I #189
DiffVSR: Revealing an Effective Recipe for Taming Robust Video Super-Resolution Against Complex Degradations Poster Session 4 & Exhibit Hall with Coffee Break
Xiaohui Li ⋅ Yihao Liu ⋅ Shuo Cao ⋅ Chen Ziyan ⋅ SHAOBIN ZHUANG ⋅ Xiangyu Chen ⋅ Yinan He ⋅ Yi Wang ⋅ Yu Qiao
Exhibit Hall I #40
End-to-End Multi-Modal Diffusion Mamba Poster Session 5 & Exhibit Hall
Chunhao Lu ⋅ Qiang Lu ⋅ Meichen Dong ⋅ Jake Luo
Exhibit Hall I #68
PanoLlama: Generating Endless and Coherent Panoramas with Next-Token-Prediction LLMs Poster Session 4 & Exhibit Hall with Coffee Break
Teng Zhou ⋅ Xiaoyu Zhang ⋅ Yongchuan Tang
Exhibit Hall I #42
Power of Cooperative Supervision: Multiple Teachers Framework for Advanced 3D Semi-Supervised Object Detection Poster Session 2 & Exhibit Hall with Coffee Break
Jin-Hee Lee ⋅ Jae-keun Lee ⋅ Jeseok Kim ⋅ Kwon Soon
Exhibit Hall I #185
Adapting In-Domain Few-Shot Segmentation to New Domains without Source Domain Retraining Poster Session 5 & Exhibit Hall
Qi Fan ⋅ Kaiqi Liu ⋅ Nian Liu ⋅ Hisham Cholakkal ⋅ Rao Anwer ⋅ Wenbin Li ⋅ Yang Gao
Exhibit Hall I #151
COVTrack: Continuous Open-Vocabulary Tracking via Adaptive Multi-Cue Fusion Poster Session 3 & Exhibit Hall
Zekun Qian ⋅ Ruize Han ⋅ Zhixiang Wang ⋅ Junhui Hou ⋅ Wei Feng
Exhibit Hall I #5
Dense Policy: Bidirectional Autoregressive Learning of Actions Poster Session 3 & Exhibit Hall
Yue Su ⋅ Xinyu Zhan ⋅ Hongjie Fang ⋅ Han Xue ⋅ Hao-Shu Fang ⋅ Yong-Lu Li ⋅ Cewu Lu ⋅ Lixin Yang
Exhibit Hall I #422
monoVLN: Bridging the Observation Gap between Monocular and Panoramic Vision and Language Navigation Poster Session 2 & Exhibit Hall with Coffee Break
Ren-Jie Lu ⋅ Yu Zhou ⋅ hao cheng ⋅ Jingke Meng ⋅ Wei-Shi Zheng
Exhibit Hall I #418
3D Mesh Editing using Masked LRMs Poster Session 2 & Exhibit Hall with Coffee Break
William Gao ⋅ Dilin Wang ⋅ Yuchen Fan ⋅ Aljaz Bozic ⋅ Tuur Stuyck ⋅ Zhengqin Li ⋅ Zhao Dong ⋅ Rakesh Ranjan ⋅ Nikolaos Sarafianos
Exhibit Hall I #200
DOGR: Towards Versatile Visual Document Grounding and Referring Poster Session 1 & Exhibit Hall
Yinan Zhou ⋅ Yuxin Chen ⋅ Haokun Lin ⋅ Yichen Wu ⋅ Shuyu Yang ⋅ Zhongang Qi ⋅ Chen Ma ⋅ Li Zhu
Exhibit Hall I #334
Supervised Exploratory Learning for Long-Tailed Visual Recognition Poster Session 1 & Exhibit Hall
Zhongquan Jian ⋅ Yanhao Chen ⋅ Wangyancheng Wangyancheng ⋅ Junfeng Yao ⋅ Meihong Wang ⋅ Qingqiang Wu
Exhibit Hall I #169
Membership Inference Attacks with False Discovery Rate Control Poster Session 1 & Exhibit Hall
Chenxu Zhao ⋅ Wei Qian ⋅ Aobo Chen ⋅ Mengdi Huai
Exhibit Hall I #106
ProbRes: Probabilistic Jump Diffusion for Open-World Egocentric Activity Recognition Poster Session 3 & Exhibit Hall
Sanjoy Kundu ⋅ Shanmukha Vellamcheti ⋅ Sathyanarayanan Aakur
Exhibit Hall I #389
MMAIF: Multi-task and Multi-degradation All-in-One for Image Fusion with Language Guidance Poster Session 3 & Exhibit Hall
Zihan Cao ⋅ Yu Zhong ⋅ Ziqi Wang ⋅ Liang-Jian Deng
Exhibit Hall I #164
Blind Video Super-Resolution based on Implicit Kernels Poster Session 3 & Exhibit Hall
Qiang Zhu ⋅ Yuxuan Jiang ⋅ Shuyuan Zhu ⋅ Fan Zhang ⋅ David Bull ⋅ Bing Zeng
Exhibit Hall I #91
TimeExpert: An Expert-Guided Video LLM for Video Temporal Grounding Poster Session 5 & Exhibit Hall
Zuhao Yang ⋅ Yingchen Yu ⋅ Yunqing Zhao ⋅ Shijian Lu ⋅ Song Bai
Exhibit Hall I #420
Kestrel: 3D Multimodal LLM for Part-Aware Grounded Description Poster Session 2 & Exhibit Hall with Coffee Break
Mahmoud Ahmed ⋅ Junjie Fei ⋅ Jian Ding ⋅ Eslam Abdelrahman ⋅ Mohamed Elhoseiny
Exhibit Hall I #371
DCHM: Depth-Consistent Human Modeling for Multiview Detection Poster Session 2 & Exhibit Hall with Coffee Break
Jiahao Ma ⋅ Tianyu Wang ⋅ Miaomiao Liu ⋅ David Ahmedt Aristizabal ⋅ Chuong Nguyen
Exhibit Hall I #255
Adversarial Robustness of Discriminative Self-Supervised Learning in Vision Poster Session 1 & Exhibit Hall
Ömer Veysel Çağatan ⋅ Ömer TAL ⋅ M. Emre Gursoy
Exhibit Hall I #210
HPSv3: Towards Wide-Spectrum Human Preference Score Poster Session 4 & Exhibit Hall with Coffee Break
Yuhang Ma ⋅ Keqiang Sun ⋅ Xiaoshi Wu ⋅ Hongsheng Li
Exhibit Hall I #19
Semantic Watermarking Reinvented: Enhancing Robustness and Generation Quality with Fourier Integrity Poster Session 4 & Exhibit Hall with Coffee Break
Sung Ju Lee ⋅ Nam Ik Cho
Exhibit Hall I #370
Dual-Process Image Generation Poster Session 4 & Exhibit Hall with Coffee Break
Grace Luo ⋅ Jonathan Granskog ⋅ Aleksander Holynski ⋅ Trevor Darrell
Exhibit Hall I #295
IntrinsicControlNet: Cross-distribution Image Generation with Real and Unreal Poster Session 6 & Exhibit Hall with Coffee Break
Jiayuan Lu ⋅ Rengan Xie ⋅ Zixuan Xie ⋅ Zhizhen Wu ⋅ Dianbing Xi ⋅ Qi Ye ⋅ Rui Wang ⋅ Hujun Bao ⋅ Yuchi Huo
Exhibit Hall I #251
Disentangling Instance and Scene Contexts for 3D Semantic Scene Completion Poster Session 6 & Exhibit Hall with Coffee Break
Enyu Liu ⋅ En Yu ⋅ Sijia Chen ⋅ Wenbing Tao
Exhibit Hall I #221
TrustMark: Robust Watermarking and Watermark Removal for Arbitrary Resolution Images Poster Session 4 & Exhibit Hall with Coffee Break
Tu Bui ⋅ Shruti Agarwal ⋅ John Collomosse
Exhibit Hall I #358
MeshMamba: State Space Models for Articulated 3D Mesh Generation and Reconstruction Poster Session 2 & Exhibit Hall with Coffee Break
Yusuke Yoshiyasu ⋅ Leyuan Sun ⋅ Ryusuke Sagawa
Exhibit Hall I #144
Domain-aware Category-level Geometry Learning Segmentation for 3D Point Clouds Poster Session 6 & Exhibit Hall with Coffee Break
Pei He ⋅ Lingling Li ⋅ Licheng Jiao ⋅ Ronghua Shang ⋅ Fang Liu ⋅ Shuang Wang ⋅ Xu Liu ⋅ wenping ma
Exhibit Hall I #347
Spatial-Temporal Aware Visuomotor Diffusion Policy Learning Poster Session 2 & Exhibit Hall with Coffee Break
Zhenyang Liu ⋅ Yikai Wang ⋅ Kuanning Wang ⋅ Longfei Liang ⋅ Xiangyang Xue ⋅ Yanwei Fu
Exhibit Hall I #197
GaussianReg: Rapid 2D/3D Registration for Emergency Surgery via Explicit 3D Modeling with Gaussian Primitives Poster Session 5 & Exhibit Hall
Weihao Yu ⋅ Xiaoqing Guo ⋅ Xinyu Liu ⋅ Yifan Liu ⋅ Hao Zheng ⋅ Yawen Huang ⋅ Yixuan Yuan
Exhibit Hall I #158
Learning Robust Image Watermarking with Lossless Cover Recovery Poster Session 4 & Exhibit Hall with Coffee Break
jiale chen ⋅ Wei Wang ⋅ Chongyang Shi ⋅ Li Dong ⋅ Xiping Hu
Exhibit Hall I #16
ArgoTweak: Towards Self-Updating HD Maps through Structured Priors Poster Session 2 & Exhibit Hall with Coffee Break
Lena Wild ⋅ Rafael Valencia ⋅ Patric Jensfelt
Exhibit Hall I #100
Event-aided Dense and Continuous Point Tracking: Everywhere and Anytime Poster Session 2 & Exhibit Hall with Coffee Break
Zhexiong Wan ⋅ Jianqin Luo ⋅ Yuchao Dai ⋅ Gim Hee Lee
Exhibit Hall I #274
Context-Aware Academic Emotion Dataset and Benchmark Poster Session 3 & Exhibit Hall
Luming Zhao ⋅ Jingwen Xuan ⋅ Jiamin Lou ⋅ Yonghui Yu ⋅ Wenwu Yang
Exhibit Hall I #362
FlowSeek: Optical Flow Made Easier with Depth Foundation Models and Motion Bases Poster Session 2 & Exhibit Hall with Coffee Break
Matteo Poggi ⋅ Fabio Tosi
Exhibit Hall I #60
TPG-INR: Target Prior-Guided Implicit 3D CT Reconstruction for Enhanced Sparse-view Imaging Poster Session 6 & Exhibit Hall with Coffee Break
QingleiCao QingleiCao ⋅ Ziyao Tang ⋅ Xiaoqin Tang
Exhibit Hall I #339
NATRA: Noise-Agnostic Framework for Trajectory Prediction with Noisy Observations Poster Session 6 & Exhibit Hall with Coffee Break
Rongqing Li ⋅ Changsheng Li ⋅ Ruilin Lv ⋅ Yuhang Li ⋅ Yang Gao ⋅ Xiaolu Zhang ⋅ JUN ZHOU
Exhibit Hall I #304
MS3D: High-Quality 3D Generation via Multi-Scale Representation Modeling Poster Session 6 & Exhibit Hall with Coffee Break
Guan Luo ⋅ Jianfeng Zhang
Exhibit Hall I #157
General Compression Framework for Efficient Transformer Object Tracking Poster Session 3 & Exhibit Hall
Lingyi Hong ⋅ Jinglun Li ⋅ Xinyu Zhou ⋅ Shilin Yan ⋅ Pinxue Guo ⋅ Kaixun Jiang ⋅ Zhaoyu Chen ⋅ Shuyong Gao ⋅ Runze Li ⋅ Xingdong Sheng ⋅ Wei Zhang ⋅ Hong Lu ⋅ Wenqiang Zhang
Exhibit Hall I #323
UniDxMD: Towards Unified Representation for Cross-Modal Unsupervised Domain Adaptation in 3D Semantic Segmentation Poster Session 5 & Exhibit Hall
Zhengyin Liang ⋅ Hui Yin ⋅ Min Liang ⋅ Qianqian Du ⋅ Ying Yang ⋅ Hua Huang
Exhibit Hall I #51
FedXDS: Leveraging Model Attribution Methods to counteract Data Heterogeneity in Federated Learning Poster Session 1 & Exhibit Hall
Maximilian Hoefler ⋅ Karsten Mueller ⋅ Wojciech Samek
Exhibit Hall I #429
Visual Textualization for Image Prompted Object Detection Poster Session 5 & Exhibit Hall
Yongjian Wu ⋅ Yang Zhou ⋅ Jiya Saiyin ⋅ Bingzheng Wei ⋅ Yan Xu
Exhibit Hall I #104
TerraMind: Large-Scale Generative Multimodality for Earth Observation Poster Session 2 & Exhibit Hall with Coffee Break
Johannes Jakubik ⋅ Felix Yang ⋅ Benedikt Blumenstiel ⋅ Erik Scheurer ⋅ Rocco Sedona ⋅ Stefano Maurogiovanni ⋅ Valerio Marsocci ⋅ Nikolaos Dionelis ⋅ Jente Bosmans ⋅ Niklas Kopp ⋅ Rahul Ramachandran ⋅ Paolo Fraccaro ⋅ Thomas Brunschwiler ⋅ Gabriele Cavallaro ⋅ Juan Moreno ⋅ Nicolas Longépé
Exhibit Hall I #221
LLaVA-SP: Enhancing Visual Representation with Visual Spatial Tokens for MLLMs Poster Session 5 & Exhibit Hall
Haoran Lou ⋅ Chunxiao Fan ⋅ Ziyan Liu ⋅ Yuexin Wu ⋅ Xinliang Wang
Exhibit Hall I #207
Deep Space Weather Model: Long-Range Solar Flare Prediction from Multi-Wavelength Images Poster Session 2 & Exhibit Hall with Coffee Break
Shunya Nagashima ⋅ Komei Sugiura
Exhibit Hall I #411
ZIM: Zero-Shot Image Matting for Anything Poster Session 5 & Exhibit Hall
Beomyoung Kim ⋅ Chanyong Shin ⋅ Joonhyun Jeong ⋅ Hyungsik Jung ⋅ Seyun Lee ⋅ Sewhan Chun ⋅ Dong-Hyun HWANG ⋅ Joonsang Yu
Exhibit Hall I #381
Fusion Meets Diverse Conditions: A High-diversity Benchmark and Baseline for UAV-based Multimodal Object Detection with Condition Cues Poster Session 6 & Exhibit Hall with Coffee Break
Chen Chen ⋅ Kangcheng Bin ⋅ Hu Ting ⋅ Jiahao Qi ⋅ Xingyue Liu ⋅ Tianpeng Liu ⋅ Zhen Liu ⋅ Yongxiang Liu ⋅ Ping Zhong
Exhibit Hall I #312
EmbodiedOcc: Embodied 3D Occupancy Prediction for Vision-based Online Scene Understanding Poster Session 6 & Exhibit Hall with Coffee Break
Yuqi Wu ⋅ Wenzhao Zheng ⋅ Sicheng Zuo ⋅ Yuanhui Huang ⋅ Jie Zhou ⋅ Jiwen Lu
Exhibit Hall I #159
TextSSR: Diffusion-based Data Synthesis for Scene Text Recognition Poster Session 4 & Exhibit Hall with Coffee Break
Xingsong Ye ⋅ Yongkun Du ⋅ Yunbo Tao ⋅ Zhineng Chen
Exhibit Hall I #247
Omegance: A Single Parameter for Various Granularities in Diffusion-Based Synthesis Poster Session 4 & Exhibit Hall with Coffee Break
Xinyu Hou ⋅ Zongsheng Yue ⋅ Xiaoming Li ⋅ Chen Change Loy
Exhibit Hall I #428
DAViD: Data-efficient and Accurate Vision Models from Synthetic Data Poster Session 2 & Exhibit Hall with Coffee Break
Fatemeh Saleh ⋅ Sadegh Aliakbarian ⋅ Charlie Hewitt ⋅ Lohit Petikam ⋅ Xiao Xian ⋅ Antonio Criminisi ⋅ Thomas J. Cashman ⋅ Tadas Baltrusaitis
Exhibit Hall I #31
RareCLIP: Rarity-aware Online Zero-shot Industrial Anomaly Detection Poster Session 5 & Exhibit Hall
Jianfang He ⋅ Min Cao ⋅ Silong Peng ⋅ Qiong Xie
Exhibit Hall I #438
A Visual Leap in CLIP Compositionality Reasoning through Generation of Counterfactual Sets Poster Session 5 & Exhibit Hall
Zexi Jia ⋅ Chuanwei Huang ⋅ Yeshuang Zhu ⋅ Hongyan Fei ⋅ Ying Deng ⋅ Zhiqiang Yuan ⋅ Jiapei Zhang ⋅ Jinchao Zhang ⋅ Jie Zhou
Exhibit Hall I #348
MOSCATO: Predicting Multiple Object State Change Through Actions Poster Session 3 & Exhibit Hall
Parnian Zameni ⋅ Yuhan Shen ⋅ Ehsan Elhamifar
Exhibit Hall I #151
Skip-Vision: Efficient and Scalable Acceleration of Vision-Language Models via Adaptive Token Skipping Poster Session 5 & Exhibit Hall
Weili Zeng ⋅ Ziyuan Huang ⋅ Kaixiang Ji ⋅ Yichao Yan
Exhibit Hall I #147
Temporal Rate Reduction Clustering for Human Motion Segmentation Poster Session 3 & Exhibit Hall
Xianghan Meng ⋅ Zhengyu Tong ⋅ Zhiyuan Huang ⋅ Chun-Guang Li
Exhibit Hall I #437
HFD-Teacher: High-Frequency Depth Distillation from Depth Foundation Models for Enhanced Depth Completion Poster Session 2 & Exhibit Hall with Coffee Break
Zhiyuan Yang ⋅ Anqi Cheng ⋅ Haiyue Zhu ⋅ Tianjiao Li ⋅ Pey Yuen Tao ⋅ Kezhi Mao
Exhibit Hall I #373
LeanVAE: An Ultra-Efficient Reconstruction VAE for Video Diffusion Models Poster Session 4 & Exhibit Hall with Coffee Break
Yu Cheng ⋅ Fajie Yuan
Exhibit Hall I #77
DreamDance: Animating Human Images by Enriching 3D Geometry Cues from 2D Poses Poster Session 3 & Exhibit Hall
Yatian Pang ⋅ Bin Zhu ⋅ Bin Lin ⋅ Mingzhe Zheng ⋅ Francis Tay ⋅ Ser-Nam Lim ⋅ Harry Yang ⋅ Li Yuan
Exhibit Hall I #381
Separation for Better Integration: Disentangling Edge and Motion in Event-based Deblurring Poster Session 3 & Exhibit Hall
Yufei Zhu ⋅ Hao Chen ⋅ Yongjian Deng ⋅ Wei You
Exhibit Hall I #445
AnimeGamer: Infinite Anime Life Simulation with Next Game State Prediction Poster Session 3 & Exhibit Hall
Junhao Cheng ⋅ Yuying Ge ⋅ Yixiao Ge ⋅ Jing Liao ⋅ Ying Shan
Exhibit Hall I #82
OmniPaint: Mastering Object-Oriented Editing via Disentangled Insertion-Removal Inpainting Poster Session 4 & Exhibit Hall with Coffee Break
Yongsheng Yu ⋅ Ziyun Zeng ⋅ Haitian Zheng ⋅ Jiebo Luo
Exhibit Hall I #235
Diversity-Enhanced Distribution Alignment for Dataset Distillation Poster Session 1 & Exhibit Hall
Hongcheng Li ⋅ Yucan Zhou ⋅ Xiaoyan Gu ⋅ Bo Li ⋅ Weiping Wang
Exhibit Hall I #348
Height-Fidelity Dense Global Fusion for Multi-modal 3D Object Detection Poster Session 6 & Exhibit Hall with Coffee Break
Hanshi Wang ⋅ Jin Gao ⋅ Weiming Hu ⋅ Zhipeng Zhang
Exhibit Hall I #188
SMSTracker: Tri-path Score Mask Sigma Fusion for Multi-Modal Tracking Poster Session 1 & Exhibit Hall
Sixian Chan ⋅ Zedong Li ⋅ Xiaoqin Zhang ⋅ Wenhao Li ⋅ Shijian Lu ⋅ Chunhua Shen
Exhibit Hall I #447
Two Losses, One Goal: Balancing Conflict Gradients for Semi-supervised Semantic Segmentation Poster Session 5 & Exhibit Hall
Rui Sun ⋅ Huayu Mai ⋅ Wangkai Li ⋅ Yujia Chen ⋅ Yuan Wang
Exhibit Hall I #52
Acknowledging Focus Ambiguity in Visual Questions Poster Session 1 & Exhibit Hall
Chongyan Chen ⋅ Yu-Yun Tseng ⋅ Zhuoheng Li ⋅ Anush Venkatesh ⋅ Danna Gurari
Exhibit Hall I #107
Guiding Noisy Label Conditional Diffusion Models with Score-based Discriminator Correction Poster Session 4 & Exhibit Hall with Coffee Break
Dat Cong ⋅ Hieu Tran ⋅ Hoang Thanh-Tung
Exhibit Hall I #349
Shape of Motion: 4D Reconstruction from a Single Video Poster Session 2 & Exhibit Hall with Coffee Break
Qianqian Wang ⋅ Vickie Ye ⋅ Hang Gao ⋅ Weijia Zeng ⋅ Jake Austin ⋅ Zhengqi Li ⋅ Angjoo Kanazawa
Exhibit Hall I #435
VSSD: Vision Mamba with Non-Causal State Space Duality Poster Session 3 & Exhibit Hall
Yuheng Shi ⋅ Mingjia Li ⋅ Minjing Dong ⋅ Chang Xu
Exhibit Hall I #77
EditCLIP: Representation Learning for Image Editing Poster Session 4 & Exhibit Hall with Coffee Break
Qian Wang ⋅ Aleksandar Cvejic ⋅ Abdelrahman Eldesokey ⋅ Peter Wonka
Exhibit Hall I #102
CorrCLIP: Reconstructing Patch Correlations in CLIP for Open-Vocabulary Semantic Segmentation Poster Session 6 & Exhibit Hall with Coffee Break
Dengke Zhang ⋅ Fagui Liu ⋅ Quan Tang
Exhibit Hall I #145
mmCooper: A Multi-agent Multi-stage Communication-efficient and Collaboration-robust Cooperative Perception Framework Poster Session 6 & Exhibit Hall with Coffee Break
Bingyi Liu ⋅ Jian Teng ⋅ Hongfei Xue ⋅ Enshu Wang ⋅ Chuanhui Zhu ⋅ Pu Wang ⋅ Libing Wu
Exhibit Hall I #354
FreqPDE: Rethinking Positional Depth Embedding for Multi-View 3D Object Detection Transformers Poster Session 6 & Exhibit Hall with Coffee Break
Junjie Zhang ⋅ Haisheng Su ⋅ Feixiang Song ⋅ Sanping Zhou ⋅ Wei Wu ⋅ Junchi Yan ⋅ Nanning Zheng
Exhibit Hall I #330
GestureLSM: Latent Shortcut based Co-Speech Gesture Generation with Spatial-Temporal Modeling Poster Session 3 & Exhibit Hall
Pinxin Liu ⋅ Luchuan Song ⋅ Junhua Huang ⋅ Haiyang Liu ⋅ Chenliang Xu
Exhibit Hall I #87
OccluGaussian: Occlusion-Aware Gaussian Splatting for Large Scene Reconstruction and Rendering Poster Session 6 & Exhibit Hall with Coffee Break
Shiyong Liu ⋅ Xiao Tang ⋅ Zhihao Li ⋅ Yingfan He ⋅ Chongjie Ye ⋅ Jianzhuang Liu ⋅ Binxiao Huang ⋅ Shunbo Zhou ⋅ Xiaofei Wu
Exhibit Hall I #186
MagShield: Towards Better Robustness in Sparse Inertial Motion Capture Under Magnetic Disturbances Poster Session 6 & Exhibit Hall with Coffee Break
Yunzhe Shao ⋅ Xinyu Yi ⋅ Lu Yin ⋅ Shihui Guo ⋅ Jun-Hai Yong ⋅ Feng Xu
Exhibit Hall I #414
Towards Cross-modal Backward-compatible Representation Learning for Vision-Language Models Poster Session 1 & Exhibit Hall
Young Kyun Jang ⋅ Ser-Nam Lim
Exhibit Hall I #161
ILLUME: Illuminating Your LLMs to See, Draw, and Self-Enhance Poster Session 5 & Exhibit Hall
Chunwei Wang ⋅ Guansong Lu ⋅ Junwei Yang ⋅ Runhui Huang ⋅ Jianhua Han ⋅ Lu Hou ⋅ Wei Zhang ⋅ Hang Xu
Exhibit Hall I #170
DeFSS: Image-to-Mask Denoising Learning for Few-shot Segmentation Poster Session 5 & Exhibit Hall
Zishu Qin ⋅ Junhao Xu ⋅ Weifeng Ge
Exhibit Hall I #227
Unveiling the Invisible: Reasoning Complex Occlusions Amodally with AURA Poster Session 5 & Exhibit Hall
Zhixuan Li ⋅ Hyunse Yoon ⋅ Sanghoon Lee ⋅ Weisi Lin
Exhibit Hall I #199
VehicleMAE: View-asymmetry Mutual Learning for Vehicle Re-identification Pre-training via Masked AutoEncoders Poster Session 1 & Exhibit Hall
Qi Wang ⋅ Zeyu Zhang ⋅ Dong Wang ⋅ Di Gai ⋅ Xin Xiong ⋅ Jiyang Xu ⋅ Ruihua Zhou
Exhibit Hall I #441
SuperEdit: Rectifying and Facilitating Supervision for Instruction-Based Image Editing Poster Session 4 & Exhibit Hall with Coffee Break
Ming Li ⋅ Xin Gu ⋅ Fan Chen ⋅ Xiaoying Xing ⋅ Longyin Wen ⋅ Chen Chen ⋅ Sijie Zhu
Exhibit Hall I #414
MagicCity: Geometry-Aware 3D City Generation from Satellite Imagery with Multi-View Consistency Poster Session 6 & Exhibit Hall with Coffee Break
Xingbo YAO ⋅ xuanmin Wang ⋅ Hao WU ⋅ Chengliang PING ⋅ ZHANG Doudou ⋅ Hui Xiong
Exhibit Hall I #57
RARE: Refine Any Registration of Pairwise Point Clouds via Zero-Shot Learning Poster Session 6 & Exhibit Hall with Coffee Break
Chengyu Zheng ⋅ Honghua Chen ⋅ Jin Huang ⋅ Mingqiang Wei
Exhibit Hall I #177
Multi-scenario Overlapping Text Segmentation with Depth Awareness Poster Session 4 & Exhibit Hall with Coffee Break
Yang Liu ⋅ Xudong Xie ⋅ Yuliang Liu ⋅ Xiang Bai
Exhibit Hall I #246
Dataset Distillation via Vision-Language Category Prototype Poster Session 1 & Exhibit Hall
YAWEN ZOU ⋅ Guang Li ⋅ Duo Su ⋅ Zi Wang ⋅ Jun YU ⋅ Chao Zhang
Exhibit Hall I #271
ConceptSplit: Decoupled Multi-Concept Personalization of Diffusion Models via Token-wise Adaptation and Attention Disentanglement Poster Session 4 & Exhibit Hall with Coffee Break
Habin Lim ⋅ Youngseob Won ⋅ Juwon Seo ⋅ Gyeong-Moon Park
Exhibit Hall I #339
ULTHO: Ultra-Lightweight yet Efficient Hyperparameter Optimization in Deep Reinforcement Learning Poster Session 1 & Exhibit Hall
Mingqi Yuan ⋅ Bo Li ⋅ Xin Jin ⋅ Wenjun Zeng
Exhibit Hall I #241
Backdoor Defense via Enhanced Splitting and Trap Isolation Poster Session 1 & Exhibit Hall
Hongrui Yu ⋅ Lu Qi ⋅ Wanyu Lin ⋅ Jian Chen ⋅ Hailong Sun ⋅ chengbin sun
Exhibit Hall I #152
Learning Hierarchical Line Buffer for Image Processing Poster Session 3 & Exhibit Hall
Jiacheng Li ⋅ Feiran Li ⋅ Daisuke Iso
Exhibit Hall I #106
ATAS: Any-to-Any Self-Distillation for Enhanced Open-Vocabulary Dense Prediction Poster Session 5 & Exhibit Hall
Soonwoo Cha ⋅ Jiwoo Song ⋅ Juan Yeo ⋅ Hyunbin Jin ⋅ Taesup Kim
Exhibit Hall I #55
MUSE-VL: Modeling Unified VLM through Semantic Discrete Encoding Poster Session 5 & Exhibit Hall
Rongchang Xie ⋅ Chen Du ⋅ Ping Song ⋅ Chang Liu
Exhibit Hall I #406
A Plug-and-Play Physical Motion Restoration Approach for In-the-Wild High-Difficulty Motions Poster Session 3 & Exhibit Hall
Youliang Zhang ⋅ Ronghui Li ⋅ Yachao Zhang ⋅ Liang Pan ⋅ Jingbo Wang ⋅ Yebin Liu ⋅ Xiu Li
Exhibit Hall I #310
Humans as Checkerboards: Calibrating Camera Motion Scale for World-Coordinate Human Mesh Recovery Poster Session 2 & Exhibit Hall with Coffee Break
Fengyuan Yang ⋅ Kerui Gu ⋅ Ha Linh Nguyen ⋅ Tze Ho Elden Tse ⋅ Angela Yao
Exhibit Hall I #98
D3: Training-Free AI-Generated Video Detection Using Second-Order Features Poster Session 3 & Exhibit Hall
Chende Zheng ⋅ Ruiqi suo ⋅ Chenhao Lin ⋅ Zhengyu Zhao ⋅ Le Yang ⋅ Shuai Liu ⋅ Minghui Yang ⋅ Cong Wang ⋅ Chao Shen
Exhibit Hall I #268
χ: Symmetry Understanding of 3D Shapes via Chirality Disentanglement Poster Session 6 & Exhibit Hall with Coffee Break
Weikang Wang ⋅ Tobias Weißberg ⋅ Nafie El Amrani ⋅ Florian Bernard
Exhibit Hall I #344
Dirichlet-Constrained Variational Codebook Learning for Temporally Coherent Video Face Restoration Poster Session 3 & Exhibit Hall
Baoyou Chen ⋅ Ce Liu ⋅ Weihao Yuan ⋅ Zilong Dong ⋅ Siyu Zhu
Exhibit Hall I #424
VideoAuteur: Towards Long Narrative Video Generation Poster Session 4 & Exhibit Hall with Coffee Break
Junfei Xiao ⋅ Feng Cheng ⋅ Lu Qi ⋅ Liangke Gui ⋅ Yang Zhao ⋅ Shanchuan Lin ⋅ Jiepeng Cen ⋅ Zhibei Ma ⋅ Alan Yuille ⋅ Lu Jiang
Exhibit Hall I #410
StreamMind: Unlocking Full Frame Rate Streaming Video Dialogue through Event-Gated Cognition Poster Session 3 & Exhibit Hall
Xin Ding ⋅ Hao Wu ⋅ Yifan Yang ⋅ Shiqi Jiang ⋅ Qianxi Zhang ⋅ Donglin Bai ⋅ Zhibo Chen ⋅ Ting Cao
Exhibit Hall I #325
ViT-Split: Unleashing the Power of Vision Foundation Models via Efficient Splitting Heads Poster Session 1 & Exhibit Hall
Yifan Li ⋅ Xin Li ⋅ Tianqin Li ⋅ Wenbin He ⋅ Yu Kong ⋅ Liu Ren
Exhibit Hall I #179
Robust and Efficient 3D Gaussian Splatting for Urban Scene Reconstruction Poster Session 6 & Exhibit Hall with Coffee Break
Zhensheng Yuan ⋅ Haozhi Huang ⋅ Zhen Xiong ⋅ Di Wang ⋅ Guanghua Yang
Exhibit Hall I #143
Neural Architecture Search Driven by Locally Guided Diffusion for Personalized Federated Learning Poster Session 1 & Exhibit Hall
PENG LIAO ⋅ Xilu Wang ⋅ Yaochu Jin ⋅ WenLi Du ⋅ Han Hu
Exhibit Hall I #395
Hierarchical 3D Scene Graphs Construction Outdoors Poster Session 6 & Exhibit Hall with Coffee Break
Jon Nyffeler ⋅ Federico Tombari ⋅ Daniel Barath
Exhibit Hall I #202
Cycle-Consistent Learning for Joint Layout-to-Image Generation and Object Detection Poster Session 2 & Exhibit Hall with Coffee Break
Xinhao Cai ⋅ Qiuxia Lai ⋅ Gensheng Pei ⋅ Xiangbo Shu ⋅ Yazhou Yao ⋅ Wenguan Wang
Exhibit Hall I #167
From Objects to Events: Unlocking Complex Visual Understanding in Object Detectors via LLM-guided Symbolic Reasoning Poster Session 5 & Exhibit Hall
Yuhui Zeng ⋅ Haoxiang Wu ⋅ Wenjie Nie ⋅ Xiawu Zheng ⋅ Guangyao Chen ⋅ Yunhang Shen ⋅ Jun Peng ⋅ Yonghong Tian ⋅ Rongrong Ji
Exhibit Hall I #429
StyleSRN: Scene Text Image Super-Resolution with Text Style Embedding Poster Session 4 & Exhibit Hall with Coffee Break
Shengrong Yuan ⋅ Runmin Wang ⋅ Ke Hao ⋅ Xu-Qi Ma ⋅ Changxin Gao ⋅ Li Liu ⋅ Nong Sang
Exhibit Hall I #364
Frequency-Guided Diffusion for Training-Free Text-Driven Image Translation Poster Session 4 & Exhibit Hall with Coffee Break
Zheng Gao ⋅ Jifei Song ⋅ Zhensong Zhang ⋅ Jiankang Deng ⋅ Ioannis Patras
Exhibit Hall I #413
Preacher: Paper-to-Video Agentic System Poster Session 4 & Exhibit Hall with Coffee Break
Jingwei Liu ⋅ Ling Yang ⋅ Hao Luo ⋅ Fan Wang ⋅ Hongyan Li ⋅ Mengdi Wang
Exhibit Hall I #214
Where am I? Cross-View Geo-localization with Natural Language Descriptions Poster Session 2 & Exhibit Hall with Coffee Break
Junyan Ye ⋅ Honglin Lin ⋅ Leyan Ou ⋅ Dairong Chen ⋅ Zihao Wang ⋅ Qi Zhu ⋅ Conghui He ⋅ Weijia Li
Exhibit Hall I #82
Frequency-Semantic Enhanced Variational Autoencoder for Zero-Shot Skeleton-based Action Recognition Poster Session 3 & Exhibit Hall
Wenhan Wu ⋅ Zhishuai Guo ⋅ Chen Chen ⋅ Hongfei Xue ⋅ Aidong Lu
Exhibit Hall I #105
How Do Multimodal Large Language Models Handle Complex Multimodal Reasoning? Placing Them in An Extensible Escape Game Poster Session 1 & Exhibit Hall
Ziyue Wang ⋅ Yurui Dong ⋅ Fuwen Luo ⋅ Minyuan Ruan ⋅ Zhili Cheng ⋅ Chi Chen ⋅ Peng Li ⋅ Yang Liu
Exhibit Hall I #451
Towards Human-like Virtual Beings: Simulating Human Behavior in 3D Scenes Poster Session 3 & Exhibit Hall
CHEN LIANG ⋅ Wenguan Wang ⋅ Yi Yang
Exhibit Hall I #69
Cross-Category Subjectivity Generalization for Style-Adaptive Sketch Re-ID Poster Session 5 & Exhibit Hall
Zechao Hu ⋅ Zhengwei Yang ⋅ Hao Li ⋅ Zheng Wang ⋅ Yixiong Zou
Exhibit Hall I #267
S3R-GS: Streamlining the Pipeline for Large-Scale Street Scene Reconstruction Poster Session 6 & Exhibit Hall with Coffee Break
Guangting Zheng ⋅ Jiajun Deng ⋅ Xiaomeng Chu ⋅ Yu Yuan ⋅ Houqiang Li ⋅ Yanyong Zhang
Exhibit Hall I #84
The Source Image is the Best Attention for Infrared and Visible Image Fusion Poster Session 3 & Exhibit Hall
Song Wang ⋅ Xie Han ⋅ Liqun Kuang ⋅ Boying Wang ⋅ Zhongyu Chen ⋅ Zherui Qiao ⋅ Fan Yang ⋅ Xiaoxia Liu ⋅ Bingyu Zhang ⋅ Zhixun Wang
Exhibit Hall I #331
WSI-LLaVA: A Multimodal Large Language Model for Whole Slide Image Poster Session 5 & Exhibit Hall
Yuci Liang ⋅ Xinheng Lyu ⋅ Meidan Ding ⋅ Wenting Chen ⋅ Xiaohan Xing ⋅ Jipeng Zhang ⋅ Sen Yang ⋅ Xiangjian He ⋅ Song Wu ⋅ Xiyue Wang ⋅ Linlin Shen
Exhibit Hall I #274
Exploiting Diffusion Prior for Task-driven Image Restoration Poster Session 3 & Exhibit Hall
Jaeha Kim ⋅ Junghun Oh ⋅ Kyoung Mu Lee
Exhibit Hall I #14
Video2BEV: Transforming Drone Videos to BEVs for Video-based Geo-localization Poster Session 6 & Exhibit Hall with Coffee Break
Hao Ju ⋅ Shaofei Huang ⋅ Si Liu ⋅ Zhedong Zheng
Exhibit Hall I #228
CLIPer: Hierarchically Improving Spatial Representation of CLIP for Open-Vocabulary Semantic Segmentation Poster Session 5 & Exhibit Hall
Lin Sun ⋅ Jiale Cao ⋅ Jin Xie ⋅ Xiaoheng Jiang ⋅ Yanwei Pang
Exhibit Hall I #321
Adaptive Articulated Object Manipulation On The Fly with Foundation Model Reasoning and Part Grounding Poster Session 3 & Exhibit Hall
Xiaojie Zhang ⋅ Yuanfei Wang ⋅ Ruihai Wu ⋅ Kunqi Xu ⋅ Yu Li ⋅ Liuyu Xiang ⋅ Hao Dong ⋅ Zhaofeng He
Exhibit Hall I #285
Scaling Laws for Native Multimodal Models Poster Session 1 & Exhibit Hall
Mustafa Shukor ⋅ Enrico Fini ⋅ Victor Guilherme Turrisi da Costa ⋅ Matthieu Cord ⋅ Joshua Susskind ⋅ Alaaeldin El-Nouby
Exhibit Hall I #227
Unlearning the Noisy Correspondence Makes CLIP More Robust Poster Session 1 & Exhibit Hall
Haochen Han ⋅ Alex Jinpeng Wang ⋅ Peijun Ye ⋅ Fangming Liu
Exhibit Hall I #424
KDA: Knowledge Diffusion Alignment with Enhanced Context for Video Temporal Grounding Poster Session 5 & Exhibit Hall
Ran Ran ⋅ Jiwei Wei ⋅ Shiyuan He ⋅ Zeyu Ma ⋅ Chaoning Zhang ⋅ Ning Xie ⋅ Yang Yang
Exhibit Hall I #331
VisNumBench: Evaluating Number Sense of Multimodal Large Language Models Poster Session 1 & Exhibit Hall
Tengjin Weng ⋅ Jingyi Wang ⋅ Wenhao Jiang ⋅ Zhong Ming
Exhibit Hall I #356
STEP-DETR: Advancing DETR-based Semi-Supervised Object Detection with Super Teacher and Pseudo-Label Guided Text Queries Poster Session 1 & Exhibit Hall
Tahira Shehzadi ⋅ Khurram Azeem Hashmi ⋅ Shalini Sarode ⋅ Didier Stricker ⋅ Muhammad Zeshan Afzal
Exhibit Hall I #283
MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation Poster Session 5 & Exhibit Hall
Fu Rong ⋅ Meng Lan ⋅ Qian Zhang ⋅ Lefei Zhang
Exhibit Hall I #392
VoxelKP: A Voxel-based Network Architecture for Human Keypoint Estimation in LiDAR Data Poster Session 6 & Exhibit Hall with Coffee Break
Jian Shi ⋅ Peter Wonka
Exhibit Hall I #343
``Principal Components" Enable A New Language of Images Poster Session 4 & Exhibit Hall with Coffee Break
Xin Wen ⋅ Bingchen Zhao ⋅ Ismail Elezi ⋅ Jiankang Deng ⋅ Xiaojuan Qi
Exhibit Hall I #168
VAFlow: Video-to-Audio Generation with Cross-Modality Flow Matching Poster Session 3 & Exhibit Hall
Xihua Wang ⋅ Xin Cheng ⋅ Yuyue Wang ⋅ Ruihua Song ⋅ Yunfeng Wang
Exhibit Hall I #167
Seeing the Trees for the Forest: Rethinking Weakly-Supervised Medical Visual Grounding Poster Session 5 & Exhibit Hall
Huy Ta ⋅ Duy Anh Huynh ⋅ Yutong Xie ⋅ Yuankai Qi ⋅ Qi Chen ⋅ Phi Le Nguyen ⋅ Sen Tran ⋅ Son Lam Phung ⋅ Anton Hengel ⋅ Zhibin Liao ⋅ Minh-Son To ⋅ Johan Verjans ⋅ Vu Phan
Exhibit Hall I #435
Beyond Blur: A Fluid Perspective on Generative Diffusion Models Poster Session 4 & Exhibit Hall with Coffee Break
Grzegorz Gruszczynski ⋅ Jakub Meixner ⋅ Michał Włodarczyk ⋅ Przemyslaw Musialski
Exhibit Hall I #280
Revisiting Adversarial Patch Defenses on Object Detectors: Unified Evaluation, Large-Scale Dataset, and New Insights Poster Session 5 & Exhibit Hall
Junhao Zheng ⋅ Jiahao Sun ⋅ Chenhao Lin ⋅ Zhengyu Zhao ⋅ Chen Ma ⋅ Chong Zhang ⋅ Cong Wang ⋅ Qian Wang ⋅ Chao Shen
Exhibit Hall I #346
Structure Matters: Revisiting Boundary Refinement in Video Object Segmentation Poster Session 3 & Exhibit Hall
Guanyi Qin ⋅ Ziyue Wang ⋅ Daiyun Shen ⋅ Haofeng Liu ⋅ Hantao Zhou ⋅ Junde Wu ⋅ Runze Hu ⋅ Yueming Jin
Exhibit Hall I #417
A View-consistent Sampling Method for Regularized Training of Neural Radiance Fields Poster Session 6 & Exhibit Hall with Coffee Break
Aoxiang Fan ⋅ Corentin Dumery ⋅ Nicolas Talabot ⋅ Pascal Fua
Exhibit Hall I #118
GGTalker: Talking Head Systhesis with Generalizable Gaussian Priors and Identity-Specific Adaptation Poster Session 3 & Exhibit Hall
Wentao Hu ⋅ Shunkai Li ⋅ Ziqiao Peng ⋅ Haoxian Zhang ⋅ Fan Shi ⋅ Xiaoqiang Liu ⋅ Pengfei Wan ⋅ Di ZHANG ⋅ Hui Tian
Exhibit Hall I #10
MonoFusion: Sparse-View 4D Reconstruction via Monocular Fusion Poster Session 2 & Exhibit Hall with Coffee Break
Zihan Wang ⋅ Jeff Tan ⋅ Tarasha Khurana ⋅ Neehar Peri ⋅ Deva Ramanan
Exhibit Hall I #305
One Last Attention for Your Vision-Language Model Poster Session 1 & Exhibit Hall
Liang Chen ⋅ Ghazi Shazan Ahmad ⋅ Tianjun Yao ⋅ Lingqiao Liu ⋅ Zhiqiang Shen
Exhibit Hall I #129
Zeroth-Order Fine-Tuning of LLMs in Random Subspaces Poster Session 1 & Exhibit Hall
Ziming Yu ⋅ Pan Zhou ⋅ Sike Wang ⋅ Jia Li ⋅ Mi Tian ⋅ Hua Huang
Exhibit Hall I #420
IFAdapter: Instance Feature Control for Grounded Text-to-Image Generation Poster Session 4 & Exhibit Hall with Coffee Break
YINWEI WU ⋅ Xianpan Zhou ⋅ bing ma ⋅ Xuefeng Su ⋅ Kai Ma ⋅ Xinchao Wang
Exhibit Hall I #101
Towards Scalable Spatial Intelligence via 2D-to-3D Data Lifting Poster Session 1 & Exhibit Hall
Xingyu Miao ⋅ Haoran Duan ⋅ Quanhao Qian ⋅ Jiuniu Wang ⋅ Yang Long ⋅ Ling Shao ⋅ Deli Zhao ⋅ Ran Xu ⋅ Gongjie Zhang
Exhibit Hall I #81
Balancing Conservatism and Aggressiveness: Prototype-Affinity Hybrid Network for Few-Shot Segmentation Poster Session 5 & Exhibit Hall
Tianyu Zou ⋅ Shengwu Xiong ⋅ Ruilin Yao ⋅ Yi Rong
Exhibit Hall I #71
EYE3:Turn Anything into Naked-eye 3D Poster Session 6 & Exhibit Hall with Coffee Break
Yingde Song ⋅ Zongyuan Yang ⋅ Baolin Liu ⋅ yongping xiong ⋅ Sai Chen ⋅ Lan Yi ⋅ Zhaohe Zhang ⋅ Xunbo Yu
Exhibit Hall I #303
C2MIL: Synchronizing Semantic and Topological Causalities in Multiple Instance Learning for Robust and Interpretable Survival Analysis Poster Session 5 & Exhibit Hall
Min Cen ⋅ Zhenfeng Zhuang ⋅ Yuzhe Zhang ⋅ Min Zeng ⋅ Baptiste Magnier ⋅ Lequan Yu ⋅ Hong Zhang ⋅ Liansheng Wang
Exhibit Hall I #430
CVPT: Cross Visual Prompt Tuning Poster Session 1 & Exhibit Hall
Lingyun Huang ⋅ Jianxu Mao ⋅ Junfei YI ⋅ Ziming Tao ⋅ Yaonan Wang
Exhibit Hall I #70
Enhancing Partially Relevant Video Retrieval with Hyperbolic Learning Poster Session 5 & Exhibit Hall
Jun Li ⋅ Jinpeng Wang ⋅ Chaolei Tan ⋅ Niu Lian ⋅ Long Chen ⋅ Yaowei Wang ⋅ Min zhang ⋅ Shu-Tao Xia ⋅ Bin Chen
Exhibit Hall I #309
MobileIE: An Extremely Lightweight and Effective ConvNet for Real-Time Image Enhancement on Mobile Devices Poster Session 5 & Exhibit Hall
HAILONG YAN ⋅ Ao Li ⋅ Xiangtao Zhang ⋅ Zhe Liu ⋅ Zenglin Shi ⋅ Ce Zhu ⋅ Le Zhang
Exhibit Hall I #201
Pi-GPS: Enhancing Geometry Problem Solving by Unleashing the Power of Diagrammatic Information Poster Session 1 & Exhibit Hall
Junbo Zhao ⋅ Ting Zhang ⋅ Jiayu Sun ⋅ Mi Tian ⋅ Hua Huang
Exhibit Hall I #135
FixTalk: Taming Identity Leakage for High-Quality Talking Head Generation in Extreme Cases Poster Session 1 & Exhibit Hall
Shuai Tan ⋅ Bill Gong ⋅ Bin Ji ⋅ Ye Pan
Exhibit Hall I #302
Serialization based Point Cloud Oversegmentation Poster Session 6 & Exhibit Hall with Coffee Break
chenghui Lu ⋅ Dilong Li ⋅ Jianlong Kwan ⋅ Ziyi Chen ⋅ Haiyan Guan
Exhibit Hall I #106
NeurOp-Diff: Continuous Remote Sensing Image Super-Resolution via Neural Operator Diffusion
Zihao Xu ⋅ Yuzhi Tang ⋅ Bowen Xu ⋅ Qingquan Li
#235
Di[M]O: Distilling Masked Diffusion Models into One-step Generator Poster Session 4 & Exhibit Hall with Coffee Break
Yuanzhi Zhu ⋅ Xi WANG ⋅ Stéphane Lathuilière ⋅ Vicky Kalogeiton
Exhibit Hall I #356
Reinforcement Learning-Guided Data Selection via Redundancy Assessment Poster Session 1 & Exhibit Hall
Suorong Yang ⋅ Peijia Li ⋅ Furao Shen ⋅ Jian Zhao
Exhibit Hall I #86
Φ-GAN:Physics-Inspired GAN for Generating SAR Images Under Limited Data Poster Session 6 & Exhibit Hall with Coffee Break
Xidan Zhang ⋅ Yihan Zhuang ⋅ Qian Guo ⋅ Haodong Yang ⋅ Xuelin Qian ⋅ Gong Cheng ⋅ Junwei Han ⋅ Zhongling Huang
Exhibit Hall I #419
One Perturbation is Enough: On Generating Universal Adversarial Perturbations against Vision-Language Pre-training Models Poster Session 1 & Exhibit Hall
Hao Fang ⋅ Jiawei Kong ⋅ Wenbo Yu ⋅ Bin Chen ⋅ Jiawei Li ⋅ Hao Wu ⋅ Shu-Tao Xia ⋅ Ke Xu
Exhibit Hall I #383
Recognizing Actions from Robotic View for Natural Human-Robot Interaction Poster Session 3 & Exhibit Hall
Ziyi Wang ⋅ Peiming Li ⋅ Hong Liu ⋅ Zhichao Deng ⋅ Can Wang ⋅ Jun Liu ⋅ Junsong Yuan ⋅ Mengyuan Liu
Exhibit Hall I #397
Addressing Text Embedding Leakage in Diffusion-based Image Editing Poster Session 4 & Exhibit Hall with Coffee Break
Sunung Mun ⋅ Jinhwan Nam ⋅ Sunghyun Cho ⋅ Jungseul Ok
Exhibit Hall I #148
Robin3D: Improving 3D Large Language Model via Robust Instruction Tuning Poster Session 1 & Exhibit Hall
Weitai Kang ⋅ Haifeng Huang ⋅ Yuzhang Shang ⋅ Mubarak Shah ⋅ Yan Yan
Exhibit Hall I #363
DDB: Diffusion Driven Balancing to Address Spurious Correlations Poster Session 4 & Exhibit Hall with Coffee Break
Aryan Yazdan Parast ⋅ Basim Azam ⋅ Naveed Akhtar
Exhibit Hall I #253
TurboVSR: Fantastic Video Upscalers and Where to Find Them Poster Session 4 & Exhibit Hall with Coffee Break
Zhongdao Wang ⋅ Guodongfang Zhao ⋅ Jingjing Ren ⋅ bailan feng ⋅ Shifeng Zhang ⋅ Wenbo Li
Exhibit Hall I #312
CoralSRT: Revisiting Coral Reef Semantic Segmentation by Feature Rectifying via Self-supervised Guidance Poster Session 5 & Exhibit Hall
Zheng Ziqiang ⋅ Wong Kwan ⋅ Binh-Son Hua ⋅ Jianbo Shi ⋅ Sai-Kit Yeung
Exhibit Hall I #16
Learning Dense Feature Matching via Lifting Single 2D Image to 3D Space Poster Session 2 & Exhibit Hall with Coffee Break
Yingping Liang ⋅ Yutao Hu ⋅ Wenqi Shao ⋅ Ying Fu
Exhibit Hall I #152
Diagnosing Pretrained Models for Out-of-distribution Detection Poster Session 1 & Exhibit Hall
Haipeng Xiong ⋅ Kai Xu ⋅ Angela Yao
Exhibit Hall I #166
Seeing 3D Through 2D Lenses: 3D Few-Shot Class-Incremental Learning via Cross-Modal Geometric Rectification Poster Session 2 & Exhibit Hall with Coffee Break
Tuo Xiang ⋅ Xuemiao Xu ⋅ Bangzhen Liu ⋅ Jinyi Li ⋅ Yong Li ⋅ Shengfeng He
Exhibit Hall I #164
CHORDS: Diffusion Sampling Accelerator with Multi-core Hierarchical ODE Solvers Poster Session 4 & Exhibit Hall with Coffee Break
Jiaqi Han ⋅ Haotian Ye ⋅ Puheng Li ⋅ Minkai Xu ⋅ James Zou ⋅ Stefano Ermon
Exhibit Hall I #431
RayGaussX: Accelerating Gaussian-Based Ray Marching for Real-Time and High-Quality Novel View Synthesis Poster Session 6 & Exhibit Hall with Coffee Break
Hugo Blanc ⋅ Jean-Emmanuel Deschaud ⋅ Alexis Paljic
Exhibit Hall I #275
Adversarial Training for Probabilistic Robustness Poster Session 1 & Exhibit Hall
YI ZHANG ⋅ Yuhang Chen ⋅ Zhen Chen ⋅ Wenjie Ruan ⋅ Xiaowei Huang ⋅ Siddartha Khastgir ⋅ Xingyu Zhao
Exhibit Hall I #149
Learning to See Inside Opaque Liquid Containers using Speckle Vibrometry Poster Session 2 & Exhibit Hall with Coffee Break
Matan Kichler ⋅ Shai Bagon ⋅ Mark Sheinin
Exhibit Hall I #417
Scaling Omni-modal Pretraining with Multimodal Context: Advancing Universal Representation Learning Across Modalities Poster Session 1 & Exhibit Hall
Yiyuan Zhang ⋅ Handong Li ⋅ Jing Liu ⋅ Xiangyu Yue
Exhibit Hall I #117
LightBSR: Towards Lightweight Blind Super-Resolution via Discriminative Implicit Degradation Representation Learning Poster Session 3 & Exhibit Hall
Jiang Yuan ⋅ ji ma ⋅ Bo Wang ⋅ Guanzhou Ke ⋅ Weiming Hu
Exhibit Hall I #181
INSTINCT: Instance-Level Interaction Architecture for Query-Based Collaborative Perception Poster Session 6 & Exhibit Hall with Coffee Break
yunjiang xu ⋅ Yupeng Ouyang ⋅ Lingzhi Li ⋅ Jin Wang ⋅ Benyuan Yang
Exhibit Hall I #70
SPD: Shallow Backdoor Protecting Deep Backdoor Against Backdoor Detection Poster Session 1 & Exhibit Hall
Shunjie Yuan ⋅ Xinghua Li ⋅ Xuelin Cao ⋅ Haiyan Zhang ⋅ Mengyao Zhu ⋅ Robert Deng
Exhibit Hall I #375
Efficient Multi-Person Motion Prediction by Lightweight Spatial and Temporal Interactions Poster Session 3 & Exhibit Hall
Yuanhong Zheng ⋅ Ruixuan Yu ⋅ Jian Sun
Exhibit Hall I #79
VISION-XL: High Definition Video Inverse Problem Solver using Latent Image Diffusion Models Poster Session 3 & Exhibit Hall
Taesung Kwon ⋅ Jong Ye
Exhibit Hall I #42
Rethinking DPO-style Diffusion Aligning Frameworks Poster Session 4 & Exhibit Hall with Coffee Break
XUN WU ⋅ Shaohan Huang ⋅ Lingjie Jiang ⋅ Furu Wei
Exhibit Hall I #304
Debiased Curriculum Adaptation for Safe Transfer Learning in Chest X-ray Classification Poster Session 5 & Exhibit Hall
Mingyang Liu ⋅ Xinyang Chen ⋅ Yang Shu ⋅ Xiucheng Li ⋅ Weili Guan ⋅ Liqiang Nie
Exhibit Hall I #264
PHATNet: A Physics-guided Haze Transfer Network for Domain-adaptive Real-world Image Dehazing Poster Session 2 & Exhibit Hall with Coffee Break
Fu-Jen Tsai ⋅ Yan-Tsung Peng ⋅ Yen-Yu Lin ⋅ Chia-Wen Lin
Exhibit Hall I #53
End-to-End Entity-Predicate Association Reasoning for Dynamic Scene Graph Generation Poster Session 4 & Exhibit Hall with Coffee Break
LiWei Wang ⋅ YanDuo Zhang ⋅ Tao Lu ⋅ Fang Liu ⋅ Huiqin Zhang ⋅ Jiayi Ma ⋅ Huabing Zhou
Exhibit Hall I #272
Breaking the Encoder Barrier for Seamless Video-Language Understanding Poster Session 5 & Exhibit Hall
Handong Li ⋅ Yiyuan Zhang ⋅ Longteng Guo ⋅ Xiangyu Yue ⋅ Jing Liu
Exhibit Hall I #318
CapeLLM: Support-Free Category-Agnostic Pose Estimation with Multimodal Large Language Models Poster Session 5 & Exhibit Hall
Junho Kim ⋅ Hyungjin Chung ⋅ Byung-Hoon Kim
Exhibit Hall I #290
GenFlowRL: Shaping Rewards with Generative Object-Centric Flow in Visual Reinforcement Learning Poster Session 3 & Exhibit Hall
Kelin Yu ⋅ Sheng Zhang ⋅ Harshit Soora ⋅ Furong Huang ⋅ Heng Huang ⋅ Pratap Tokekar ⋅ Ruohan Gao
Exhibit Hall I #299
Perceiving and Acting in First-Person: A Dataset and Benchmark for Egocentric Human-Object-Human Interactions Poster Session 3 & Exhibit Hall
Liang Xu ⋅ Chengqun Yang ⋅ Zili Lin ⋅ Fei Xu ⋅ Yifan Liu ⋅ Congsheng Xu ⋅ Yiyi Zhang ⋅ Jie Qin ⋅ Xingdong Sheng ⋅ Yunhui Liu ⋅ Xin Jin ⋅ Yichao Yan ⋅ Wenjun Zeng ⋅ Xiaokang Yang
Exhibit Hall I #239
Leveraging Panoptic Scene Graph for Evaluating Fine-Grained Text-to-Image Generation Poster Session 4 & Exhibit Hall with Coffee Break
Xueqing Deng ⋅ Linjie Yang ⋅ Qihang Yu ⋅ Chenglin Yang ⋅ Liang-Chieh (Jay) Chen
Exhibit Hall I #21
VisHall3D: Monocular Semantic Scene Completion from Reconstructing the Visible Regions to Hallucinating the Invisible Regions Poster Session 6 & Exhibit Hall with Coffee Break
Haoang Lu ⋅ Yuanqi Su ⋅ Xiaoning Zhang ⋅ Longjun Gao ⋅ Yu Xue ⋅ Le Wang
Exhibit Hall I #382
InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity Poster Session 3 & Exhibit Hall
Liming Jiang ⋅ Qing Yan ⋅ Yumin Jia ⋅ Zichuan Liu ⋅ Hao Kang ⋅ Xin Lu
Exhibit Hall I #84
Hierarchical Variational Test-Time Prompt Generation for Zero-Shot Generalization Poster Session 1 & Exhibit Hall
Zhaoyang Wu ⋅ Fang Liu ⋅ Licheng Jiao ⋅ Shuo Li ⋅ Lingling Li ⋅ Xu Liu ⋅ Puhua Chen ⋅ wenping ma
Exhibit Hall I #211
GaSLight: Gaussian Splats for Spatially-Varying Lighting in HDR Poster Session 6 & Exhibit Hall with Coffee Break
Christophe Bolduc ⋅ Yannick Hold-Geoffroy ⋅ Jean-Francois Lalonde
Exhibit Hall I #423
GUAVA: Generalizable Upper Body 3D Gaussian Avatar Poster Session 3 & Exhibit Hall
Dongbin Zhang ⋅ Yunfei Liu ⋅ Lijian Lin ⋅ Ye Zhu ⋅ Yang Li ⋅ Minghan Qin ⋅ Yu Li ⋅ Haoqian Wang
Exhibit Hall I #396
CO2-Net: A Physics-Informed Spatio-Temporal Model for Global Surface CO2 Reconstruction Poster Session 2 & Exhibit Hall with Coffee Break
Hao Zheng ⋅ Yuting Zheng ⋅ Hanbo Huang ⋅ Chaofan Sun ⋅ Enhui Liao ⋅ Lin Liu ⋅ Yi Han ⋅ Hao Zhou ⋅ Shiyu Liang
Exhibit Hall I #112
PoseAnchor: Robust Root Position Estimation for 3D Human Pose Estimation Poster Session 2 & Exhibit Hall with Coffee Break
Jun-Hee Kim ⋅ Jumin Han ⋅ Seong-Whan Lee
Exhibit Hall I #193
Semantic Alignment and Reinforcement for Data-Free Quantization of Vision Transformers Poster Session 3 & Exhibit Hall
Yunshan Zhong ⋅ Yuyao Zhou ⋅ Yuxin Zhang ⋅ Wanchen Sui ⋅ Shen Li ⋅ Yong Li ⋅ Fei Chao ⋅ Rongrong Ji
Exhibit Hall I #234
GS-LIVM: Real-Time Photo-Realistic LiDAR-Inertial-Visual Mapping with Gaussian Splatting Poster Session 6 & Exhibit Hall with Coffee Break
Yusen XIE ⋅ Zhenmin Huang ⋅ Jin Wu ⋅ Jun Ma
Exhibit Hall I #207
Salvaging the Overlooked: Leveraging Class-Aware Contrastive Learning for Multi-Class Anomaly Detection Poster Session 5 & Exhibit Hall
Lei Fan ⋅ Junjie Huang ⋅ Donglin Di ⋅ Anyang Su ⋅ Tianyou Song ⋅ Maurice Pagnucco ⋅ Yang Song
Exhibit Hall I #150
Boosting Multimodal Learning via Disentangled Gradient Learning Poster Session 5 & Exhibit Hall
Shicai Wei ⋅ Chunbo Luo ⋅ Yang Luo
Exhibit Hall I #289
Task Vector Quantization for Memory-Efficient Model Merging Poster Session 5 & Exhibit Hall
Youngeun Kim ⋅ Seunghwan Lee ⋅ Aecheon Jung ⋅ Bogon Ryu ⋅ Sungeun Hong
Exhibit Hall I #29
Weakly Supervised Visible-Infrared Person Re-Identification via Heterogeneous Expert Collaborative Consistency Learning Poster Session 3 & Exhibit Hall
Yafei Zhang ⋅ Lingqi Kong ⋅ Huafeng Li ⋅ Jie Wen
Exhibit Hall I #250
SurfaceSplat: Connecting Surface Reconstruction and Gaussian Splatting Poster Session 6 & Exhibit Hall with Coffee Break
Zihui Gao ⋅ Jia-Wang Bian ⋅ Guosheng Lin ⋅ Hao Chen ⋅ Chunhua Shen
Exhibit Hall I #368
The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single Transformer Poster Session 5 & Exhibit Hall
Weixian Lei ⋅ Jiacong Wang ⋅ Haochen Wang ⋅ Xiangtai Li ⋅ Jun Hao Liew ⋅ Jiashi Feng ⋅ Zilong Huang
Exhibit Hall I #91
CaliMatch: Adaptive Calibration for Improving Safe Semi-supervised Learning Poster Session 1 & Exhibit Hall
Jinsoo Bae ⋅ Seoung Bum Kim ⋅ Hyungrok Do
Exhibit Hall I #264
Amodal3R: Amodal 3D Reconstruction from Occluded 2D Images Poster Session 2 & Exhibit Hall with Coffee Break
Tianhao Wu ⋅ Chuanxia Zheng ⋅ Frank Guan ⋅ Andrea Vedaldi ⋅ Tat-Jen Cham
Exhibit Hall I #392
DC-AR: Efficient Masked Autoregressive Image Generation with Deep Compression Hybrid Tokenizer Poster Session 4 & Exhibit Hall with Coffee Break
Yecheng Wu ⋅ Han Cai ⋅ Junyu Chen ⋅ Zhuoyang Zhang ⋅ Enze Xie ⋅ Jincheng YU ⋅ Junsong Chen ⋅ Jinyi Hu ⋅ Yao Lu ⋅ Song Han
Exhibit Hall I #301
Language Decoupling with Fine-grained Knowledge Guidance for Referring Multi-object Tracking Poster Session 5 & Exhibit Hall
guangyao Li ⋅ Siping Zhuang ⋅ Yajun Jian ⋅ Yan Yan ⋅ Hanzi Wang
Exhibit Hall I #360
Neural Multi-View Self-Calibrated Photometric Stereo without Photometric Stereo Cues Poster Session 6 & Exhibit Hall with Coffee Break
Xu Cao ⋅ Takafumi Taketomi
Exhibit Hall I #273
Reminiscence Attack on Residuals: Exploiting Approximate Machine Unlearning for Privacy Poster Session 1 & Exhibit Hall
Yaxin Xiao ⋅ Qingqing Ye ⋅ Li Hu ⋅ Huadi Zheng ⋅ Haibo Hu ⋅ Zi Liang ⋅ Haoyang LI ⋅ JIAOYIJIE JIAOYIJIE
Exhibit Hall I #282
RayPose: Ray Bundling Diffusion for Template Views in Unseen 6D Object Pose Estimation Poster Session 2 & Exhibit Hall with Coffee Break
Junwen Huang ⋅ Shishir Reddy Vutukur ⋅ Peter Yu ⋅ Nassir Navab ⋅ Slobodan Ilic ⋅ Benjamin Busam
Exhibit Hall I #385
Hydra-NeXt: Robust Closed-Loop Driving with Open-Loop Training Poster Session 6 & Exhibit Hall with Coffee Break
Zhenxin Li ⋅ Shihao Wang ⋅ Shiyi Lan ⋅ Zhiding Yu ⋅ Zuxuan Wu ⋅ Jose M. Alvarez
Exhibit Hall I #250
CanFields: Consolidating Diffeomorphic Flows for Non-Rigid 4D Interpolation from Arbitrary-Length Sequences Poster Session 6 & Exhibit Hall with Coffee Break
Miaowei Wang ⋅ Changjian Li ⋅ Amir Vaxman
Exhibit Hall I #374
QR-LoRA: Efficient and Disentangled Fine-tuning via QR Decomposition for Customized Generation Poster Session 4 & Exhibit Hall with Coffee Break
Jiahui Yang ⋅ Yongjia Ma ⋅ Donglin Di ⋅ Hao Li ⋅ Chen Wei ⋅ Xie Yan ⋅ Jianxun Cui ⋅ Xun Yang ⋅ Wangmeng Zuo
Exhibit Hall I #259
X2I: Seamless Integration of Multimodal Understanding into Diffusion Transformer via Attention Distillation Poster Session 4 & Exhibit Hall with Coffee Break
jian ma ⋅ Qirong Peng ⋅ Xu Guo ⋅ Chen Chen ⋅ Haonan Lu ⋅ Zhenyu Yang
Exhibit Hall I #177
WaveMamba: Wavelet-Driven Mamba Fusion for RGB-Infrared Object Detection Poster Session 3 & Exhibit Hall
Haodong Zhu ⋅ Wenhao Dong ⋅ Linlin Yang ⋅ Hong Li ⋅ Yuguang Yang ⋅ Yangyang Ren ⋅ Qingcheng Zhu ⋅ Zichao Feng ⋅ CHANGBI LI ⋅ Shaohui Lin ⋅ Runqi Wang ⋅ Xiaoyan Luo ⋅ Baochang Zhang
Exhibit Hall I #114
Backdooring Self-Supervised Contrastive Learning by Noisy Alignment Poster Session 1 & Exhibit Hall
Tuo Chen ⋅ Jie Gui ⋅ Minjing Dong ⋅ Ju Jia ⋅ Lanting Fang ⋅ Jian liu
Exhibit Hall I #342
CounterPC: Counterfactual Feature Realignment for Unsupervised Domain Adaptation on Point Clouds Poster Session 6 & Exhibit Hall with Coffee Break
Feng Yang ⋅ Yichao Cao ⋅ Xiu Su ⋅ Dan Niu ⋅ Xuanpeng Li
Exhibit Hall I #4
Multidimensional Byte Pair Encoding: Shortened Sequences for Improved Visual Data Generation Poster Session 5 & Exhibit Hall
Tim Elsner ⋅ Paula Usinger ⋅ Julius Nehring-Wirxel ⋅ Gregor Kobsik ⋅ Victor Czech ⋅ Yanjiang He ⋅ Isaak Lim ⋅ Leif Kobbelt
Exhibit Hall I #142
Robust Dataset Condensation using Supervised Contrastive Learning Poster Session 1 & Exhibit Hall
Nicole Kim ⋅ Hwanjun Song
Exhibit Hall I #263
SCAN: Bootstrapping Contrastive Pre-training for Data Efficiency Poster Session 1 & Exhibit Hall
Yangyang Guo ⋅ Mohan Kankanhalli
Exhibit Hall I #340
IDEATOR: Jailbreaking and Benchmarking Large Vision-Language Models Using Themselves Poster Session 2 & Exhibit Hall with Coffee Break
Ruofan Wang ⋅ Juncheng Li ⋅ Yixu Wang ⋅ Bo Wang ⋅ Xiaosen Wang ⋅ Yan Teng ⋅ Yingchun Wang ⋅ Xingjun Ma ⋅ Yu-Gang Jiang
Exhibit Hall I #362
AccidentalGS: 3D Gaussian Splatting from Accidental Camera Motion Poster Session 6 & Exhibit Hall with Coffee Break
Mao Mao ⋅ Xujie Shen ⋅ Guyuan Chen ⋅ Boming Zhao ⋅ Jiarui Hu ⋅ Hujun Bao ⋅ Zhaopeng Cui
Exhibit Hall I #263
GEOBench-VLM: Benchmarking Vision-Language Models for Geospatial Tasks Poster Session 2 & Exhibit Hall with Coffee Break
Muhammad Danish ⋅ Muhammad Akhtar Munir ⋅ Syed Shah ⋅ Kartik Kuckreja ⋅ Fahad Khan ⋅ Paolo Fraccaro ⋅ Alexandre Lacoste ⋅ Salman Khan
Exhibit Hall I #198
Event-boosted Deformable 3D Gaussians for Dynamic Scene Reconstruction Poster Session 6 & Exhibit Hall with Coffee Break
Wenhao Xu ⋅ Wenming Weng ⋅ Yueyi Zhang ⋅ Ruikang Xu ⋅ Zhiwei Xiong
Exhibit Hall I #348
MagicMirror: ID-Preserved Video Generation in Video Diffusion Transformers Poster Session 3 & Exhibit Hall
Yuechen Zhang ⋅ YaoYang Liu ⋅ Bin Xia ⋅ Bohao PENG ⋅ Zexin Yan ⋅ Eric Lo ⋅ Jiaya Jia
Exhibit Hall I #420
MRGen: Segmentation Data Engine For Underrepresented MRI Modalities Poster Session 5 & Exhibit Hall
Haoning Wu ⋅ Ziheng Zhao ⋅ Ya Zhang ⋅ Yanfeng Wang ⋅ Weidi Xie
Exhibit Hall I #10
GAP: Gaussianize Any Point Clouds with Text Guidance Poster Session 6 & Exhibit Hall with Coffee Break
Weiqi Zhang ⋅ Junsheng Zhou ⋅ Haotian Geng ⋅ Wenyuan Zhang ⋅ Liang Han
Exhibit Hall I #87
DNF-Intrinsic: Deterministic Noise-Free Diffusion for Indoor Inverse Rendering Poster Session 3 & Exhibit Hall
Rongjia Zheng ⋅ Qing Zhang ⋅ Chengjiang Long ⋅ Wei-Shi Zheng
Exhibit Hall I #31
Cross-modal Ship Re-Identification via Optical and SAR Imagery: A Novel Dataset and Method Poster Session 2 & Exhibit Hall with Coffee Break
Han Wang ⋅ Shengyang Li ⋅ Jian Yang ⋅ Yuxuan Liu ⋅ Yixuan Lv ⋅ Zhuang Zhou
Exhibit Hall I #268
MoFRR: Mixture of Diffusion Models for Face Retouching Restoration Poster Session 3 & Exhibit Hall
Jiaxin Liu ⋅ Qichao Ying ⋅ Zhenxing Qian ⋅ Sheng Li ⋅ Runqi Zhang ⋅ Jian liu ⋅ Xinpeng Zhang
Exhibit Hall I #267
Adversarial Reconstruction Feedback for Robust Fine-grained Generalization Poster Session 1 & Exhibit Hall
Shijie Wang ⋅ Jian Shi ⋅ Haojie Li
Exhibit Hall I #284
Embodied Image Captioning: Self-supervised Learning Agents for Spatially Coherent Image Descriptions Poster Session 5 & Exhibit Hall
Tommaso Galliena ⋅ Tommaso Apicella ⋅ Stefano Rosa ⋅ Pietro Morerio ⋅ ALESSIO DEL BUE ⋅ Lorenzo Natale
Exhibit Hall I #428
Unified Adversarial Augmentation for Improving Palmprint Recognition Poster Session 3 & Exhibit Hall
Jianlong Jin ⋅ Chenglong Zhao ⋅ Ruixin Zhang ⋅ Sheng Shang ⋅ Yang Zhao ⋅ Jun Wang ⋅ Jingyun Zhang ⋅ Shouhong Ding ⋅ Wei Jia ⋅ Yunsheng Wu
Exhibit Hall I #390
Adding Additional Control to One-Step Diffusion with Joint Distribution Matching Poster Session 1 & Exhibit Hall
Yihong Luo ⋅ Tianyang Hu ⋅ Yifan Song ⋅ Jiacheng Sun ⋅ Zhenguo Li ⋅ Jing Tang
Exhibit Hall I #373
Enhancing Transferability of Targeted Adversarial Examples via Inverse Target Gradient Competition and Spatial Distance Stretching Poster Session 1 & Exhibit Hall
Zhankai Li ⋅ Weiping Wang ⋅ jie li ⋅ Shigeng Zhang ⋅ Yunan Hu ⋅ Song Guo
Exhibit Hall I #345
LDPose: Towards Inclusive Human Pose Estimation for Limb-Deficient Individuals in the Wild Poster Session 2 & Exhibit Hall with Coffee Break
Jiaying Ying ⋅ Heming Du ⋅ Kaihao Zhang ⋅ Lincheng Li ⋅ Xin Yu
Exhibit Hall I #454
OURO: A Self-Bootstrapped Framework for Enhancing Multimodal Scene Understanding Poster Session 4 & Exhibit Hall with Coffee Break
Tianrun Xu ⋅ Guanyu Chen ⋅ Ye Li ⋅ Xi Yuxin ⋅ Zeyu Mu ⋅ Ruichen Wang ⋅ Tianren Zhang ⋅ Haichuan Gao ⋅ Feng Chen
Exhibit Hall I #322
LEGION: Learning to Ground and Explain for Synthetic Image Detection Poster Session 4 & Exhibit Hall with Coffee Break
Hengrui Kang ⋅ Siwei Wen ⋅ Zichen Wen ⋅ Junyan Ye ⋅ Weijia Li ⋅ Peilin Feng ⋅ Baichuan Zhou ⋅ Bin Wang ⋅ Dahua Lin ⋅ Linfeng Zhang ⋅ Conghui He
Exhibit Hall I #389
SMP-Attack: Boosting the Transferability of Feature Importance-based Adversarial Attack with Semantics-aware Multi-granularity Patchout Poster Session 1 & Exhibit Hall
Wen Yang ⋅ Guodong Liu ⋅ Di Ming
Exhibit Hall I #417
Spatial-Temporal Forgery Trace based Forgery Image Identification Poster Session 4 & Exhibit Hall with Coffee Break
Yilin Wang ⋅ Zunlei Feng ⋅ Jiachi Wang ⋅ Hengrui Lou ⋅ Binjia Zhou ⋅ Jie Lei ⋅ Mingli Song ⋅ Yijun Bei
Exhibit Hall I #208
Towards Annotation-Free Evaluation: KPAScore for Human Keypoint Detection Poster Session 2 & Exhibit Hall with Coffee Break
Xiaoxiao Wang ⋅ Chunxiao Li ⋅ Peng Sun ⋅ Boming Miao ⋅ Yunjian Zhang ⋅ Yao Zhu
Exhibit Hall I #322
Ultra High-Resolution Image Inpainting with Patch-Based Content Consistency Adapter Poster Session 4 & Exhibit Hall with Coffee Break
JianHui Zhang ⋅ Shen Cheng ⋅ Qirui Sun ⋅ Jia Liu ⋅ Wang Luyang ⋅ chaoyu feng ⋅ Chen Fang ⋅ LEI LEI ⋅ Jue Wang ⋅ Shuaicheng Liu
Exhibit Hall I #201
PROL : Rehearsal Free Continual Learning in Streaming Data via Prompt Online Learning Poster Session 1 & Exhibit Hall
Muhammad Anwar Ma'sum ⋅ Mahardhika Pratama ⋅ Savitha Ramasamy ⋅ Lin Liu ⋅ H Habibullah ⋅ Ryszard Kowalczyk
Exhibit Hall I #225
Dual Domain Control via Active Learning for Remote Sensing Domain Incremental Object Detection Poster Session 1 & Exhibit Hall
Jiachen Sun ⋅ De Cheng ⋅ Xi Yang ⋅ Nannan Wang
Exhibit Hall I #354
SUV: Suppressing Undesired Video Content via Semantic Modulation Based on Text Embeddings Poster Session 4 & Exhibit Hall with Coffee Break
Xiang Lv ⋅ Mingwen Shao ⋅ Lingzhuang Meng ⋅ Chang Liu ⋅ Yecong Wan ⋅ Xinyuan Chen
Exhibit Hall I #333
Enpowering Your Pansharpening Models with Generalizability: Unified Distribution is All You Need Poster Session 3 & Exhibit Hall
Yongchuan Cui ⋅ Peng Liu ⋅ HUI ZHANG
Exhibit Hall I #174
DiMPLe - Disentangled Multi-Modal Prompt Learning: Enhancing Out-Of-Distribution Alignment with Invariant and Spurious Feature Separation Poster Session 1 & Exhibit Hall
Umaima Rahman ⋅ Mohammad Yaqub ⋅ Dwarikanath Mahapatra
Exhibit Hall I #145
Beyond Low-Rank Tuning: Model Prior-Guided Rank Allocation for Effective Transfer in Low-Data and Large-Gap Regimes. Poster Session 1 & Exhibit Hall
Chuyan Zhang ⋅ Kefan Wang ⋅ Yun Gu
Exhibit Hall I #310
OracleFusion: Assisting the Decipherment of Oracle Bone Script with Structurally Constrained Semantic Typography Poster Session 5 & Exhibit Hall
Li Caoshuo ⋅ Zengmao Ding ⋅ Xiaobin Hu ⋅ Bang Li ⋅ Donghao Luo ⋅ AndyPianWu AndyPianWu ⋅ Chaoyang Wang ⋅ Chengjie Wang ⋅ Taisong Jin ⋅ SevenShu SevenShu ⋅ Yunsheng Wu ⋅ Yongge Liu ⋅ Rongrong Ji
Exhibit Hall I #9
COSMO: Combination of Selective Memorization for Low-cost Vision-and-Language Navigation Poster Session 2 & Exhibit Hall with Coffee Break
Siqi Zhang ⋅ Yanyuan Qiao ⋅ Qunbo Wang ⋅ Zike Yan ⋅ Qi Wu ⋅ Zhihua Wei ⋅ Jing Liu
Exhibit Hall I #46
CoStoDet-DDPM: Collaborative Training of Stochastic and Deterministic Models Improves Surgical Workflow Anticipation and Recognition Poster Session 5 & Exhibit Hall
Kaixiang Yang ⋅ Xin Li ⋅ Qiang Li ⋅ Zhiwei Wang
Exhibit Hall I #371
MixA-Q: Revisiting Activation Sparsity for Vision Transformers from a Mixed-Precision Quantization Perspective Poster Session 5 & Exhibit Hall
Weitian Wang ⋅ Shubham rai ⋅ Cecilia De la Parra ⋅ Akash Kumar
Exhibit Hall I #219
MEGA: Memory-Efficient 4D Gaussian Splatting for Dynamic Scenes Poster Session 6 & Exhibit Hall with Coffee Break
XINJIE ZHANG ⋅ Zhening Liu ⋅ Yifan Zhang ⋅ Xingtong Ge ⋅ Dailan He ⋅ Tongda Xu ⋅ Yan Wang ⋅ Zehong Lin ⋅ Shuicheng YAN ⋅ Jun Zhang
Exhibit Hall I #300
TOGA: Temporally Grounded Open-Ended Video QA with Weak Supervision Poster Session 5 & Exhibit Hall
Ayush Gupta ⋅ Anirban Roy ⋅ Rama Chellappa ⋅ Nathaniel D. Bastian ⋅ Alvaro Velasquez ⋅ Susmit Jha
Exhibit Hall I #357
LaRender: Training-Free Occlusion Control in Image Generation via Latent Rendering Poster Session 5 & Exhibit Hall
Xiaohang Zhan ⋅ Dingming Liu
Exhibit Hall I #75
DynFaceRestore: Balancing Fidelity and Quality in Diffusion-Guided Blind Face Restoration with Dynamic Blur-Level Mapping and Guidance Poster Session 3 & Exhibit Hall
Huu Phu Do ⋅ Yu-Wei Chen ⋅ Yi-Cheng Liao ⋅ Chi-Wei Hsiao ⋅ Han-Yang Wang ⋅ Wei-Chen Chiu ⋅ Ching-Chun Huang
Exhibit Hall I #39
Generalized Few-Shot Point Cloud Segmentation via LLM-Assisted Hyper-Relation Matching Poster Session 5 & Exhibit Hall
Zhaoyang Li ⋅ Yuan Wang ⋅ Guoxin Xiong ⋅ Wangkai Li ⋅ Yuwen Pan ⋅ Tianzhu Zhang
Exhibit Hall I #308
Training-free Geometric Image Editing on Diffusion Models Poster Session 4 & Exhibit Hall with Coffee Break
Hanshen Zhu ⋅ Zhen Zhu ⋅ Kaile Zhang ⋅ Yiming Gong ⋅ Yuliang Liu ⋅ Xiang Bai
Exhibit Hall I #407
Boost 3D Reconstruction using Diffusion-based Monocular Camera Calibration Poster Session 2 & Exhibit Hall with Coffee Break
Junyuan Deng ⋅ Wei Yin ⋅ Xiaoyang Guo ⋅ Qian Zhang ⋅ Xiaotao Hu ⋅ Weiqiang Ren ⋅ XIAOXIAO LONG ⋅ Ping Tan
Exhibit Hall I #196
Monocular Facial Appearance Capture in the Wild Poster Session 3 & Exhibit Hall
Yingyan Xu ⋅ Kate Gadola ⋅ Prashanth Chandran ⋅ Sebastian Weiss ⋅ Markus Gross ⋅ Gaspard Zoss ⋅ Derek Bradley
Exhibit Hall I #195
Growing a Twig to Accelerate Large Vision-Language Models Poster Session 5 & Exhibit Hall
Zhenwei Shao ⋅ Mingyang Wang ⋅ Zhou Yu ⋅ Wenwen Pan ⋅ Yan Yang ⋅ Tao Wei ⋅ Hongyuan Zhang ⋅ Ning Mao ⋅ Chen Wei ⋅ Jun Yu
Exhibit Hall I #25
AMD: Adaptive Momentum and Decoupled Contrastive Learning Framework for Robust Long-Tail Trajectory Prediction Poster Session 6 & Exhibit Hall with Coffee Break
Bin Rao ⋅ Haicheng Liao ⋅ Yanchen Guan ⋅ Chengyue Wang ⋅ Bonan Wang ⋅ Jiaxun Zhang ⋅ Zhenning Li
Exhibit Hall I #398
FreeDance: Towards Harmonic Free-Number Group Dance Generation via a Unified Framework Poster Session 3 & Exhibit Hall
Yiwen Zhao ⋅ Yang Wang ⋅ Liting Wen ⋅ Hengyuan Zhang ⋅ Xingqun Qi
Exhibit Hall I #51
Deep Incomplete Multi-view Clustering with Distribution Dual-Consistency Recovery Guidance Poster Session 1 & Exhibit Hall
Jiaqi Jin ⋅ Siwei Wang ⋅ Zhibin Dong ⋅ Xihong Yang ⋅ Xinwang Liu ⋅ En Zhu ⋅ Kunlun He
Exhibit Hall I #87
Learning Visual Hierarchies in Hyperbolic Space for Image Retrieval Poster Session 3 & Exhibit Hall
Ziwei Wang ⋅ Sameera Ramasinghe ⋅ Chenchen Xu ⋅ Julien Monteil ⋅ Loris Bazzani ⋅ Thalaiyasingam Ajanthan
Exhibit Hall I #376
TemCoCo: Temporally Consistent Multi-modal Video Fusion with Visual-Semantic Collaboration Poster Session 3 & Exhibit Hall
Gong Meiqi ⋅ Hao Zhang ⋅ Xunpeng Yi ⋅ Linfeng Tang ⋅ Jiayi Ma
Exhibit Hall I #407
RetinexMCNet: A Memory Controller Dominated Network for Low-Light Video Enhancement Based on Retinex Poster Session 2 & Exhibit Hall with Coffee Break
Meiao Wang ⋅ Xuejing Kang ⋅ Yaxi Lu ⋅ Jie Xu
Exhibit Hall I #440
D2ST-Adapter: Disentangled-and-Deformable Spatio-Temporal Adapter for Few-shot Action Recognition Poster Session 3 & Exhibit Hall
Wenjie Pei ⋅ Qizhong Tan ⋅ Guangming Lu ⋅ Jiandong Tian ⋅ Jun Yu
Exhibit Hall I #123
Sliced Wasserstein Bridge for Open-Vocabulary Video Instance Segmentation Poster Session 3 & Exhibit Hall
Zheyun Qin ⋅ Deng Yu ⋅ Chuanchen Luo ⋅ Zhumin Chen
Exhibit Hall I #233
Frequency-Aware Autoregressive Modeling for Efficient High-Resolution Image Synthesis Poster Session 4 & Exhibit Hall with Coffee Break
Zhuokun Chen ⋅ Jugang Fan ⋅ Zhuowei Yu ⋅ Bohan Zhuang ⋅ Mingkui Tan
Exhibit Hall I #215
KinMo: Kinematic-aware Human Motion Understanding and Generation Poster Session 3 & Exhibit Hall
Pengfei Zhang ⋅ Pinxin Liu ⋅ Pablo Garrido ⋅ Hyeongwoo Kim ⋅ Bindita Chaudhuri
Exhibit Hall I #111
CODA: Repurposing Continuous VAEs for Discrete Tokenization Poster Session 4 & Exhibit Hall with Coffee Break
Zeyu Liu ⋅ Zanlin Ni ⋅ Yeguo Hua ⋅ Xin Deng ⋅ Xiao Ma ⋅ Cheng Zhong ⋅ Gao Huang
Exhibit Hall I #386
3D Gaussian Splatting Driven Multi-View Robust Physical Adversarial Camouflage Generation Poster Session 6 & Exhibit Hall with Coffee Break
Tianrui Lou ⋅ Xiaojun Jia ⋅ Siyuan Liang ⋅ Jiawei Liang ⋅ Ming Zhang ⋅ Yanjun Xiao ⋅ Xiaochun Cao
Exhibit Hall I #389
Head2Body: Body Pose Generation from Multi-sensory Head-mounted Inputs Poster Session 2 & Exhibit Hall with Coffee Break
Minh Tran ⋅ Hongda Mao ⋅ Qingshuang Chen ⋅ Yelin Kim
Exhibit Hall I #172
LLM-Assisted Semantic Guidance for Sparsely Annotated Remote Sensing Object Detection Poster Session 5 & Exhibit Hall
Wei Liao ⋅ Chunyan Xu ⋅ Chenxu Wang ⋅ Zhen Cui
Exhibit Hall I #256
From Reusing to Forecasting: Accelerating Diffusion Models with TaylorSeers Poster Session 4 & Exhibit Hall with Coffee Break
Jiacheng Liu ⋅ Chang Zou ⋅ Yuanhuiyi Lyu ⋅ Junjie Chen ⋅ Linfeng Zhang
Exhibit Hall I #92
DADM: Dual Alignment of Domain and Modality for Face Anti-spoofing Poster Session 3 & Exhibit Hall
Yang JingYi ⋅ Xun Lin ⋅ Zitong YU ⋅ Liepiao Zhang ⋅ Xin Liu ⋅ Hui Li ⋅ Xiaochen Yuan ⋅ Xiaochun Cao
Exhibit Hall I #192
Quantifying and Narrowing the Unknown: Interactive Text-to-Video Retrieval via Uncertainty Minimization Poster Session 5 & Exhibit Hall
Bingqing Zhang ⋅ Zhuo Cao ⋅ Heming Du ⋅ Yang Li ⋅ Xue Li ⋅ Jiajun Liu ⋅ Sen Wang
Exhibit Hall I #217
Gradient Decomposition and Alignment for Incremental Object Detection Poster Session 1 & Exhibit Hall
Wenlong Luo ⋅ Shizhou Zhang ⋅ De Cheng ⋅ Yinghui Xing ⋅ Guoqiang Liang ⋅ PENG WANG ⋅ Yanning Zhang
Exhibit Hall I #421
PacGDC: Label-Efficient Generalizable Depth Completion with Projection Ambiguity and Consistency Poster Session 2 & Exhibit Hall with Coffee Break
Haotian Wang ⋅ Aoran Xiao ⋅ Xiaoqin Zhang ⋅ Meng Yang ⋅ Shijian Lu
Exhibit Hall I #253
TruthPrInt: Mitigating Large Vision-Language Models Object Hallucination Via Latent Truthful-Guided Pre-Intervention Poster Session 2 & Exhibit Hall with Coffee Break
Jinhao Duan ⋅ Fei Kong ⋅ Hao Cheng ⋅ James Diffenderfer ⋅ Bhavya Kailkhura ⋅ Lichao Sun ⋅ Xiaofeng Zhu ⋅ Xiaoshuang Shi ⋅ Kaidi Xu
Exhibit Hall I #220
Adversarial Attention Perturbations for Large Object Detection Transformers Poster Session 1 & Exhibit Hall
Zachary Yahn ⋅ Selim Tekin ⋅ Fatih Ilhan ⋅ Sihao Hu ⋅ Tiansheng Huang ⋅ Yichang Xu ⋅ Margaret Loper ⋅ Ling Liu
Exhibit Hall I #294
MCAM: Multimodal Causal Analysis Model for Ego-Vehicle-Level Driving Video Understanding Poster Session 2 & Exhibit Hall with Coffee Break
Tongtong Cheng ⋅ Rongzhen Li ⋅ Yixin Xiong ⋅ Tao Zhang ⋅ Jing Wang ⋅ Kai Liu
Exhibit Hall I #43
When and Where do Data Poisons Attack Textual Inversion? Poster Session 4 & Exhibit Hall with Coffee Break
Jeremy Styborski ⋅ Mingzhi Lyu ⋅ Jiayou Lu ⋅ Nupur Kapur ⋅ Adams Kong
Exhibit Hall I #436
Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation and Reconstruction Poster Session 6 & Exhibit Hall with Coffee Break
Yuanhao Cai ⋅ He Zhang ⋅ Kai Zhang ⋅ Yixun Liang ⋅ Mengwei Ren ⋅ Fujun Luan ⋅ Qing Liu ⋅ Soo Ye Kim ⋅ Jianming Zhang ⋅ Zhifei Zhang ⋅ Yuqian Zhou ⋅ YULUN ZHANG ⋅ Xiaokang Yang ⋅ Zhe Lin ⋅ Alan Yuille
Exhibit Hall I #32
SIMS: Simulating Stylized Human-Scene Interactions with Retrieval-Augmented Script Generation Poster Session 3 & Exhibit Hall
Wenjia Wang ⋅ Liang Pan ⋅ Zhiyang Dou ⋅ Jidong Mei ⋅ Zhouyingcheng Liao ⋅ Yifan Wu ⋅ Yuke Lou ⋅ Jingbo Wang ⋅ Lei Yang ⋅ Taku Komura
Exhibit Hall I #388
PBCAT: Patch-Based Composite Adversarial Training against Physically Realizable Attacks on Object Detection Poster Session 5 & Exhibit Hall
Xiao Li ⋅ Yiming Zhu ⋅ Yifan Huang ⋅ Wei Zhang ⋅ Yingzhe He ⋅ Jie Shi ⋅ Xiaolin Hu
Exhibit Hall I #436
SteerX: Creating Any Camera-Free 3D and 4D Scenes with Geometric Steering Poster Session 6 & Exhibit Hall with Coffee Break
Byeongjun Park ⋅ Hyojun Go ⋅ Hyelin Nam ⋅ Byung-Hoon Kim ⋅ Hyungjin Chung ⋅ Changick Kim
Exhibit Hall I #252
Engage for All: Making Ordinary Image Descriptions Appealing Again! Poster Session 4 & Exhibit Hall with Coffee Break
Yuyan Chen ⋅ Yifan Jiang ⋅ Li Zhou ⋅ Jinghan Cao ⋅ Yu Guan ⋅ Ming Yang ⋅ Qingpei Guo
Exhibit Hall I #427
Seam360GS: Seamless 360° Gaussian Splatting from Real-World Omnidirectional Images Poster Session 6 & Exhibit Hall with Coffee Break
Changha Shin ⋅ Woong Oh Cho ⋅ Seon Joo Kim
Exhibit Hall I #409
HiGarment: Cross-modal Harmony Based Diffusion Model for Flat Sketch to Realistic Garment Image Poster Session 4 & Exhibit Hall with Coffee Break
Junyi Guo ⋅ Jingxuan Zhang ⋅ Fangyu Wu ⋅ Huanda Lu ⋅ Qiufeng Wang ⋅ Wenmian Yang ⋅ ENG Gee LIM ⋅ Dongming Lu
Exhibit Hall I #350
AU-Blendshape for Fine-grained Stylized 3D Facial Expression Manipulation Poster Session 3 & Exhibit Hall
Hao Li ⋅ Ju Dai ⋅ Feng Zhou ⋅ Kaida Ning ⋅ Lei Li ⋅ Junjun Pan
Exhibit Hall I #245
LMM4LMM: Benchmarking and Evaluating Large-multimodal Image Generation with LMMs Poster Session 4 & Exhibit Hall with Coffee Break
Jiarui Wang ⋅ Huiyu Duan ⋅ Yu Zhao ⋅ Juntong Wang ⋅ Guangtao Zhai ⋅ Xiongkuo Min
Exhibit Hall I #233
BokehDiff: Neural Lens Blur with One-Step Diffusion Poster Session 2 & Exhibit Hall with Coffee Break
Chengxuan Zhu ⋅ Qingnan Fan ⋅ Qi Zhang ⋅ Jinwei Chen ⋅ Huaqi Zhang ⋅ Chao Xu ⋅ Boxin Shi
Exhibit Hall I #421
VCA: Video Curious Agent for Long Video Understanding Poster Session 5 & Exhibit Hall
Zeyuan Yang ⋅ Delin Chen ⋅ Xueyang Yu ⋅ Maohao Shen ⋅ Chuang Gan
Exhibit Hall I #35
Geometry Distributions Poster Session 1 & Exhibit Hall
Biao Zhang ⋅ Jing Ren ⋅ Peter Wonka
Exhibit Hall I #132
Social Debiasing for Fair Multi-modal LLMs Poster Session 1 & Exhibit Hall
Harry Cheng ⋅ Yangyang Guo ⋅ Qingpei Guo ⋅ Ming Yang ⋅ Tian Gan ⋅ Weili Guan ⋅ Liqiang Nie
Exhibit Hall I #157
Hierarchy-Aware Pseudo Word Learning with Text Adaptation for Zero-Shot Composed Image Retrieval Poster Session 5 & Exhibit Hall
Zhe Li ⋅ Lei Zhang ⋅ Zheren Fu ⋅ Kun Zhang ⋅ Zhendong Mao
Exhibit Hall I #423
GaussianUpdate: Continual 3D Gaussian Splatting Update for Changing Environments Poster Session 6 & Exhibit Hall with Coffee Break
Lin Zeng ⋅ Boming Zhao ⋅ Jiarui Hu ⋅ Xujie Shen ⋅ Ziqiang Dang ⋅ Hujun Bao ⋅ Zhaopeng Cui
Exhibit Hall I #103
DALIP: Distribution Alignment-based Language-Image Pre-Training for Domain-Specific Data Poster Session 1 & Exhibit Hall
Junjie Wu ⋅ Jiangtao Xie ⋅ Zhaolin Zhang ⋅ Qilong Wang ⋅ Qinghua Hu ⋅ Peihua Li ⋅ Sen Xu
Exhibit Hall I #190
Long-term Traffic Simulation with Interleaved Autoregressive Motion and Scenario Generation Poster Session 6 & Exhibit Hall with Coffee Break
Xiuyu Yang ⋅ Shuhan Tan ⋅ Philipp Kraehenbuehl
Exhibit Hall I #55
Perspective-Invariant 3D Object Detection Poster Session 6 & Exhibit Hall with Coffee Break
Alan Liang ⋅ Lingdong Kong ⋅ Dongyue Lu ⋅ Youquan Liu ⋅ Jian Fang ⋅ Huaici Zhao ⋅ Wei Tsang Ooi
Exhibit Hall I #291
Probabilistic Inertial Poser (ProbIP): Uncertainty-aware Human Motion Modeling from Sparse Inertial Sensors Poster Session 6 & Exhibit Hall with Coffee Break
Min Kim ⋅ Younho Jeon ⋅ Sungho Jo
Exhibit Hall I #112
ARMO: Autoregressive Rigging for Multi-Category Objects Poster Session 2 & Exhibit Hall with Coffee Break
mingze sun ⋅ Shiwei Mao ⋅ Keyi Chen ⋅ Yurun Chen ⋅ Shunlin Lu ⋅ Jingbo Wang ⋅ Junting Dong ⋅ Ruqi Huang
Exhibit Hall I #254
Harnessing Vision Foundation Models for High-Performance, Training-Free Open Vocabulary Segmentation Poster Session 5 & Exhibit Hall
Yuheng Shi ⋅ Minjing Dong ⋅ Chang Xu
Exhibit Hall I #347
Aligning Constraint Generation with Design Intent in Parametric CAD Poster Session 2 & Exhibit Hall with Coffee Break
Evan Casey ⋅ Tianyu Zhang ⋅ Shu Ishida ⋅ John Thompson ⋅ Amir Khasahmadi ⋅ Joseph Lambourne ⋅ Pradeep Kumar Jayaraman ⋅ Karl Willis
Exhibit Hall I #338
Golden Noise for Diffusion Models: A Learning Framework Poster Session 4 & Exhibit Hall with Coffee Break
zikai zhou ⋅ Shitong Shao ⋅ Lichen Bai ⋅ Shufei Zhang ⋅ zhiqiang xu ⋅ Bo Han ⋅ Zeke Xie
Exhibit Hall I #268
STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution Poster Session 4 & Exhibit Hall with Coffee Break
Rui Xie ⋅ Yinhong Liu ⋅ Penghao Zhou ⋅ Chen Zhao ⋅ Jun Zhou ⋅ Kai Zhang ⋅ Zhenyu Zhang ⋅ Jian Yang ⋅ Zhenheng Yang ⋅ Ying Tai
Exhibit Hall I #213
Vision-Language Interactive Relation Mining for Open-Vocabulary Scene Graph Generation Poster Session 4 & Exhibit Hall with Coffee Break
Yukuan Min ⋅ Muli Yang ⋅ Jinhao Zhang ⋅ Yuxuan Wang ⋅ Aming WU ⋅ Cheng Deng
Exhibit Hall I #179
OrderChain: Towards General Instruct-Tuning for Stimulating the Ordinal Understanding Ability of MLLM Poster Session 1 & Exhibit Hall
Jinhong Wang ⋅ Shuo Tong ⋅ Jintai CHEN ⋅ Jian liu ⋅ Dongqi Tang ⋅ Weiqiang Wang ⋅ Wentong Li ⋅ Hongxia Xu ⋅ Danny Chen ⋅ Jian Wu
Exhibit Hall I #323
Joint Learning of Pose Regression and Denoising Diffusion with Score Scaling Sampling for Category-level 6D Pose Estimation Poster Session 2 & Exhibit Hall with Coffee Break
Seunghyun Lee ⋅ Tae-Kyun Kim
Exhibit Hall I #68
LHM: Large Animatable Human Reconstruction Model for Single Image to 3D in Seconds Poster Session 3 & Exhibit Hall
Lingteng Qiu ⋅ Xiaodong Gu ⋅ Peihao Li ⋅ Qi Zuo ⋅ Weichao Shen ⋅ Junfei Zhang ⋅ Kejie Qiu ⋅ Weihao Yuan ⋅ Guanying Chen ⋅ Zilong Dong ⋅ Liefeng Bo
Exhibit Hall I #394
UniVG: A Generalist Diffusion Model for Unified Image Generation and Editing Poster Session 4 & Exhibit Hall with Coffee Break
Tsu-Jui Fu ⋅ Yusu Qian ⋅ Chen Chen ⋅ Wenze Hu ⋅ Zhe Gan ⋅ Yinfei Yang
Exhibit Hall I #217
Distilling Diffusion Models to Efficient 3D LiDAR Scene Completion Poster Session 2 & Exhibit Hall with Coffee Break
shengyuan zhang ⋅ An Zhao ⋅ Ling Yang ⋅ Zejian Li ⋅ Chenye Meng ⋅ Haoran Xu ⋅ Tianrun Chen ⋅ AnYang Wei ⋅ Perry GU ⋅ Lingyun Sun
Exhibit Hall I #225
FOLDER: Accelerating Multi-Modal Large Language Models with Enhanced Performance Poster Session 5 & Exhibit Hall
Haicheng Wang ⋅ Zhemeng Yu ⋅ Gabriele Spadaro ⋅ Chen Ju ⋅ Victor Quétu ⋅ Shuai Xiao ⋅ Enzo Tartaglione
Exhibit Hall I #359
ViSpeak: Visual Instruction Feedback in Streaming Videos Poster Session 5 & Exhibit Hall
Shenghao Fu ⋅ Qize Yang ⋅ Yuan-Ming Li ⋅ Yi-Xing Peng ⋅ Kun-Yu Lin ⋅ Xihan Wei ⋅ Jian-Fang Hu ⋅ Xiaohua Xie ⋅ Wei-Shi Zheng
Exhibit Hall I #185
FedAGC: Federated Continual Learning with Asymmetric Gradient Correction Poster Session 1 & Exhibit Hall
Chengchao Zhang ⋅ Fanhua Shang ⋅ Hongying Liu ⋅ Liang Wan ⋅ Wei Feng
Exhibit Hall I #357
MoGA: 3D Generative Avatar Prior for Monocular Gaussian Avatar Reconstruction Poster Session 3 & Exhibit Hall
Zijian Dong ⋅ Longteng Duan ⋅ Jie Song ⋅ Michael Black ⋅ Andreas Geiger
Exhibit Hall I #312
ATCTrack: Aligning Target-Context Cues with Dynamic Target States for Robust Vision-Language Tracking Poster Session 5 & Exhibit Hall
Xiaokun Feng ⋅ Shiyu Hu ⋅ Xuchen Li ⋅ Dailing Zhang ⋅ Meiqi Wu ⋅ Jing Zhang ⋅ Xiaotang Chen ⋅ Kaiqi Huang
Exhibit Hall I #5
Federated Representation Angle Learning Poster Session 1 & Exhibit Hall
Liping Yi ⋅ Han Yu ⋅ Gang Wang ⋅ xiaoguang Liu ⋅ Xiaoxiao Li
Exhibit Hall I #115
MotionShot: Adaptive Motion Transfer across Arbitrary Objects for Text-to-Video Generation Poster Session 3 & Exhibit Hall
Yanchen Liu ⋅ Yanan SUN ⋅ Zhening Xing ⋅ Junyao Gao ⋅ Kai Chen ⋅ Wenjie Pei
Exhibit Hall I #175
GroundFlow: A Plug-in Module for Temporal Reasoning on 3D Point Cloud Sequential Grounding Poster Session 6 & Exhibit Hall with Coffee Break
Zijun Lin ⋅ Shuting He ⋅ Cheston Tan ⋅ Bihan Wen
Exhibit Hall I #391
Enhancing Adversarial Transferability by Balancing Exploration and Exploitation with Gradient-Guided Sampling Poster Session 1 & Exhibit Hall
Zenghao Niu ⋅ Weicheng Xie ⋅ Siyang Song ⋅ Zitong YU ⋅ Feng Liu ⋅ Linlin Shen
Exhibit Hall I #361
CWNet: Causal Wavelet Network for Low-Light Image Enhancement Poster Session 2 & Exhibit Hall with Coffee Break
Tongshun Zhang ⋅ Pingping Liu ⋅ Yubing Lu ⋅ Mengen Cai ⋅ Zijian Zhang ⋅ Zhe Zhang ⋅ Qiuzhan Zhou
Exhibit Hall I #354
InterSyn: Interleaved Learning for Dynamic Motion Synthesis in the Wild Poster Session 3 & Exhibit Hall
Yiyi Ma ⋅ Yuanzhi Liang ⋅ Xiu Li ⋅ Chi Zhang ⋅ Xuelong Li
Exhibit Hall I #266
GeoDistill: Geometry-Guided Self-Distillation for Weakly Supervised Cross-View Localization Poster Session 6 & Exhibit Hall with Coffee Break
Shaowen Tong ⋅ Zimin Xia ⋅ Alexandre Alahi ⋅ Xuming He ⋅ Yujiao Shi
Exhibit Hall I #60
BlinkTrack: Feature Tracking over 80 FPS via Events and Images Poster Session 2 & Exhibit Hall with Coffee Break
Yichen Shen ⋅ Yijin Li ⋅ Shuo Chen ⋅ Guanglin Li ⋅ Zhaoyang Huang ⋅ Hujun Bao ⋅ Zhaopeng Cui ⋅ Guofeng Zhang
Exhibit Hall I #402
DICE: Staleness-Centric Optimizations for Parallel Diffusion MoE Inference Poster Session 4 & Exhibit Hall with Coffee Break
Jiajun Luo ⋅ Lizhuo Luo ⋅ Jianru Xu ⋅ Jiajun Song ⋅ Rongwei Lu ⋅ Chen Tang ⋅ Zhi Wang
Exhibit Hall I #55
AerialVG: A Challenging Benchmark for Aerial Visual Grounding by Exploring Positional Relations Poster Session 2 & Exhibit Hall with Coffee Break
Junli Liu ⋅ Qizhi Chen ⋅ Zhigang Wang ⋅ Yiwen Tang ⋅ Yiting Zhang ⋅ Chi Yan ⋅ Dong Wang ⋅ Xuelong Li ⋅ Bin Zhao
Exhibit Hall I #15
The Curse of Conditions: Analyzing and Improving Optimal Transport for Conditional Flow-Based Generation Poster Session 4 & Exhibit Hall with Coffee Break
Ho Kei Cheng ⋅ Alex Schwing
Exhibit Hall I #94
Diffusion-based Source-biased Model for Single Domain Generalized Object Detection Poster Session 1 & Exhibit Hall
Han Jiang ⋅ Wenfei Yang ⋅ Tianzhu Zhang ⋅ Yongdong Zhang
Exhibit Hall I #137
ReconDreamer++: Harmonizing Generative and Reconstructive Models for Driving Scene Representation Poster Session 6 & Exhibit Hall with Coffee Break
Guosheng Zhao ⋅ Xiaofeng Wang ⋅ Chaojun Ni ⋅ Zheng Zhu ⋅ Wenkang Qin ⋅ Guan Huang ⋅ Xingang Wang
Exhibit Hall I #193
VLRMBench: A Comprehensive and Challenging Benchmark for Vision-Language Reward Models Poster Session 1 & Exhibit Hall
JIACHENG RUAN ⋅ Wenzhen Yuan ⋅ Xian Gao ⋅ Ye Guo ⋅ Daoxin Zhang ⋅ Zhe Xu ⋅ Yao Hu ⋅ Ting Liu ⋅ yuzhuo fu
Exhibit Hall I #292
Measuring the Impact of Rotation Equivariance on Aerial Object Detection Poster Session 2 & Exhibit Hall with Coffee Break
Xiuyu Wu ⋅ Xinhao Wang ⋅ Xiubin Zhu ⋅ Lan Yang ⋅ Jiyuan Liu ⋅ Xingchen Hu
Exhibit Hall I #216
Enhanced Pansharpening via Quaternion Spatial-Spectral Interactions Poster Session 3 & Exhibit Hall
Dong Li ⋅ Chunhui Luo ⋅ Yuanfei Bao ⋅ Gang Yang ⋅ Jie Xiao ⋅ Xueyang Fu ⋅ Zheng-Jun Zha
Exhibit Hall I #85
Monocular Semantic Scene Completion via Masked Recurrent Networks Poster Session 6 & Exhibit Hall with Coffee Break
Xuzhi Wang ⋅ Xinran Wu ⋅ Song Wang ⋅ Lingdong Kong ⋅ Ziping Zhao
Exhibit Hall I #9
Client2Vec: Improving Federated Learning by Distribution Shifts Aware Client Indexing Poster Session 1 & Exhibit Hall
Yongxin Guo ⋅ Lin Wang ⋅ Xiaoying Tang ⋅ Tao Lin
Exhibit Hall I #126
OpenRSD: Towards Open-prompts for Object Detection in Remote Sensing Images Poster Session 2 & Exhibit Hall with Coffee Break
Ziyue Huang ⋅ Yongchao Feng ⋅ Ziqi Liu ⋅ Shuai Yang ⋅ Qingjie Liu ⋅ Yunhong Wang
Exhibit Hall I #317
InsViE-1M: Effective Instruction-based Video Editing with Elaborate Dataset Construction Poster Session 4 & Exhibit Hall with Coffee Break
Yuhui WU ⋅ Liyi Chen ⋅ Ruibin Li ⋅ Shihao Wang ⋅ Chenxi Xie ⋅ Lei Zhang
Exhibit Hall I #173
PathDiff: Histopathology Image Synthesis with Unpaired Text and Mask Conditions Poster Session 5 & Exhibit Hall
Mahesh Bhosale ⋅ Abdul Wasi ⋅ Yuanhao Zhai ⋅ Yunjie Tian ⋅ Samuel Border ⋅ Nan Xi ⋅ Pinaki Sarder ⋅ Junsong Yuan ⋅ David Doermann ⋅ Xuan Gong
Exhibit Hall I #246
PhysRig: Differentiable Physics-Based Skinning and Rigging Framework for Realistic Articulated Object Modeling Poster Session 2 & Exhibit Hall with Coffee Break
Hao Zhang ⋅ Haolan Xu ⋅ Chun Feng ⋅ Varun Jampani ⋅ Narendra Ahuja
Exhibit Hall I #150
From Gaze to Movement: Predicting Visual Attention for Autonomous Driving Human-Machine Interaction based on Programmatic Imitation Learning Poster Session 6 & Exhibit Hall with Coffee Break
Yexin Huang ⋅ Yongbin Lin ⋅ Lishengsa Yue ⋅ Zhihong Yao ⋅ Jie Wang
Exhibit Hall I #136
ALOcc: Adaptive Lifting-based 3D Semantic Occupancy and Cost Volume-based Flow Predictions Poster Session 1 & Exhibit Hall
Dubing Chen ⋅ Jin Fang ⋅ Wencheng Han ⋅ Xinjing Cheng ⋅ Junbo Yin ⋅ Cheng-zhong Xu ⋅ Fahad Khan ⋅ Jianbing Shen
Exhibit Hall I #389
Optical Model-Driven Sharpness Mapping for Autofocus in Small Depth-of-Field and Severe Defocus Scenarios Poster Session 2 & Exhibit Hall with Coffee Break
Chen-Liang Fan ⋅ Mingpei Cao ⋅ Chih-Chien Hung ⋅ Yuesheng Zhu
Exhibit Hall I #131
HyPiDecoder: Hybrid Pixel Decoder for Efficient Segmentation and Detection Poster Session 5 & Exhibit Hall
Fengzhe Zhou ⋅ Humphrey Shi
Exhibit Hall I #215
UniCombine: Unified Multi-Conditional Combination with Diffusion Transformer Poster Session 4 & Exhibit Hall with Coffee Break
Haoxuan Wang ⋅ Jinlong Peng ⋅ Qingdong He ⋅ Hao Yang ⋅ Ying Jin ⋅ Jiafu Wu ⋅ Xiaobin Hu ⋅ Yanjie Pan ⋅ Zhenye Gan ⋅ Mingmin Chi ⋅ Bo Peng ⋅ Yabiao Wang
Exhibit Hall I #330
MMAD: Multi-label Micro-Action Detection in Videos Poster Session 3 & Exhibit Hall
Kun Li ⋅ pengyu Liu ⋅ Dan Guo ⋅ Fei Wang ⋅ zhiliang wu ⋅ Hehe Fan ⋅ Meng Wang
Exhibit Hall I #305
MP-HSIR: A Multi-Prompt Framework for Universal Hyperspectral Image Restoration Poster Session 3 & Exhibit Hall
Zhehui Wu ⋅ Yong Chen ⋅ Naoto Yokoya ⋅ Wei He
Exhibit Hall I #283
Perceive, Understand and Restore: Real-World Image Super-Resolution with Autoregressive Multimodal Generative Models Poster Session 4 & Exhibit Hall with Coffee Break
Hongyang Wei ⋅ Shuaizheng Liu ⋅ Chun Yuan ⋅ Lei Zhang
Exhibit Hall I #359
Learning to Generalize without Bias for Open-Vocabulary Action Recognition Poster Session 3 & Exhibit Hall
Yating Yu ⋅ Congqi Cao ⋅ Yifan Zhang ⋅ Yanning Zhang
Exhibit Hall I #263
Semantic Equitable Clustering: A Simple and Effective Strategy for Clustering Vision Tokens Poster Session 1 & Exhibit Hall
Qihang Fan ⋅ Huaibo Huang ⋅ Mingrui Chen ⋅ Ran He
Exhibit Hall I #374
TikZero: Zero-Shot Text-Guided Graphics Program Synthesis Poster Session 4 & Exhibit Hall with Coffee Break
Jonas Belouadi ⋅ Eddy Ilg ⋅ Margret Keuper ⋅ Hideki Tanaka ⋅ Masao Utiyama ⋅ Raj Dabre ⋅ Steffen Eger ⋅ Simone Paolo Ponzetto
Exhibit Hall I #278
SAUCE: Selective Concept Unlearning in Vision-Language Models with Sparse Autoencoders Poster Session 1 & Exhibit Hall
Jiahui Geng ⋅ Qing Li
Exhibit Hall I #279
Training-Free Industrial Defect Generation with Diffusion Models Poster Session 5 & Exhibit Hall
Ruyi Xu ⋅ Yen-Tzu Chiu ⋅ Tai-I Chen ⋅ Oscar Chew ⋅ Yung-Yu Chuang ⋅ Wen-Huang Cheng
Exhibit Hall I #413
Feature Decomposition-Recomposition in Large Vision-Language Model for Few-Shot Class-Incremental Learning Poster Session 1 & Exhibit Hall
Zongyao Xue ⋅ Meina Kan ⋅ Shiguang Shan ⋅ Xilin Chen
Exhibit Hall I #291
When Schrödinger Bridge Meets Real-World Image Dehazing with Unpaired Training Poster Session 2 & Exhibit Hall with Coffee Break
Yunwei Lan ⋅ Zhigao Cui ⋅ Xin Luo ⋅ Chang Liu ⋅ Nian Wang ⋅ Menglin Zhang ⋅ Yanzhao Su ⋅ Dong Liu
Exhibit Hall I #351
TRCE: Towards Reliable Malicious Concept Erasure in Text-to-Image Diffusion Models Poster Session 4 & Exhibit Hall with Coffee Break
Ruidong Chen ⋅ honglin guo ⋅ Lanjun Wang ⋅ Chenyu Zhang ⋅ Weizhi Nie ⋅ Anan Liu
Exhibit Hall I #388
MagicMotion: Controllable Video Generation with Dense-to-Sparse Trajectory Guidance Poster Session 3 & Exhibit Hall
Quanhao Li ⋅ Zhen Xing ⋅ Rui Wang ⋅ Hui Zhang ⋅ Qi Dai ⋅ Zuxuan Wu
Exhibit Hall I #198
Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness Poster Session 2 & Exhibit Hall with Coffee Break
Haochen Wang ⋅ Yucheng Zhao ⋅ Tiancai Wang ⋅ Haoqiang Fan ⋅ Xiangyu Zhang ⋅ Zhaoxiang Zhang
Exhibit Hall I #400
SEAL: Semantic Aware Image Watermarking Poster Session 4 & Exhibit Hall with Coffee Break
Kasra Arabi ⋅ R. Teal Witter ⋅ Chinmay Hegde ⋅ Niv Cohen
Exhibit Hall I #124
ArchiSet: Benchmarking Editable and Consistent Single-View 3D Reconstruction of Buildings with Specific Window-to-Wall Ratios Poster Session 6 & Exhibit Hall with Coffee Break
Jun Yin ⋅ Pengyu Zeng ⋅ Licheng Shen ⋅ Miao Zhang ⋅ Jing Zhong ⋅ Yuxing Han ⋅ Shuai Lu
Exhibit Hall I #122
SkySense V2: A Unified Foundation Model for Multi-modal Remote Sensing Poster Session 2 & Exhibit Hall with Coffee Break
Yingying Zhang ⋅ Lixiang Ru ⋅ Kang Wu ⋅ Lei Yu ⋅ Lei Liang ⋅ Yansheng Li ⋅ Jingdong Chen
Exhibit Hall I #388
DMesh++: An Efficient Differentiable Mesh for Complex Shapes Poster Session 6 & Exhibit Hall with Coffee Break
Sanghyun Son ⋅ Matheus Gadelha ⋅ Yang Zhou ⋅ Matthew Fisher ⋅ Zexiang Xu ⋅ Yi-Ling Qiao ⋅ Ming Lin ⋅ Yi Zhou
Exhibit Hall I #181
Advancing Textual Prompt Learning with Anchored Attributes Poster Session 1 & Exhibit Hall
Zheng Li ⋅ Yibing Song ⋅ Ming-Ming Cheng ⋅ Xiang Li ⋅ jian Yang
Exhibit Hall I #336
Meta-Unlearning on Diffusion Models: Preventing Relearning Unlearned Concepts Poster Session 1 & Exhibit Hall
Hongcheng Gao ⋅ Tianyu Pang ⋅ Chao Du ⋅ Taihang Hu ⋅ Zhijie Deng ⋅ Min Lin
Exhibit Hall I #193
AR-1-to-3: Single Image to Consistent 3D Object via Next-View Prediction Poster Session 6 & Exhibit Hall with Coffee Break
Xuying Zhang ⋅ Yupeng Zhou ⋅ Kai Wang ⋅ Yikai Wang ⋅ Zhen Li ⋅ Daquan Zhou ⋅ Shaohui Jiao ⋅ Qibin Hou ⋅ Ming-Ming Cheng
Exhibit Hall I #151
TR-PTS: Task-Relevant Parameter and Token Selection for Efficient Tuning Poster Session 1 & Exhibit Hall
Siqi Luo ⋅ Haoran Yang ⋅ Yi Xin ⋅ Mingyang Yi ⋅ Guangyang Wu ⋅ Guangtao Zhai ⋅ Xiaohong Liu
Exhibit Hall I #409
Benchmarking Multimodal Large Language Models Against Image Corruptions Poster Session 2 & Exhibit Hall with Coffee Break
Xinkuan Qiu ⋅ Meina Kan ⋅ Yongbin Zhou ⋅ Shiguang Shan
Exhibit Hall I #375
Harnessing Uncertainty-aware Bounding Boxes for Unsupervised 3D Object Detection Poster Session 2 & Exhibit Hall with Coffee Break
Ruiyang Zhang ⋅ Hu Zhang ⋅ Zhedong Zheng
Exhibit Hall I #396
DexH2R: A Benchmark for Dynamic Dexterous Grasping in Human-to-Robot Handover Poster Session 3 & Exhibit Hall
Youzhuo Wang ⋅ jiayi ye ⋅ Chuyang Xiao ⋅ Yiming Zhong ⋅ Heng Tao ⋅ Hang Yu ⋅ Yumeng Liu ⋅ Jingyi Yu ⋅ Yuexin Ma
Exhibit Hall I #254
Latent Expression Generation for Referring Image Segmentation and Grounding Poster Session 5 & Exhibit Hall
Seonghoon Yu ⋅ Junbeom Hong ⋅ Joonseok Lee ⋅ Jeany Son
Exhibit Hall I #146
LiON-LoRA: Rethinking LoRA Fusion to Unify Controllable Spatial and Temporal Generation for Video Diffusion Poster Session 3 & Exhibit Hall
Yisu Zhang ⋅ Chenjie Cao ⋅ Chaohui Yu ⋅ Jianke Zhu
Exhibit Hall I #430
BASIC: Boosting Visual Alignment with Intrinsic Refined Embeddings in Multimodal Large Language Models Poster Session 5 & Exhibit Hall
Jianting Tang ⋅ Yubo Wang ⋅ Haoyu Cao ⋅ Linli Xu
Exhibit Hall I #73
Bridging Diffusion Models and 3D Representations: A 3D Consistent Super-Resolution Framework Poster Session 3 & Exhibit Hall
Yi-Ting Chen ⋅ Ting-Hsuan Liao ⋅ Pengsheng Guo ⋅ Alex Schwing ⋅ Jia-Bin Huang
Exhibit Hall I #328
Deterministic Object Pose Confidence Region Estimation Poster Session 4 & Exhibit Hall with Coffee Break
Jinghao Wang ⋅ Zhang Li ⋅ Zi Wang ⋅ Banglei Guan ⋅ Yang Shang ⋅ Qifeng Yu
Exhibit Hall I #383
Online Language Splatting Poster Session 6 & Exhibit Hall with Coffee Break
Saimouli Katragadda ⋅ Cho-Ying Wu ⋅ Yuliang Guo ⋅ Xinyu Huang ⋅ Guoquan Huang ⋅ Liu Ren
Exhibit Hall I #111
JailbreakDiffBench: A Comprehensive Benchmark for Jailbreaking Diffusion Models Poster Session 4 & Exhibit Hall with Coffee Break
Xiaolong Jin ⋅ Zixuan Weng ⋅ Hanxi Guo ⋅ Chenlong Yin ⋅ Siyuan Cheng ⋅ Guangyu Shen ⋅ Xiangyu Zhang
Exhibit Hall I #149
UIPro: Unleashing Superior Interaction Capability For GUI Agents Poster Session 1 & Exhibit Hall
Hongxin Li ⋅ Jingran Su ⋅ Jingfan CHEN ⋅ Zheng Ju ⋅ Yuntao Chen ⋅ Li Qing ⋅ Zhaoxiang Zhang
Exhibit Hall I #143
SALAD -- Semantics-Aware Logical Anomaly Detection Poster Session 5 & Exhibit Hall
Matic Fučka ⋅ Vitjan Zavrtanik ⋅ Danijel Skocaj
Exhibit Hall I #191
FineMotion: A Dataset and Benchmark with both Spatial and Temporal Annotation for Fine-grained Motion Generation and Editing Poster Session 3 & Exhibit Hall
Bizhu Wu ⋅ Jinheng Xie ⋅ Meidan Ding ⋅ Zhe Kong ⋅ Jianfeng Ren ⋅ Ruibin Bai ⋅ Rong Qu ⋅ Linlin Shen
Exhibit Hall I #360
FiffDepth: Feed-forward Transformation of Diffusion-Based Generators for Detailed Depth Estimation Poster Session 2 & Exhibit Hall with Coffee Break
Yunpeng Bai ⋅ Qixing Huang
Exhibit Hall I #94
Perception-as-Control: Fine-grained Controllable Image Animation with 3D-aware Motion Representation Poster Session 3 & Exhibit Hall
Yingjie Chen ⋅ Yifang Men ⋅ Yuan Yao ⋅ Miaomiao Cui ⋅ Liefeng Bo
Exhibit Hall I #412
OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models Poster Session 3 & Exhibit Hall
gaojie lin ⋅ Jianwen Jiang ⋅ Jiaqi Yang ⋅ Zerong Zheng ⋅ Chao Liang ⋅ ZHANG YUAN ⋅ Jingtu Li
Exhibit Hall I #361
Knowledge Transfer from Interaction Learning Poster Session 1 & Exhibit Hall
Yilin Gao ⋅ Kangyi Chen ⋅ Zhongxing Peng ⋅ Hengjie Lu ⋅ Shugong Xu
Exhibit Hall I #333
WIR3D: Visually-Informed and Geometry-Aware 3D Shape Abstraction Poster Session 4 & Exhibit Hall with Coffee Break
Richard Liu ⋅ Daniel Fu ⋅ Noah Tan ⋅ Itai Lang ⋅ Rana Hanocka
Exhibit Hall I #384
GSV3D: Gaussian Splatting-based Geometric Distillation with Stable Video Diffusion for Single-Image 3D Object Generation Poster Session 2 & Exhibit Hall with Coffee Break
Ye Tao ⋅ jiawei zhang ⋅ Yahao Shi ⋅ Dongqing Zou ⋅ Bin Zhou
Exhibit Hall I #257
Multi-modal Segment Anything Model for Camouflaged Scene Segmentation Poster Session 5 & Exhibit Hall
Guangyu Ren ⋅ Hengyan Liu ⋅ Michalis Lazarou ⋅ Tania Stathaki
Exhibit Hall I #8
Shot-by-Shot: Film-Grammar-Aware Training-Free Audio Description Generation Poster Session 4 & Exhibit Hall with Coffee Break
Junyu Xie ⋅ Tengda Han ⋅ Max Bain ⋅ Arsha Nagrani ⋅ Eshika Khandelwal ⋅ Gül Varol ⋅ Weidi Xie ⋅ Andrew Zisserman
Exhibit Hall I #155
MagicColor: Multi-instance Sketch Colorization Poster Session 4 & Exhibit Hall with Coffee Break
yinhan Zhang ⋅ Yue Ma ⋅ Bingyuan Wang ⋅ Qifeng Chen ⋅ Zeyu Wang
Exhibit Hall I #30
Synthesizing Near-Boundary OOD Samples for Out-of-Distribution Detection Poster Session 1 & Exhibit Hall
Jinglun Li ⋅ Kaixun Jiang ⋅ Zhaoyu Chen ⋅ Bo Lin ⋅ Yao Tang ⋅ Weifeng Ge ⋅ Wenqiang Zhang
Exhibit Hall I #422
Cassic: Towards Content-Adaptive State-Space Models for Learned Image Compression Poster Session 4 & Exhibit Hall with Coffee Break
Shiyu Qin ⋅ Jinpeng Wang ⋅ Yimin Zhou ⋅ Bin Chen ⋅ Tianci Luo ⋅ Baoyi An ⋅ Tao Dai ⋅ Shu-Tao Xia ⋅ Yaowei Wang
Exhibit Hall I #80
Know Your Attention Maps: Class-specific Token Masking for Weakly Supervised Semantic Segmentation Poster Session 5 & Exhibit Hall
Joëlle Hanna ⋅ Damian Borth
Exhibit Hall I #373
Can We Achieve Efficient Diffusion Without Self-Attention? Distilling Self-Attention into Convolutions Poster Session 4 & Exhibit Hall with Coffee Break
ZiYi Dong ⋅ Chengxing Zhou ⋅ Weijian Deng ⋅ Pengxu Wei ⋅ Xiangyang Ji ⋅ Liang Lin
Exhibit Hall I #241
Ultra-Precision 6DoF Pose Estimation Using 2-D Interpolated Discrete Fourier Transform Poster Session 2 & Exhibit Hall with Coffee Break
Guowei Shi ⋅ Zian Mao ⋅ Peisen Huang
Exhibit Hall I #72
RealGeneral: Unifying Visual Generation via Temporal In-Context Learning with Video Models Poster Session 4 & Exhibit Hall with Coffee Break
Yijing Lin ⋅ Mengqi Huang ⋅ Shuhan Zhuang ⋅ Zhendong Mao
Exhibit Hall I #10
PixelStitch: Structure-Preserving Pixel-Wise Bidirectional Warps for Unsupervised Image Stitching Poster Session 6 & Exhibit Hall with Coffee Break
Hengzhe Jin ⋅ Lang Nie ⋅ Chunyu Lin ⋅ Xiaomei Feng ⋅ Yao Zhao
Exhibit Hall I #328
A Differentiable Wave Optics Model for End-to-End Computational Imaging System Optimization Poster Session 6 & Exhibit Hall with Coffee Break
Chi-Jui Ho ⋅ Yash Belhe ⋅ Steve Rotenberg ⋅ Ravi Ramamoorthi ⋅ Tzu-Mao Li ⋅ Nicholas Antipa
Exhibit Hall I #320
Towards a Universal Image Degradation Model via Content-Degradation Disentanglement Poster Session 3 & Exhibit Hall
Wenbo Yang ⋅ Zhongling Wang ⋅ Zhou Wang
Exhibit Hall I #279
Intra-view and Inter-view Correlation Guided Multi-view Novel Class Discovery Poster Session 1 & Exhibit Hall
Xinhang Wan ⋅ Jiyuan Liu ⋅ Qian Qu ⋅ Suyuan Liu ⋅ Chuyu Zhang ⋅ Fangdi Wang ⋅ Xinwang Liu ⋅ En Zhu ⋅ Kunlun He
Exhibit Hall I #385
HUST: High-Fidelity Unbiased Skin Tone Estimation via Texture Quantization Poster Session 3 & Exhibit Hall
Zimin Ran ⋅ Xingyu Ren ⋅ Xiang An ⋅ Kaicheng Yang ⋅ Ziyong Feng ⋅ Jing Yang ⋅ Rolandos Alexandros Potamias ⋅ Linchao Zhu ⋅ Jiankang Deng
Exhibit Hall I #332
One Polyp Identifies All: One-Shot Polyp Segmentation with SAM via Cascaded Priors and Iterative Prompt Evolution Poster Session 5 & Exhibit Hall
Xinyu Mao ⋅ Xiaohan Xing ⋅ Fei MENG ⋅ Jianbang LIU ⋅ Fan BAI ⋅ Qiang Nie ⋅ Max Meng
Exhibit Hall I #410
FDPT: Federated Discrete Prompt Tuning for Black-Box Visual-Language Models Poster Session 1 & Exhibit Hall
Jiaqi Wu ⋅ Simin Chen ⋅ Jing Tang ⋅ Yuzhe YANG ⋅ Yiming Chen ⋅ Lixu Wang ⋅ Song Lin ⋅ Zehua Wang ⋅ Wei Chen ⋅ Zijian Tian
Exhibit Hall I #224
Leveraging the Power of MLLMs for Gloss-Free Sign Language Translation Poster Session 5 & Exhibit Hall
Jungeun Kim ⋅ Hyeongwoo Jeon ⋅ Jongseong Bae ⋅ Ha Young Kim
Exhibit Hall I #117
CATP-LLM: Empowering Large Language Models for Cost-Aware Tool Planning Poster Session 2 & Exhibit Hall with Coffee Break
Duo Wu ⋅ Jinghe Wang ⋅ Yuan Meng ⋅ Yanning Zhang ⋅ Le Sun ⋅ Zhi Wang
Exhibit Hall I #346
Dynamic Group Detection using VLM-augmented Temporal Groupness Graph Poster Session 3 & Exhibit Hall
Kaname Yokoyama ⋅ Chihiro Nakatani ⋅ Norimichi Ukita
Exhibit Hall I #43
CanonSwap: High-Fidelity and Consistent Video Face Swapping via Canonical Space Modulation Poster Session 3 & Exhibit Hall
Xiangyang Luo ⋅ Ye Zhu ⋅ Yunfei Liu ⋅ Lijian Lin ⋅ Cong Wan ⋅ Zijian Cai ⋅ Yu Li ⋅ Shao-Lun Huang
Exhibit Hall I #6
MedVSR: Medical Video Super-Resolution with Cross State-Space Propagation Poster Session 3 & Exhibit Hall
Xinyu Liu ⋅ Guolei Sun ⋅ Cheng Wang ⋅ Yixuan Yuan ⋅ Ender Konukoglu
Exhibit Hall I #160
Learning Deblurring Texture Prior from Unpaired Data with Diffusion Model Poster Session 3 & Exhibit Hall
Chengxu Liu ⋅ Lu Qi ⋅ Jinshan Pan ⋅ Xueming Qian ⋅ Ming-Hsuan Yang
Exhibit Hall I #395
Top2Pano: Learning to Generate Indoor Panoramas from Top-Down View Poster Session 6 & Exhibit Hall with Coffee Break
Zitong Zhang ⋅ Suranjan Gautam ⋅ Rui Yu
Exhibit Hall I #365
ConformalSAM: Unlocking the Potential of Foundational Segmentation Models in Semi-Supervised Semantic Segmentation with Conformal Prediction Poster Session 5 & Exhibit Hall
Danhui Chen ⋅ Ziquan Liu ⋅ Chuxi Yang ⋅ Dan Wang ⋅ Yan Yan ⋅ Yi Xu ⋅ Xiangyang Ji
Exhibit Hall I #398
Underwater Visual SLAM with Depth Uncertainty and Medium Modeling Poster Session 1 & Exhibit Hall
Rui Liu ⋅ Sheng Fan ⋅ Wenguan Wang ⋅ Yi Yang
Exhibit Hall I #83
Generalization-Preserved Learning: Closing the Backdoor to Catastrophic Forgetting in Continual Deepfake Detection Poster Session 1 & Exhibit Hall
Xueyi Zhang ⋅ Peiyin Zhu ⋅ Chengwei Zhang ⋅ Zhiyuan Yan ⋅ Jikang Cheng ⋅ Mingrui Lao ⋅ Siqi Cai ⋅ Yanming Guo
Exhibit Hall I #353
Open-Vocabulary Octree-Graph for 3D Scene Understanding Poster Session 2 & Exhibit Hall with Coffee Break
Zhigang Wang ⋅ Yifei Su ⋅ Chenhui Li ⋅ Dong Wang ⋅ Yan Huang ⋅ Xuelong Li ⋅ Bin Zhao
Exhibit Hall I #189
LangBridge: Interpreting Image as a Combination of Language Embeddings Poster Session 5 & Exhibit Hall
Jiaqi Liao ⋅ Yuwei Niu ⋅ Fanqing Meng ⋅ Hao Li ⋅ Changyao Tian ⋅ Yinuo Du ⋅ Yuwen Xiong ⋅ Dianqi Li ⋅ Xizhou Zhu ⋅ Li Yuan ⋅ Jifeng Dai ⋅ Yu Cheng
Exhibit Hall I #372
IGD: Instructional Graphic Design with Multimodal Layer Generation Poster Session 4 & Exhibit Hall with Coffee Break
Yadong Qu ⋅ Shancheng Fang ⋅ Yuxin Wang ⋅ Xiaorui Wang ⋅ Zhineng Chen ⋅ Hongtao Xie ⋅ Yongdong Zhang
Exhibit Hall I #320
ResGS: Residual Densification of 3D Gaussian for Efficient Detail Recovery Poster Session 6 & Exhibit Hall with Coffee Break
Yanzhe Lyu ⋅ Kai Cheng ⋅ Kang Xin ⋅ Xuejin Chen
Exhibit Hall I #325
DASH: 4D Hash Encoding with Self-Supervised Decomposition for Real-Time Dynamic Scene Rendering Poster Session 6 & Exhibit Hall with Coffee Break
Jie Chen ⋅ Zhangchi Hu ⋅ Peixi Wu ⋅ Huyue Zhu ⋅ Hebei Li ⋅ Xiaoyan Sun
Exhibit Hall I #158
Bridging the Gap Between Ideal and Real-world Evaluation: Benchmarking AI-Generated Image Detection in Challenging Scenarios Poster Session 5 & Exhibit Hall
Chunxiao Li ⋅ Xiaoxiao Wang ⋅ Meiling Li ⋅ Boming Miao ⋅ Peng Sun ⋅ Yunjian Zhang ⋅ Xiangyang Ji ⋅ Yao Zhu
Exhibit Hall I #54
You Share Beliefs, I Adapt: Progressive Heterogeneous Collaborative Perception Poster Session 6 & Exhibit Hall with Coffee Break
hao si ⋅ Ehsan Javanmardi ⋅ Manabu Tsukada
Exhibit Hall I #270
Learning Normal Flow Directly From Events Poster Session 2 & Exhibit Hall with Coffee Break
Dehao Yuan ⋅ Levi Burner ⋅ Jiayi Wu ⋅ Minghui Liu ⋅ Jingxi Chen ⋅ Yiannis Aloimonos ⋅ Cornelia Fermuller
Exhibit Hall I #277
UniMLVG: Unified Framework for Multi-view Long Video Generation with Comprehensive Control Capabilities for Autonomous Driving Poster Session 6 & Exhibit Hall with Coffee Break
Rui Chen ⋅ Zehuan Wu ⋅ Yichen Liu ⋅ Yuxin Guo ⋅ Jingcheng Ni ⋅ Haifeng Xia ⋅ Siyu Xia
Exhibit Hall I #69
EVT: Efficient View Transformation for Multi-Modal 3D Object Detection Poster Session 6 & Exhibit Hall with Coffee Break
Yongjin Lee ⋅ Hyeon-Mun Jeong ⋅ Yurim Jeon ⋅ Sanghyun Kim
Exhibit Hall I #185
An Inversion-based Measure of Memorization for Diffusion Models Poster Session 4 & Exhibit Hall with Coffee Break
Zhe Ma ⋅ Qingming Li ⋅ Xuhong Zhang ⋅ Tianyu Du ⋅ Ruixiao Lin ⋅ Zonghui Wang ⋅ Shouling Ji ⋅ Wenzhi CHEN
Exhibit Hall I #198
TLB-VFI: Temporal-Aware Latent Brownian Bridge Diffusion for Video Frame Interpolation Poster Session 4 & Exhibit Hall with Coffee Break
Zonglin Lyu ⋅ Chen Chen
Exhibit Hall I #130
Face Retouching with Diffusion Data Generation and Spectral Restorement Poster Session 3 & Exhibit Hall
Zhidan Xu ⋅ Xiaoqin Zhang ⋅ Shijian Lu
Exhibit Hall I #444
HumanSAM: Classifying Human-centric Forgery Videos in Human Spatial, Appearance, and Motion Anomaly Poster Session 3 & Exhibit Hall
Chang Liu ⋅ Yunfan Ye ⋅ Fan Zhang ⋅ Qingyang Zhou ⋅ Yuchuan Luo ⋅ Zhiping Cai
Exhibit Hall I #380
OcRFDet: Object-Centric Radiance Fields for Multi-View 3D Object Detection in Autonomous Driving Poster Session 6 & Exhibit Hall with Coffee Break
Mingqian Ji ⋅ Jian Yang ⋅ Shanshan Zhang
Exhibit Hall I #20
Contrastive Flow Matching Poster Session 1 & Exhibit Hall
George Stoica ⋅ Vivek Ramanujan ⋅ Xiang Fan ⋅ Ali Farhadi ⋅ Ranjay Krishna ⋅ Judy Hoffman
Exhibit Hall I #103
Class Token as Proxy: Optimal Transport-assisted Proxy Learning for Weakly Supervised Semantic Segmentation Poster Session 5 & Exhibit Hall
Jian Wang ⋅ Tianhong Dai ⋅ Bingfeng Zhang ⋅ Siyue Yu ⋅ ENG Gee LIM ⋅ Jimin XIAO
Exhibit Hall I #173
Neural Compression for 3D Geometry Sets Poster Session 6 & Exhibit Hall with Coffee Break
Siyu Ren ⋅ Junhui Hou ⋅ Weiyao Lin ⋅ Wenping Wang
Exhibit Hall I #54
Learnable Logit Adjustment for Imbalanced Semi-Supervised Learning under Class Distribution Mismatch Poster Session 1 & Exhibit Hall
lee hyuck ⋅ Taemin Park ⋅ Heeyoung Kim
Exhibit Hall I #245
SMARTIES: Spectrum-Aware Multi-Sensor Auto-Encoder for Remote Sensing Images Poster Session 2 & Exhibit Hall with Coffee Break
Gencer Sumbul ⋅ Chang Xu ⋅ Emanuele Dalsasso ⋅ Devis Tuia
Exhibit Hall I #51
Dataset Ownership Verification for Pre-trained Masked Models Poster Session 1 & Exhibit Hall
Yuechen Xie ⋅ Jie Song ⋅ Yicheng Shan ⋅ Xiaoyan Zhang ⋅ Yuanyu Wan ⋅ Shengxuming Zhang ⋅ Jiarui Duan ⋅ Mingli Song
Exhibit Hall I #289
CARL: Causality-guided Architecture Representation Learning for an Interpretable Performance Predictor Poster Session 5 & Exhibit Hall
Han Ji ⋅ Yuqi Feng ⋅ Jiahao Fan ⋅ Yanan Sun
Exhibit Hall I #302
From Holistic to Localized: Local Enhanced Adapters for Efficient Visual Instruction Fine-Tuning Poster Session 1 & Exhibit Hall
Pengkun Jiao ⋅ Bin Zhu ⋅ Jingjing Chen ⋅ Chong-Wah Ngo ⋅ Yu-Gang Jiang
Exhibit Hall I #251
DiffPCI: Large Motion Point Cloud frame Interpolation with Diffusion Model Poster Session 6 & Exhibit Hall with Coffee Break
tianyu zhang ⋅ Haobo Jiang ⋅ jian Yang ⋅ Jin Xie
Exhibit Hall I #254
GLEAM: Enhanced Transferable Adversarial Attacks for Vision-Language Pre-training Models via Global-Local Transformations Poster Session 1 & Exhibit Hall
Yunqi Liu ⋅ Xiaohui Cui ⋅ Ouyang Xue
Exhibit Hall I #148
Disentangled World Models: Learning to Transfer Semantic Knowledge from Distracting Videos for Reinforcement Learning Poster Session 1 & Exhibit Hall
Qi Wang ⋅ Zhipeng Zhang ⋅ Baao Xie ⋅ Xin Jin ⋅ Yunbo Wang ⋅ Shiyu Wang ⋅ Liaomo Zheng ⋅ Xiaokang Yang ⋅ Wenjun Zeng
Exhibit Hall I #239
Stepping Out of Similar Semantic Space for Open-Vocabulary Segmentation Poster Session 5 & Exhibit Hall
Yong Liu ⋅ Song-Li Wu ⋅ Sule Bai ⋅ Jiahao Wang ⋅ Yitong Wang ⋅ Yansong Tang
Exhibit Hall I #269
MultiModal Action Conditioned Video Simulation Poster Session 3 & Exhibit Hall
Yichen Li ⋅ Antonio Torralba
Exhibit Hall I #393
ClearSight: Human Vision-Inspired Solutions for Event-Based Motion Deblurring Poster Session 2 & Exhibit Hall with Coffee Break
Xiaopeng LIN ⋅ Yulong Huang ⋅ Hongwei Ren ⋅ Zunchang Liu ⋅ Hongxiang Huang ⋅ Yue Zhou ⋅ Haotian FU ⋅ Bojun Cheng
Exhibit Hall I #230
PBFG: A New Physically-Based Dataset and Removal of Lens Flares and Glares Poster Session 2 & Exhibit Hall with Coffee Break
Jie Zhu ⋅ Sungkil Lee
Exhibit Hall I #40
Correspondence as Video: Test-Time Adaption on SAM2 for Reference Segmentation in the Wild Poster Session 2 & Exhibit Hall with Coffee Break
Haoran Wang ⋅ Zekun Li ⋅ Jian Zhang ⋅ Lei Qi ⋅ Yinghuan Shi
Exhibit Hall I #296
An Information-Theoretic Regularizer for Lossy Neural Image Compression Poster Session 4 & Exhibit Hall with Coffee Break
ZHANG YINGWEN ⋅ Meng Wang ⋅ Xihua Sheng ⋅ Peilin CHEN ⋅ Junru Li ⋅ Li Zhang ⋅ Shiqi Wang
Exhibit Hall I #64
Knowledge-Guided Part Segmentation Poster Session 2 & Exhibit Hall with Coffee Break
Xuejian Gou ⋅ Fang Liu ⋅ Licheng Jiao ⋅ Shuo Li ⋅ Lingling Li ⋅ Hao Wang ⋅ Xu Liu ⋅ Puhua Chen ⋅ wenping ma
Exhibit Hall I #44
ASGS: Single-Domain Generalizable Open-Set Object Detection via Adaptive Subgraph Searching Poster Session 5 & Exhibit Hall
Yuxuan Yuan ⋅ Luyao Tang ⋅ Chaoqi Chen ⋅ Yixin Chen ⋅ Yue Huang ⋅ Xinghao Ding
Exhibit Hall I #105
DADet: Safeguarding Image Conditional Diffusion Models against Adversarial and Backdoor Attacks via Diffusion Anomaly Detection Poster Session 4 & Exhibit Hall with Coffee Break
Hongwei Yu ⋅ Xinlong Ding ⋅ Jiawei Li ⋅ Jinlong Wang ⋅ Yudong Zhang ⋅ Rongquan Wang ⋅ Huimin Ma ⋅ Jiansheng Chen
Exhibit Hall I #242
Unbiased Region-Language Alignment for Open-Vocabulary Dense Prediction Poster Session 5 & Exhibit Hall
Yunheng Li ⋅ Yuxuan Li ⋅ Quan-Sheng Zeng ⋅ Wenhai Wang ⋅ Qibin Hou ⋅ Ming-Ming Cheng
Exhibit Hall I #378
Rethinking Layered Graphic Design Generation with a Top-Down Approach Poster Session 4 & Exhibit Hall with Coffee Break
Jingye Chen ⋅ Zhaowen Wang ⋅ Nanxuan Zhao ⋅ Li Zhang ⋅ Difan Liu ⋅ Jimei Yang ⋅ Qifeng Chen
Exhibit Hall I #189
LEGO-Maker: A Semantic-Driven Algorithm for Text-to-3D Generation Poster Session 4 & Exhibit Hall with Coffee Break
Yifei Zhang ⋅ Lei Chen
Exhibit Hall I #23
ReME: A Data-Centric Framework for Training-Free Open-Vocabulary Segmentation Poster Session 5 & Exhibit Hall
Xiwei Xuan ⋅ Ziquan Deng ⋅ Kwan-Liu Ma
Exhibit Hall I #109
MonoMobility: Zero-Shot 3D Mobility Analysis from Monocular Videos Poster Session 2 & Exhibit Hall with Coffee Break
Hongyi Zhou ⋅ Xiaogang Wang ⋅ Yulan Guo ⋅ Kai Xu
Exhibit Hall I #355
PCR-GS: COLMAP-Free 3D Gaussian Splatting via Pose Co-Regularizations Poster Session 6 & Exhibit Hall with Coffee Break
YU WEI ⋅ Jiahui Zhang ⋅ Xiaoqin Zhang ⋅ Ling Shao ⋅ Shijian Lu
Exhibit Hall I #172
Performing Defocus Deblurring by Modeling its Formation Process Poster Session 2 & Exhibit Hall with Coffee Break
Zhengbo Zhang ⋅ Lin Geng Foo ⋅ Hossein Rahmani ⋅ Jun Liu ⋅ De Wen Soh
Exhibit Hall I #71
CasP: Improving Semi-Dense Feature Matching Pipeline Leveraging Cascaded Correspondence Priors for Guidance Poster Session 6 & Exhibit Hall with Coffee Break
Peiqi Chen ⋅ Lei Yu ⋅ Yi Wan ⋅ Yingying Pei ⋅ Xinyi Liu ⋅ YongxiangYao YongxiangYao ⋅ Yingying Zhang ⋅ Lixiang Ru ⋅ Liheng Zhong ⋅ Jingdong Chen ⋅ Ming Yang ⋅ Yongjun Zhang
Exhibit Hall I #322
OmniDiff: A Comprehensive Benchmark for Fine-grained Image Difference Captioning Poster Session 5 & Exhibit Hall
Yuan Liu ⋅ Saihui Hou ⋅ Saijie Hou ⋅ Jiabao Du ⋅ Shibei Meng ⋅ Yongzhen Huang
Exhibit Hall I #152
Toward Long-Tailed Online Anomaly Detection through Class-Agnostic Concepts Poster Session 5 & Exhibit Hall
Chiao-An Yang ⋅ Kuan-Chuan Peng ⋅ Raymond A. Yeh
Exhibit Hall I #341
PLMP - Point-Line Minimal Problems for Projective SfM Poster Session 2 & Exhibit Hall with Coffee Break
Kim Kiehn ⋅ Albin Ahlbäck ⋅ Kathlén Kohn
Exhibit Hall I #333
More Reliable Pseudo-labels, Better Performance: A Generalized Approach to Single Positive Multi-label Learning Poster Session 1 & Exhibit Hall
Luong Tran ⋅ Thieu Vo ⋅ Anh Nguyen ⋅ Sang Dinh ⋅ Van Nguyen
Exhibit Hall I #118
SpiLiFormer: Enhancing Spiking Transformers with Lateral Inhibition Poster Session 5 & Exhibit Hall
Zeqi Zheng ⋅ Yanchen Huang ⋅ Yingchao Yu ⋅ Zizheng Zhu ⋅ Junfeng Tang ⋅ Zhaofei Yu ⋅ Yaochu Jin
Exhibit Hall I #444
SparseVILA: Decoupling Visual Sparsity for Efficient VLM Inference Poster Session 5 & Exhibit Hall
Samir Khaki ⋅ Junxian Guo ⋅ Jiaming Tang ⋅ Shang Yang ⋅ Yukang Chen ⋅ Konstantinos Plataniotis ⋅ Yao Lu ⋅ Song Han ⋅ Zhijian Liu
Exhibit Hall I #375
Active Perception Meets Rule-Guided RL: A Two-Phase Approach for Precise Object Navigation in Complex Environments Poster Session 2 & Exhibit Hall with Coffee Break
Liang Qin ⋅ Min Wang ⋅ Peiwei Li ⋅ Wengang Zhou ⋅ Houqiang Li
Exhibit Hall I #243
INTER: Mitigating Hallucination in Large Vision-Language Models by Interaction Guidance Sampling Poster Session 1 & Exhibit Hall
Xin Dong ⋅ Shichao Dong ⋅ Jin Wang ⋅ Jing Huang ⋅ Li Zhou ⋅ Zenghui Sun ⋅ Lihua Jing ⋅ Jinsong Lan ⋅ Xiaoyong Zhu ⋅ Bo Zheng
Exhibit Hall I #233
UNIS: A Unified Framework for Achieving Unbiased Neural Implicit Surfaces in Volume Rendering Poster Session 6 & Exhibit Hall with Coffee Break
Junkai Deng ⋅ Hanting Niu ⋅ Jiaze Li ⋅ Fei Hou ⋅ Ying He
Exhibit Hall I #284
Loss Functions for Predictor-based Neural Architecture Search Poster Session 1 & Exhibit Hall
Han Ji ⋅ Yuqi Feng ⋅ Jiahao Fan ⋅ Yanan Sun
Exhibit Hall I #144
Advancing Text-to-3D Generation with Linearized Lookahead Variational Score Distillation Poster Session 4 & Exhibit Hall with Coffee Break
Yu Lei ⋅ Bingde Liu ⋅ Qingsong Xie ⋅ Haonan Lu ⋅ Zhijie Deng
Exhibit Hall I #448
Decoding Correlation-Induced Misalignment in the Stable Diffusion Workflow for Text-to-Image Generation Poster Session 4 & Exhibit Hall with Coffee Break
Yunze Tong ⋅ Fengda Zhang ⋅ Didi Zhu ⋅ Jun Xiao ⋅ Kun Kuang
Exhibit Hall I #317
Steering Guidance for Personalized Text-to-Image Diffusion Models Poster Session 4 & Exhibit Hall with Coffee Break
Sunghyun Park ⋅ Seokeon Choi ⋅ Hyoungwoo Park ⋅ Sungrack Yun
Exhibit Hall I #97
ONLY: One-Layer Intervention Sufficiently Mitigates Hallucinations in Large Vision-Language Models Poster Session 1 & Exhibit Hall
Zifu Wan ⋅ Ce Zhang ⋅ Silong Yong ⋅ Martin Ma ⋅ Simon Stepputtis ⋅ Louis-Philippe Morency ⋅ Deva Ramanan ⋅ Katia Sycara ⋅ Yaqi Xie
Exhibit Hall I #298
M-SpecGene: Generalized Foundation Model for RGBT Multispectral Vision Poster Session 2 & Exhibit Hall with Coffee Break
Kailai Zhou ⋅ Fuqiang Yang ⋅ Shixian Wang ⋅ Bihan Wen ⋅ Chongde Zi ⋅ Linsen Chen ⋅ Qiu Shen ⋅ Xun Cao
Exhibit Hall I #267
SpatialCrafter: Unleashing the Imagination of Video Diffusion Models for Scene Reconstruction from Limited Observations Poster Session 6 & Exhibit Hall with Coffee Break
Songchun Zhang ⋅ Huiyao Xu ⋅ Sitong Guo ⋅ Zhongwei Xie ⋅ Hujun Bao ⋅ Weiwei Xu ⋅ Changqing Zou
Exhibit Hall I #297
Seeing Through Deepfakes: A Human-Inspired Framework for Multi-Face Detection Poster Session 3 & Exhibit Hall
Juan Hu ⋅ Shaojing Fan ⋅ Terence Sim
Exhibit Hall I #425
Snakes and Ladders: Two Steps Up for VideoMamba Poster Session 5 & Exhibit Hall
Hui Lu ⋅ Albert Ali Salah ⋅ Ronald Poppe
Exhibit Hall I #415
Efficient Visual Place Recognition Through Multimodal Semantic Knowledge Integration Poster Session 2 & Exhibit Hall with Coffee Break
Sitao Zhang ⋅ Hongda Mao ⋅ Qingshuang Chen ⋅ Yelin Kim
Exhibit Hall I #54
COME: Dual Structure-Semantic Learning with Collaborative MoE for Universal Lesion Detection Across Heterogeneous Ultrasound Datasets Poster Session 5 & Exhibit Hall
Lingyu Chen ⋅ Yawen Zeng ⋅ Yue Wang ⋅ Peng Wan ⋅ Guo-chen Ning ⋅ Hongen Liao ⋅ Daoqiang Zhang ⋅ Fang Chen
Exhibit Hall I #154
Hybrid Layout Control for Diffusion Transformer: Fewer Annotations, Superior Aesthetics Poster Session 4 & Exhibit Hall with Coffee Break
Keming Wu ⋅ Junwen Chen ⋅ Zhanhao Liang ⋅ Yinuo Wang ⋅ Ji Li ⋅ Chao Zhang ⋅ Bin Wang ⋅ Yuhui Yuan
Exhibit Hall I #291
PLAN: Proactive Low-Rank Allocation for Continual Learning Poster Session 1 & Exhibit Hall
XIEQUN WANG ⋅ Zhan Zhuang ⋅ Yu Zhang
Exhibit Hall I #268
Leveraging Spatial Invariance to Boost Adversarial Transferability Poster Session 1 & Exhibit Hall
Zihan Zhou ⋅ LI LI ⋅ Yanli Ren ⋅ Chuan Qin ⋅ Guorui Feng
Exhibit Hall I #125
AnyPortal: Zero-Shot Consistent Video Background Replacement Poster Session 4 & Exhibit Hall with Coffee Break
Wenshuo Gao ⋅ Xicheng Lan ⋅ Shuai Yang
Exhibit Hall I #394
Textured 3D Regenerative Morphing with 3D Diffusion Prior Poster Session 4 & Exhibit Hall with Coffee Break
Songlin Yang ⋅ Yushi LAN ⋅ Honghua Chen ⋅ Xingang Pan
Exhibit Hall I #26
Inference-Time Diffusion Model Distillation Poster Session 1 & Exhibit Hall
Geon Yeong Park ⋅ Sang Wan Lee ⋅ Jong Ye
Exhibit Hall I #377
Marigold-DC: Zero-Shot Monocular Depth Completion with Guided Diffusion Poster Session 2 & Exhibit Hall with Coffee Break
Massimiliano Viola ⋅ Kevin Qu ⋅ Nando Metzger ⋅ Bingxin Ke ⋅ Alexander Becker ⋅ Konrad Schindler ⋅ Anton Obukhov
Exhibit Hall I #32
Transformer-based Tooth Alignment Prediction with Occlusion and Collision Constraints Poster Session 6 & Exhibit Hall with Coffee Break
DongZhenXing DongZhenXing ⋅ Jiazhou Chen
Exhibit Hall I #40
EC-Flow: Enabling Versatile Robotic Manipulation from Action-Unlabeled Videos via Embodiment-Centric Flow Poster Session 3 & Exhibit Hall
Yixiang Chen ⋅ Peiyan Li ⋅ Yan Huang ⋅ Jiabing Yang ⋅ Kehan Chen ⋅ Liang Wang
Exhibit Hall I #184
CityNav: A Large-Scale Dataset for Real-World Aerial Navigation Poster Session 2 & Exhibit Hall with Coffee Break
Jungdae Lee ⋅ Taiki Miyanishi ⋅ Shuhei Kurita ⋅ Koya Sakamoto ⋅ Daichi Azuma ⋅ Yutaka Matsuo ⋅ Nakamasa Inoue
Exhibit Hall I #84
Scene Graph Guided Generation: Enable Accurate Relations Generation in Text-to-Image Models via Textural Rectification Poster Session 4 & Exhibit Hall with Coffee Break
Guibao SHEN ⋅ Luozhou Wang ⋅ Jiantao Lin ⋅ Wenhang Ge ⋅ CHAOZHE ZHANG ⋅ Xin Tao ⋅ Di ZHANG ⋅ Pengfei Wan ⋅ Guangyong Chen ⋅ Yijun Li ⋅ Ying-Cong Chen
Exhibit Hall I #51
ReMP-AD: Retrieval-enhanced Multi-modal Prompt Fusion for Few-Shot Industrial Visual Anomaly Detection Poster Session 5 & Exhibit Hall
Hongchi Ma ⋅ Guanglei Yang ⋅ Debin Zhao ⋅ Yanli JI ⋅ Wangmeng Zuo
Exhibit Hall I #58
GMMamba: Group Masking Mamba for Whole Slide Image Classification Poster Session 3 & Exhibit Hall
Tingting Zheng ⋅ Hongxun Yao ⋅ Kui Jiang ⋅ Yi Xiao ⋅ Sicheng Zhao
Exhibit Hall I #301
TimeFormer: Capturing Temporal Relationships of Deformable 3D Gaussians for Robust Reconstruction Poster Session 2 & Exhibit Hall with Coffee Break
Dadong Jiang ⋅ Zhi Hou ⋅ Zhihui Ke ⋅ Xianghui Yang ⋅ Xiaobo Zhou ⋅ Tie Qiu
Exhibit Hall I #348
Hierarchy UGP: Hierarchy Unified Gaussian Primitive for Large-Scale Dynamic Scene Reconstruction Poster Session 6 & Exhibit Hall with Coffee Break
Hongyang Sun ⋅ Qinglin Yang ⋅ Jiawei Wang ⋅ Zhen Xu ⋅ Chen Liu ⋅ Yida Wang ⋅ Kun Zhan ⋅ Hujun Bao ⋅ Xiaowei Zhou ⋅ Sida Peng
Exhibit Hall I #149
SciVid: Cross-Domain Evaluation of Video Models in Scientific Applications Poster Session 5 & Exhibit Hall
Yana Hasson ⋅ Pauline Luc ⋅ Liliane Momeni ⋅ Maks Ovsjanikov ⋅ Guillaume Le Moing ⋅ Alina Kuznetsova ⋅ Ira Ktena ⋅ Jennifer J. Sun ⋅ Skanda Koppula ⋅ Dilara Gokay ⋅ Joseph Heyward ⋅ Etienne Pot ⋅ Andrew Zisserman
Exhibit Hall I #187
Backdoor Mitigation by Distance-Driven Detoxification Poster Session 1 & Exhibit Hall
Shaokui Wei ⋅ Jiayin Liu ⋅ Hongyuan Zha
Exhibit Hall I #419
Democratizing High-Fidelity Co-Speech Gesture Video Generation Poster Session 3 & Exhibit Hall
Xu Yang ⋅ Shaoli Huang ⋅ Shenbo Xie ⋅ Xuelin Chen ⋅ Yifei Liu ⋅ Changxing Ding
Exhibit Hall I #403
UnrealZoo: Enriching Photo-realistic Virtual Worlds for Embodied AI Poster Session 2 & Exhibit Hall with Coffee Break
Fangwei Zhong ⋅ Kui Wu ⋅ Churan Wang ⋅ Hao Chen ⋅ Hai Ci ⋅ Zhoujun Li ⋅ Yizhou Wang
Exhibit Hall I #69
Region-based Cluster Discrimination for Visual Representation Learning Poster Session 1 & Exhibit Hall
Yin Xie ⋅ Kaicheng Yang ⋅ Xiang An ⋅ Kun Wu ⋅ Yongle Zhao ⋅ Weimo Deng ⋅ Zimin Ran ⋅ Yumeng Wang ⋅ Ziyong Feng ⋅ Roy Miles ⋅ Ismail Elezi ⋅ Jiankang Deng
Exhibit Hall I #162
CMB-ML: A Cosmic Microwave Background Dataset for the Oldest Possible Computer Vision Task Poster Session 2 & Exhibit Hall with Coffee Break
James Amato ⋅ Yunan Xie ⋅ Leonel Medina-Varela ⋅ Ammar Aljerwi ⋅ Adam McCutcheon ⋅ T. Rippentrop ⋅ Kristian Gonzalez ⋅ Jacques Delabrouille ⋅ Mustapha Ishak ⋅ Nicholas Ruozzi
Exhibit Hall I #413
Adapt Foundational Segmentation Models with Heterogeneous Searching Space Poster Session 5 & Exhibit Hall
Li Yi ⋅ Jie Hu ⋅ Songan Zhang ⋅ GUANNAN JIANG
Exhibit Hall I #336
Think Twice: Test-Time Reasoning for Robust CLIP Zero-Shot Classification Poster Session 1 & Exhibit Hall
Shenyu Lu ⋅ Zhaoying Pan ⋅ Xiaoqian Wang
Exhibit Hall I #269
Rethinking Detecting Salient and Camouflaged Objects in Unconstrained Scenes Poster Session 5 & Exhibit Hall
Zhangjun Zhou ⋅ Yiping Li ⋅ Chunlin Zhong ⋅ Jianuo Huang ⋅ Jialun Pei ⋅ Hua Li ⋅ He Tang
Exhibit Hall I #242
Counting Stacked Objects Poster Session 5 & Exhibit Hall
Corentin Dumery ⋅ Noa Ette ⋅ Aoxiang Fan ⋅ Ren Li ⋅ Jingyi Xu ⋅ Hieu Le ⋅ Pascal Fua
Exhibit Hall I #74
Joint Self-Supervised Video Alignment and Action Segmentation Poster Session 3 & Exhibit Hall
Ali Shah Ali ⋅ Syed Ahmed Mahmood ⋅ Mubin Saeed ⋅ Andrey Konin ⋅ Zeeshan Zia ⋅ Quoc-Huy Tran
Exhibit Hall I #76
TrackAny3D: Transferring Pretrained 3D Models for Category-unified 3D Point Cloud Tracking Poster Session 6 & Exhibit Hall with Coffee Break
Mengmeng Wang ⋅ Haonan Wang ⋅ Yulong Li ⋅ Xiangjie Kong ⋅ Jiaxin Du ⋅ Feng Xia ⋅ Guojiang Shen
Exhibit Hall I #340
Allowing Oscillation Quantization: Overcoming Solution Space Limitation in Low Bit-Width Quantization Poster Session 5 & Exhibit Hall
Weiying Xie ⋅ Zihan Meng ⋅ Jitao Ma ⋅ Wenjin Guo ⋅ Haowei Li ⋅ Haonan Qin ⋅ Leyuan Fang ⋅ Yunsong Li
Exhibit Hall I #451
MOVE: Motion-Guided Few-Shot Video Object Segmentation Poster Session 3 & Exhibit Hall
Kaining Ying ⋅ Hengrui Hu ⋅ Henghui Ding
Exhibit Hall I #154
SDFormer: Vision-based 3D Semantic Scene Completion via SAM-assisted Dual-channel Voxel Transformer Poster Session 6 & Exhibit Hall with Coffee Break
Yujie Xue ⋅ Huilong Pi ⋅ Jiapeng Zhang ⋅ Qin Yunchuan ⋅ Zhuo Tang ⋅ Kenli Li ⋅ Ruihui Li
Exhibit Hall I #204
Enhancing Numerical Prediction of MLLMs with Soft Labeling Poster Session 1 & Exhibit Hall
Pei Wang ⋅ Zhaowei Cai ⋅ Hao Yang ⋅ Davide Modolo ⋅ Ashwin Swaminathan
Exhibit Hall I #318
TopoTTA: Topology-Enhanced Test-Time Adaptation for Tubular Structure Segmentation Poster Session 5 & Exhibit Hall
Jiale Zhou ⋅ Wenhan Wang ⋅ Shikun Li ⋅ Xiaolei Qu ⋅ Xin Guo ⋅ Yizhong Liu ⋅ Wenzhong Tang ⋅ Xun Lin ⋅ Yefeng Zheng
Exhibit Hall I #405
RealCam-I2V: Real-World Image-to-Video Generation with Interactive Complex Camera Control Poster Session 6 & Exhibit Hall with Coffee Break
Teng Li ⋅ Guangcong Zheng ⋅ Rui Jiang ⋅ Shuigenzhan Shuigenzhan ⋅ Tao Wu ⋅ Yehao Lu ⋅ Yining Lin ⋅ Chuanyun Deng ⋅ Yepan Xiong ⋅ Min Chen ⋅ Lin Cheng ⋅ Xi Li
Exhibit Hall I #392
ReAL-AD: Towards Human-Like Reasoning in End-to-End Autonomous Driving Poster Session 6 & Exhibit Hall with Coffee Break
Yuhang Lu ⋅ Jiadong Tu ⋅ Yuexin Ma ⋅ Xinge Zhu
Exhibit Hall I #296
TAD-E2E: A Large-scale End-to-end Autonomous Driving Dataset Poster Session 6 & Exhibit Hall with Coffee Break
Chang Liu ⋅ mingxuzhu mingxuzhu ⋅ Zheyuan Zhang ⋅ Linna Song ⋅ xiao zhao ⋅ Luo Qingliang ⋅ Qi Wang ⋅ Chufan Guo ⋅ Kuifeng Su
Exhibit Hall I #182
MotionDiff: Training-free Zero-shot Interactive Motion Editing via Flow-assisted Multi-view Diffusion Poster Session 3 & Exhibit Hall
Yikun Ma ⋅ Yiqing Li ⋅ Jiawei Wu ⋅ Xing Luo ⋅ Zhi Jin
Exhibit Hall I #421
FPEM: Face Prior Enhanced Facial Attractiveness Prediction for Live Videos with Face Retouching Poster Session 3 & Exhibit Hall
Hui Li ⋅ Xiaoyu Ren ⋅ Hongjiu Yu ⋅ Ying Chen ⋅ Kai Li ⋅ L Wang ⋅ Xiongkuo Min ⋅ Huiyu Duan ⋅ Guangtao Zhai ⋅ Xu Liu
Exhibit Hall I #136
VAGUE: Visual Contexts Clarify Ambiguous Expressions Poster Session 1 & Exhibit Hall
Heejeong Nam ⋅ Jinwoo Ahn ⋅ Keummin Ka ⋅ Jiwan Chung ⋅ Youngjae Yu
Exhibit Hall I #136
Overcoming Dual Drift for Continual Long-Tailed Visual Question Answering Poster Session 1 & Exhibit Hall
Feifei Zhang ⋅ Zhihao Wang ⋅ Xi Zhang ⋅ Changsheng Xu
Exhibit Hall I #414
Photolithography Overlay Map Generation with Implicit Knowledge Distillation Diffusion Transformer Poster Session 4 & Exhibit Hall with Coffee Break
YuanFu Yang ⋅ Hsiu-Hui Hsiao
Exhibit Hall I #37
Does Your Vision-Language Model Get Lost in the Long Video Sampling Dilemma? Poster Session 5 & Exhibit Hall
Tianyuan Qu ⋅ Longxiang Tang ⋅ Bohao PENG ⋅ Senqiao Yang ⋅ Bei Yu ⋅ Jiaya Jia
Exhibit Hall I #103
What's Making That Sound Right Now? Video-centric Audio-Visual Localization Poster Session 5 & Exhibit Hall
hahyeon choi ⋅ Junhoo Lee ⋅ Nojun Kwak
Exhibit Hall I #28
STD-GS: Exploring Frame-Event Interaction for SpatioTemporal-Disentangled Gaussian Splatting to Reconstruct High-Dynamic Scene Poster Session 6 & Exhibit Hall with Coffee Break
Hanyu Zhou ⋅ Haonan Wang ⋅ Haoyue Liu ⋅ Yuxing Duan ⋅ Luxin Yan ⋅ Gim Hee Lee
Exhibit Hall I #8
Robust Machine Unlearning for Quantized Neural Networks via Adaptive Gradient Reweighting with Similar Labels Poster Session 5 & Exhibit Hall
Yujia Tong ⋅ Yuze Wang ⋅ Jingling Yuan ⋅ Chuang Hu
Exhibit Hall I #77
Zero-Shot Vision Encoder Grafting via LLM Surrogates Poster Session 1 & Exhibit Hall
Kaiyu Yue ⋅ Vasu Singla ⋅ Menglin Jia ⋅ John Kirchenbauer ⋅ Rifaa Qadri ⋅ Zikui Cai ⋅ Abhinav Bhatele ⋅ Furong Huang ⋅ Tom Goldstein
Exhibit Hall I #400
OV-SCAN: Semantically Consistent Alignment for Novel Object Discovery in Open-Vocabulary 3D Object Detection Poster Session 2 & Exhibit Hall with Coffee Break
Adrian Chow ⋅ Evelien Riddell ⋅ Yimu Wang ⋅ Sean Sedwards ⋅ Krzysztof Czarnecki
Exhibit Hall I #279
FullDiT: Video Generative Foundation Models with Multimodal Control via Full Attention Poster Session 4 & Exhibit Hall with Coffee Break
Xuan Ju ⋅ Weicai Ye ⋅ Quande Liu ⋅ Qiulin Wang ⋅ Xintao Wang ⋅ Pengfei Wan ⋅ Di ZHANG ⋅ Kun Gai ⋅ Qiang Xu
Exhibit Hall I #81
SC-Lane: Slope-aware and Consistent Road Height Estimation Framework for 3D Lane Detection Poster Session 6 & Exhibit Hall with Coffee Break
Chaesong Park ⋅ Eunbin Seo ⋅ JihyeonHwang JihyeonHwang ⋅ Jongwoo Lim
Exhibit Hall I #355
Exploring the Visual Feature Space for Multimodal Neural Decoding Poster Session 1 & Exhibit Hall
Weihao Xia ⋅ Cengiz Oztireli
Exhibit Hall I #410
RAGNet: Large-scale Reasoning-based Affordance Segmentation Benchmark towards General Grasping Poster Session 3 & Exhibit Hall
Dongming Wu ⋅ Yanping Fu ⋅ Saike Huang ⋅ Yingfei Liu ⋅ Fan Jia ⋅ Nian Liu ⋅ Feng Dai ⋅ Tiancai Wang ⋅ Rao Anwer ⋅ Fahad Khan ⋅ Jianbing Shen
Exhibit Hall I #186
GeoMan: Temporally Consistent Human Geometry Estimation using Image-to-Video Diffusion Poster Session 2 & Exhibit Hall with Coffee Break
Gwanghyun Kim ⋅ Xueting Li ⋅ Ye Yuan ⋅ Koki Nagano ⋅ Tianye Li ⋅ Jan Kautz ⋅ Se Young Chun ⋅ Umar Iqbal
Exhibit Hall I #229
Stereo Any Video: Temporally Consistent Stereo Matching Poster Session 5 & Exhibit Hall
Junpeng Jing ⋅ Weixun Luo ⋅ Ye Mao ⋅ Krystian Mikolajczyk
Exhibit Hall I #98
Keyframe-oriented Vision Token Pruning: Enhancing Efficiency of Large Vision Language Models on Long-Form Video Processing Poster Session 5 & Exhibit Hall
Yudong Liu ⋅ Jingwei Sun ⋅ Yueqian Lin ⋅ Jingyang Zhang ⋅ Ming Yin ⋅ Qinsi Wang ⋅ Jianyi Zhang ⋅ Hai Li ⋅ Yiran Chen
Exhibit Hall I #95
When Confidence Fails: Revisiting Pseudo-Label Selection in Semi-supervised Semantic Segmentation Poster Session 5 & Exhibit Hall
Pan Liu ⋅ Jinshi Liu
Exhibit Hall I #194
Bridging Local Inductive Bias and Long-Range Dependencies with Pixel-Mamba for End-to-end Whole Slide Image Analysis Poster Session 5 & Exhibit Hall
Zhongwei Qiu ⋅ Hanqing Chao ⋅ Tiancheng Lin ⋅ Wanxing Chang ⋅ Zijiang Yang ⋅ Wenpei Jiao ⋅ Yixuan Shen ⋅ Yunshuo Zhang ⋅ Yelin Yang ⋅ Wenbin Liu ⋅ Hui Jiang ⋅ Yun Bian ⋅ Ke Yan ⋅ Dakai Jin ⋅ Le Lu
Exhibit Hall I #276
Neuroverse3D: Developing In-Context Learning Universal Model for Neuroimaging in 3D Poster Session 5 & Exhibit Hall
Jiesi Hu ⋅ Hanyang Peng ⋅ Yanwu Yang ⋅ Xutao Guo ⋅ Yang Shang ⋅ Pengcheng Shi ⋅ Chenfei Ye ⋅ Ting Ma
Exhibit Hall I #180
Heavy Labels Out! Dataset Distillation with Label Space Lightening Poster Session 2 & Exhibit Hall with Coffee Break
Ruonan Yu ⋅ Songhua Liu ⋅ Zigeng Chen ⋅ Jingwen Ye ⋅ Xinchao Wang
Exhibit Hall I #226
Taming Flow Matching with Unbalanced Optimal Transport into Fast Pansharpening Poster Session 1 & Exhibit Hall
Zihan Cao ⋅ Yu Zhong ⋅ Liang-Jian Deng
Exhibit Hall I #258
Revisiting Pool-based Prompt Learning for Few-shot Class-incremental Learning Poster Session 1 & Exhibit Hall
Yongwei Jiang ⋅ Yixiong Zou ⋅ Yuhua Li ⋅ Ruixuan Li
Exhibit Hall I #114
CarGait: Cross-Attention based Re-ranking for Gait recognition Poster Session 3 & Exhibit Hall
Gavriel Habib ⋅ Noa Barzilay ⋅ Or Shimshi ⋅ Rami Ben-Ari ⋅ Nir Darshan
Exhibit Hall I #177
Incremental Few-Shot Semantic Segmentation via Multi-Level Switchable Visual Prompts Poster Session 5 & Exhibit Hall
Maoxian Wan ⋅ Kaige Li ⋅ Qichuan Geng ⋅ Weimin Shi ⋅ Zhong Zhou
Exhibit Hall I #404
ZeroKey: Point-Level Reasoning and Zero-Shot 3D Keypoint Detection from Large Language Models Poster Session 5 & Exhibit Hall
Bingchen Gong ⋅ Diego Gomez ⋅ Abdullah Hamdi ⋅ Abdelrahman Eldesokey ⋅ Ahmed Abdelreheem ⋅ Peter Wonka ⋅ Maks Ovsjanikov
Exhibit Hall I #214
SVIP: Semantically Contextualized Visual Patches for Zero-Shot Learning Poster Session 1 & Exhibit Hall
Zhi Chen ⋅ Zecheng Zhao ⋅ Jingcai Guo ⋅ Jingjing Li ⋅ Zi Huang
Exhibit Hall I #311
Flash-VStream: Efficient Real-Time Understanding for Long Video Streams Poster Session 5 & Exhibit Hall
Haoji Zhang ⋅ Yiqin Wang ⋅ Yansong Tang ⋅ Yong Liu ⋅ Jiashi Feng ⋅ Xiaojie Jin
Exhibit Hall I #118
MR-FIQA: Face Image Quality Assessment with Multi-Reference Representations from Synthetic Data Generation Poster Session 3 & Exhibit Hall
Fu-Zhao Ou ⋅ Chongyi Li ⋅ Shiqi Wang ⋅ Sam Kwong
Exhibit Hall I #274
Learnable Fractional Reaction-Diffusion Dynamics for Under-Display ToF Imaging and Beyond Poster Session 2 & Exhibit Hall with Coffee Break
Xin Qiao ⋅ Matteo Poggi ⋅ Xing Wei ⋅ Pengchao Deng ⋅ Yanhui Zhou ⋅ Stefano Mattoccia
Exhibit Hall I #99
Gait-X: Exploring X modality for Generalized Gait Recognition Poster Session 3 & Exhibit Hall
Zengbin Wang ⋅ Saihui Hou ⋅ Junjie Li ⋅ Xu Liu ⋅ Chunshui Cao ⋅ Yongzhen Huang ⋅ Siye Wang ⋅ Man Zhang
Exhibit Hall I #308
Scendi Score: Prompt‑Aware Diversity Evaluation via Schur Complement of CLIP Embeddings Poster Session 4 & Exhibit Hall with Coffee Break
Azim Ospanov ⋅ Mohammad Jalali ⋅ Farzan Farnia
Exhibit Hall I #195
Discretized Gaussian Representation for Tomographic Reconstruction Poster Session 6 & Exhibit Hall with Coffee Break
Shaokai Wu ⋅ Yuxiang Lu ⋅ Yapan Guo ⋅ Wei Ji ⋅ Suizhi Huang ⋅ Fengyu Yang ⋅ Shalayiding Sirejiding ⋅ Qichen He ⋅ Jing Tong ⋅ Yanbiao Ji ⋅ Yue Ding ⋅ Hongtao Lu
Exhibit Hall I #33
Wave-MambaAD: Wavelet-driven State Space Model for Multi-class Unsupervised Anomaly Detection Poster Session 5 & Exhibit Hall
Qiao Zhang ⋅ Mingwen Shao ⋅ Xinyuan Chen ⋅ Xiang Lv ⋅ Kai Xu
Exhibit Hall I #101
3D Test-time Adaptation via Graph Spectral Driven Point Shift Poster Session 6 & Exhibit Hall with Coffee Break
Xin Wei ⋅ Qin Yang ⋅ Yijie Fang ⋅ Mingrui Zhu ⋅ Nannan Wang
Exhibit Hall I #197
Task-Decoupled Bézier Surface Constraint for Uneven Low-Light Image Enhancement Poster Session 2 & Exhibit Hall with Coffee Break
Xingxiang Zhou ⋅ Xiangdong Su ⋅ Haoran Zhang ⋅ Wei Chen ⋅ Guanglai Gao
Exhibit Hall I #173
EMoTive: Event-guided Trajectory Modeling for 3D Motion Estimation Poster Session 2 & Exhibit Hall with Coffee Break
Zengyu Wan ⋅ Wei Zhai ⋅ Yang Cao ⋅ Zheng-Jun Zha
Exhibit Hall I #406
Text-to-Any-Skeleton Motion Generation Without Retargeting Poster Session 3 & Exhibit Hall
Qingyuan Liu ⋅ Ke Lv ⋅ Kun Dong ⋅ Jian Xue ⋅ Zehai Niu ⋅ Jinbao Wang
Exhibit Hall I #275
Completing 3D Partial Assemblies with View-Consistent 2D-3D Correspondence Poster Session 2 & Exhibit Hall with Coffee Break
Weihao Wang ⋅ Yu Lan ⋅ Mingyu You ⋅ Bin He
Exhibit Hall I #256
Aligning Global Semantics and Local Textures in Generative Video Enhancement Poster Session 4 & Exhibit Hall with Coffee Break
Zhikai Chen ⋅ Fuchen Long ⋅ Zhaofan Qiu ⋅ Ting Yao ⋅ Wengang Zhou ⋅ Jiebo Luo ⋅ Tao Mei
Exhibit Hall I #210
Simulating Dual-Pixel Images From Ray Tracing For Depth Estimation Poster Session 6 & Exhibit Hall with Coffee Break
Fengchen He ⋅ Dayang Zhao ⋅ Hao Xu ⋅ Tingwei Quan ⋅ Shaoqun zeng
Exhibit Hall I #132
Robust 3D-Masked Part-level Editing in 3D Gaussian Splatting with Regularized Score Distillation Sampling Poster Session 2 & Exhibit Hall with Coffee Break
Hayeon Kim ⋅ Ji Ha Jang ⋅ Se Young Chun
Exhibit Hall I #45
Learning Robust Stereo Matching in the Wild with Selective Mixture-of-Experts Poster Session 5 & Exhibit Hall
Yun Wang ⋅ Longguang Wang ⋅ Chenghao Zhang ⋅ Yongjian Zhang ⋅ Zhanjie Zhang ⋅ Ao Ma ⋅ Chenyou Fan ⋅ Tin Lun Lam ⋅ Junjie Hu
Exhibit Hall I #137
Global-Aware Monocular Semantic Scene Completion with State Space Models Poster Session 6 & Exhibit Hall with Coffee Break
Shijie Li ⋅ Zhongyao Cheng ⋅ Rong Li ⋅ Shuai Li ⋅ Juergen Gall ⋅ Xun Xu ⋅ Xulei Yang
Exhibit Hall I #80
DIMO: Diverse 3D Motion Generation for Arbitrary Objects Poster Session 3 & Exhibit Hall
Linzhan Mou ⋅ Jiahui Lei ⋅ Chen Wang ⋅ Lingjie Liu ⋅ Kostas Daniilidis
Exhibit Hall I #410
GTR: Guided Thought Reinforcement Prevents Thought Collapse in RL-based VLM Agent Training Poster Session 4 & Exhibit Hall with Coffee Break
Tong Wei ⋅ Yijun Yang ⋅ Junliang Xing ⋅ Yuanchun Shi ⋅ Zongqing Lu ⋅ Deheng Ye
Exhibit Hall I #379
LoRAverse: A Submodular Framework to Retrieve Diverse Adapters for Diffusion Models Poster Session 4 & Exhibit Hall with Coffee Break
Mert Sonmezer ⋅ Matthew Zheng ⋅ Pinar Yanardag
Exhibit Hall I #286
Autoregressive Denoising Score Matching is a Good Video Anomaly Detector Poster Session 3 & Exhibit Hall
hanwen Zhang ⋅ Congqi Cao ⋅ Qinyi Lv ⋅ Lingtong Min ⋅ Yanning Zhang
Exhibit Hall I #193
MagicDrive-V2: High-Resolution Long Video Generation for Autonomous Driving with Adaptive Control Poster Session 6 & Exhibit Hall with Coffee Break
Ruiyuan Gao ⋅ Kai Chen ⋅ Bo Xiao ⋅ Lanqing HONG ⋅ Zhenguo Li ⋅ Qiang Xu
Exhibit Hall I #329
PVChat: Personalized Video Chat with One-Shot Learning Poster Session 5 & Exhibit Hall
YUFEI SHI ⋅ Weilong Yan ⋅ Gang Xu ⋅ Yumeng Li ⋅ Yucheng Chen ⋅ ZhenXi Li ⋅ Fei Yu ⋅ Ming Li ⋅ Si Yong Yeo
Exhibit Hall I #332
AIM: Amending Inherent Interpretability via Self-Supervised Masking Poster Session 1 & Exhibit Hall
Eyad Alshami ⋅ Shashank Agnihotri ⋅ Bernt Schiele ⋅ Margret Keuper
Exhibit Hall I #85
From Panels to Prose: Generating Literary Narratives from Comics Poster Session 5 & Exhibit Hall
Ragav Sachdeva ⋅ Andrew Zisserman
Exhibit Hall I #193
MVGBench: a Comprehensive Benchmark for Multi-view Generation Models Poster Session 2 & Exhibit Hall with Coffee Break
Xianghui Xie ⋅ Jan Lenssen ⋅ Gerard Pons-Moll
Exhibit Hall I #299
A Constrained Optimization Approach for Gaussian Splatting from Coarsely-posed Images and Noisy Lidar Point Clouds Poster Session 1 & Exhibit Hall
Jizong Peng ⋅ Tze Ho Elden Tse ⋅ Kai Xu ⋅ Wenchao Gao ⋅ Angela Yao
Exhibit Hall I #273
Conditional Visual Autoregressive Modeling for Pathological Image Restoration Poster Session 4 & Exhibit Hall with Coffee Break
Ziyi Liu ⋅ Zhe Xu ⋅ Jiabo MA ⋅ Wenqiang Li ⋅ Ruixuan Wang ⋅ Bo Du ⋅ Hao Chen
Exhibit Hall I #281
Text-IRSTD: Leveraging Semantic Text to Promote Infrared Small Target Detection in Complex Scenes Poster Session 3 & Exhibit Hall
Feng Huang ⋅ Shuyuan Zheng ⋅ Zhaobing Qiu ⋅ Huanxian Liu ⋅ huanxin Bai ⋅ Liqiong Chen
Exhibit Hall I #58
Amodal Depth Anything: Amodal Depth Estimation in the Wild Poster Session 2 & Exhibit Hall with Coffee Break
Zhenyu Li ⋅ Mykola Lavreniuk ⋅ Jian Shi ⋅ Shariq Bhat ⋅ Peter Wonka
Exhibit Hall I #436
SEGA: A Stepwise Evolution Paradigm for Content-Aware Layout Generation with Design Prior Poster Session 4 & Exhibit Hall with Coffee Break
Bo Zhao ⋅ Haoran Wang ⋅ Jinghui Wang ⋅ Hanzhang Wang ⋅ Huan Yang ⋅ Wei Ji ⋅ Hao Liu ⋅ Xinyan Xiao
Exhibit Hall I #425
RobustSplat: Decoupling Densification and Dynamics for Transient-Free 3DGS Poster Session 6 & Exhibit Hall with Coffee Break
Chuanyu Fu ⋅ Yuqi Zhang ⋅ Kunbin Yao ⋅ Guanying Chen ⋅ Yuan Xiong ⋅ Chuan Huang ⋅ Shuguang Cui ⋅ Xiaochun Cao
Exhibit Hall I #233
High-Resolution Spatiotemporal Modeling with Global-Local State Space Models for Video-Based Human Pose Estimation Poster Session 2 & Exhibit Hall with Coffee Break
Runyang Feng ⋅ Hyung Jin Chang ⋅ Tze Ho Elden Tse ⋅ Boeun Kim ⋅ Yi Chang ⋅ Yixing Gao
Exhibit Hall I #367
MCOP: Multi-UAV Collaborative Occupancy Prediction Poster Session 6 & Exhibit Hall with Coffee Break
Zefu Lin ⋅ Wenbo Chen ⋅ Xiaojuan Jin ⋅ Yuran Yang ⋅ Lue Fan ⋅ YIXIN ZHANG ⋅ Yufeng Zhang ⋅ Zhaoxiang Zhang
Exhibit Hall I #244
Bayesian-Inspired Space-Time Superpixels Poster Session 2 & Exhibit Hall with Coffee Break
Kent Gauen ⋅ Stanley Chan
Exhibit Hall I #34
From Easy to Hard: Progressive Active Learning Framework for Infrared Small Target Detection with Single Point Supervision Poster Session 1 & Exhibit Hall
Chuang Yu ⋅ Jinmiao Zhao ⋅ Yunpeng Liu ⋅ Sicheng Zhao ⋅ Yimian Dai ⋅ Xiangyu Yue
Exhibit Hall I #238
Mitigating Catastrophic Overfitting in Fast Adversarial Training via Label Information Elimination Poster Session 1 & Exhibit Hall
Chao Pan ⋅ Ke Tang ⋅ Li Qing ⋅ Xin Yao
Exhibit Hall I #276
Consistency Trajectory Matching for One-Step Generative Super-Resolution Poster Session 3 & Exhibit Hall
Weiyi You ⋅ Mingyang Zhang ⋅ Leheng Zhang ⋅ Xingyu Zhou ⋅ Kexuan Shi ⋅ Shuhang Gu
Exhibit Hall I #258
MotionLab: Unified Human Motion Generation and Editing via the Motion-Condition-Motion Paradigm Poster Session 3 & Exhibit Hall
Ziyan Guo ⋅ Zeyu HU ⋅ Na Zhao ⋅ De Wen Soh
Exhibit Hall I #363
Scoring, Remember, and Reference: Catching Camouflaged Objects in Videos Poster Session 3 & Exhibit Hall
Yuang Feng ⋅ Shuyong Gao ⋅ Fuzhen Yan ⋅ Yicheng Song ⋅ Lingyi Hong ⋅ Junjie Hu ⋅ Wenqiang Zhang
Exhibit Hall I #286
Geometric Alignment and Prior Modulation for View-Guided Point Cloud Completion on Unseen Categories Poster Session 6 & Exhibit Hall with Coffee Break
Jingqiao Xiu ⋅ Yicong Li ⋅ Na Zhao ⋅ Han Fang ⋅ Xiang Wang ⋅ Angela Yao
Exhibit Hall I #262
FRET: Feature Redundancy Elimination for Test Time Adaptation Poster Session 1 & Exhibit Hall
Linjing You ⋅ Jiabao Lu ⋅ Xiayuan Huang ⋅ Xiangli Nie
Exhibit Hall I #192
Motion-2-to-3: Leveraging 2D Motion Data for 3D Motion Generations Poster Session 3 & Exhibit Hall
Ruoxi Guo ⋅ Huaijin Pi ⋅ Zehong Shen ⋅ Qing Shuai ⋅ zechenhu zechenhu ⋅ Zhumei Wang ⋅ Yajiao Dong ⋅ Ruizhen Hu ⋅ Taku Komura ⋅ Sida Peng ⋅ Xiaowei Zhou
Exhibit Hall I #405
AnyBimanual: Transferring Unimanual Policy for General Bimanual Manipulation Poster Session 3 & Exhibit Hall
Guanxing Lu ⋅ Tengbo Yu ⋅ Haoyuan Deng ⋅ Season Chen ⋅ Yansong Tang ⋅ Ziwei Wang
Exhibit Hall I #344
SPA: Efficient User-Preference Alignment against Uncertainty in Medical Image Segmentation Poster Session 5 & Exhibit Hall
Jiayuan Zhu ⋅ Junde Wu ⋅ Cheng Ouyang ⋅ Konstantinos Kamnitsas ⋅ Alison Noble
Exhibit Hall I #370
Signs as Tokens: A Retrieval-Enhanced Multilingual Sign Language Generator Poster Session 5 & Exhibit Hall
Ronglai Zuo ⋅ Rolandos Alexandros Potamias ⋅ Evangelos Ververas ⋅ Jiankang Deng ⋅ Stefanos Zafeiriou
Exhibit Hall I #379
A₀ : An Affordance-Aware Hierarchical Model for General Robotic Manipulation Poster Session 3 & Exhibit Hall
Rongtao Xu ⋅ Jian Zhang ⋅ Minghao Guo ⋅ Youpeng Wen ⋅ Haoting Yang ⋅ Min Lin ⋅ Jianzheng Huang ⋅ Zhe Li ⋅ Kaidong Zhang ⋅ Liqiong Wang ⋅ Yuxuan Kuang ⋅ Meng Cao ⋅ Feng Zheng ⋅ Xiaodan Liang
Exhibit Hall I #329
FVGen: Accelerating Novel-View Synthesis with Adversarial Video Diffusion Distillation Poster Session 6 & Exhibit Hall with Coffee Break
Wenbin Teng ⋅ Gonglin Chen ⋅ Haiwei Chen ⋅ Yajie Zhao
Exhibit Hall I #131
PVMamba: Parallelizing Vision Mamba via Dynamic State Aggregation Poster Session 3 & Exhibit Hall
Fei Xie ⋅ Zhongdao Wang ⋅ Weijia Zhang ⋅ Chao Ma
Exhibit Hall I #20
Controllable and Expressive One-Shot Video Head Swapping Poster Session 3 & Exhibit Hall
Chaonan Ji ⋅ Jinwei Qi ⋅ Peng Zhang ⋅ Bang Zhang ⋅ Liefeng Bo
Exhibit Hall I #22
When Pixel Difference Patterns Meet ViT: PiDiViT for Few-Shot Object Detection Poster Session 5 & Exhibit Hall
Hongliang Zhou ⋅ Yongxiang Liu ⋅ Canyu Mo ⋅ Weijie Li ⋅ Bowen Peng ⋅ Li Liu
Exhibit Hall I #422
Boosting Adversarial Transferability via Residual Perturbation Attack Poster Session 1 & Exhibit Hall
Jinjia Peng ⋅ Zeze Tao ⋅ Huibing Wang ⋅ Meng Wang ⋅ Yang Wang
Exhibit Hall I #110
What Makes for Text to 360-degree Panorama Generation with Stable Diffusion? Poster Session 4 & Exhibit Hall with Coffee Break
Jinhong Ni ⋅ Chang-Bin Zhang ⋅ Qiang Zhang ⋅ Jing Zhang
Exhibit Hall I #160
Learning Normals of Noisy Points by Local Gradient-Aware Surface Filtering Poster Session 6 & Exhibit Hall with Coffee Break
Qing Li ⋅ Huifang Feng ⋅ Xun Gong ⋅ Liang Han
Exhibit Hall I #396
ORION: A Holistic End-to-End Autonomous Driving Framework by Vision-Language Instructed Action Generation Poster Session 6 & Exhibit Hall with Coffee Break
Haoyu Fu ⋅ Diankun Zhang ⋅ Zongchuang Zhao ⋅ Jianfeng Cui ⋅ DINGKANG LIANG ⋅ Chong Zhang ⋅ Dingyuan Zhang ⋅ Hongwei Xie ⋅ BING WANG ⋅ Xiang Bai
Exhibit Hall I #10
Learning Pixel-adaptive Multi-layer Perceptrons for Real-time Image Enhancement Poster Session 3 & Exhibit Hall
Junyu Lou ⋅ Xiaorui Zhao ⋅ Kexuan Shi ⋅ Shuhang Gu
Exhibit Hall I #386
Kaleidoscopic Background Attack: Disrupting Pose Estimation with Multi-Fold Radial Symmetry Textures Poster Session 6 & Exhibit Hall with Coffee Break
Xinlong Ding ⋅ Hongwei Yu ⋅ Jiawei Li ⋅ Feifan Li ⋅ Yu Shang ⋅ Bochao Zou ⋅ Huimin Ma ⋅ Jiansheng Chen
Exhibit Hall I #364
CULTURE3D: A Large-Scale and Diverse Dataset of Cultural Landmarks and Terrains for Gaussian-Based Scene Rendering Poster Session 6 & Exhibit Hall with Coffee Break
xinyi zheng ⋅ Steve Zhang ⋅ Weizhe Lin ⋅ Fan Zhang ⋅ Walterio Mayol-Cuevas ⋅ Yunze Liu ⋅ Junxiao Shen
Exhibit Hall I #418
NullSwap: Proactive Identity Cloaking Against Deepfake Face Swapping Poster Session 3 & Exhibit Hall
Tianyi Wang ⋅ Shuaicheng Niu ⋅ Harry Cheng ⋅ xiao zhang ⋅ Yinglong Wang
Exhibit Hall I #74
Forensic-MoE: Exploring Comprehensive Synthetic Image Detection Traces with Mixture of Experts Poster Session 4 & Exhibit Hall with Coffee Break
Mingqi Fang ⋅ Ziguang Li ⋅ Lingyun Yu ⋅ Quanwei Yang ⋅ Hongtao Xie ⋅ Yongdong Zhang
Exhibit Hall I #276
RoboTron-Mani: All-in-One Multimodal Large Model for Robotic Manipulation Poster Session 3 & Exhibit Hall
Feng yan ⋅ Fanfan Liu ⋅ Yiyang Huang ⋅ ZechaoGuan ZechaoGuan ⋅ Liming Zheng ⋅ Yufeng Zhong ⋅ Chengjian Feng ⋅ Lin Ma
Exhibit Hall I #348
Information-Bottleneck Driven Binary Neural Network for Change Detection Poster Session 2 & Exhibit Hall with Coffee Break
Kaijie Yin ⋅ Zhiyuan Zhang ⋅ Shu Kong ⋅ Tian Gao ⋅ Cheng-zhong Xu ⋅ Hui Kong
Exhibit Hall I #202
Entropy-Adaptive Diffusion Policy Optimization with Dynamic Step Alignment Poster Session 1 & Exhibit Hall
Renye Yan ⋅ Jikang Cheng ⋅ Yaozhong Gan ⋅ Shikun Sun ⋅ You Wu ⋅ Yunfan Yang ⋅ Ling Liang ⋅ JinLong Lin ⋅ Yeshuang Zhu ⋅ Jie Zhou ⋅ Jinchao Zhang ⋅ Junliang Xing ⋅ Yimao Cai ⋅ Ru Huang
Exhibit Hall I #174
Time-Aware Auto White Balance in Mobile Photography Poster Session 2 & Exhibit Hall with Coffee Break
Mahmoud Afifi ⋅ Luxi Zhao ⋅ Abhijith Punnappurath ⋅ Mohamed Abdelsalam ⋅ Ran Zhang ⋅ Michael Brown
Exhibit Hall I #2
ViewSRD: 3D Visual Grounding via Structured Multi-View Decomposition Poster Session 2 & Exhibit Hall with Coffee Break
Ronggang Huang ⋅ Haoxin Yang ⋅ Yan Cai ⋅ Xuemiao Xu ⋅ Huaidong Zhang ⋅ Shengfeng He
Exhibit Hall I #441
Physical Degradation Model-Guided Interferometric Hyperspectral Reconstruction with Unfolding Transformer Poster Session 3 & Exhibit Hall
Yuansheng Li ⋅ Yunhao Zou ⋅ Linwei Chen ⋅ Ying Fu
Exhibit Hall I #358
VPR-Cloak: A First Look at Privacy Cloak Against Visual Place Recognition Poster Session 2 & Exhibit Hall with Coffee Break
Shuting Dong ⋅ Mingzhi Chen ⋅ Feng Lu ⋅ Hao Yu ⋅ Guanghao Li ⋅ Zhe Wu ⋅ Ming Tang ⋅ Chun Yuan
Exhibit Hall I #204
Evidential Knowledge Distillation Poster Session 1 & Exhibit Hall
Liangyu Xiang ⋅ Junyu Gao ⋅ Changsheng Xu
Exhibit Hall I #259
Pruning All-Rounder: Rethinking and Improving Inference Efficiency for Large Vision Language Models Poster Session 5 & Exhibit Hall
Wei Suo ⋅ Ji Ma ⋅ Mengyang Sun ⋅ Lin Wu ⋅ PENG WANG ⋅ Yanning Zhang
Exhibit Hall I #42
Align Your Rhythm: Generating Highly Aligned Dance Poses with Gating-Enhanced Rhythm-Aware Feature Representation Poster Session 3 & Exhibit Hall
Congyi Fan ⋅ Jian Guan ⋅ Xuanjia Zhao ⋅ Dongli Xu ⋅ Youtian Lin ⋅ Tong Ye ⋅ Pengming Feng ⋅ Haiwei Pan
Exhibit Hall I #300
HOMO-Feature: Cross-Arbitrary-Modal Image Matching with Homomorphism of Organized Major Orientation Poster Session 3 & Exhibit Hall
Chenzhong Gao ⋅ Wei Li ⋅ Desheng Weng
Exhibit Hall I #49
OCSplats: Observation Completeness Quantification and Label Noise Separation in 3DGS Poster Session 6 & Exhibit Hall with Coffee Break
Han Ling ⋅ Yinghui Sun ⋅ Xian Xu ⋅ Quansen Sun
Exhibit Hall I #92
GSOT3D: Towards Generic 3D Single Object Tracking in the Wild Poster Session 2 & Exhibit Hall with Coffee Break
Yifan Jiao ⋅ Yunhao Li ⋅ Junhua Ding ⋅ Qing Yang ⋅ Song Fu ⋅ Heng Fan ⋅ Libo Zhang
Exhibit Hall I #42
GWM: Towards Scalable Gaussian World Models for Robotic Manipulation Poster Session 2 & Exhibit Hall with Coffee Break
Guanxing Lu ⋅ Baoxiong Jia ⋅ Puhao Li ⋅ Yixin Chen ⋅ Ziwei Wang ⋅ Yansong Tang ⋅ Siyuan Huang
Exhibit Hall I #399
Dynamic-DINO: Fine-Grained Mixture of Experts Tuning for Real-time Open-Vocabulary Object Detection Poster Session 5 & Exhibit Hall
Yehao Lu ⋅ Minghe Weng ⋅ Zekang Xiao ⋅ Rui Jiang ⋅ Wei Su ⋅ Guangcong Zheng ⋅ Luping Luping ⋅ Xi Li
Exhibit Hall I #99
WAVE: Warp-Based View Guidance for Consistent Novel View Synthesis Using a Single Image Poster Session 3 & Exhibit Hall
Jiwoo Park ⋅ Tae Choi ⋅ Youngjun Jun ⋅ Seong Jae Hwang
Exhibit Hall I #179
DanceEditor: Towards Iterative Editable Music-driven Dance Generation with Open-Vocabulary Descriptions Poster Session 3 & Exhibit Hall
Hengyuan Zhang ⋅ Zhe Li ⋅ Xingqun Qi ⋅ Mengze Li ⋅ Muyi Sun ⋅ Siye Wang ⋅ Man Zhang ⋅ Sirui Han
Exhibit Hall I #202
TAG-WM: Tamper-Aware Generative Image Watermarking via Diffusion Inversion Sensitivity Poster Session 4 & Exhibit Hall with Coffee Break
Yuzhuo Chen ⋅ Zehua Ma ⋅ Han Fang ⋅ Weiming Zhang ⋅ Nenghai Yu
Exhibit Hall I #176
HORT: Monocular Hand-held Objects Reconstruction with Transformers Poster Session 2 & Exhibit Hall with Coffee Break
Zerui Chen ⋅ Rolandos Alexandros Potamias ⋅ Shizhe Chen ⋅ Cordelia Schmid
Exhibit Hall I #96
Multimodal Latent Diffusion Model for Complex Sewing Pattern Generation Poster Session 4 & Exhibit Hall with Coffee Break
Shengqi Liu ⋅ Yuhao Cheng ⋅ Zhuo Chen ⋅ Xingyu Ren ⋅ Wenhan Zhu ⋅ Lincheng Li ⋅ Mengxiao Bi ⋅ Xiaokang Yang ⋅ Yichao Yan
Exhibit Hall I #264
Enhancing Reward Models for High-quality Image Generation: Beyond Text-Image Alignment Poster Session 4 & Exhibit Hall with Coffee Break
ying ba ⋅ Tianyu Zhang ⋅ Yalong Bai ⋅ Wenyi Mo ⋅ Tao Liang ⋅ Bing Su ⋅ Ji-Rong Wen
Exhibit Hall I #397
Lightweight and Fast Real-time Image Enhancement via Decomposition of the Spatial-aware Lookup Tables Poster Session 3 & Exhibit Hall
Wontae Kim ⋅ Keuntek Lee ⋅ Nam Ik Cho
Exhibit Hall I #178
Diffusion-based 3D Hand Motion Recovery with Intuitive Physics Poster Session 2 & Exhibit Hall with Coffee Break
Yufei Zhang ⋅ Zijun Cui ⋅ Jeffrey Kephart ⋅ Qiang Ji
Exhibit Hall I #214
HumanOLAT: A Large-Scale Dataset for Full-Body Human Relighting and Novel-View Synthesis Poster Session 6 & Exhibit Hall with Coffee Break
Timo Teufel ⋅ xilong zhou ⋅ Umar Iqbal ⋅ Pramod Rao ⋅ Pulkit Gera ⋅ Jan Kautz ⋅ Vladislav Golyanik ⋅ Christian Theobalt
Exhibit Hall I #424
Devil is in the Uniformity: Exploring Diverse Learners within Transformer for Image Restoration Poster Session 3 & Exhibit Hall
Shihao Zhou ⋅ Dayu Li ⋅ Jinshan Pan ⋅ Juncheng Zhou ⋅ Jinglei Shi ⋅ Jufeng Yang
Exhibit Hall I #216
Tensor-aggregated LoRA in Federated Fine-tuning Poster Session 1 & Exhibit Hall
Zhixuan Li ⋅ Binqian Xu ⋅ Xiangbo Shu ⋅ Jiachao Zhang ⋅ Yazhou Yao ⋅ Guo-Sen Xie ⋅ Jinhui Tang
Exhibit Hall I #91
LVAgent: Long Video Understanding by Multi-Round Dynamical Collaboration of MLLM Agents Poster Session 5 & Exhibit Hall
Boyu Chen ⋅ Zhengrong Yue ⋅ Siran Chen ⋅ Zikang Wang ⋅ Yang Liu ⋅ Peng Li ⋅ Yali Wang
Exhibit Hall I #41
Aligning Vision to Language: Annotation-Free Multimodal Knowledge Graph Construction for Enhanced LLMs Reasoning Poster Session 1 & Exhibit Hall
Junming Liu ⋅ Siyuan Meng ⋅ Yanting Gao ⋅ Song Mao ⋅ Pinlong Cai ⋅ Guohang Yan ⋅ Yirong Chen ⋅ Zilin Bian ⋅ DING WANG ⋅ Botian Shi
Exhibit Hall I #84
EMatch: A Unified Framework for Event-based Optical Flow and Stereo Matching Poster Session 2 & Exhibit Hall with Coffee Break
Pengjie Zhang ⋅ Lin Zhu ⋅ Xiao Wang ⋅ Lizhi Wang ⋅ Hua Huang
Exhibit Hall I #78
Liberated-GS: 3D Gaussian Splatting Independent from SfM Point Clouds Poster Session 6 & Exhibit Hall with Coffee Break
Weihong Pan ⋅ Xiaoyu Zhang ⋅ Hongjia Zhai ⋅ Xiaojun Xiang ⋅ Hanqing Jiang ⋅ Guofeng Zhang
Exhibit Hall I #189
Unlocking the Potential of Diffusion Priors in Blind Face Restoration Poster Session 3 & Exhibit Hall
Yunqi Miao ⋅ Zhiyu Qu ⋅ Mingqi Gao ⋅ Changrui Chen ⋅ Jifei Song ⋅ Jungong Han ⋅ Jiankang Deng
Exhibit Hall I #327
DyWA: Dynamics-adaptive World Action Model for Generalizable Non-prehensile Manipulation Poster Session 3 & Exhibit Hall
Jiangran Lyu ⋅ Ziming Li ⋅ Xuesong Shi ⋅ Chaoyi Xu ⋅ Yizhou Wang ⋅ He Wang
Exhibit Hall I #99
Self-Supervised Sparse Sensor Fusion for Long Range Perception Poster Session 6 & Exhibit Hall with Coffee Break
Edoardo Palladin ⋅ Samuel Brucker ⋅ Filippo Ghilotti ⋅ Praveen Narayanan ⋅ Mario Bijelic ⋅ Felix Heide
Exhibit Hall I #268
Joint Asymmetric Loss for Learning with Noisy Labels Poster Session 1 & Exhibit Hall
Jialiang Wang ⋅ Xianming Liu ⋅ Xiong Zhou ⋅ Gangfeng Hu ⋅ Deming Zhai ⋅ Junjun Jiang ⋅ Xiangyang Ji
Exhibit Hall I #176
SAME: Learning Generic Language-Guided Visual Navigation with State-Adaptive Mixture of Experts Poster Session 2 & Exhibit Hall with Coffee Break
Gengze Zhou ⋅ Yicong Hong ⋅ Zun Wang ⋅ Chongyang Zhao ⋅ Mohit Bansal ⋅ Qi Wu
Exhibit Hall I #261
Implicit Counterfactual Learning for Audio-Visual Segmentation Poster Session 5 & Exhibit Hall
Mingfeng Zha ⋅ Tianyu Li ⋅ Guoqing Wang ⋅ Peng Wang ⋅ Yangyang Wu ⋅ Yang Yang ⋅ Heng Tao Shen
Exhibit Hall I #240
DC-ControlNet: Decoupling Inter- and Intra-Element Conditions in Image Generation with Diffusion Models Poster Session 4 & Exhibit Hall with Coffee Break
hongji yang ⋅ Wencheng Han ⋅ Yucheng Zhou ⋅ Jianbing Shen
Exhibit Hall I #401
STaR: Seamless Spatial-Temporal Aware Motion Retargeting with Penetration and Consistency Constraints Poster Session 3 & Exhibit Hall
Xiaohang Yang ⋅ Qing Wang ⋅ Jiahao Yang ⋅ Gregory Slabaugh ⋅ Shanxin Yuan
Exhibit Hall I #277
FALCON: Resolving Visual Redundancy and Fragmentation in High-resolution Multimodal Large Language Models via Visual Registers Poster Session 5 & Exhibit Hall
Renshan Zhang ⋅ Rui Shao ⋅ Gongwei Chen ⋅ Miao Zhang ⋅ Kaiwen Zhou ⋅ Weili Guan ⋅ Liqiang Nie
Exhibit Hall I #351
Visual-Oriented Fine-Grained Knowledge Editing for MultiModal Large Language Models Poster Session 1 & Exhibit Hall
Zhen Zeng ⋅ Leijiang Gu ⋅ Xun Yang ⋅ Zhangling Duan ⋅ Zenglin Shi ⋅ Meng Wang
Exhibit Hall I #229
Competitive Distillation: A Simple Learning Strategy for Improving Visual Classification Poster Session 1 & Exhibit Hall
Daqian Shi ⋅ Xiaolei Diao ⋅ Xu Chen ⋅ Cedric John
Exhibit Hall I #275
AIComposer: Any Style and Content Image Composition via Feature Integration Poster Session 4 & Exhibit Hall with Coffee Break
Haowen Li ⋅ Zhenfeng Fan ⋅ Zhang Wen ⋅ Zhengzhou Zhu ⋅ Yunjin Li
Exhibit Hall I #187
Rethink Sparse Signals for Pose-guided Text-to-image Generation Poster Session 4 & Exhibit Hall with Coffee Break
Wenjie Xuan ⋅ Jing Zhang ⋅ Juhua Liu ⋅ Bo Du ⋅ Dacheng Tao
Exhibit Hall I #96
VPO: Aligning Text-to-Video Generation Models with Prompt Optimization Poster Session 4 & Exhibit Hall with Coffee Break
Jiale Cheng ⋅ Ruiliang Lyu ⋅ Xiaotao Gu ⋅ Xiao Liu ⋅ Jiazheng Xu ⋅ Yida Lu ⋅ Jiayan Teng ⋅ Zhuoyi Yang ⋅ Yuxiao Dong ⋅ Jie Tang ⋅ Hongning Wang ⋅ Minlie Huang
Exhibit Hall I #70
Stylized-Face: A Million-level Stylized Face Dataset for Face Recognition Poster Session 3 & Exhibit Hall
Zhengyuan Peng ⋅ Jianqing Xu ⋅ Yuge Huang ⋅ Jinkun Hao ⋅ Shouhong Ding ⋅ zhizhong zhang ⋅ Xin TAN ⋅ Lizhuang Ma
Exhibit Hall I #287
Uncover Treasures in DCT: Advancing JPEG Quality Enhancement by Exploiting Latent Correlations Poster Session 4 & Exhibit Hall with Coffee Break
jing Yang ⋅ Qunliang Xing ⋅ Mai Xu ⋅ Minglang Qiao
Exhibit Hall I #260
From One to More: Contextual Part Latents for 3D Generation Poster Session 2 & Exhibit Hall with Coffee Break
Shaocong Dong ⋅ Lihe Ding ⋅ Xiao Chen ⋅ Yaokun Li ⋅ Yuxin WANG ⋅ Yucheng Wang ⋅ Qi WANG ⋅ Jaehyeok Kim ⋅ Chenjian Gao ⋅ Zhanpeng Huang ⋅ Zibin Wang ⋅ Tianfan Xue ⋅ Dan Xu
Exhibit Hall I #301
Single-Scanline Relative Pose Estimation for Rolling Shutter Cameras Poster Session 2 & Exhibit Hall with Coffee Break
Petr Hruby ⋅ Marc Pollefeys
Exhibit Hall I #199
Unified Multi-Agent Trajectory Modeling with Masked Trajectory Diffusion Poster Session 6 & Exhibit Hall with Coffee Break
songru Yang ⋅ Zhenwei Shi ⋅ Zhengxia Zou
Exhibit Hall I #274
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion Poster Session 5 & Exhibit Hall
Ahmed Nassar ⋅ Matteo Omenetti ⋅ Maksym Lysak ⋅ Nikolaos Livathinos ⋅ Christoph Auer ⋅ Lucas Morin ⋅ Rafael Teixeira de Lima ⋅ Yusik Kim ⋅ A. Said Gurbuz ⋅ Michele Dolfi ⋅ Peter Staar
Exhibit Hall I #203
Beyond Walking: A Large-Scale Image-Text Benchmark for Text-based Person Anomaly Search Poster Session 3 & Exhibit Hall
Shuyu Yang ⋅ Yaxiong Wang ⋅ Li Zhu ⋅ Zhedong Zheng
Exhibit Hall I #162
Images as Noisy Labels: Unleashing the Potential of the Diffusion Model for Open-Vocabulary Semantic Segmentation Poster Session 5 & Exhibit Hall
Fan Li ⋅ Xuanbin Wang ⋅ Xuan Wang ⋅ Zhaoxiang Zhang ⋅ yuelei xu
Exhibit Hall I #417
ContextFace: Generating Facial Expressions from Emotional Contexts Poster Session 3 & Exhibit Hall
minjung kim ⋅ Minsang Kim ⋅ Seung Jun Baek
Exhibit Hall I #129
Agreement aware and dissimilarity oriented GLOM Poster Session 5 & Exhibit Hall
Ru Zeng ⋅ Yan Song ⋅ Yang ZHANG ⋅ yanlinghu yanlinghu ⋅ Hui Yu
Exhibit Hall I #426
The Best of Both Worlds: Integrating Language Models and Diffusion Models for Video Generation Poster Session 4 & Exhibit Hall with Coffee Break
Aoxiong Yin ⋅ Kai Shen ⋅ Yichong Leng ⋅ Xu Tan ⋅ Xinyu Zhou ⋅ Juncheng Li ⋅ Siliang Tang
Exhibit Hall I #67
LLaFEA: Frame-Event Complementary Fusion for Fine-Grained Spatiotemporal Understanding in LMMs Poster Session 5 & Exhibit Hall
Hanyu Zhou ⋅ Gim Hee Lee
Exhibit Hall I #235
Bridging Class Imbalance and Partial Labeling via Spectral-Balanced Energy Propagation for Skeleton-based Action Recognition Poster Session 3 & Exhibit Hall
Yandan Wang ⋅ Chenqi Guo ⋅ Yinglong Ma ⋅ Jiangyan Chen ⋅ Yuan Gao ⋅ Weiming Dong
Exhibit Hall I #15
MeasureXpert: Automatic Anthropometric Measurement Extraction from Two Unregistered, Partial, Posed, and Dressed Body Scans Poster Session 2 & Exhibit Hall with Coffee Break
Ran Zhao ⋅ Xinxin Dai ⋅ Pengpeng Hu ⋅ Vasile Palade ⋅ Adrian Munteanu
Exhibit Hall I #430
ForeSight: Multi-View Streaming Joint Object Detection and Trajectory Forecasting Poster Session 6 & Exhibit Hall with Coffee Break
Sandro Papais ⋅ Letian Wang ⋅ Brian Cheong ⋅ Steven Waslander
Exhibit Hall I #71
DeRIS: Decoupling Perception and Cognition for Enhanced Referring Image Segmentation through Loopback Synergy Poster Session 5 & Exhibit Hall
Ming Dai ⋅ Wenxuan Cheng ⋅ Jiang-Jiang Liu ⋅ Sen Yang ⋅ Wenxiao Cai ⋅ Yanpeng Sun ⋅ Wankou Yang
Exhibit Hall I #13
ResidualViT for Efficient Temporally Dense Video Encoding Poster Session 5 & Exhibit Hall
Mattia Soldan ⋅ Fabian Caba Heilbron ⋅ Bernard Ghanem ⋅ Josef Sivic ⋅ Bryan Russell
Exhibit Hall I #236
VideoOrion: Tokenizing Object Dynamics in Videos Poster Session 5 & Exhibit Hall
Yicheng Feng ⋅ Yijiang Li ⋅ Wanpeng Zhang ⋅ Sipeng Zheng ⋅ Hao Luo ⋅ Zihao Yue ⋅ Zongqing Lu
Exhibit Hall I #56
SeaS: Few-shot Industrial Anomaly Image Generation with Separation and Sharing Fine-tuning Poster Session 5 & Exhibit Hall
Zhewei Dai ⋅ Shilei Zeng ⋅ Haotian Liu ⋅ Xurui Li ⋅ Feng Xue ⋅ Yu Zhou
Exhibit Hall I #315
MoSiC: Optimal-Transport Motion Trajectory for Dense Self-Supervised Learning Poster Session 2 & Exhibit Hall with Coffee Break
Mohammadreza Salehi ⋅ Shashanka Venkataramanan ⋅ Ioana Simion ⋅ Stratis Gavves ⋅ Cees Snoek ⋅ Yuki Asano
Exhibit Hall I #142
Exploring Weather-aware Aggregation and Adaptation for Semantic Segmentation under Adverse Conditions Poster Session 3 & Exhibit Hall
Yuwen Pan ⋅ Rui Sun ⋅ Wangkai Li ⋅ Tianzhu Zhang
Exhibit Hall I #371
MUG: Pseudo Labeling Augmented Audio-Visual Mamba Network for Audio-Visual Video Parsing Poster Session 5 & Exhibit Hall
Langyu Wang ⋅ Langyu Wang ⋅ Yingying Chen ⋅ Yiyuan Zhang ⋅ Ming Tang ⋅ Jinqiao Wang
Exhibit Hall I #80
Randomized Autoregressive Visual Generation Poster Session 4 & Exhibit Hall with Coffee Break
Qihang Yu ⋅ Ju He ⋅ Xueqing Deng ⋅ Xiaohui Shen ⋅ Liang-Chieh (Jay) Chen
Exhibit Hall I #340
CleanPose: Category-Level Object Pose Estimation via Causal Learning and Knowledge Distillation Poster Session 2 & Exhibit Hall with Coffee Break
Xiao Lin ⋅ Yun Peng ⋅ Liuyi Wang ⋅ xianyou zhong ⋅ Minghao Zhu ⋅ Jingwei Yang ⋅ Yi Feng ⋅ Chengju Liu ⋅ Qijun Chen
Exhibit Hall I #91
Unsupervised RGB-D Point Cloud Registration for Scenes with Low Overlap and Photometric Inconsistency Poster Session 6 & Exhibit Hall with Coffee Break
yejun Shou ⋅ Haocheng Wang ⋅ Lingfeng Shen ⋅ Qian Zheng ⋅ Gang Pan ⋅ Yanlong Cao
Exhibit Hall I #14
Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment Poster Session 4 & Exhibit Hall with Coffee Break
Lijie Liu ⋅ Tianxiang Ma ⋅ Bingchuan Li ⋅ Zhuowei Chen ⋅ Jiawei Liu ⋅ Gen Li ⋅ SiYu Zhou ⋅ Qian HE ⋅ Xinglong Wu
Exhibit Hall I #6
TokensGen: Harnessing Condensed Tokens for Long Video Generation Poster Session 4 & Exhibit Hall with Coffee Break
Wenqi Ouyang ⋅ Zeqi Xiao ⋅ Danni Yang ⋅ Yifan Zhou ⋅ Shuai Yang ⋅ Lei Yang ⋅ Jianlou Si ⋅ Xingang Pan
Exhibit Hall I #318
Gradient-Reweighted Adversarial Camouflage for Physical Object Detection Evasion Poster Session 3 & Exhibit Hall
Jiawei Liang ⋅ Siyuan Liang ⋅ Tianrui Lou ⋅ Ming Zhang ⋅ liwenjin liwenjin ⋅ Dunqiu fan ⋅ Xiaochun Cao
Exhibit Hall I #364
Progressive Homeostatic and Plastic Prompt Tuning for Audio-Visual Multi-Task Incremental Learning Poster Session 1 & Exhibit Hall
Jiong Yin ⋅ Liang Li ⋅ Jiehua Zhang ⋅ Yuhan Gao ⋅ Chenggang Yan ⋅ Xichun Sheng
Exhibit Hall I #183
MixA: A Mixed Attention approach with Stable Lightweight Linear Attention to enhance Efficiency of Vision Transformers at the Edge Poster Session 5 & Exhibit Hall
Sabbir Ahmed ⋅ Jingtao Li ⋅ Weiming Zhuang ⋅ Chen Chen ⋅ Lingjuan Lyu
Exhibit Hall I #129
Transparent Vision: A Theory of Hierarchical Invariant Representations Poster Session 1 & Exhibit Hall
Shuren Qi ⋅ Yushu Zhang ⋅ CHAO WANG ⋅ Zhihua Xia ⋅ Xiaochun Cao ⋅ FENGLEI FAN
Exhibit Hall I #319
AutoPrompt: Automated Red-Teaming of Text-to-Image Models via LLM-Driven Adversarial Prompts Poster Session 4 & Exhibit Hall with Coffee Break
Yufan Liu ⋅ Wanqian Zhang ⋅ Huashan Chen ⋅ Lin Wang ⋅ Xiaojun Jia ⋅ Zheng Lin ⋅ Weiping Wang
Exhibit Hall I #256
Q-Frame: Query-aware Frame Selection and Multi-Resolution Adaptation for Video-LLMs Poster Session 5 & Exhibit Hall
Shaojie Zhang ⋅ Jiahui Yang ⋅ Jianqin Yin ⋅ Zhenbo Luo ⋅ Jian Luan
Exhibit Hall I #211
LongAnimation: Long Animation Generation with Dynamic Global-Local Memory Poster Session 3 & Exhibit Hall
Nan Chen ⋅ Mengqi Huang ⋅ Yihao Meng ⋅ Zhendong Mao
Exhibit Hall I #3
OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation Poster Session 4 & Exhibit Hall with Coffee Break
Junyuan Zhang ⋅ Qintong Zhang ⋅ Bin Wang ⋅ Linke Ouyang ⋅ Zichen Wen ⋅ Ying Li ⋅ Ka-Ho Chow ⋅ Conghui He ⋅ Wentao Zhang
Exhibit Hall I #245
Efficient Event Camera Data Pretraining with Adaptive Prompt Fusion Poster Session 2 & Exhibit Hall with Coffee Break
Quanmin Liang ⋅ Qiang Li ⋅ Shuai Liu ⋅ Xinzi Cao ⋅ Jinyi Lu ⋅ Feidiao Yang ⋅ Wei Zhang ⋅ Kai Huang ⋅ Yonghong Tian
Exhibit Hall I #342
Lightweight Gradient-Aware Upscaling of 3D Gaussian Splatting Images Poster Session 6 & Exhibit Hall with Coffee Break
Simon Niedermayr ⋅ Christoph Neuhauser ⋅ Rüdiger Westermann
Exhibit Hall I #109
RoBridge: A Hierarchical Architecture Bridging Cognition and Execution for General Robotic Manipulation Poster Session 3 & Exhibit Hall
Kaidong Zhang ⋅ Rongtao Xu ⋅ Ren Pengzhen ⋅ Junfan Lin ⋅ Hefeng Wu ⋅ Liang Lin ⋅ Xiaodan Liang
Exhibit Hall I #432
SEGS-SLAM: Structure-enhanced 3D Gaussian Splatting SLAM with Appearance Embedding Poster Session 6 & Exhibit Hall with Coffee Break
Tianci Wen ⋅ Zhiang Liu ⋅ Yongchun Fang
Exhibit Hall I #326
BoxDreamer: Dreaming Box Corners for Generalizable Object Pose Estimation Poster Session 2 & Exhibit Hall with Coffee Break
Yuanhong Yu ⋅ Xingyi He ⋅ Chen Zhao ⋅ Junhao Yu ⋅ Jiaqi Yang ⋅ Ruizhen Hu ⋅ Yujun Shen ⋅ Xing Zhu ⋅ Xiaowei Zhou ⋅ Sida Peng
Exhibit Hall I #409
Looking in the Mirror: A Faithful Counterfactual Explanation Method for Interpreting Deep Image Classification Models Poster Session 1 & Exhibit Hall
Townim Chowdhury ⋅ Vu Phan ⋅ Kewen Liao ⋅ Nanyu Dong ⋅ Minh-Son To ⋅ Anton Hengel ⋅ Johan Verjans ⋅ Zhibin Liao
Exhibit Hall I #203
FLSeg: Enhancing Privacy and Robustness in Federated Learning under Heterogeneous Data via Model Segmentation Poster Session 1 & Exhibit Hall
Zichun Su ⋅ Zhi Lu ⋅ Yutong Wu ⋅ Renfei Shen ⋅ Songfeng Lu
Exhibit Hall I #364
Self-Calibrating Gaussian Splatting for Large Field-of-View Reconstruction Poster Session 6 & Exhibit Hall with Coffee Break
Youming Deng ⋅ Wenqi Xian ⋅ Guandao Yang ⋅ Leonidas Guibas ⋅ Gordon Wetzstein ⋅ Steve Marschner ⋅ Paul Debevec
Exhibit Hall I #38
Trial-Oriented Visual Rearrangement Poster Session 2 & Exhibit Hall with Coffee Break
Yuyi Liu ⋅ Xinhang Song ⋅ Tianliang Qi ⋅ Shuqiang Jiang
Exhibit Hall I #282
MSQ: Memory-Efficient Bit Sparsification Quantization Poster Session 5 & Exhibit Hall
Seokho Han ⋅ Seoyeon Yoon ⋅ Jinhee Kim ⋅ Dongwei Wang ⋅ Kang Jeon ⋅ Huanrui Yang ⋅ Jong Hwan Ko
Exhibit Hall I #195
SuMa: A Subspace Mapping Approach for Robust and Effective Concept Erasure in Text-to-Image Diffusion Models Poster Session 4 & Exhibit Hall with Coffee Break
Kien Nguyen ⋅ Anh Tran ⋅ Cuong Pham
Exhibit Hall I #450
LVFace: Progressive Cluster Optimization for Large Vision Models in Face Recognition Poster Session 3 & Exhibit Hall
Jinghan You ⋅ Shanglin Li ⋅ Yuanrui Sun ⋅ Jiangchuanwei Wei ⋅ Mingyu Guo ⋅ Chao Feng ⋅ Jiao Ran
Exhibit Hall I #173
Timestep-Aware Diffusion Model for Extreme Image Rescaling Poster Session 4 & Exhibit Hall with Coffee Break
Ce Wang ⋅ Zhenyu Hu ⋅ Wanjie Sun ⋅ Zhenzhong Chen
Exhibit Hall I #66
Recovering Parametric Scenes from Very Few Time-of-Flight Pixels Poster Session 6 & Exhibit Hall with Coffee Break
Carter Sifferman ⋅ Yiquan Li ⋅ Yiming Li ⋅ Fangzhou Mu ⋅ Michael Gleicher ⋅ Mohit Gupta ⋅ Yin Li
Exhibit Hall I #315
SRefiner: Soft-Braid Attention for Multi-Agent Trajectory Refinement Poster Session 1 & Exhibit Hall
Liwen Xiao ⋅ Zhiyu Pan ⋅ Zhicheng Wang ⋅ Zhiguo Cao ⋅ Wei Li
Exhibit Hall I #82
Generating Physically Stable and Buildable Brick Structures from Text Poster Session 4 & Exhibit Hall with Coffee Break
Ava Pun ⋅ Kangle Deng ⋅ Ruixuan Liu ⋅ Deva Ramanan ⋅ Changliu Liu ⋅ Jun-Yan Zhu
Exhibit Hall I #306
An Empirical Study of Autoregressive Pre-training from Videos Poster Session 4 & Exhibit Hall with Coffee Break
Jathushan Rajasegaran ⋅ Ilija Radosavovic ⋅ Rahul Ravishankar ⋅ Yossi Gandelsman ⋅ Christoph Feichtenhofer ⋅ Jitendra Malik
Exhibit Hall I #405
Rethinking Few Shot CLIP Benchmarks: A Critical Analysis in the Inductive Setting Poster Session 1 & Exhibit Hall
Alexey Kravets ⋅ Da Chen ⋅ Vinay Namboodiri
Exhibit Hall I #172
TACO: Taming Diffusion for in-the-wild Video Amodal Completion Poster Session 3 & Exhibit Hall
Ruijie Lu ⋅ Yixin Chen ⋅ Yu Liu ⋅ Jiaxiang Tang ⋅ Junfeng Ni ⋅ Diwen Wan ⋅ Gang Zeng ⋅ Siyuan Huang
Exhibit Hall I #342
STI-Bench: Are MLLMs Ready for Precise Spatial-Temporal World Understanding? Poster Session 2 & Exhibit Hall with Coffee Break
Yun Li ⋅ Yiming Zhang ⋅ Tao Lin ⋅ Xiangrui Liu ⋅ Wenxiao Cai ⋅ Zheng Liu ⋅ Bo Zhao
Exhibit Hall I #56
Debiased Teacher for Day-to-Night Domain Adaptive Object Detection Poster Session 1 & Exhibit Hall
Yiming Cui ⋅ Liang Li ⋅ Haibing YIN ⋅ Yuhan Gao ⋅ Yaoqi Sun ⋅ Chenggang Yan
Exhibit Hall I #237
Towards Effective Foundation Model Adaptation for Extreme Cross-Domain Few-Shot Learning Poster Session 1 & Exhibit Hall
Fei Zhou ⋅ Peng Wang ⋅ Lei Zhang ⋅ Wei Wei ⋅ Chen Ding ⋅ Guosheng Lin ⋅ Yanning Zhang
Exhibit Hall I #430
SpikePack: Enhanced Information Flow in Spiking Neural Networks with High Hardware Compatibility Poster Session 5 & Exhibit Hall
Guobin Shen ⋅ Jindong Li ⋅ Tenglong Li ⋅ Dongcheng Zhao ⋅ Yi Zeng
Exhibit Hall I #338
AV-Flow: Transforming Text to Audio-Visual Human-like Interactions Poster Session 3 & Exhibit Hall
Aggelina Chatziagapi ⋅ Louis-Philippe Morency ⋅ Hongyu Gong ⋅ Michael Zollhöfer ⋅ Dimitris Samaras ⋅ Alexander Richard
Exhibit Hall I #402
Stronger, Steadier & Superior: Geometric Consistency in Depth VFM Forges Domain Generalized Semantic Segmentation Poster Session 2 & Exhibit Hall with Coffee Break
Siyu Chen ⋅ Ting Han ⋅ Changshe Zhang ⋅ Xin Luo ⋅ Meiliu Wu ⋅ Guorong Cai ⋅ Jinhe Su
Exhibit Hall I #308
Efficient Adaptation of Pre-trained Vision Transformer underpinned by Approximately Orthogonal Fine-Tuning Strategy Poster Session 1 & Exhibit Hall
Yiting Yang ⋅ Hao Luo ⋅ Yuan Sun ⋅ Qingsen Yan ⋅ Haokui Zhang ⋅ Wei Dong ⋅ Guoqing Wang ⋅ Peng Wang ⋅ Yang Yang ⋅ Heng Tao Shen
Exhibit Hall I #458
FA: Forced Prompt Learning of Vision-Language Models for Out-of-Distribution Detection Poster Session 1 & Exhibit Hall
Xinhua Lu ⋅ Runhe Lai ⋅ Yanqi Wu ⋅ Kanghao Chen ⋅ Wei-Shi Zheng ⋅ Ruixuan Wang
Exhibit Hall I #100
Compression-Aware One-Step Diffusion Model for JPEG Artifact Removal Poster Session 4 & Exhibit Hall with Coffee Break
Jinpei Guo ⋅ Zheng Chen ⋅ Wenbo Li ⋅ Yong Guo ⋅ YULUN ZHANG
Exhibit Hall I #4
ConstStyle: Robust Domain Generalization with Unified Style Transformation Poster Session 1 & Exhibit Hall
Nam Duong Tran ⋅ Nam Nguyen Phuong ⋅ Hieu Pham ⋅ Phi Le Nguyen ⋅ My Thai
Exhibit Hall I #293
CCL-LGS: Contrastive Codebook Learning for 3D Language Gaussian Splatting Poster Session 2 & Exhibit Hall with Coffee Break
Lei Tian ⋅ Xiaomin Li ⋅ Liqian Ma ⋅ Hao Yin ⋅ Zirui Zheng ⋅ Hefei Huang ⋅ Taiqing Li ⋅ Huchuan Lu ⋅ Xu Jia
Exhibit Hall I #453
DreamActor-M1: Holistic, Expressive and Robust Human Image Animation with Hybrid Guidance Poster Session 3 & Exhibit Hall
Yuxuan Luo ⋅ Zhengkun Rong ⋅ Lizhen Wang ⋅ Longhao Zhang ⋅ Tianshu Hu
Exhibit Hall I #97
Learning Few-Step Diffusion Models by Trajectory Distribution Matching Poster Session 4 & Exhibit Hall with Coffee Break
Yihong Luo ⋅ Tianyang Hu ⋅ Jiacheng Sun ⋅ Yujun Cai ⋅ Jing Tang
Exhibit Hall I #271
Aether: Geometric-Aware Unified World Modeling Poster Session 2 & Exhibit Hall with Coffee Break
Haoyi Zhu ⋅ Yifan Wang ⋅ Jianjun Zhou ⋅ Wenzheng Chang ⋅ Yang Zhou ⋅ Zizun Li ⋅ Junyi Chen ⋅ Chunhua Shen ⋅ Jiangmiao Pang ⋅ Tong He
Exhibit Hall I #331
ConsistentCity: Semantic Flow-guided Occupancy DiT for Temporally Consistent Driving Scene Synthesis Poster Session 6 & Exhibit Hall with Coffee Break
Benjin Zhu ⋅ Xiaogang Wang ⋅ Hongsheng Li
Exhibit Hall I #161
CLOT: Closed Loop Optimal Transport for Unsupervised Action Segmentation Poster Session 3 & Exhibit Hall
Elena Bueno-Benito ⋅ Mariella Dimiccoli
Exhibit Hall I #66
Dual-Temporal Exemplar Representation Network for Video Semantic Segmentation Poster Session 3 & Exhibit Hall
Xiaolong Xu ⋅ Lei Zhang ⋅ Jiayi Li ⋅ Lituan Wang ⋅ Yifan Guan ⋅ Yu Yan ⋅ Leyi Zhang ⋅ Hao Song
Exhibit Hall I #71
Gaussian Variation Field Diffusion for High-fidelity Video-to-4D Synthesis Poster Session 3 & Exhibit Hall
Bowen Zhang ⋅ Sicheng Xu ⋅ Chuxin Wang ⋅ Jiaolong Yang ⋅ Feng Zhao ⋅ Dong Chen ⋅ Baining Guo
Exhibit Hall I #236
Unified Open-World Segmentation with Multi-Modal Prompts Poster Session 5 & Exhibit Hall
Yang Liu ⋅ Yufei Yin ⋅ Chenchen Jing ⋅ Muzhi Zhu ⋅ Hao Chen ⋅ Yuling Xi ⋅ Bo Feng ⋅ Hao Wang ⋅ Shiyu Li ⋅ Chunhua Shen
Exhibit Hall I #165
Neurons: Emulating the Human Visual Cortex Improves Fidelity and Interpretability in fMRI-to-Video Reconstruction Poster Session 4 & Exhibit Hall with Coffee Break
Haonan Wang ⋅ Qixiang ZHANG ⋅ Lehan Wang ⋅ Xuanqi Huang ⋅ Xiaomeng Li
Exhibit Hall I #334
Outdoor Monocular SLAM with Global Scale-Consistent 3D Gaussian Pointmaps Poster Session 6 & Exhibit Hall with Coffee Break
Chong Cheng ⋅ Sicheng Yu ⋅ Zijian Wang ⋅ Yifan Zhou ⋅ Hao Wang
Exhibit Hall I #125
LayerAnimate: Layer-level Control for Animation Poster Session 3 & Exhibit Hall
Yuxue Yang ⋅ Lue Fan ⋅ Zuzeng Lin ⋅ Feng Wang ⋅ Zhaoxiang Zhang
Exhibit Hall I #81
AnimateAnyMesh: A Feed-Forward 4D Foundation Model for Text-Driven Universal Mesh Animation Poster Session 3 & Exhibit Hall
zijie wu ⋅ Chaohui Yu ⋅ Fan Wang ⋅ Xiang Bai
Exhibit Hall I #335
Rethinking the Embodied Gap in Vision-and-Language Navigation: A Holistic Study of Physical and Visual Disparities Poster Session 2 & Exhibit Hall with Coffee Break
Liuyi Wang ⋅ Xinyuan Xia ⋅ Hui Zhao ⋅ Hanqing Wang ⋅ Tai Wang ⋅ Yilun Chen ⋅ Chengju Liu ⋅ Qijun Chen ⋅ Jiangmiao Pang
Exhibit Hall I #416
SuperEvent: Cross-Modal Learning of Event-based Keypoint Detection for SLAM Poster Session 2 & Exhibit Hall with Coffee Break
Yannick Burkhardt ⋅ Simon Schaefer ⋅ Stefan Leutenegger
Exhibit Hall I #366
RogSplat: Robust Gaussian Splatting via Generative Priors Poster Session 6 & Exhibit Hall with Coffee Break
Hanyang Kong ⋅ Xingyi Yang ⋅ Xinchao Wang
Exhibit Hall I #97
From Imitation to Innovation: The Emergence of AI's Unique Artistic Styles and the Challenge of Copyright Protection Poster Session 4 & Exhibit Hall with Coffee Break
Zexi Jia ⋅ Chuanwei Huang ⋅ Hongyan Fei ⋅ Yeshuang Zhu ⋅ Zhiqiang Yuan ⋅ Ying Deng ⋅ Jiapei Zhang ⋅ Jinchao Zhang ⋅ Jie Zhou
Exhibit Hall I #393
Intra-modal and Cross-modal Synchronization for Audio-visual Deepfake Detection and Temporal Localization Poster Session 3 & Exhibit Hall
Ashutosh Anshul ⋅ Shreyas Gopal ⋅ Deepu Rajan ⋅ Eng Chng
Exhibit Hall I #359
MinCD-PnP: Learning 2D-3D Correspondences with Approximate Blind PnP Poster Session 6 & Exhibit Hall with Coffee Break
Pei An ⋅ Jiaqi Yang ⋅ Muyao Peng ⋅ You Yang ⋅ Qiong Liu ⋅ Xiaolin Wu ⋅ Liangliang Nan
Exhibit Hall I #174
SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation Poster Session 3 & Exhibit Hall
Shiqi Huang ⋅ Shuting He ⋅ Huaiyuan Qin ⋅ Bihan Wen
Exhibit Hall I #241
Mind the Cost of Scaffold! Benign Clients May Even Become Accomplices of Backdoor Attack Poster Session 1 & Exhibit Hall
Xingshuo Han ⋅ Xuanye Zhang ⋅ Xiang Lan ⋅ Haozhao Wang ⋅ Shengmin Xu ⋅ Shen Ren ⋅ Jason Zeng ⋅ Ming Wu ⋅ Michael Heinrich ⋅ Tianwei Zhang
Exhibit Hall I #140
How To Make Your Cell Tracker Say "I dunno!" Poster Session 2 & Exhibit Hall with Coffee Break
Richard D Paul ⋅ Johannes Seiffarth ⋅ David Rügamer ⋅ Hanno Scharr ⋅ Katharina Nöh
Exhibit Hall I #178
InstructSeg: Unifying Instructed Visual Segmentation with Multi-modal Large Language Models Poster Session 5 & Exhibit Hall
Cong Wei ⋅ Yujie Zhong ⋅ yingsen zeng ⋅ Haoxian Tan ⋅ Yong Liu ⋅ Hongfa Wang ⋅ Yujiu Yang
Exhibit Hall I #37
Bilateral Collaboration with Large Vision-Language Models for Open Vocabulary Human-Object Interaction Detection Poster Session 5 & Exhibit Hall
Yupeng Hu ⋅ Changxing Ding ⋅ Chang Sun ⋅ Shaoli Huang ⋅ Xiangmin Xu
Exhibit Hall I #31
Open-ended Hierarchical Streaming Video Understanding with Vision Language Models Poster Session 5 & Exhibit Hall
Hyolim Kang ⋅ Yunsu Park ⋅ Youngbeom Yoo ⋅ Yeeun Choi ⋅ Seon Joo Kim
Exhibit Hall I #87
V2M4: 4D Mesh Animation Reconstruction from a Single Monocular Video Poster Session 3 & Exhibit Hall
Jianqi Chen ⋅ Biao Zhang ⋅ Xiangjun Tang ⋅ Peter Wonka
Exhibit Hall I #155
CoTMR: Chain-of-Thought Multi-Scale Reasoning for Training-Free Zero-Shot Composed Image Retrieval Poster Session 5 & Exhibit Hall
Zelong Sun ⋅ Dong Jing ⋅ Zhiwu Lu
Exhibit Hall I #270
Towards a 3D Transfer-based Black-box Attack via Critical Feature Guidance Poster Session 6 & Exhibit Hall with Coffee Break
Shuchao Pang ⋅ Zhenghan Chen ⋅ Shen Zhang ⋅ Liming Lu ⋅ Siyuan Liang ⋅ Anan Du ⋅ Yongbin Zhou
Exhibit Hall I #211
DepthSync: Diffusion Guidance-Based Depth Synchronization for Scale- and Geometry-Consistent Video Depth Estimation Poster Session 2 & Exhibit Hall with Coffee Break
Yue-Jiang Dong ⋅ Wang Zhao ⋅ Jiale Xu ⋅ Ying Shan ⋅ Song-Hai Zhang
Exhibit Hall I #37
A Token-level Text Image Foundation Model for Document Understanding Poster Session 5 & Exhibit Hall
Tongkun Guan ⋅ Zining Wang ⋅ Pei Fu ⋅ Zhentao Guo ⋅ Wei Shen ⋅ Kai zhou ⋅ Tiezhu Yue ⋅ Chen Duan ⋅ Hao Sun ⋅ Qianyi Jiang ⋅ Junfeng Luo ⋅ Xiaokang Yang
Exhibit Hall I #322
Learning 3D Object Spatial Relationships from Pre-trained 2D Diffusion Models Poster Session 2 & Exhibit Hall with Coffee Break
Sangwon Baik ⋅ Hyeonwoo Kim ⋅ Hanbyul Joo
Exhibit Hall I #320
MonoMVSNet: Monocular Priors Guided Multi-View Stereo Network Poster Session 6 & Exhibit Hall with Coffee Break
Jianfei Jiang ⋅ Qiankun Liu ⋅ Haochen Yu ⋅ Hongyuan Liu ⋅ Liyong Wang ⋅ Jiansheng Chen ⋅ Huimin Ma
Exhibit Hall I #298
Instance-Level Video Depth in Groups Beyond Occlusions Poster Session 2 & Exhibit Hall with Coffee Break
Yuan Liang ⋅ Yang Zhou ⋅ Ziming Sun ⋅ Tianyi Xiang ⋅ Guiqing Li ⋅ Shengfeng He
Exhibit Hall I #241
Flow Stochastic Segmentation Networks Poster Session 3 & Exhibit Hall
Fabio De Sousa Ribeiro ⋅ Omar Todd ⋅ Charles Jones ⋅ Avinash Kori ⋅ Raghav Mehta ⋅ Ben Glocker
Exhibit Hall I #447
Animate Anyone 2: High-Fidelity Character Image Animation with Environment Affordance Poster Session 3 & Exhibit Hall
Li Hu ⋅ wang yuan ⋅ Zhen Shen ⋅ Xin Gao ⋅ Dechao Meng ⋅ Li'an Zhuo ⋅ Peng Zhang ⋅ Bang Zhang ⋅ Liefeng Bo
Exhibit Hall I #19
Future-Aware Interaction Network For Motion Forecasting Poster Session 2 & Exhibit Hall with Coffee Break
Shijie Li ⋅ Chunyu Liu ⋅ Xun Xu ⋅ Si Yong Yeo ⋅ Xulei Yang
Exhibit Hall I #234
ScanEdit: Hierarchically-Guided Functional 3D Scan Editing Poster Session 6 & Exhibit Hall with Coffee Break
Mohamed El Amine Boudjoghra ⋅ Ivan Laptev ⋅ Angela Dai
Exhibit Hall I #231
Latent Diffusion Models with Masked AutoEncoders Poster Session 4 & Exhibit Hall with Coffee Break
Junho Lee ⋅ Jeongwoo Shin ⋅ Hyungwook Choi ⋅ Joonseok Lee
Exhibit Hall I #243
DreamCube: RGB-D Panorama Generation via Multi-plane Synchronization Poster Session 6 & Exhibit Hall with Coffee Break
Yukun Huang ⋅ Yanning Zhou ⋅ Jianan Wang ⋅ Kaiyi Huang ⋅ Xihui Liu
Exhibit Hall I #19
From Enhancement to Understanding: Build a Generalized Bridge for Low-light Vision via Semantically Consistent Unsupervised Fine-tuning Poster Session 3 & Exhibit Hall
Sen Wang ⋅ Shao Zeng ⋅ Tianjun Gu ⋅ zhizhong zhang ⋅ Ruixin Zhang ⋅ Shouhong Ding ⋅ Jingyun Zhang ⋅ Jun Wang ⋅ Xin TAN ⋅ Yuan Xie ⋅ Lizhuang Ma
Exhibit Hall I #357
Inpaint4Drag: Repurposing Inpainting Models for Drag-Based Image Editing via Bidirectional Warping Poster Session 4 & Exhibit Hall with Coffee Break
Jingyi Lu ⋅ Kai Han
Exhibit Hall I #328
MotionAgent: Fine-grained Controllable Video Generation via Motion Field Agent Poster Session 3 & Exhibit Hall
Xinyao Liao ⋅ Xianfang Zeng ⋅ Liao Wang ⋅ Gang YU ⋅ Guosheng Lin ⋅ Chi Zhang
Exhibit Hall I #122
Unified Video Generation via Next-Set Prediction in Continuous Domain Poster Session 4 & Exhibit Hall with Coffee Break
Zhanzhou Feng ⋅ Qingpei Guo ⋅ Xinyu Xiao ⋅ Ruihan Xu ⋅ Ming Yang ⋅ Shiliang Zhang
Exhibit Hall I #435
PropVG: End-to-End Proposal-Driven Visual Grounding with Multi-Granularity Discrimination Poster Session 2 & Exhibit Hall with Coffee Break
Ming Dai ⋅ Wenxuan Cheng ⋅ Jiedong Zhuang ⋅ Jiang-Jiang Liu ⋅ Hongshen Zhao ⋅ Zhenhua Feng ⋅ Wankou Yang
Exhibit Hall I #191
LazyMAR: Accelerating Masked Autoregressive Models via Feature Caching Poster Session 4 & Exhibit Hall with Coffee Break
Feihong Yan ⋅ qingyan wei ⋅ Jiayi Tang ⋅ Jiajun Li ⋅ Yulin Wang ⋅ Xuming Hu ⋅ Huiqi Li ⋅ Linfeng Zhang
Exhibit Hall I #62
Visual Intention Grounding for Egocentric Assistants Poster Session 1 & Exhibit Hall
Pengzhan Sun ⋅ Junbin Xiao ⋅ Tze Ho Elden Tse ⋅ Yicong Li ⋅ Arjun Akula ⋅ Angela Yao
Exhibit Hall I #231
Omni-scene Perception-oriented Point Cloud Geometry Enhancement for Coordinate Quantization Poster Session 6 & Exhibit Hall with Coffee Break
Wang Liu ⋅ Wei Gao
Exhibit Hall I #127
MVQA: Mamba with Unified Sampling for Efficient Video Quality Assessment Poster Session 4 & Exhibit Hall with Coffee Break
Yachun Mi ⋅ Yu Li ⋅ Weicheng Meng ⋅ Chaofeng Chen ⋅ Chen Hui ⋅ Shaohui Liu
Exhibit Hall I #346
INS-MMBench: A Comprehensive Benchmark for Evaluating LVLMs' Performance in Insurance Poster Session 2 & Exhibit Hall with Coffee Break
Chenwei Lin ⋅ Hanjia Lyu ⋅ Xian Xu ⋅ Jiebo Luo
Exhibit Hall I #377
FreeFlux: Understanding and Exploiting Layer-Specific Roles in RoPE-Based MMDiT for Versatile Image Editing Poster Session 4 & Exhibit Hall with Coffee Break
Tianyi Wei ⋅ Yifan Zhou ⋅ Dongdong Chen ⋅ Xingang Pan
Exhibit Hall I #178
PriOr-Flow: Enhancing Primitive Panoramic Optical Flow with Orthogonal View Poster Session 2 & Exhibit Hall with Coffee Break
Longliang Liu ⋅ Miaojie Feng ⋅ Junda Cheng ⋅ Jijun Xiang ⋅ Xuan Zhu ⋅ Xin Yang
Exhibit Hall I #29
Retinex-MEF: Retinex-based Glare Effects Aware Unsupervised Multi-Exposure Image Fusion Poster Session 2 & Exhibit Hall with Coffee Break
Haowen Bai ⋅ Jiangshe Zhang ⋅ Zixiang Zhao ⋅ Lilun Deng ⋅ Yukun Cui ⋅ Shuang Xu
Exhibit Hall I #209
Zero-Shot Composed Image Retrieval via Dual-Stream Instruction-Aware Distillation Poster Session 5 & Exhibit Hall
Wenliang Zhong ⋅ Rob Barton ⋅ Weizhi An ⋅ Feng Jiang ⋅ Hehuan Ma ⋅ Yuzhi Guo ⋅ Abhishek Dan ⋅ Shioulin Sam ⋅ Karim Bouyarmane ⋅ Junzhou Huang
Exhibit Hall I #226