ICCV 2025 Accepted Papers
|
Real3D: Towards Scaling Large Reconstruction Models with Real Images
Poster Session 2 & Exhibit Hall with Coffee Break
Hanwen Jiang ⋅ Qixing Huang ⋅ Georgios Pavlakos
|
Exhibit Hall I #76 | |
|
Enhancing Few-Shot Vision-Language Classification with Large Multimodal Model Features
Poster Session 1 & Exhibit Hall
Chancharik Mitra ⋅ Brandon Huang ⋅ Tianning Chai ⋅ Zhiqiu Lin ⋅ Assaf Arbelle ⋅ Rogerio Feris ⋅ Leonid Karlinsky ⋅ Trevor Darrell ⋅ Deva Ramanan ⋅ Roei Herzig
|
Exhibit Hall I #254 | |
|
ContraGS: Codebook-Condensed and Trainable Gaussian Splatting for Fast, Memory-Efficient Reconstruction
Poster Session 6 & Exhibit Hall with Coffee Break
Sankeerth Durvasula ⋅ Sharanshangar Muhunthan ⋅ Zain Moustafa ⋅ Richard Chen ⋅ Ruofan Liang ⋅ Yushi Guan ⋅ Nilesh Ahuja ⋅ Nilesh Jain ⋅ Selvakumar Panneer ⋅ Nandita Vijaykumar
|
Exhibit Hall I #406 | |
|
ARIG: Autoregressive Interactive Head Generation for Real-time Conversations
Poster Session 3 & Exhibit Hall
Ying Guo ⋅ Xi Liu ⋅ Cheng Zhen ⋅ Pengfei Yan ⋅ Xiaoming Wei
|
Exhibit Hall I #278 | |
|
VALLR: Visual ASR Language Model for Lip Reading
Poster Session 1 & Exhibit Hall
Marshall Thomas ⋅ Edward Fish ⋅ Richard Bowden
|
Exhibit Hall I #262 | |
|
FREE-Merging: Fourier Transform for Efficient Model Merging
Poster Session 1 & Exhibit Hall
Shenghe Zheng ⋅ Hongzhi Wang
|
Exhibit Hall I #359 | |
|
Chimera: Improving Generalist Model with Domain-Specific Experts
Poster Session 1 & Exhibit Hall
Tianshuo Peng ⋅ Mingsheng Li ⋅ Jiakang Yuan ⋅ Hongbin Zhou ⋅ Renqiu Xia ⋅ Renrui Zhang ⋅ LEI BAI ⋅ Song Mao ⋅ Bin Wang ⋅ Aojun Zhou ⋅ Botian Shi ⋅ Tao Chen ⋅ Bo Zhang ⋅ Xiangyu Yue
|
Exhibit Hall I #278 | |
|
Why LVLMs Are More Prone to Hallucinations in Longer Responses: The Role of Context
Poster Session 1 & Exhibit Hall
Ge Zheng ⋅ Jiaye Qian ⋅ Jiajin Tang ⋅ Sibei Yang
|
Exhibit Hall I #384 | |
|
Any-SSR: How Recursive Least Squares Works in Continual Learning of Large Language Model
Poster Session 1 & Exhibit Hall
Kai Tong ⋅ Kang Pan ⋅ Xiao Zhang ⋅ Erli Meng ⋅ Run He ⋅ Yawen Cui ⋅ Nuoyan Guo ⋅ Huiping Zhuang
|
Exhibit Hall I #281 | |
|
Audio-visual Controlled Video Diffusion with Masked Selective State Spaces Modeling for Natural Talking Head Generation
Poster Session 3 & Exhibit Hall
Fating Hong ⋅ Zunnan Xu ⋅ Zixiang Zhou ⋅ Jun Zhou ⋅ Xiu Li ⋅ Qin Lin ⋅ Qinglin Lu ⋅ Dan Xu
|
Exhibit Hall I #240 | |
|
ImHead: A Large-scale Implicit Morphable Model for Localized Head Modeling
Poster Session 3 & Exhibit Hall
Rolandos Alexandros Potamias ⋅ Stathis Galanakis ⋅ Jiankang Deng ⋅ Athanasios Papaioannou ⋅ Stefanos Zafeiriou
|
Exhibit Hall I #18 | |
|
PathFinder: A Multi-Modal Multi-Agent System for Medical Diagnostic Decision-Making Applied to Histopathology
Poster Session 5 & Exhibit Hall
Fatemeh Ghezloo ⋅ Saygin Seyfioglu ⋅ Rustin Soraki ⋅ Wisdom Ikezogwo ⋅ Beibin Li ⋅ Tejoram Vivekanandan ⋅ Joann Elmore ⋅ Ranjay Krishna ⋅ Linda Shapiro
|
Exhibit Hall I #342 | |
|
SAS: Segment Any 3D Scene with Integrated 2D Priors
Poster Session 2 & Exhibit Hall with Coffee Break
Zhuoyuan Li ⋅ Jiahao Lu ⋅ Jiacheng Deng ⋅ Hanzhi Chang ⋅ Lifan Wu ⋅ Yanzhe Liang ⋅ Tianzhu Zhang
|
Exhibit Hall I #310 | |
|
GloPER: Unsupervised Animal Pattern Extraction from Local Reconstruction
Poster Session 2 & Exhibit Hall with Coffee Break
Bowen Chen ⋅ Yun Sing Koh ⋅ Gillian Dobbie
|
Exhibit Hall I #140 | |
|
StrandHead: Text to Hair-Disentangled 3D Head Avatars Using Human-Centric Priors
Poster Session 3 & Exhibit Hall
Xiaokun Sun ⋅ Zeyu Cai ⋅ Ying Tai ⋅ Jian Yang ⋅ Zhenyu Zhang
|
Exhibit Hall I #320 | |
|
Creation-MMBench: Assessing Context-Aware Creative Intelligence in MLLMs
Poster Session 1 & Exhibit Hall
Xinyu Fang ⋅ Zhijian Chen ⋅ Kai Lan ⋅ Lixin Ma ⋅ Shengyuan Ding ⋅ Yingji Liang ⋅ Xiangyu Zhao ⋅ Farong Wen ⋅ Zicheng Zhang ⋅ Guofeng Zhang ⋅ Haodong Duan ⋅ Kai Chen ⋅ Dahua Lin
|
Exhibit Hall I #32 | |
|
Can Knowledge be Transferred from Unimodal to Multimodal? Investigating the Transitivity of Multimodal Knowledge Editing
Poster Session 1 & Exhibit Hall
Lingyong Fang ⋅ Xinzhong Wang ⋅ Depeng depeng wang ⋅ Zongru Wu ⋅ Ya Guo ⋅ Huijia Zhu ⋅ Zhuosheng Zhang ⋅ Gongshen Liu
|
Exhibit Hall I #226 | |
|
Token Activation Map to Visually Explain Multimodal LLMs
Poster Session 1 & Exhibit Hall
Yi Li ⋅ Hualiang Wang ⋅ Xinpeng Ding ⋅ Haonan Wang ⋅ Xiaomeng Li
|
Exhibit Hall I #380 | |
|
Probabilistic Prototype Calibration of Vision-language Models for Generalized Few-shot Semantic Segmentation
Poster Session 5 & Exhibit Hall
Jie Liu ⋅ Jiayi Shen ⋅ Pan Zhou ⋅ Jan-Jakob Sonke ⋅ Stratis Gavves
|
Exhibit Hall I #126 | |
|
Learning Interpretable Queries for Explainable Image Classification with Information Pursuit
Poster Session 1 & Exhibit Hall
Stefan Kolek ⋅ Aditya Chattopadhyay ⋅ Kwan Ho Ryan Chan ⋅ Hector Andrade Loarca ⋅ Gitta Kutyniok ⋅ Rene Vidal
|
Exhibit Hall I #367 | |
|
Long-Tailed Classification with Multi-Granularity Semantics
Poster Session 1 & Exhibit Hall
Yuting Liu ⋅ Liu Yang ⋅ Yu Wang
|
Exhibit Hall I #401 | |
|
VideoLLaMB: Long Streaming Video Understanding with Recurrent Memory Bridges
Poster Session 5 & Exhibit Hall
Yuxuan Wang ⋅ Yiqi Song ⋅ Cihang Xie ⋅ Yang Liu ⋅ Zilong Zheng
|
Exhibit Hall I #409 | |
|
Auto-Regressive Transformation for Image Alignment
Poster Session 3 & Exhibit Hall
Kanggeon Lee ⋅ Soochahn Lee ⋅ Kyoung Mu Lee
|
Exhibit Hall I #336 | |
|
LMM-Det: Make Large Multimodal Models Excel in Object Detection
Poster Session 1 & Exhibit Hall
Jincheng Li ⋅ Chunyu Xie ⋅ Ji Ao ⋅ Dawei Leng ⋅ Yuhui Yin
|
Exhibit Hall I #19 | |
|
Attention to the Burtiness in Visual Prompt Tuning!
Poster Session 1 & Exhibit Hall
Yuzhu Wang ⋅ Manni Duan ⋅ Shu Kong
|
Exhibit Hall I #398 | |
|
Diffusion-Based Imaginative Coordination for Bimanual Manipulation
Poster Session 3 & Exhibit Hall
Huilin Xu ⋅ Jian Ding ⋅ Jiakun Xu ⋅ Ruixiang Wang ⋅ Jun Chen ⋅ Jinjie Mai ⋅ Yanwei Fu ⋅ Bernard Ghanem ⋅ Feng Xu ⋅ Mohamed Elhoseiny
|
Exhibit Hall I #137 | |
|
Learning Neural Scene Representation from iToF Imaging
Poster Session 6 & Exhibit Hall with Coffee Break
Wenjie Chang ⋅ Hanzhi Chang ⋅ Yueyi Zhang ⋅ Wenfei Yang ⋅ Tianzhu Zhang
|
Exhibit Hall I #310 | |
|
ChartCap: Mitigating Hallucination of Dense Chart Captioning
Junyoung Lim ⋅ Jaewoo Ahn ⋅ Gunhee Kim
|
Exhibit Hall I #298 | |
|
MultiVerse: A Multi-Turn Conversation Benchmark for Evaluating Large Vision and Language Models
Poster Session 1 & Exhibit Hall
Young-Jun Lee ⋅ Byung-Kwan Lee ⋅ Jianshu Zhang ⋅ Yechan Hwang ⋅ Byungsoo Ko ⋅ Han-Gyu Kim ⋅ Dongyu Yao ⋅ Xuankun Rong ⋅ Eojin Joo ⋅ Seung-Ho Han ⋅ Bowon Ko ⋅ Ho-Jin Choi
|
Exhibit Hall I #57 | |
|
Causal Disentanglement and Cross-Modal Alignment for Enhanced Few-Shot Learning
Poster Session 1 & Exhibit Hall
Tianjiao Jiang ⋅ Zhen Zhang ⋅ Yuhang Liu ⋅ Javen Qinfeng Shi
|
Exhibit Hall I #74 | |
|
Weakly-Supervised Learning of Dense Functional Correspondences
Poster Session 2 & Exhibit Hall with Coffee Break
Stefan Stojanov ⋅ Linan Zhao ⋅ Yunzhi Zhang ⋅ Daniel Yamins ⋅ Jiajun Wu
|
Exhibit Hall I #184 | |
|
PERSONA: Personalized Whole-Body 3D Avatar with Pose-Driven Deformations from a Single Image
Poster Session 3 & Exhibit Hall
Geonhee Sim ⋅ Gyeongsik Moon
|
Exhibit Hall I #251 | |
|
Unsupervised Visual Chain-of-Thought Reasoning via Preference Optimization
Poster Session 1 & Exhibit Hall
Kesen Zhao ⋅ Beier Zhu ⋅ Qianru Sun ⋅ Hanwang Zhang
|
Exhibit Hall I #209 | |
|
Staining and Locking Computer Vision Models Without Retraining
Poster Session 1 & Exhibit Hall
Oliver Sutton ⋅ Qinghua Zhou ⋅ George Leete ⋅ Alexander Gorban ⋅ Ivan Tyukin
|
Exhibit Hall I #213 | |
|
Test-Time Prompt Tuning for Zero-Shot Depth Completion
Chanhwi Jeong ⋅ Inhwan Bae ⋅ Jin-Hwi Park ⋅ Hae-Gon Jeon
|
Exhibit Hall I #415 | |
|
TITAN: Query-Token based Domain Adaptive Adversarial Learning
Poster Session 1 & Exhibit Hall
Tajamul Ashraf ⋅ Janibul Bashir
|
Exhibit Hall I #14 | |
|
Dual Reciprocal Learning of Language-based Human Motion Understanding and Generation
Poster Session 2 & Exhibit Hall with Coffee Break
CHEN LIANG ⋅ Zhicheng Shi ⋅ Wenguan Wang ⋅ Yi Yang
|
Exhibit Hall I #115 | |
|
StolenLoRA: Exploring LoRA Extraction Attacks via Synthetic Data
Yixu Wang ⋅ Yan Teng ⋅ Yingchun Wang ⋅ Xingjun Ma
|
Exhibit Hall I #15 | |
|
MMReason: An Open-Ended Multi-Modal Multi-Step Reasoning Benchmark for MLLMs Toward AGI
Poster Session 1 & Exhibit Hall
Huanjin Yao ⋅ Jiaxing Huang ⋅ Yawen Qiu ⋅ Michael K. Chen ⋅ Wenzheng Liu ⋅ Wei Zhang ⋅ wenjie zeng ⋅ Xikun ZHANG ⋅ Jingyi Zhang ⋅ YuXin Song ⋅ Wenhao Wu ⋅ Dacheng Tao
|
Exhibit Hall I #16 | |
|
PlaceIt3D: Language-Guided Object Placement in Real 3D Scenes
Poster Session 2 & Exhibit Hall with Coffee Break
Ahmed Abdelreheem ⋅ Filippo Aleotti ⋅ Jamie Watson ⋅ Zawar Qureshi ⋅ Abdelrahman Eldesokey ⋅ Peter Wonka ⋅ Gabriel Brostow ⋅ Sara Vicente ⋅ Guillermo Garcia-Hernando
|
Exhibit Hall I #154 | |
|
Know "No" Better: A Data-Driven Approach for Enhancing Negation Awareness in CLIP
Poster Session 1 & Exhibit Hall
Junsung Park ⋅ Jungbeom Lee ⋅ Jongyoon Song ⋅ Sangwon Yu ⋅ Dahuin Jung ⋅ Sungroh Yoon
|
Exhibit Hall I #260 | |
|
Motion Synthesis with Sparse and Flexible Keyjoint Control
Poster Session 3 & Exhibit Hall
Inwoo Hwang ⋅ Jinseok Bae ⋅ Donggeun Lim ⋅ Young Min Kim
|
Exhibit Hall I #303 | |
|
StruMamba3D: Exploring Structural Mamba for Self-supervised Point Cloud Representation Learning
Poster Session 6 & Exhibit Hall with Coffee Break
Chuxin Wang ⋅ Yixin Zha ⋅ Wenfei Yang ⋅ Tianzhu Zhang
|
Exhibit Hall I #370 | |
|
TRACE: Learning 3D Gaussian Physical Dynamics from Multi-view Videos
Poster Session 2 & Exhibit Hall with Coffee Break
Jinxi Li ⋅ Ziyang Song ⋅ Bo Yang
|
Exhibit Hall I #357 | |
|
D-Attn: Decomposed Attention for Large Vision-and-Language Model
Poster Session 5 & Exhibit Hall
Chia-Wen Kuo ⋅ Sijie Zhu ⋅ Fan Chen ⋅ Xiaohui Shen ⋅ Longyin Wen
|
Exhibit Hall I #388 | |
|
InfoBridge: Balanced Multimodal Integration through Conditional Dependency Modeling
Poster Session 1 & Exhibit Hall
Chenxin Li ⋅ Yifan Liu ⋅ Panwang Pan ⋅ Hengyu Liu ⋅ Xinyu Liu ⋅ Wuyang Li ⋅ Cheng Wang ⋅ Weihao Yu ⋅ Yiyang LIN ⋅ Yixuan Yuan
|
Exhibit Hall I #27 | |
|
Closed-Loop Transfer for Weakly-supervised Affordance Grounding
Poster Session 2 & Exhibit Hall with Coffee Break
Jiajin Tang ⋅ Zhengxuan Wei ⋅ Ge Zheng ⋅ Sibei Yang
|
Exhibit Hall I #423 | |
|
Exploring View Consistency for Scene-Adaptive Low-Light Light Field Image Enhancement
Shuo Zhang ⋅ Chen Gao ⋅ Youfang Lin
|
Exhibit Hall I #217 | |
|
Neuromanifold-Regularized KANs for Shape-fair Feature Representations
Poster Session 3 & Exhibit Hall
Mazlum Arslan ⋅ Weihong Guo ⋅ Shuo Li
|
Exhibit Hall I #262 | |
|
Training-Free Class Purification for Open-Vocabulary Semantic Segmentation
Poster Session 5 & Exhibit Hall
Qi Chen ⋅ Lingxiao Yang ⋅ Yun Chen ⋅ Nailong Zhao ⋅ Jianhuang Lai ⋅ Jie Shao ⋅ Xiaohua Xie
|
Exhibit Hall I #314 | |
|
Perspective-Aware Reasoning in Vision-Language Models via Mental Imagery Simulation
Poster Session 2 & Exhibit Hall with Coffee Break
Yuseung Lee ⋅ Jihyeon Je ⋅ Chanho Park ⋅ Mikaela Uy ⋅ Leonidas Guibas ⋅ Minhyuk Sung
|
Exhibit Hall I #397 | |
|
Erasing More Than Intended? How Concept Erasure Degrades the Generation of Non-Target Concepts
Poster Session 4 & Exhibit Hall with Coffee Break
Ibtihel Amara ⋅ Ahmed Imtiaz Humayun ⋅ Ivana Kajic ⋅ Zarana Parekh ⋅ Natalie Harris ⋅ Sarah Young ⋅ Chirag Nagpal ⋅ Najoung Kim ⋅ Junfeng He ⋅ Cristina Vasconcelos ⋅ Deepak Ramachandran ⋅ Golnoosh Farnadi ⋅ Katherine Heller ⋅ Mohammad Havaei ⋅ Negar Rostamzadeh
|
Exhibit Hall I #145 | |
|
Colors See Colors Ignore: Clothes Changing ReID with Color Disentanglement
Poster Session 4 & Exhibit Hall with Coffee Break
Priyank Pathak ⋅ Yogesh Rawat
|
Exhibit Hall I #183 | |
|
MonoSOWA: Scalable monocular 3D Object detector Without human Annotations
Poster Session 2 & Exhibit Hall with Coffee Break
Jan Skvrna ⋅ Lukas Neumann
|
Exhibit Hall I #244 | |
|
MA-CIR: A Multimodal Arithmetic Benchmark for Composed Image Retrieval
Poster Session 5 & Exhibit Hall
Jaeseok Byun ⋅ Young Kyun Jang ⋅ Seokhyeon Jeong ⋅ Donghyun Kim ⋅ Taesup Moon
|
Exhibit Hall I #143 | |
|
Revisiting Point Cloud Completion: Are We Ready For The Real-World?
Poster Session 6 & Exhibit Hall with Coffee Break
Stuti Pathak ⋅ Prashant Kumar ⋅ Dheeraj Baiju ⋅ Nicholus Mboga ⋅ Gunther Steenackers ⋅ Rudi Penne
|
Exhibit Hall I #63 | |
|
Clink! Chop! Thud! - Learning Object Sounds from Real-World Interactions
Poster Session 3 & Exhibit Hall
Mengyu Yang ⋅ Yiming Chen ⋅ Haozheng Pei ⋅ Siddhant Agarwal ⋅ Arun Vasudevan ⋅ James Hays
|
Exhibit Hall I #428 | |
|
UnZipLoRA: Separating Content and Style from a Single Image
Chang Liu ⋅ Viraj Shah ⋅ Aiyu Cui ⋅ Svetlana Lazebnik
|
Exhibit Hall I #181 | |
|
Learning Precise Affordances from Egocentric Videos for Robotic Manipulation
Poster Session 3 & Exhibit Hall
Li ⋅ Nikolaos Tsagkas ⋅ Jifei Song ⋅ Ruaridh Mon-Williams ⋅ Sethu Vijayakumar ⋅ Kun Shao ⋅ Laura Sevilla-Lara
|
Exhibit Hall I #53 | |
|
Learning Visual Proxy for Compositional Zero-Shot Learning
Poster Session 1 & Exhibit Hall
Shiyu Zhang ⋅ Cheng Yan ⋅ Yang Liu ⋅ Chenchen Jing ⋅ Lei Zhou ⋅ Wenjun Wang
|
Exhibit Hall I #257 | |
|
SC-Captioner: Improving Image Captioning with Self-Correction by Reinforcement Learning
Poster Session 5 & Exhibit Hall
Lin Zhang ⋅ Xianfang Zeng ⋅ Kangcong Li ⋅ Gang YU ⋅ Tao Chen
|
Exhibit Hall I #316 | |
|
Principles of Visual Tokens for Efficient Video Understanding
Poster Session 5 & Exhibit Hall
Xinyue Hao ⋅ Li ⋅ Shreyank Gowda ⋅ Robert Fisher ⋅ Jonathan Huang ⋅ Anurag Arnab ⋅ Laura Sevilla-Lara
|
Exhibit Hall I #135 | |
|
DuoCLR: Dual-Surrogate Contrastive Learning for Skeleton-based Human Action Segmentation
Poster Session 3 & Exhibit Hall
Haitao Tian
|
Exhibit Hall I #354 | |
|
NuiScene: Exploring Efficient Generation of Unbounded Outdoor Scenes
Poster Session 6 & Exhibit Hall with Coffee Break
Han-Hung Lee ⋅ Qinghong Han ⋅ Angel Chang
|
Exhibit Hall I #173 | |
|
Progressive Artwork Outpainting via Latent Diffusion Models
Poster Session 4 & Exhibit Hall with Coffee Break
Dae-Young Song ⋅ Jung-Jae Yu ⋅ Donghyeon Cho
|
Exhibit Hall I #48 | |
|
Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Sucheng Ren ⋅ Qihang Yu ⋅ Ju He ⋅ Xiaohui Shen ⋅ Alan Yuille ⋅ Liang-Chieh (Jay) Chen
|
Exhibit Hall I #85 | |
|
CObL: Toward Zero-Shot Ordinal Layering without User Prompting
Aneel Damaraju ⋅ Dean Hazineh ⋅ Todd Zickler
|
Exhibit Hall I #294 | |
|
Hierarchical Material Recognition from Local Appearance
Matthew Beveridge ⋅ Shree Nayar
|
Exhibit Hall I #295 | |
|
Event-guided Unified Framework for Low-light Video Enhancement, Frame Interpolation, and Deblurring
Poster Session 2 & Exhibit Hall with Coffee Break
Taewoo Kim ⋅ Kuk-Jin Yoon
|
Exhibit Hall I #330 | |
|
DisenQ: Disentangling Q-Former for Activity-Biometrics
Shehreen Azad ⋅ Yogesh Rawat
|
Exhibit Hall I #330 | |
|
PS-Mamba: Spatial-Temporal Graph Mamba for Pose Sequence Refinement
Poster Session 2 & Exhibit Hall with Coffee Break
Haoye Dong ⋅ Gim Hee Lee
|
Exhibit Hall I #334 | |
|
SliderSpace: Decomposing the Visual Capabilities of Diffusion Models
Poster Session 4 & Exhibit Hall with Coffee Break
Rohit Gandikota ⋅ Zongze Wu ⋅ Richard Zhang ⋅ David Bau ⋅ Eli Shechtman ⋅ Nicholas Kolkin
|
Exhibit Hall I #105 | |
|
GIViC: Generative Implicit Video Compression
Poster Session 4 & Exhibit Hall with Coffee Break
Ge Gao ⋅ Siyue Teng ⋅ Tianhao Peng ⋅ Fan Zhang ⋅ David Bull
|
Exhibit Hall I #237 | |
|
Aligning Moments in Time using Video Queries
Poster Session 5 & Exhibit Hall
Yogesh Kumar ⋅ Uday Agarwal ⋅ Manish Gupta ⋅ Anand Mishra
|
Exhibit Hall I #39 | |
|
ReFlex: Text-Guided Editing of Real Images in Rectified Flow via Mid-Step Feature Extraction and Attention Adaptation
Jimyeong Kim ⋅ Jungwon Park ⋅ Yeji Song ⋅ Nojun Kwak ⋅ Wonjong Rhee
|
Exhibit Hall I #100 | |
|
Streamlining Image Editing with Layered Diffusion Brushes
Poster Session 4 & Exhibit Hall with Coffee Break
Peyman Gholami ⋅ Robert Xiao
|
Exhibit Hall I #238 | |
|
MDD: A Dataset for Text-and-Music Conditioned Duet Dance Generation
Poster Session 3 & Exhibit Hall
Prerit Gupta ⋅ Jason Alexander Fotso-Puepi ⋅ Zhengyuan Li ⋅ Jay Mehta ⋅ Aniket Bera
|
Exhibit Hall I #369 | |
|
GECO: Geometrically Consistent Embedding with Lightspeed Inference
Poster Session 2 & Exhibit Hall with Coffee Break
Regine Hartwig ⋅ Dominik Muhle ⋅ Riccardo Marin ⋅ Daniel Cremers
|
Exhibit Hall I #403 | |
|
Removing Cost Volumes from Optical Flow Estimators
Poster Session 1 & Exhibit Hall
Simon Kiefhaber ⋅ Stefan Roth ⋅ Simone Schaub-Meyer
|
Exhibit Hall I #76 | |
|
HouseTour: A Virtual Real Estate A(I)gent
Poster Session 4 & Exhibit Hall with Coffee Break
Ata Çelen ⋅ Iro Armeni ⋅ Daniel Barath ⋅ Marc Pollefeys
|
Exhibit Hall I #275 | |
|
Scheduling Weight Transitions for Quantization-Aware Training
Poster Session 5 & Exhibit Hall
Junghyup Lee ⋅ Jeimin Jeon ⋅ Dohyung Kim ⋅ Bumsub Ham
|
Exhibit Hall I #345 | |
|
p-MoD: Building Mixture-of-Depths MLLMs via Progressive Ratio Decay
Poster Session 1 & Exhibit Hall
Jun Zhang ⋅ Desen Meng ⋅ Zhengming Zhang ⋅ Zhenpeng Huang ⋅ Tao Wu ⋅ Limin Wang
|
Exhibit Hall I #344 | |
|
Event-based Visual Vibrometry
Poster Session 6 & Exhibit Hall with Coffee Break
Xinyu Zhou ⋅ Peiqi Duan ⋅ Yeliduosi Xiaokaiti ⋅ Chao Xu ⋅ Boxin Shi
|
Exhibit Hall I #75 | |
|
Mobile Video Diffusion
Poster Session 4 & Exhibit Hall with Coffee Break
Haitam Ben Yahia ⋅ Denis Korzhenkov ⋅ Ioannis Lelekas ⋅ Amir Ghodrati ⋅ Amir Habibian
|
Exhibit Hall I #437 | |
|
LeGrad: An Explainability Method for Vision Transformers via Feature Formation Sensitivity
Poster Session 5 & Exhibit Hall
Walid Bousselham ⋅ Angie Boggust ⋅ Sofian Chaybouti ⋅ Hendrik Strobelt ⋅ Hilde Kuehne
|
Exhibit Hall I #50 | |
|
ConsNoTrainLoRA: Data-driven Weight Initialization of Low-rank Adapters using Constraints
Poster Session 1 & Exhibit Hall
Debasmit Das ⋅ Hyoungwoo Park ⋅ Munawar Hayat ⋅ Seokeon Choi ⋅ Sungrack Yun ⋅ Fatih Porikli
|
Exhibit Hall I #37 | |
|
Self-Calibrated Variance-Stabilizing Transformations for Real-World Image Denoising
Poster Session 3 & Exhibit Hall
Sébastien Herbreteau ⋅ Michael Unser
|
Exhibit Hall I #45 | |
|
Multi-modal Identity Extraction
Poster Session 3 & Exhibit Hall
Ryan Webster ⋅ Teddy Furon
|
Exhibit Hall I #73 | |
|
ViT-Linearizer: Distilling Quadratic Knowledge into Linear-Time Vision Models
Poster Session 5 & Exhibit Hall
Guoyizhe Wei ⋅ Rama Chellappa
|
Exhibit Hall I #89 | |
|
XTrack: Multimodal Training Boosts RGB-X Video Object Trackers
Poster Session 2 & Exhibit Hall with Coffee Break
Yuedong Tan ⋅ Zongwei Wu ⋅ Yuqian Fu ⋅ Zhuyun Zhou ⋅ Guolei Sun ⋅ Eduard Zamfir ⋅ Chao Ma ⋅ Danda Pani Paudel ⋅ Luc Gool ⋅ Radu Timofte
|
Exhibit Hall I #66 | |
|
FaceXFormer: A Unified Transformer for Facial Analysis
Poster Session 3 & Exhibit Hall
Kartik Narayan ⋅ Vibashan VS ⋅ Rama Chellappa ⋅ Vishal Patel
|
Exhibit Hall I #128 | |
|
Laboring on less labors: RPCA Paradigm for Pan-sharpening
Poster Session 3 & Exhibit Hall
honghui xu ⋅ Chuangjie Fang ⋅ Yibin Wang ⋅ Jie Wu ⋅ Jianwei Zheng
|
Exhibit Hall I #130 | |
|
Riemannian-Geometric Fingerprints of Generative Models
Hae Jin Song ⋅ Laurent Itti
|
Exhibit Hall I #133 | |
|
LDIP: Long Distance Information Propagation for Video Super-Resolution
Poster Session 3 & Exhibit Hall
Michael Bernasconi ⋅ Abdelaziz Djelouah ⋅ Yang Zhang ⋅ Markus Gross ⋅ Christopher Schroers
|
Exhibit Hall I #145 | |
|
Multi-identity Human Image Animation with Structural Video Diffusion
Poster Session 3 & Exhibit Hall
Zhenzhi Wang ⋅ Yixuan Li ⋅ yanhong zeng ⋅ Yuwei Guo ⋅ Dahua Lin ⋅ Tianfan Xue ⋅ Bo Dai
|
Exhibit Hall I #182 | |
|
FedDifRC: Unlocking the Potential of Text-to-Image Diffusion Models in Heterogeneous Federated Learning
Poster Session 1 & Exhibit Hall
Huan Wang ⋅ Haoran Li ⋅ Huaming Chen ⋅ Jun Yan ⋅ Jiahua Shi ⋅ Jun Shen
|
Exhibit Hall I #346 | |
|
LoRA-FAIR: Federated LoRA Fine-Tuning with Aggregation and Initialization Refinement
Poster Session 1 & Exhibit Hall
Jieming Bian ⋅ Lei Wang ⋅ Letian Zhang ⋅ Jie Xu
|
Exhibit Hall I #347 | |
|
Category-Specific Selective Feature Enhancement for Long-Tailed Multi-Label Image Classification
Poster Session 1 & Exhibit Hall
Ruiqi Du ⋅ Xu Tang ⋅ Xiangrong Zhang ⋅ Jingjing Ma
|
Exhibit Hall I #349 | |
|
Meta-Learning Dynamic Center Distance: Hard Sample Mining for Learning with Noisy Labels
Poster Session 1 & Exhibit Hall
Chenyu Mu ⋅ Yijun Qu ⋅ Jiexi Yan ⋅ Erkun Yang ⋅ Cheng Deng
|
Exhibit Hall I #29 | |
|
FreeMorph: Tuning-Free Generalized Image Morphing with Diffusion Model
Poster Session 4 & Exhibit Hall with Coffee Break
Yukang Cao ⋅ Chenyang Si ⋅ Jinghao Wang ⋅ Ziwei Liu
|
Exhibit Hall I #310 | |
|
Registration beyond Points: General Affine Subspace Alignment via Geodesic Distance on Grassmann Manifold
Jaeho Shin ⋅ Hyeonjae Gil ⋅ Junwoo Jang ⋅ Maani Ghaffari ⋅ Ayoung Kim
|
Exhibit Hall I #350 | |
|
DM-EFS: Dynamically Multiplexed Expanded Features Set Form for Robust and Efficient Small Object Detection
Poster Session 5 & Exhibit Hall
Aashish Sharma
|
Exhibit Hall I #447 | |
|
Inverse Image-Based Rendering for Light Field Generation from Single Images
Hyunjun Jung ⋅ Hae-Gon Jeon
|
Exhibit Hall I #2 | |
|
PossLoss: A Reliable and Sensitive Facial Landmark Detection Loss Function
Poster Session 6 & Exhibit Hall with Coffee Break
Qikui Zhu
|
Exhibit Hall I #13 | |
|
DAA*: Deep Angular A Star for Image-based Path Planning
Poster Session 6 & Exhibit Hall with Coffee Break
Zhiwei Xu
|
Exhibit Hall I #53 | |
|
Pseudo-SD: Pseudo Controlled Stable Diffusion for Semi-Supervised and Cross-Domain Semantic Segmentation
Poster Session 5 & Exhibit Hall
Dong Zhao ⋅ Qi Zang ⋅ Shuang Wang ⋅ Nicu Sebe ⋅ Zhun Zhong
|
Exhibit Hall I #244 | |
|
Frequency-Dynamic Attention Modulation For Dense Prediction
Poster Session 5 & Exhibit Hall
Linwei Chen ⋅ Lin Gu ⋅ Ying Fu
|
Exhibit Hall I #265 | |
|
Memory-Efficient 4-bit Preconditioned Stochastic Optimization
Poster Session 5 & Exhibit Hall
Jingyang Li ⋅ Kuangyu Ding ⋅ Kim-chuan Toh ⋅ Pan Zhou
|
Exhibit Hall I #266 | |
|
Hierarchical Cross-modal Prompt Learning for Vision-Language Models
Poster Session 1 & Exhibit Hall
Hao Zheng ⋅ Shunzhi Yang ⋅ Zhuoxin He ⋅ Jinfeng Yang ⋅ Zhenhua Huang
|
Exhibit Hall I #171 | |
|
Boosting Domain Generalized and Adaptive Detection with Diffusion Models: Fitness, Generalization, and Transferability
Boyong He ⋅ Yuxiang Ji ⋅ Zhuoyue Tan ⋅ Liaoni Wu
|
Exhibit Hall I #173 | |
|
ObjectRelator: Enabling Cross-View Object Relation Understanding Across Ego-Centric and Exo-Centric Perspectives
Yuqian Fu ⋅ Runze Wang ⋅ Bin Ren ⋅ Guolei Sun ⋅ Biao Gong ⋅ Yanwei Fu ⋅ Danda Pani Paudel ⋅ Xuanjing Huang ⋅ Luc Gool
|
Exhibit Hall I #141 | |
|
Target Bias Is All You Need: Zero-Shot Debiasing of Vision-Language Models with Bias Corpus
Poster Session 1 & Exhibit Hall
Taeuk Jang ⋅ Hoin Jung ⋅ Xiaoqian Wang
|
Exhibit Hall I #175 | |
|
Long-Context State-Space Video World Models
Poster Session 2 & Exhibit Hall with Coffee Break
Ryan Po ⋅ Yotam Nitzan ⋅ Richard Zhang ⋅ Berlin Chen ⋅ Tri Dao ⋅ Eli Shechtman ⋅ Gordon Wetzstein ⋅ Xun Huang
|
Exhibit Hall I #349 | |
|
PASG: A Closed-Loop Framework for Automated Geometric Primitive Extraction and Semantic Anchoring in Robotic Manipulation
Poster Session 2 & Exhibit Hall with Coffee Break
Zhihao ZHU ⋅ Yifan Zheng ⋅ Siyu Pan ⋅ Yaohui Jin ⋅ Yao Mu
|
Exhibit Hall I #369 | |
|
DiST-4D: Disentangled Spatiotemporal Diffusion with Metric Depth for 4D Driving Scene Generation
Poster Session 6 & Exhibit Hall with Coffee Break
Jiazhe Guo ⋅ Yikang Ding ⋅ Xiwu Chen ⋅ Shuo Chen ⋅ Bohan Li ⋅ Yingshuang Zou ⋅ Xiaoyang Lyu ⋅ Feiyang Tan ⋅ Xiaojuan Qi ⋅ Zhiheng Li ⋅ Hao Zhao
|
Exhibit Hall I #243 | |
|
Fine-grained Spatiotemporal Grounding on Egocentric Videos
Poster Session 2 & Exhibit Hall with Coffee Break
Shuo LIANG ⋅ Yiwu Zhong ⋅ Zi-Yuan Hu ⋅ Yeyao Tao ⋅ Liwei Wang
|
Exhibit Hall I #410 | |
|
VRBench: A Benchmark for Multi-Step Reasoning in Long Narrative Videos
Poster Session 5 & Exhibit Hall
Jiashuo Yu ⋅ Yue Wu ⋅ Meng Chu ⋅ Zhifei Ren ⋅ Zizheng Huang ⋅ Pei Chu ⋅ Ruijie Zhang ⋅ Yinan He ⋅ Qirui Li ⋅ Songze Li ⋅ Zhenxiang Li ⋅ Zhongying Tu ⋅ Conghui He ⋅ Yu Qiao ⋅ Yali Wang ⋅ Yi Wang ⋅ Limin Wang
|
Exhibit Hall I #174 | |
|
Improving SAM for Camouflaged Object Detection via Dual Stream Adapters
Poster Session 5 & Exhibit Hall
Jiaming Liu ⋅ Linghe Kong ⋅ Guihai Chen
|
Exhibit Hall I #197 | |
|
FedMeNF: Privacy-Preserving Federated Meta-Learning for Neural Fields
Poster Session 1 & Exhibit Hall
Junhyeog Yun ⋅ Minui Hong ⋅ Gunhee Kim
|
Exhibit Hall I #196 | |
|
Sparsity Outperforms Low-Rank Projections in Few-Shot Adaptation
Poster Session 1 & Exhibit Hall
Nairouz Mrabah ⋅ Nicolas Richet ⋅ Ismail Ayed ⋅ Eric Granger
|
Exhibit Hall I #290 | |
|
DWIM: Towards Tool-aware Visual Reasoning via Discrepancy-aware Workflow Generation & Instruct-Masking Tuning
Poster Session 1 & Exhibit Hall
Fucai Ke ⋅ Vijay Kumar b g ⋅ Xingjian Leng ⋅ Zhixi Cai ⋅ Zaid Khan ⋅ Weiqing Wang ⋅ Pari Delir Haghighi ⋅ Hamid Rezatofighi ⋅ Manmohan Chandraker
|
Exhibit Hall I #314 | |
|
GT-Mean Loss: A Simple Yet Effective Solution for Brightness Mismatch in Low-Light Image Enhancement
Poster Session 2 & Exhibit Hall with Coffee Break
Jingxi Liao ⋅ Shijie Hao ⋅ Richang Hong ⋅ Meng Wang
|
Exhibit Hall I #102 | |
|
Trust but Verify: Programmatic VLM Evaluation in the Wild
Poster Session 1 & Exhibit Hall
Viraj Prabhu ⋅ Senthil Purushwalkam ⋅ An Yan ⋅ Caiming Xiong ⋅ Ran Xu
|
Exhibit Hall I #301 | |
|
MemDistill: Distilling LiDAR Knowledge into Memory for Camera-Only 3D Object Detection
Poster Session 2 & Exhibit Hall with Coffee Break
Donghyeon Kwon ⋅ Youngseok Yoon ⋅ Hyeongseok Son ⋅ Suha Kwak
|
Exhibit Hall I #170 | |
|
HRScene: How Far Are VLMs from Effective High-Resolution Image Understanding?
Poster Session 5 & Exhibit Hall
Yusen Zhang ⋅ Wenliang Zheng ⋅ Aashrith Madasu ⋅ Peng Shi ⋅ Ryo Kamoi ⋅ Hao Zhou ⋅ Zhuoyang Zou ⋅ Shu Zhao ⋅ Sarkar Snigdha Sarathi Das ⋅ Vipul Gupta ⋅ Xiaoxin Lu ⋅ Nan Zhang ⋅ Ranran Zhang ⋅ Avitej Iyer ⋅ Renze Lou ⋅ Wenpeng Yin ⋅ Rui Zhang
|
Exhibit Hall I #293 | |
|
DrivingGPT: Unifying Driving World Modeling and Planning with Multi-modal Autoregressive Transformers
Poster Session 6 & Exhibit Hall with Coffee Break
Yuntao Chen ⋅ Yuqi Wang ⋅ Zhaoxiang Zhang
|
Exhibit Hall I #209 | |
|
GenieBlue: Integrating both Linguistic and Multimodal Capabilities for Large Language Models on Mobile Devices
Poster Session 1 & Exhibit Hall
Xudong LU ⋅ Yinghao Chen ⋅ Renshou Wu ⋅ Haohao Gao ⋅ Xi Chen ⋅ Xue Yang ⋅ Xiangyu Zhao ⋅ Aojun Zhou ⋅ Fangyuan Li ⋅ Yafei Wen ⋅ Xiaoxin Chen ⋅ shuai ren ⋅ Hongsheng Li
|
Exhibit Hall I #393 | |
|
Task-Aware Prompt Gradient Projection for Parameter-Efficient Tuning Federated Class-Incremental Learning
Poster Session 1 & Exhibit Hall
Hualong Ke ⋅ Yachao Zhang ⋅ Jiangming Shi ⋅ FangyongWang FangyongWang ⋅ Yuan Xie ⋅ Yanyun Qu
|
Exhibit Hall I #242 | |
|
Parameter-Efficient Adaptation of Geospatial Foundation Models through Embedding Deflection
Poster Session 2 & Exhibit Hall with Coffee Break
Romain Thoreau ⋅ Valerio Marsocci ⋅ Dawa Derksen
|
Exhibit Hall I #429 | |
|
Multimodal LLM Guided Exploration and Active Mapping using Fisher Information
Poster Session 2 & Exhibit Hall with Coffee Break
Wen Jiang ⋅ BOSHU LEI ⋅ Katrina Ashton ⋅ Kostas Daniilidis
|
Exhibit Hall I #35 | |
|
Orchid: Image Latent Diffusion for Joint Appearance and Geometry Generation
Poster Session 6 & Exhibit Hall with Coffee Break
Akshay Krishnan ⋅ Xinchen Yan ⋅ Vincent Casser ⋅ Abhijit Kundu
|
Exhibit Hall I #337 | |
|
CLIPSym: Delving into Symmetry Detection with CLIP
Poster Session 5 & Exhibit Hall
Tinghan Yang ⋅ Md Ashiqur Rahman ⋅ Raymond A. Yeh
|
Exhibit Hall I #113 | |
|
UDC-VIT: A Real-World Video Dataset for Under-Display Cameras
Kyusu Ahn ⋅ JiSoo Kim ⋅ Sangik Lee ⋅ HyunGyu Lee ⋅ Byeonghyun Ko ⋅ Chanwoo Park ⋅ Jaejin Lee
|
Exhibit Hall I #89 | |
|
ScenePainter: Semantically Consistent Perpetual 3D Scene Generation with Concept Relation Alignment
Poster Session 6 & Exhibit Hall with Coffee Break
Chong Xia ⋅ Shengjun Zhang ⋅ Fangfu Liu ⋅ Chang Liu ⋅ Khodchaphun Hirunyaratsameewong ⋅ Yueqi Duan
|
Exhibit Hall I #394 | |
|
Temperature in Cosine-based Softmax Loss
Poster Session 5 & Exhibit Hall
Takumi Kobayashi
|
Exhibit Hall I #224 | |
|
SceneSplat: Gaussian Splatting-based Scene Understanding with Vision-Language Pretraining
Poster Session 2 & Exhibit Hall with Coffee Break
Yue Li ⋅ Qi Ma ⋅ Runyi Yang ⋅ Huapeng Li ⋅ Mengjiao Ma ⋅ Bin Ren ⋅ Nikola Popovic ⋅ Nicu Sebe ⋅ Ender Konukoglu ⋅ Theo Gevers ⋅ Luc Gool ⋅ Martin R. Oswald ⋅ Danda Pani Paudel
|
Exhibit Hall I #146 | |
|
Understanding Museum Exhibits using Vision-Language Reasoning
Poster Session 1 & Exhibit Hall
Ada-Astrid Balauca ⋅ Sanjana Garai ⋅ Stefan Balauca ⋅ Rasesh Shetty ⋅ Naitik Agrawal ⋅ Dhwanil Shah ⋅ Yuqian Fu ⋅ Xi Wang ⋅ Kristina Toutanova ⋅ Danda Pani Paudel ⋅ Luc Gool
|
Exhibit Hall I #202 | |
|
Neural Solver of Dichromatic Reflection Model for Specular Highlight Removal
Poster Session 2 & Exhibit Hall with Coffee Break
Gang Fu
|
Exhibit Hall I #208 | |
|
Correspondence-Free Fast and Robust Spherical Point Pattern Registration
Poster Session 6 & Exhibit Hall with Coffee Break
Anik Sarker ⋅ Alan Asbeck
|
Exhibit Hall I #331 | |
|
SILO: Solving Inverse Problems with Latent Operators
Poster Session 3 & Exhibit Hall
Ron Raphaeli ⋅ Sean Man ⋅ Michael Elad
|
Exhibit Hall I #52 | |
|
Geminio: Language-Guided Gradient Inversion Attacks in Federated Learning
Poster Session 1 & Exhibit Hall
Junjie Shan ⋅ Ziqi Zhao ⋅ Jialin Lu ⋅ Rui Zhang ⋅ SM Yiu ⋅ Ka-Ho Chow
|
Exhibit Hall I #250 | |
|
SD2Actor: Continuous State Decomposition via Diffusion Embeddings for Robotic Manipulation
Poster Session 3 & Exhibit Hall
lijiayi jiayi
|
Exhibit Hall I #352 | |
|
Imbalance in Balance: Online Concept Balancing in Generation Models
Poster Session 4 & Exhibit Hall with Coffee Break
Yukai Shi ⋅ Jiarong Ou ⋅ Rui Chen ⋅ Haotian Yang ⋅ Jiahao Wang ⋅ Xin Tao ⋅ Pengfei Wan ⋅ Di ZHANG ⋅ Kun Gai
|
Exhibit Hall I #244 | |
|
Progressive Distribution Bridging: Unsupervised Adaptation for Large-scale Pre-trained Models via Adaptive Auxiliary Data
Poster Session 1 & Exhibit Hall
Weinan He ⋅ Yixin Zhang ⋅ Zilei Wang
|
Exhibit Hall I #305 | |
|
Efficient Concertormer for Image Deblurring and Beyond
Poster Session 3 & Exhibit Hall
Pin-Hung Kuo ⋅ Jinshan Pan ⋅ Shao-Yi Chien ⋅ Ming-Hsuan Yang
|
Exhibit Hall I #439 | |
|
TryOn-Refiner: Conditional Rectified-flow-based TryOn Refiner for More Accurate Detail Reconstruction
Poster Session 4 & Exhibit Hall with Coffee Break
Wen Qian
|
Exhibit Hall I #73 | |
|
ASCENT: Annotation-free Self-supervised Contrastive Embeddings for 3D Neuron Tracking in Fluorescence Microscopy
Poster Session 3 & Exhibit Hall
Haejun Han ⋅ Hang Lu
|
Exhibit Hall I #440 | |
|
IntroStyle: Training-Free Introspective Style Attribution using Diffusion Features
Poster Session 4 & Exhibit Hall with Coffee Break
Anand Kumar ⋅ Jiteng Mu ⋅ Nuno Vasconcelos
|
Exhibit Hall I #2 | |
|
From Sharp to Blur: Unsupervised Domain Adaptation for 2D Human Pose Estimation Under Extreme Motion Blur Using Event Cameras
Poster Session 2 & Exhibit Hall with Coffee Break
Youngho Kim ⋅ Hoonhee Cho ⋅ Kuk-Jin Yoon
|
Exhibit Hall I #412 | |
|
One Object, Multiple Lies: A Benchmark for Cross-task Adversarial Attack on Unified Vision-Language Models
Poster Session 1 & Exhibit Hall
Jiale Zhao ⋅ XINYANG JIANG ⋅ Junyao Gao ⋅ Yuhao Xue ⋅ Cairong Zhao
|
Exhibit Hall I #8 | |
|
ObjectMate: A Recurrence Prior for Object Insertion and Subject-Driven Generation
Daniel Winter ⋅ Asaf Shul ⋅ Matan Cohen ⋅ Dana Berman ⋅ Yael Pritch ⋅ Alex Rav-Acha ⋅ Yedid Hoshen
|
Exhibit Hall I #132 | |
|
LLM Thought Divergence and Convergence for Dialogue-Based Image Generation Control
Poster Session 4 & Exhibit Hall with Coffee Break
Hui Li
|
Exhibit Hall I #309 | |
|
RALoc: Enhancing Outdoor LiDAR Localization via Rotation Awareness
Yuyang Yang ⋅ Wen Li ⋅ Sheng Ao ⋅ Qingshan Xu ⋅ Shangshu Yu ⋅ guo yu ⋅ Yin Zhou ⋅ Siqi Shen ⋅ Cheng Wang
|
Exhibit Hall I #307 | |
|
Soft Separation and Distillation: Toward Global Uniformity in Federated Unsupervised Learning
Poster Session 1 & Exhibit Hall
Hung-Chieh Fang ⋅ Hsuan-Tien Lin ⋅ Irwin King ⋅ Yifei Zhang
|
Exhibit Hall I #274 | |
|
HOLa: Zero-Shot HOI Detection with Low-Rank Decomposed VLM Feature Adaptation
Poster Session 1 & Exhibit Hall
Qinqian Lei ⋅ Bo Wang ⋅ Robby Tan
|
Exhibit Hall I #165 | |
|
Instruction-Grounded Visual Projectors for Continual Learning of Generative Vision-Language Models
Poster Session 1 & Exhibit Hall
Hyundong Jin ⋅ Hyung Jin Chang ⋅ Eunwoo Kim
|
Exhibit Hall I #322 | |
|
Spatial Preference Rewarding for MLLMs Spatial Understanding
Poster Session 1 & Exhibit Hall
Han Qiu ⋅ Peng Gao ⋅ Lewei Lu ⋅ Xiaoqin Zhang ⋅ Ling Shao ⋅ Shijian Lu
|
Exhibit Hall I #58 | |
|
Generative Zoo
Poster Session 2 & Exhibit Hall with Coffee Break
Tomasz Niewiadomski ⋅ Anastasios Yiannakidis ⋅ Hanz Cuevas Velasquez ⋅ Soubhik Sanyal ⋅ Michael Black ⋅ Silvia Zuffi ⋅ Peter Kulits
|
Exhibit Hall I #327 | |
|
Towards Adversarial Robustness via Debiased High-Confidence Logit Alignment
Poster Session 1 & Exhibit Hall
Kejia Zhang ⋅ Juanjuan Weng ⋅ Zhiming Luo ⋅ Shaozi Li
|
Exhibit Hall I #256 | |
|
Learning Null Geodesics for Gravitational Lensing Rendering in General Relativity
Poster Session 6 & Exhibit Hall with Coffee Break
Mingyuan Sun ⋅ Zheng Fang ⋅ Jiaxu Wang ⋅ Kun-Yi Zhang ⋅ Qiang Zhang ⋅ Renjing Xu
|
Exhibit Hall I #363 | |
|
St4RTrack: Simultaneous 4D Reconstruction and Tracking in the World
Poster Session 2 & Exhibit Hall with Coffee Break
Haiwen Feng ⋅ Junyi Zhang ⋅ Qianqian Wang ⋅ Yufei Ye ⋅ Pengcheng Yu ⋅ Michael Black ⋅ Trevor Darrell ⋅ Angjoo Kanazawa
|
Exhibit Hall I #328 | |
|
Describe Anything: Detailed Localized Image and Video Captioning
Poster Session 5 & Exhibit Hall
Long Lian ⋅ Yifan Ding ⋅ Yunhao Ge ⋅ Sifei Liu ⋅ Hanzi Mao ⋅ Boyi Li ⋅ Marco Pavone ⋅ Ming-Yu Liu ⋅ Trevor Darrell ⋅ Adam Yala ⋅ Yin Cui
|
Exhibit Hall I #184 | |
|
Stable-Sim2Real: Exploring Simulation of Real-Captured 3D Data with Two-Stage Depth Diffusion
Mutian Xu ⋅ Chongjie Ye ⋅ Haolin Liu ⋅ Yushuang Wu ⋅ Jiahao Chang ⋅ Xiaoguang Han
|
Exhibit Hall I #240 | |
|
Multi-Modal Multi-Task Unified Embedding Model (M3T-UEM): A Task-Adaptive Representation Learning Framework
Poster Session 5 & Exhibit Hall
Rohan Sharma ⋅ Changyou Chen ⋅ Feng-Ju Chang ⋅ Seongjun Yun ⋅ Xiaohu Xie ⋅ Rui Meng ⋅ Dehong Xu ⋅ Alejandro Mottini ⋅ qingjun cui
|
Exhibit Hall I #280 | |
|
AHCPTQ: Accurate and Hardware-Compatible Post-Training Quantization for Segment Anything Model
Poster Session 5 & Exhibit Hall
Wenlun Zhang ⋅ Yunshan Zhong ⋅ Shimpei Ando ⋅ Kentaro Yoshioka
|
Exhibit Hall I #243 | |
|
MVTrajecter: Multi-View Pedestrian Tracking with Trajectory Motion Cost and Trajectory Appearance Cost
Poster Session 3 & Exhibit Hall
Taiga Yamane ⋅ Ryo Masumura ⋅ Satoshi Suzuki ⋅ Shota Orihashi
|
Exhibit Hall I #309 | |
|
InvRGB+L: Inverse Rendering of Complex Scenes with Unified Color and LiDAR Reflectance Modeling
Poster Session 6 & Exhibit Hall with Coffee Break
Xiaoxue Chen ⋅ Bhargav Chandaka ⋅ Chih-Hao Lin ⋅ Ya-Qin Zhang ⋅ David Forsyth ⋅ Hao Zhao ⋅ Shenlong Wang
|
Exhibit Hall I #238 | |
|
Is Visual in-Context Learning for Compositional Medical Tasks within Reach?
Poster Session 1 & Exhibit Hall
Simon Reiß ⋅ Zdravko Marinov ⋅ Alexander Jaus ⋅ Constantin Seibold ⋅ M. Sarfraz ⋅ Erik Rodner ⋅ Rainer Stiefelhagen
|
Exhibit Hall I #243 | |
|
Moto: Latent Motion Token as the Bridging Language for Learning Robot Manipulation from Videos
Poster Session 5 & Exhibit Hall
Yi Chen ⋅ Yuying Ge ⋅ Weiliang Tang ⋅ Yizhuo Li ⋅ Yixiao Ge ⋅ Mingyu Ding ⋅ Ying Shan ⋅ Xihui Liu
|
Exhibit Hall I #229 | |
|
Effective Training Data Synthesis for Improving MLLM Chart Understanding
Poster Session 1 & Exhibit Hall
Yuwei Yang ⋅ Zeyu Zhang ⋅ Yunzhong Hou ⋅ Zhuowan Li ⋅ Gaowen Liu ⋅ Ali Payani ⋅ Yuan-Sen Ting ⋅ Liang Zheng
|
Exhibit Hall I #244 | |
|
SpinMeRound: Consistent Multi-View Identity Generation Using Diffusion Models
Poster Session 3 & Exhibit Hall
Stathis Galanakis ⋅ Alexandros Lattas ⋅ Stylianos Moschoglou ⋅ Bernhard Kainz ⋅ Stefanos Zafeiriou
|
Exhibit Hall I #409 | |
|
Heuristic-Induced Multimodal Risk Distribution Jailbreak Attack for Multimodal Large Language Models
Poster Session 1 & Exhibit Hall
Ma Teng ⋅ Xiaojun Jia ⋅ Ranjie Duan ⋅ Xinfeng Li ⋅ Yihao Huang ⋅ Xiaoshuang Jia ⋅ Zhixuan Chu ⋅ Wenqi Ren
|
Exhibit Hall I #247 | |
|
Improving Rectified Flow with Boundary Conditions
Poster Session 4 & Exhibit Hall with Coffee Break
Xixi Hu ⋅ Runlong Liao ⋅ Bo Liu ⋅ Keyang Xu ⋅ Yeqing Li ⋅ Eugene Ie ⋅ Hongliang Fei ⋅ qiang liu
|
Exhibit Hall I #316 | |
|
Active Learning Meets Foundation Models: Fast Remote Sensing Data Annotation for Object Detection
Poster Session 2 & Exhibit Hall with Coffee Break
Marvin Burges ⋅ Philipe Dias ⋅ Dalton Lunga ⋅ Carson Woody ⋅ Sarah Walters
|
Exhibit Hall I #97 | |
|
Safeguarding Vision-Language Models: Mitigating Vulnerabilities to Gaussian Noise in Perturbation-based Attacks
Poster Session 1 & Exhibit Hall
Jiawei Wang ⋅ Yushen Zuo ⋅ Yuanjun Chai ⋅ Zhendong Liu ⋅ Yicheng Fu ⋅ Yichun Feng ⋅ Kin Man Lam
|
Exhibit Hall I #255 | |
|
Optimal Transport for Brain-Image Alignment: Unveiling Redundancy and Synergy in Neural Information Processing
Poster Session 5 & Exhibit Hall
Yang Xiao ⋅ Wang Lu ⋅ Jie Ji ⋅ Ruimeng Ye ⋅ Li ⋅ Xiaolong Ma ⋅ Bo Hui
|
Exhibit Hall I #60 | |
|
TrafficLoc: Localizing Traffic Surveillance Cameras in 3D Scenes
Poster Session 6 & Exhibit Hall with Coffee Break
Yan Xia ⋅ Yunxiang Lu ⋅ Rui Song ⋅ Oussema Dhaouadi ⋅ Joao F. Henriques ⋅ Daniel Cremers
|
Exhibit Hall I #383 | |
|
Intervening in Black Box: Concept Bottleneck Model for Enhancing Human Neural Network Mutual Understanding
Poster Session 1 & Exhibit Hall
Nuoye Xiong ⋅ Anqi Dong ⋅ Ning Wang ⋅ Cong Hua ⋅ Guangming Zhu ⋅ Lin Mei ⋅ peiyi shen ⋅ zhang liang
|
Exhibit Hall I #261 | |
|
Resolving Token-Space Gradient Conflicts: Token Space Manipulation for Transformer-Based Multi-Task Learning
Poster Session 1 & Exhibit Hall
Wooseong Jeong ⋅ Kuk-Jin Yoon
|
Exhibit Hall I #266 | |
|
DisCoPatch: Taming Adversarially-driven Batch Statistics for Improved Out-of-Distribution Detection
Poster Session 1 & Exhibit Hall
Francisco Caetano ⋅ Christiaan Viviers ⋅ Luis Zavala-Mondragón ⋅ Peter H.N. De With ⋅ Fons van der Sommen
|
Exhibit Hall I #267 | |
|
Scaling and Taming Adversarial Training with Synthetic Data
Poster Session 1 & Exhibit Hall
Juntao Wu ⋅ Xianting Huang ⋅ Yu Chen ⋅ Shuai Pang ⋅ Ke Wang
|
Exhibit Hall I #272 | |
|
DOLLAR: Few-Step Video Generation via Distillation and Latent Reward Optimization
Poster Session 4 & Exhibit Hall with Coffee Break
Zihan Ding ⋅ Chi Jin ⋅ Difan Liu ⋅ Haitian Zheng ⋅ Krishna Kumar Singh ⋅ Qiang Zhang ⋅ Yan Kang ⋅ Zhe Lin ⋅ Yuchen Liu
|
Exhibit Hall I #294 | |
|
Generative Adversarial Diffusion
Poster Session 4 & Exhibit Hall with Coffee Break
U-Chae Jun ⋅ Jaeeun Ko ⋅ Jiwoo Kang
|
Exhibit Hall I #182 | |
|
Music Grounding by Short Video
Poster Session 5 & Exhibit Hall
Zijie Xin ⋅ Minquan Wang ⋅ Jingyu Liu ⋅ Quan Chen ⋅ Ye Ma ⋅ Peng Jiang ⋅ Xirong Li
|
Exhibit Hall I #234 | |
|
Fewer Denoising Steps or Cheaper Per-Step Inference: Towards Compute-Optimal Diffusion Model Deployment
Poster Session 1 & Exhibit Hall
Zhenbang Du ⋅ Yonggan Fu ⋅ Lifu Wang ⋅ Jiayi Qian ⋅ Xiao Luo ⋅ Yingyan Celine Lin
|
Exhibit Hall I #277 | |
|
Your Text Encoder Can Be An Object-Level Watermarking Controller
Poster Session 4 & Exhibit Hall with Coffee Break
Naresh Kumar Devulapally ⋅ Mingzhen Huang ⋅ Vishal Asnani ⋅ Shruti Agarwal ⋅ Siwei Lyu ⋅ Vishnu Lokhande
|
Exhibit Hall I #162 | |
|
Enhanced Event-based Dense Stereo via Cross-Sensor Knowledge Distillation
Poster Session 2 & Exhibit Hall with Coffee Break
Haihao Zhang ⋅ Yunjian Zhang ⋅ Jianing Li ⋅ Lin Zhu ⋅ Meng Lv ⋅ Yao Zhu ⋅ Yanwei Liu ⋅ Xiangyang Ji
|
Exhibit Hall I #39 | |
|
PlugMark: A Plug-in Zero-Watermarking Framework for Diffusion Models
Poster Session 4 & Exhibit Hall with Coffee Break
Pengzhen Chen ⋅ Yanwei Liu ⋅ Xiaoyan Gu ⋅ Enci Liu ⋅ Zhuoyi Shang ⋅ Xiangyang Ji ⋅ Wu Liu
|
Exhibit Hall I #234 | |
|
GauUpdate: New Object Insertion in 3D Gaussian Fields with Consistent Global Illumination
Poster Session 6 & Exhibit Hall with Coffee Break
Chengwei REN ⋅ Fan Zhang ⋅ Liangchao Xu ⋅ Liang Pan ⋅ Ziwei Liu ⋅ Wenping Wang ⋅ Xiao-Ping Zhang ⋅ Yuan Liu
|
Exhibit Hall I #380 | |
|
Diff2I2P: Differentiable Image-to-Point Cloud Registration with Diffusion Prior
Poster Session 6 & Exhibit Hall with Coffee Break
Juncheng Mu ⋅ Chengwei REN ⋅ Weixiang Zhang ⋅ Liang Pan ⋅ Xiao-Ping Zhang ⋅ Yue Gao
|
Exhibit Hall I #101 | |
|
EgoMusic-driven Human Dance Motion Estimation with Skeleton Mamba
Poster Session 3 & Exhibit Hall
Quang Nguyen ⋅ Nhat Le ⋅ Baoru Huang ⋅ Minh VU ⋅ Chengcheng Tang ⋅ Van Nguyen ⋅ Ngan Le ⋅ Thieu Vo ⋅ Anh Nguyen
|
Exhibit Hall I #190 | |
|
CT-ScanGaze: A Dataset and Baselines for 3D Volumetric Scanpath Modeling
Trong-Thang Pham ⋅ AKASH AWASTHI ⋅ Saba Khan ⋅ Esteban Marti ⋅ Tien-Phat Nguyen ⋅ Khoa Vo ⋅ Minh Tran ⋅ Ngoc Son Nguyen ⋅ Cuong Van ⋅ Yuki Ikebe ⋅ Anh Nguyen ⋅ Anh Nguyen ⋅ Zhigang Deng ⋅ Carol Wu ⋅ Hien Nguyen ⋅ Ngan Le
|
Exhibit Hall I #181 | |
|
Not Only Vision: Evolve Visual Speech Recognition via Peripheral Information
Poster Session 1 & Exhibit Hall
Zhaoxin Yuan ⋅ Shuang Yang ⋅ Shiguang Shan ⋅ Xilin Chen
|
Exhibit Hall I #285 | |
|
KOEnsAttack: Towards Efficient Data-Free Black-Box Adversarial Attacks via Knowledge-Orthogonalized Substitute Ensembles
Poster Session 1 & Exhibit Hall
Chaoyong Yang ⋅ Jia-Li Yin ⋅ Bin Chen ⋅ Zhaozhe Hu ⋅ Xiaolei Liu ⋅ Wei Lin
|
Exhibit Hall I #286 | |
|
CLIP-Adapted Region-to-Text Learning for Generative Open-Vocabulary Semantic Segmentation
Poster Session 5 & Exhibit Hall
Jiannan Ge ⋅ Lingxi Xie ⋅ Hongtao Xie ⋅ Pandeng Li ⋅ Sun-Ao Liu ⋅ XIAOPENG ZHANG ⋅ Qi Tian ⋅ Yongdong Zhang
|
Exhibit Hall I #397 | |
|
LV-MAE: Learning Long Video Representations through Masked-Embedding Autoencoders
Poster Session 5 & Exhibit Hall
Ilan Naiman ⋅ Emanuel Baruch Baruch ⋅ Oron Anschel ⋅ Alon Shoshan ⋅ Igor Kviatkovsky ⋅ Manoj Aggarwal ⋅ Gerard Medioni
|
Exhibit Hall I #148 | |
|
PanSt3R: Multi-view Consistent Panoptic Segmentation
Poster Session 2 & Exhibit Hall with Coffee Break
Lojze Zust ⋅ Yohann Cabon ⋅ Juliette Marrie ⋅ Leonid Antsfeld ⋅ Boris Chidlovskii ⋅ Jerome Revaud ⋅ Gabriela Csurka
|
Exhibit Hall I #79 | |
|
Guiding Diffusion-Based Articulated Object Generation by Partial Point Cloud Alignment and Physical Plausibility Constraints
Jens U. Kreber ⋅ Joerg Stueckler
|
Exhibit Hall I #296 | |
|
GARF: Learning Generalizable 3D Reassembly for Real-World Fractures
Poster Session 2 & Exhibit Hall with Coffee Break
Sihang Li ⋅ Zeyu Jiang ⋅ Grace Chen ⋅ Chenyang Xu ⋅ Siqi Tan ⋅ Xue Wang ⋅ Irving Fang ⋅ Kristof Zyskowski ⋅ Shannon McPherron ⋅ Radu Iovita ⋅ Chen Feng ⋅ Jing Zhang
|
Exhibit Hall I #64 | |
|
PhysSplat: Efficient Physics Simulation for 3D Scenes via MLLM-Guided Gaussian Splatting
Poster Session 2 & Exhibit Hall with Coffee Break
Haoyu Zhao ⋅ Hao Wang ⋅ Xingyue Zhao ⋅ Hao Fei ⋅ Hongqiu Wang ⋅ Chengjiang Long ⋅ Hua Zou
|
Exhibit Hall I #21 | |
|
Derm1M: A Million-scale Vision-Language Dataset Aligned with Clinical Ontology Knowledge for Dermatology
Siyuan Yan ⋅ Ming Hu ⋅ Yiwen Jiang ⋅ Xieji Li ⋅ Hao Fei ⋅ Philipp Tschandl ⋅ Harald Kittler ⋅ Zongyuan Ge
|
Exhibit Hall I #252 | |
|
Where, What, Why: Towards Explainable Driver Attention Prediction
Yuchen Zhou ⋅ Jiayu Tang ⋅ Xiaoyan Xiao ⋅ Yueyao Lin ⋅ Linkai Liu ⋅ Zipeng Guo ⋅ Hao Fei ⋅ Xiaobo Xia ⋅ Chao Gou
|
Exhibit Hall I #246 | |
|
TRNAS: A Training-Free Robust Neural Architecture Search
Poster Session 1 & Exhibit Hall
Yeming Yang ⋅ Qingling Zhu ⋅ Jianping Luo ⋅ Ka-Chun Wong ⋅ Qiuzhen Lin ⋅ Jianqiang Li
|
Exhibit Hall I #212 | |
|
Dynamic Multimodal Prototype Learning in Vision-Language Models
Poster Session 1 & Exhibit Hall
Xingyu Zhu ⋅ Shuo Wang ⋅ Beier Zhu ⋅ Miaoge Li ⋅ Yunfan Li ⋅ Junfeng Fang ⋅ Zhicai Wang ⋅ Dongsheng Wang ⋅ Hanwang Zhang
|
Exhibit Hall I #230 | |
|
CAP: Evaluation of Persuasive and Creative Image Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Aysan Aghazadeh ⋅ Adriana Kovashka
|
Exhibit Hall I #199 | |
|
SummDiff: Generative Modeling of Video Summarization with Diffusion
Kwanseok Kim ⋅ Jaehoon Hahm ⋅ Sumin Kim ⋅ Jinhwan Sul ⋅ Byung-Hak Kim ⋅ Joonseok Lee
|
Exhibit Hall I #20 | |
|
PersonalVideo: High ID-Fidelity Video Customization without Dynamic and Semantic Degradation
Poster Session 4 & Exhibit Hall with Coffee Break
Hengjia Li ⋅ Haonan Qiu ⋅ Shiwei Zhang ⋅ Xiang Wang ⋅ Yujie Wei ⋅ Zekun Li ⋅ Yingya Zhang ⋅ Boxi Wu ⋅ Deng Cai
|
Exhibit Hall I #433 | |
|
StreamGS: Online Generalizable Gaussian Splatting Reconstruction for Unposed Image Streams
Poster Session 6 & Exhibit Hall with Coffee Break
Yang LI ⋅ Jinglu Wang ⋅ Lei Chu ⋅ Xiao Li ⋅ Shiu-hong Kao ⋅ Ying-Cong Chen ⋅ Yan Lu
|
Exhibit Hall I #107 | |
|
Unleashing Vecset Diffusion Model for Fast Shape Generation
Zeqiang Lai ⋅ Zhao Yunfei ⋅ Zibo Zhao ⋅ Haolin Liu ⋅ Fu-Yun Wang ⋅ Huiwen Shi ⋅ Xianghui Yang ⋅ Qingxiang Lin ⋅ Jingwei Huang ⋅ Lliu Yuhong ⋅ Jie Jiang ⋅ Chunchao Guo ⋅ Xiangyu Yue
|
Exhibit Hall I #232 | |
|
Auto-Regressively Generating Multi-View Consistent Images
Poster Session 1 & Exhibit Hall
JiaKui Hu ⋅ Yuxiao Yang ⋅ Jialun Liu ⋅ Jinbo Wu ⋅ Chen Zhao ⋅ Yanye Lu
|
Exhibit Hall I #235 | |
|
MagicID: Hybrid Preference Optimization for ID-Consistent and Dynamic-Preserved Video Customization
Poster Session 3 & Exhibit Hall
Hengjia Li ⋅ Lifan Jiang ⋅ Xi Xiao ⋅ Tianyang Wang ⋅ Hongwei Yi ⋅ Boxi Wu ⋅ Deng Cai
|
Exhibit Hall I #257 | |
|
CA-I2P: Channel-Adaptive Registration Network with Global Optimal Selection
Poster Session 6 & Exhibit Hall with Coffee Break
Zhixin Cheng ⋅ Jiacheng Deng ⋅ Xinjun Li ⋅ Xiaotian Yin ⋅ Bohao Liao ⋅ Baoqun Yin ⋅ Wenfei Yang ⋅ Tianzhu Zhang
|
Exhibit Hall I #292 | |
|
Towards Performance Consistency in Multi-Level Model Collaboration
Poster Session 1 & Exhibit Hall
Qi Li ⋅ Runpeng Yu ⋅ Xinchao Wang
|
Exhibit Hall I #236 | |
|
Visual Interestingness Decoded: How GPT-4o Mirrors Human Interests
Poster Session 4 & Exhibit Hall with Coffee Break
Fitim Abdullahu ⋅ Helmut Grabner
|
Exhibit Hall I #43 | |
|
FuXi-RTM: A Physics-Guided Prediction Framework with Radiative Transfer Modeling
qiusheng huang ⋅ Xiaohui Zhong ⋅ Xu Fan ⋅ Hao Li
|
Exhibit Hall I #360 | |
|
DMQ: Dissecting Outliers of Diffusion Models for Post-Training Quantization
Poster Session 4 & Exhibit Hall with Coffee Break
Dongyeun Lee ⋅ jiwan hur ⋅ Hyounguk Shon ⋅ Jae Young Lee ⋅ Junmo Kim
|
Exhibit Hall I #347 | |
|
Democratizing Text-to-Image Masked Generative Models with Compact Text-Aware One-Dimensional Tokens
Poster Session 4 & Exhibit Hall with Coffee Break
Dongwon Kim ⋅ Ju He ⋅ Qihang Yu ⋅ Chenglin Yang ⋅ Xiaohui Shen ⋅ Suha Kwak ⋅ Liang-Chieh (Jay) Chen
|
Exhibit Hall I #341 | |
|
Understanding Personal Concept in Open-Vocabulary Semantic Segmentation
Poster Session 5 & Exhibit Hall
Sunghyun Park ⋅ Jungsoo Lee ⋅ Shubhankar Borse ⋅ Munawar Hayat ⋅ Sungha Choi ⋅ Kyuwoong Hwang ⋅ Fatih Porikli
|
Exhibit Hall I #15 | |
|
DuoLoRA : Cycle-consistent and Rank-disentangled Content-Style Personalization
Poster Session 4 & Exhibit Hall with Coffee Break
Aniket Roy ⋅ Shubhankar Borse ⋅ Shreya Kadambi ⋅ Debasmit Das ⋅ Shweta Mahajan ⋅ Risheek Garrepalli ⋅ Hyojin Park ⋅ Ankita Nayak ⋅ Rama Chellappa ⋅ Munawar Hayat ⋅ Fatih Porikli
|
Exhibit Hall I #47 | |
|
ObjectGS: Object-aware Scene Reconstruction and Scene Understanding via Gaussian Splatting
Poster Session 2 & Exhibit Hall with Coffee Break
Ruijie Zhu ⋅ Mulin Yu ⋅ Linning Xu ⋅ Lihan Jiang ⋅ Yixuan Li ⋅ Tianzhu Zhang ⋅ Jiangmiao Pang ⋅ Bo Dai
|
Exhibit Hall I #314 | |
|
Video-T1: Test-time Scaling for Video Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Fangfu Liu ⋅ Hanyang Wang ⋅ Yimo Cai ⋅ Kaiyan Zhang ⋅ Xiaohang Zhan ⋅ Yueqi Duan
|
Exhibit Hall I #362 | |
|
Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflection
Poster Session 4 & Exhibit Hall with Coffee Break
Shufan Li ⋅ Konstantinos Kallidromitis ⋅ Akash Gokul ⋅ Arsh Koneru ⋅ Yusuke Kato ⋅ Kazuki Kozuka ⋅ Aditya Grover
|
Exhibit Hall I #72 | |
|
Discovering Divergent Representations between Text-to-Image Models
Poster Session 4 & Exhibit Hall with Coffee Break
Lisa Dunlap ⋅ Trevor Darrell ⋅ Joseph Gonzalez ⋅ Fabian Caba Heilbron ⋅ Josef Sivic ⋅ Bryan Russell
|
Exhibit Hall I #252 | |
|
VRM: Knowledge Distillation via Virtual Relation Matching
Weijia Zhang ⋅ Fei Xie ⋅ Weidong Cai ⋅ Chao Ma
|
Exhibit Hall I #249 | |
|
SKALD: Learning-Based Shot Assembly for Coherent Multi-Shot Video Creation
Poster Session 4 & Exhibit Hall with Coffee Break
Chen Yi Lu ⋅ Mehrab Tanjim ⋅ Ishita Dasgupta ⋅ Somdeb Sarkhel ⋅ Gang Wu ⋅ Saayan Mitra ⋅ Somali Chaterji
|
Exhibit Hall I #284 | |
|
CoDa-4DGS: Dynamic Gaussian Splatting with Context and Deformation Awareness for Autonomous Driving
Poster Session 6 & Exhibit Hall with Coffee Break
Rui Song ⋅ Chenwei Liang ⋅ Yan Xia ⋅ Walter Zimmer ⋅ Hu Cao ⋅ Holger Caesar ⋅ Andreas Festag ⋅ Alois Knoll
|
Exhibit Hall I #319 | |
|
GDKVM: Echocardiography Video Segmentation via Spatiotemporal Key-Value Memory with Gated Delta Rule
Poster Session 3 & Exhibit Hall
Rui Wang ⋅ Yimu Sun ⋅ Jingxing Guo ⋅ Huisi Wu ⋅ Jing Qin
|
Exhibit Hall I #205 | |
|
ImageGem: In-the-wild Generative Image Interaction Dataset for Generative Model Personalization
Poster Session 4 & Exhibit Hall with Coffee Break
Yuanhe Guo ⋅ Linxi Xie ⋅ Zhuoran Chen ⋅ Kangrui Yu ⋅ Ryan Po ⋅ Guandao Yang ⋅ Gordon Wetzstein ⋅ Hongyi Wen
|
Exhibit Hall I #449 | |
|
One Trajectory, One Token: Grounded Video Tokenization via Panoptic Sub-object Trajectory
Chenhao Zheng ⋅ Jieyu Zhang ⋅ Mohammadreza Salehi ⋅ Ziqi Gao ⋅ Vishnu Iyengar ⋅ Norimasa Kobori ⋅ Quan Kong ⋅ Ranjay Krishna
|
Exhibit Hall I #317 | |
|
LGA-Net: Learning Local and Global Affinities for Sparse Scribble based Image Colorization
Poster Session 2 & Exhibit Hall with Coffee Break
Hongjin Lyu ⋅ Bo Li ⋅ Paul Rosin ⋅ Yu-Kun Lai
|
Exhibit Hall I #293 | |
|
Backdoor Attacks on Neural Networks via One-Bit Flip
Poster Session 1 & Exhibit Hall
Xiang Li ⋅ Lannan Luo ⋅ Qiang Zeng
|
Exhibit Hall I #406 | |
|
Gaze-Language Alignment for Zero-Shot Prediction of Visual Search Targets from Human Gaze Scanpaths
Poster Session 1 & Exhibit Hall
Sounak Mondal ⋅ Naveen Sendhilnathan ⋅ Ting Zhang ⋅ Yue Liu ⋅ Michael Proulx ⋅ Michael Iuzzolino ⋅ Chuan Qin ⋅ Tanya Jonker
|
Exhibit Hall I #252 | |
|
O-MaMa: Learning Object Mask Matching between Egocentric and Exocentric Views
Poster Session 2 & Exhibit Hall with Coffee Break
Lorenzo Mur-Labadia ⋅ Maria Santos-Villafranca ⋅ Jesus Bermudez-cameo ⋅ Alejandro Perez-Yus ⋅ Ruben Martinez-Cantin ⋅ Jose Guerrero
|
Exhibit Hall I #176 | |
|
SAM Encoder Breach by Adversarial Simplicial Complex Triggers Downstream Model Failures
Poster Session 3 & Exhibit Hall
Yi Qin ⋅ Rui Wang ⋅ Tao Huang ⋅ Tong Xiao ⋅ Liping Jing
|
Exhibit Hall I #57 | |
|
HiMTok: Learning Hierarchical Mask Tokens for Image Segmentation with Large Multimodal Model
Poster Session 5 & Exhibit Hall
Tao Wang ⋅ Changxu Cheng ⋅ Lingfeng Wang ⋅ Senda Chen ⋅ Wuyue Zhao
|
Exhibit Hall I #327 | |
|
Semi-supervised Concept Bottleneck Models
Poster Session 1 & Exhibit Hall
Lijie Hu ⋅ Tianhao Huang ⋅ Huanyi Xie ⋅ Xilin Gong ⋅ Chenyang Ren ⋅ Zhengyu Hu ⋅ Lu Yu ⋅ Ping Ma ⋅ Di Wang
|
Exhibit Hall I #191 | |
|
Normal and Abnormal Pathology Knowledge-Augmented Vision-Language Model for Anomaly Detection in Pathology Images
Poster Session 5 & Exhibit Hall
Jinsol Song ⋅ Jiamu Wang ⋅ Anh Nguyen ⋅ Keunho Byeon ⋅ Sangjeong Ahn ⋅ Sung Hak Lee ⋅ Jin Tae Kwak
|
Exhibit Hall I #212 | |
|
DuET: Dual Incremental Object Detection via Exemplar-Free Task Arithmetic
Poster Session 1 & Exhibit Hall
Munish Monga ⋅ Vishal Chudasama ⋅ Pankaj Wasnik ⋅ Biplab Banerjee
|
Exhibit Hall I #288 | |
|
WINS: Winograd Structured Pruning for Fast Winograd Convolution
Cheonjun Park ⋅ Hyunjae Oh ⋅ Mincheol Park ⋅ Hyunchan Moon ⋅ Minsik Kim ⋅ Suhyun Kim ⋅ Myung Kuk Yoon ⋅ Won Woo Ro
|
Exhibit Hall I #252 | |
|
ART: Adaptive Relation Tuning for Generalized Relation Prediction
Poster Session 4 & Exhibit Hall with Coffee Break
Gopika Sudhakaran ⋅ Hikaru Shindo ⋅ Patrick Schramowski ⋅ Simone Schaub-Meyer ⋅ Kristian Kersting ⋅ Stefan Roth
|
Exhibit Hall I #136 | |
|
Feed-Forward SceneDINO for Unsupervised Semantic Scene Completion
Poster Session 2 & Exhibit Hall with Coffee Break
Aleksandar Jevtić ⋅ Christoph Reich ⋅ Felix Wimbauer ⋅ Oliver Hahn ⋅ Christian Rupprecht ⋅ Stefan Roth ⋅ Daniel Cremers
|
Exhibit Hall I #166 | |
|
DISTIL: Data-Free Inversion of Suspicious Trojan Inputs via Latent Diffusion
Poster Session 1 & Exhibit Hall
Hossein Mirzaei ⋅ Zeinab Taghavi ⋅ Sepehr Rezaee ⋅ Masoud Hadi ⋅ Moein Madadi ⋅ Mackenzie Mathis
|
Exhibit Hall I #295 | |
|
Factorized Learning for Temporally Grounded Video-Language Models
Poster Session 5 & Exhibit Hall
Wenzheng Zeng ⋅ Difei Gao ⋅ Mike Zheng Shou ⋅ Hwee Tou Ng
|
Exhibit Hall I #84 | |
|
FedPall: Prototype-based Adversarial and Collaborative Learning for Federated Learning with Feature Drift
Poster Session 1 & Exhibit Hall
yong zhang ⋅ Feng Liang ⋅ Guanghu Yuan ⋅ Min Yang ⋅ Chengming Li ⋅ Xiping Hu
|
Exhibit Hall I #287 | |
|
Multimodal LLMs as Customized Reward Models for Text-to-Image Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Shijie Zhou ⋅ Ruiyi Zhang ⋅ Huaisheng Zhu ⋅ Branislav Kveton ⋅ Yufan Zhou ⋅ Jiuxiang Gu ⋅ Jian Chen ⋅ Changyou Chen
|
Exhibit Hall I #455 | |
|
MissRAG: Addressing the Missing Modality Challenge in Multimodal Large Language Models
Poster Session 1 & Exhibit Hall
Vittorio Pipoli ⋅ Alessia Saporita ⋅ Federico Bolelli ⋅ Marcella Cornia ⋅ Lorenzo Baraldi ⋅ Costantino Grana ⋅ Rita Cucchiara ⋅ Elisa Ficarra
|
Exhibit Hall I #297 | |
|
FinMMR: Make Financial Numerical Reasoning More Multimodal, Comprehensive, and Challenging
Poster Session 1 & Exhibit Hall
Zichen Tang ⋅ Haihong E ⋅ Jiacheng Liu ⋅ Zhongjun Yang ⋅ Rongjin Li ⋅ Zihua Rong ⋅ Haoyang He ⋅ Zhuodi Hao ⋅ Xinyang Hu ⋅ Kun Ji ⋅ Ziyan Ma ⋅ Mengyuan Ji ⋅ Jun Zhang ⋅ Chenghao Ma ⋅ Qianhe Zheng ⋅ Yang Liu ⋅ Yiling Huang ⋅ Xinyi Hu ⋅ Qing Huang ⋅ Zijian Xie ⋅ Shiyao Peng
|
Exhibit Hall I #300 | |
|
Iris: Breaking GUI Complexity with Adaptive Focus and Self-Refining
Poster Session 5 & Exhibit Hall
Zhiqi Ge ⋅ Juncheng Li ⋅ Xinglei Pang ⋅ Minghe Gao ⋅ Kaihang Pan ⋅ Wang Lin ⋅ Hao Fei ⋅ Wenqiao Zhang ⋅ Siliang Tang ⋅ Yueting Zhuang
|
Exhibit Hall I #446 | |
|
No Pose at All: Self-Supervised Pose-Free 3D Gaussian Splatting from Sparse Views
Ranran Huang ⋅ Krystian Mikolajczyk
|
Exhibit Hall I #311 | |
|
External Knowledge Injection for CLIP-Based Class-Incremental Learning
Poster Session 1 & Exhibit Hall
Da-Wei Zhou ⋅ Kai-Wen Li ⋅ Jingyi Ning ⋅ Han-Jia Ye ⋅ Lijun Zhang ⋅ De-Chuan Zhan
|
Exhibit Hall I #308 | |
|
Cooperative Pseudo Labeling for Unsupervised Federated Classification
Poster Session 1 & Exhibit Hall
Kuangpu Guo ⋅ Lijun Sheng ⋅ Yongcan Yu ⋅ Jian Liang ⋅ Zilei Wang ⋅ Ran He
|
Exhibit Hall I #309 | |
|
DreamLayer: Simultaneous Multi-Layer Generation via Diffusion Model
Junjia Huang ⋅ Pengxiang Yan ⋅ Jinhang Cai ⋅ Jiyang Liu ⋅ Zhao Wang ⋅ Yitong Wang ⋅ Xinglong Wu ⋅ Guanbin Li
|
Exhibit Hall I #312 | |
|
AutoOcc: Automatic Open-Ended Semantic Occupancy Annotation via Vision-Language Guided Gaussian Splatting
Xiaoyu Zhou ⋅ Jingqi Wang ⋅ Yongtao Wang ⋅ Yufei Wei ⋅ Nan Dong ⋅ Ming-Hsuan Yang
|
Exhibit Hall I #313 | |
|
Augmenting Moment Retrieval: Zero-Dependency Two-Stage Learning
Poster Session 1 & Exhibit Hall
Zhengxuan Wei ⋅ Jiajin Tang ⋅ Sibei Yang
|
Exhibit Hall I #316 | |
|
Rep-MTL: Unleashing the Power of Representation-level Task Saliency for Multi-Task Learning
Zedong Wang ⋅ Siyuan Li ⋅ Dan Xu
|
Exhibit Hall I #317 | |
|
Fine-Grained Evaluation of Large Vision-Language Models in Autonomous Driving
Poster Session 2 & Exhibit Hall with Coffee Break
Yue Li ⋅ Meng Tian ⋅ Zhenyu Lin ⋅ Jiangtong Zhu ⋅ Dechang Zhu ⋅ Haiqiang Liu ⋅ Yueyi Zhang ⋅ Zhiwei Xiong ⋅ Xinhai Zhao
|
Exhibit Hall I #414 | |
|
Activation Subspaces for Out-of-Distribution Detection
Poster Session 1 & Exhibit Hall
Barış Zöngür ⋅ Robin Hesse ⋅ Stefan Roth
|
Exhibit Hall I #326 | |
|
PAN-Crafter: Learning Modality-Consistent Alignment for PAN-Sharpening
Poster Session 1 & Exhibit Hall
Jeonghyeok Do ⋅ Sungpyo Kim ⋅ Geunhyuk Youk ⋅ Jaehyup Lee ⋅ Munchurl Kim
|
Exhibit Hall I #397 | |
|
Continual Adaptation: Environment-Conditional Parameter Generation for Object Detection in Dynamic Scenarios
Poster Session 1 & Exhibit Hall
Deng Li ⋅ Aming WU ⋅ Yang Li ⋅ Yaowei Wang ⋅ Yahong Han
|
Exhibit Hall I #416 | |
|
Differentially Private Fine-Tuning of Diffusion Models
Poster Session 1 & Exhibit Hall
Yu-Lin Tsai ⋅ Yizhe Li ⋅ Zekai Chen ⋅ Po-Yu Chen ⋅ Francois Buet-Golfouse ⋅ Chia-Mu Yu ⋅ Xuebin Ren
|
Exhibit Hall I #428 | |
|
IRGPT: Understanding Real-world Infrared Image with Bi-cross-modal Curriculum on Large-scale Benchmark
Poster Session 1 & Exhibit Hall
Zhe Cao ⋅ Jin Zhang ⋅ Ruiheng Zhang
|
Exhibit Hall I #6 | |
|
Multi-turn Consistent Image Editing
Poster Session 4 & Exhibit Hall with Coffee Break
Zijun Zhou ⋅ Yingying Deng ⋅ Xiangyu He ⋅ Weiming Dong ⋅ Fan Tang
|
Exhibit Hall I #86 | |
|
A Hidden Stumbling Block in Generalized Category Discovery: Distracted Attention
Poster Session 1 & Exhibit Hall
Qiyu Xu ⋅ Zhanxuan Hu ⋅ Yu Duan ⋅ Ercheng Pei ⋅ Yonghang Tai
|
Exhibit Hall I #28 | |
|
CAFA: a Controllable Automatic Foley Artist
Poster Session 4 & Exhibit Hall with Coffee Break
Roi Benita ⋅ Michael Finkelson ⋅ Tavi Halperin ⋅ Gleb Sterkin ⋅ Yossi Adi
|
Exhibit Hall I #98 | |
|
Unknown Text Learning for CLIP-based Few-Shot Open-set Recognition
Poster Session 1 & Exhibit Hall
Rui Ma ⋅ Qilong Wang ⋅ Bing Cao ⋅ Qinghua Hu ⋅ Yahong Han
|
Exhibit Hall I #52 | |
|
Reducing Unimodal Bias in Multi-Modal Semantic Segmentation with Multi-Scale Functional Entropy Regularization
Poster Session 5 & Exhibit Hall
Xu Zheng ⋅ Yuanhuiyi Lyu ⋅ Lutao Jiang ⋅ Danda Pani Paudel ⋅ Luc Gool ⋅ Xuming Hu
|
Exhibit Hall I #127 | |
|
Personalized Federated Learning under Local Supervision
Poster Session 1 & Exhibit Hall
Qiqi Liu ⋅ Jiaqiang Li ⋅ Yuchen Liu ⋅ Yaochu Jin ⋅ Lingjuan Lyu ⋅ Xiaohu Wu ⋅ Han Yu
|
Exhibit Hall I #379 | |
|
Multi-View 3D Point Tracking
Poster Session 1 & Exhibit Hall
Frano Rajič ⋅ Haofei Xu ⋅ Marko Mihajlovic ⋅ Siyuan Li ⋅ Irem Demir ⋅ Emircan Gündoğdu ⋅ Lei Ke ⋅ Sergey Prokudin ⋅ Marc Pollefeys ⋅ Siyu Tang
|
Exhibit Hall I #75 | |
|
Hyper-Depth: Hypergraph-based Multi-Scale Representation Fusion for Monocular Depth Estimation
Poster Session 2 & Exhibit Hall with Coffee Break
Lin Bie ⋅ Siqi Li ⋅ Yifan Feng ⋅ Yue Gao
|
Exhibit Hall I #6 | |
|
Learning Separable Fine-Grained Representation via Dendrogram Construction from Coarse Labels for Fine-grained Visual Recognition
Poster Session 1 & Exhibit Hall
Guanghui Shi ⋅ Xuefeng liang ⋅ Wenjie Li ⋅ Xiaoyu Lin
|
Exhibit Hall I #72 | |
|
PRVQL: Progressive Knowledge-guided Refinement for Robust Egocentric Visual Query Localization
Poster Session 2 & Exhibit Hall with Coffee Break
Bing Fan ⋅ Yunhe Feng ⋅ Yapeng Tian ⋅ James Liang ⋅ Yuewei Lin ⋅ Yan Huang ⋅ Heng Fan
|
Exhibit Hall I #13 | |
|
PRO-VPT: Distribution-Adaptive Visual Prompt Tuning via Prompt Relocation
Poster Session 1 & Exhibit Hall
Chikai Shang ⋅ Mengke Li ⋅ Yiqun Zhang ⋅ Zhen Chen ⋅ Jinlin Wu ⋅ Fangqing Gu ⋅ Yang Lu ⋅ Yiu-ming Cheung
|
Exhibit Hall I #138 | |
|
Language-Driven Multi-Label Zero-Shot Learning with Semantic Granularity
Poster Session 1 & Exhibit Hall
Shouwen Wang ⋅ Qian Wan ⋅ Junbin Gao ⋅ Zhigang Zeng
|
Exhibit Hall I #178 | |
|
Generalized Deep Multi-view Clustering via Causal Learning with Partially Aligned Cross-view Correspondence
Poster Session 1 & Exhibit Hall
Xihong Yang ⋅ Siwei Wang ⋅ Jiaqi Jin ⋅ Fangdi Wang ⋅ Tianrui Liu ⋅ Yueming Jin ⋅ Xinwang Liu ⋅ En Zhu ⋅ Kunlun He
|
Exhibit Hall I #180 | |
|
Granular Concept Circuits: Toward a Fine-Grained Circuit Discovery for Concept Representations
Poster Session 1 & Exhibit Hall
Dahee Kwon ⋅ Sehyun Lee ⋅ Jaesik Choi
|
Exhibit Hall I #214 | |
|
Learning an Implicit Physics Model for Image-based Fluid Simulation
Poster Session 2 & Exhibit Hall with Coffee Break
Emily Jia ⋅ Jiageng Mao ⋅ Zhiyuan Gao ⋅ Yajie Zhao ⋅ Yue Wang
|
Exhibit Hall I #190 | |
|
Less is More: Empowering GUI Agent with Context-Aware Simplification
Gongwei Chen ⋅ Xurui Zhou ⋅ Rui Shao ⋅ Yibo Lyu ⋅ Kaiwen Zhou ⋅ Shuai Wang ⋅ WenTao Li ⋅ Yinchuan Li ⋅ Zhongang Qi ⋅ Liqiang Nie
|
Exhibit Hall I #83 | |
|
Trace3D: Consistent Segmentation Lifting via Gaussian Instance Tracing
Poster Session 2 & Exhibit Hall with Coffee Break
Hongyu Shen ⋅ Junfeng Ni ⋅ Weishuo Li ⋅ Mingtao Pei ⋅ Yixin Chen ⋅ Siyuan Huang
|
Exhibit Hall I #155 | |
|
IM360: Large-scale Indoor Mapping with 360 Cameras
Poster Session 6 & Exhibit Hall with Coffee Break
Dongki Jung ⋅ Jaehoon Choi ⋅ Yonghan Lee ⋅ Dinesh Manocha
|
Exhibit Hall I #416 | |
|
EventUPS: Uncalibrated Photometric Stereo Using an Event Camera
Jinxiu Liang ⋅ Bohan Yu ⋅ Siqi Yang ⋅ Haotian Zhuang ⋅ Jieji Ren ⋅ Peiqi Duan ⋅ Boxin Shi
|
Exhibit Hall I #235 | |
|
Noise-Modeled Diffusion Models for Low-Light Spike Image Restoration
Ruonan Liu ⋅ Lin Zhu ⋅ Xijie Xiang ⋅ Lizhi Wang ⋅ Hua Huang
|
Exhibit Hall I #382 | |
|
Rethinking the Upsampling Process in Light Field Super-Resolution with Spatial-Epipolar Implicit Image Function
Poster Session 2 & Exhibit Hall with Coffee Break
Ruixuan Cong ⋅ Yu Wang ⋅ Mingyuan Zhao ⋅ Da Yang ⋅ Rongshan Chen ⋅ Hao Sheng
|
Exhibit Hall I #239 | |
|
Harnessing Input-Adaptive Inference for Efficient VLN
Poster Session 2 & Exhibit Hall with Coffee Break
Dongwoo Kang ⋅ Akhil Perincherry ⋅ Zachary Coalson ⋅ Aiden Gabriel ⋅ Stefan Lee ⋅ Sanghyun Hong
|
Exhibit Hall I #300 | |
|
When Lighting Deceives: Exposing Vision-Language Models' Illumination Vulnerability Through Illumination Transformation Attack
Poster Session 3 & Exhibit Hall
Hanqing Liu ⋅ Shouwei Ruan ⋅ Yao Huang ⋅ Shiji Zhao ⋅ Xingxing Wei
|
Exhibit Hall I #44 | |
|
SemTalk: Holistic Co-speech Motion Generation with Frame-level Semantic Emphasis
Poster Session 3 & Exhibit Hall
Xiangyue Zhang ⋅ Jianfang Li ⋅ Jiaxu Zhang ⋅ Ziqiang Dang ⋅ Jianqiang Ren ⋅ Liefeng Bo ⋅ Zhigang Tu
|
Exhibit Hall I #353 | |
|
PersonaCraft: Personalized and Controllable Full-Body Multi-Human Scene Generation Using Occlusion-Aware 3D-Conditioned Diffusion
Poster Session 3 & Exhibit Hall
Gwanghyun Kim ⋅ Suh Jeon Jeon ⋅ Seunggyu Lee ⋅ Se Young Chun
|
Exhibit Hall I #191 | |
|
MotionFollower: Editing Video Motion via Score-Guided Diffusion
Poster Session 3 & Exhibit Hall
Shuyuan Tu ⋅ Qi Dai ⋅ Zihao Zhang ⋅ Sicheng Xie ⋅ Zhi-Qi Cheng ⋅ Chong Luo ⋅ Xintong Han ⋅ Zuxuan Wu ⋅ Yu-Gang Jiang
|
Exhibit Hall I #265 | |
|
Online Generic Event Boundary Detection
Poster Session 3 & Exhibit Hall
Hyung Rok Jung ⋅ Daneul Kim ⋅ Seunggyun Lim ⋅ Jeany Son ⋅ Jonghyun Choi
|
Exhibit Hall I #351 | |
|
A Recipe for Generating 3D Worlds from a Single Image
Poster Session 1 & Exhibit Hall
Katja Schwarz ⋅ Denis Rozumny ⋅ Samuel Rota Bulò ⋅ Lorenzo Porzi ⋅ Peter Kontschieder
|
Exhibit Hall I #327 | |
|
Guiding Diffusion Models with Adaptive Negative Sampling Without External Resources
Poster Session 4 & Exhibit Hall with Coffee Break
Alakh Desai ⋅ Nuno Vasconcelos
|
Exhibit Hall I #117 | |
|
Transformed Low-rank Adaptation via Tensor Decomposition and Its Applications to Text-to-image Models
Poster Session 4 & Exhibit Hall with Coffee Break
Zerui Tao ⋅ Yuhta Takida ⋅ Naoki Murata ⋅ Qibin Zhao ⋅ Yuki Mitsufuji
|
Exhibit Hall I #137 | |
|
DLFR-Gen: Diffusion-based Video Generation with Dynamic Latent Frame Rate
Poster Session 4 & Exhibit Hall with Coffee Break
Zhihang Yuan ⋅ Rui Xie ⋅ Yuzhang Shang ⋅ Hanling Zhang ⋅ Siyuan Wang ⋅ Shengen Yan ⋅ Guohao Dai ⋅ Yu Wang
|
Exhibit Hall I #144 | |
|
DiffDoctor: Diagnosing Image Diffusion Models Before Treating
Poster Session 4 & Exhibit Hall with Coffee Break
Yiyang Wang ⋅ Xi Chen ⋅ Xiaogang Xu ⋅ Sihui Ji ⋅ Yu Liu ⋅ Yujun Shen ⋅ Hengshuang Zhao
|
Exhibit Hall I #387 | |
|
GroundingSuite: Measuring Complex Multi-Granular Pixel Grounding
Poster Session 5 & Exhibit Hall
Rui Hu ⋅ Yuxuan Zhang ⋅ Lianghui Zhu ⋅ Tianheng Cheng ⋅ Lei Liu ⋅ Heng Liu ⋅ Longjin Ran ⋅ Xiaoxin Chen ⋅ Wenyu Liu ⋅ Xinggang Wang
|
Exhibit Hall I #312 | |
|
Video Motion Graphs
Haiyang Liu ⋅ Zhan Xu ⋅ Fating Hong ⋅ Hsin-Ping Huang ⋅ Yi Zhou ⋅ Yang Zhou
|
Exhibit Hall I #350 | |
|
Adaptive Learning of High-Value Regions for Semi-Supervised Medical Image Segmentation
Poster Session 5 & Exhibit Hall
Tao Lei ⋅ Ziyao Yang ⋅ Xingwu wang ⋅ Yi Wang ⋅ Xuan Wang ⋅ FeimanSun FeimanSun ⋅ Asoke Nandi
|
Exhibit Hall I #153 | |
|
Constructing Ophthalmic MLLM for Positioning-diagnosis Collaboration Through Clinical Cognitive Chain Reasoning
Poster Session 5 & Exhibit Hall
Xinyao Liu ⋅ Diping Song
|
Exhibit Hall I #164 | |
|
Hallucinatory Image Tokens: A Training-free EAZY Approach to Detecting and Mitigating Object Hallucinations in LVLMs
Poster Session 5 & Exhibit Hall
Liwei Che ⋅ Qingze T Liu ⋅ Jing Jia ⋅ Weiyi Qin ⋅ Ruixiang Tang ⋅ Vladimir Pavlovic
|
Exhibit Hall I #172 | |
|
Keep Your Friends Close, and Your Enemies Farther: Distance-aware Voxel-wise Contrastive Learning for Semi-supervised Multi-organ Segmentation
Poster Session 5 & Exhibit Hall
Haochen Zhao ⋅ Jianwei Niu ⋅ Xuefeng Liu ⋅ Xiaozheng Xie ⋅ Li Kuang ⋅ Haotian Yang ⋅ Bin Dai ⋅ Hui Meng ⋅ Yong Wang
|
Exhibit Hall I #190 | |
|
Integrating Biological Knowledge for Robust Microscopy Image Profiling on De Novo Cell Lines
Jiayuan Chen ⋅ Thai-Hoang Pham ⋅ Yuanlong Wang ⋅ Ping Zhang
|
Exhibit Hall I #286 | |
|
Spectral Sensitivity Estimation with an Uncalibrated Diffraction Grating
Poster Session 6 & Exhibit Hall with Coffee Break
Lilika Makabe ⋅ Hiroaki Santo ⋅ Fumio Okura ⋅ Michael Brown ⋅ Yasuyuki Matsushita
|
Exhibit Hall I #245 | |
|
TransiT: Transient Transformer for Non-line-of-sight Videography
Poster Session 6 & Exhibit Hall with Coffee Break
Ruiqian Li ⋅ Siyuan Shen ⋅ Suan Xia ⋅ Ziheng Wang ⋅ Xingyue Peng ⋅ Chengxuan Song ⋅ Yingsheng Zhu ⋅ Tao Wu ⋅ Shiying Li ⋅ Jingyi Yu
|
Exhibit Hall I #272 | |
|
LA-MOTR: End-to-End Multi-Object Tracking by Learnable Association
Poster Session 3 & Exhibit Hall
Peng Wang ⋅ Yongcai Wang ⋅ Hualong Cao ⋅ Wang Chen ⋅ Deying Li
|
Exhibit Hall I #230 | |
|
Contact-Aware Amodal Completion for Human-Object Interaction via Multi-Regional Inpainting
Seunggeun Chi ⋅ Pin-Hao Huang ⋅ Enna Sachdeva ⋅ Kwonjoon Lee
|
Exhibit Hall I #419 | |
|
On the Complexity-Faithfulness Trade-off of Gradient-Based Explanations
Poster Session 1 & Exhibit Hall
Amir Mehrpanah ⋅ Matteo Gamba ⋅ Kevin Smith ⋅ Hossein Azizpour
|
Exhibit Hall I #328 | |
|
TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Models
Poster Session 1 & Exhibit Hall
Mark YU ⋅ Wenbo Hu ⋅ Jinbo Xing ⋅ Ying Shan
|
Exhibit Hall I #154 | |
|
CSD-VAR: Content-Style Decomposition in Visual Autoregressive Models
Poster Session 4 & Exhibit Hall with Coffee Break
Quang-Binh Nguyen ⋅ Minh Luu ⋅ Quang Nguyen ⋅ Anh Tran ⋅ Khoi Nguyen
|
Exhibit Hall I #203 | |
|
Oasis: One Image is All You Need for Multimodal Instruction Data Synthesis
Poster Session 1 & Exhibit Hall
Letian Zhang ⋅ Quan Cui ⋅ Bingchen Zhao ⋅ Cheng Yang
|
Exhibit Hall I #329 | |
|
Learning to Inference Adaptively for Multimodal Large Language Models
Poster Session 1 & Exhibit Hall
Zhuoyan Xu ⋅ Khoi Nguyen ⋅ Preeti Mukherjee ⋅ Saurabh Bagchi ⋅ Somali Chaterji ⋅ Yingyu Liang ⋅ Yin Li
|
Exhibit Hall I #330 | |
|
Self-Reinforcing Prototype Evolution with Dual-Knowledge Cooperation for Semi-Supervised Lifelong Person Re-Identification
Poster Session 1 & Exhibit Hall
Kunlun Xu ⋅ Fan Zhuo ⋅ Jiangmeng Li ⋅ Xu Zou ⋅ Jiahuan Zhou
|
Exhibit Hall I #331 | |
|
Hierarchical Divide-and-Conquer Grouping for Classification Adaptation of Pre-Trained Models
Poster Session 1 & Exhibit Hall
Ziqian Lu ⋅ Yunlong Yu ⋅ Qinyue Tong ⋅ Jun Liu
|
Exhibit Hall I #332 | |
|
Lark: Low-Rank Updates After Knowledge Localization for Few-shot Class-Incremental Learning
Poster Session 1 & Exhibit Hall
Jinxin Shi ⋅ Jiabao Zhao ⋅ Yifan Yang ⋅ Xingjiao Wu ⋅ Jiawen Li ⋅ Liang He
|
Exhibit Hall I #335 | |
|
Large Multi-modal Models Can Interpret Features in Large Multi-modal Models
Poster Session 1 & Exhibit Hall
Kaichen Zhang ⋅ Yifei Shen ⋅ Bo Li ⋅ Ziwei Liu
|
Exhibit Hall I #339 | |
|
A Conditional Probability Framework for Compositional Zero-shot Learning
Poster Session 1 & Exhibit Hall
Peng Wu ⋅ Qiuxia Lai ⋅ Hao Fang ⋅ Guo-Sen Xie ⋅ Yilong Yin ⋅ Xiankai Lu ⋅ Wenguan Wang
|
Exhibit Hall I #341 | |
|
Mind the Gap: Preserving and Compensating for the Modality Gap in CLIP-Based Continual Learning
Linlan Huang ⋅ Xusheng Cao ⋅ Haori Lu ⋅ Yifan Meng ⋅ Fei Yang ⋅ Xialei Liu
|
Exhibit Hall I #351 | |
|
BUFFER-X: Towards Zero-Shot Point Cloud Registration in Diverse Scenes
Minkyun Seo ⋅ Hyungtae Lim ⋅ Kanghee Lee ⋅ Luca Carlone ⋅ Jaesik Park
|
Exhibit Hall I #358 | |
|
RANKCLIP: Ranking-Consistent Language-Image Pretraining
Poster Session 1 & Exhibit Hall
Yiming Zhang ⋅ Zhuokai Zhao ⋅ Zhaorun Chen ⋅ Zhili Feng ⋅ Zenghui Ding ⋅ Yining Sun
|
Exhibit Hall I #360 | |
|
An Efficient Post-hoc Framework for Reducing Task Discrepancy of Text Encoders for Composed Image Retrieval
Poster Session 1 & Exhibit Hall
Jaeseok Byun ⋅ Seokhyeon Jeong ⋅ Wonjae Kim ⋅ Sanghyuk Chun ⋅ Taesup Moon
|
Exhibit Hall I #362 | |
|
ZIUM: Zero-Shot Intent-Aware Adversarial Attack on Unlearned Models
Poster Session 1 & Exhibit Hall
Hyun Jun Yook ⋅ Ga San Jhun ⋅ Cho Hyun ⋅ Min Jeon ⋅ Donghyun Kim ⋅ Tae Hyung Kim ⋅ Youn Lee
|
Exhibit Hall I #365 | |
|
Federated Prompt-Tuning with Heterogeneous and Incomplete Multimodal Client Data
Poster Session 1 & Exhibit Hall
Hang Phung ⋅ Manh Nguyen ⋅ Thanh Huynh ⋅ Quoc Viet Hung Nguyen ⋅ Trong Nghia Hoang ⋅ Phi Le Nguyen
|
Exhibit Hall I #366 | |
|
Find a Scapegoat: Poisoning Membership Inference Attack and Defense to Federated Learning
Poster Session 1 & Exhibit Hall
Wenjin Mo ⋅ Zhiyuan Li ⋅ Minghong Fang ⋅ Mingwei Fang
|
Exhibit Hall I #369 | |
|
OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning
Poster Session 1 & Exhibit Hall
Xianhang Li ⋅ Yanqing Liu ⋅ Haoqin Tu ⋅ Cihang Xie
|
Exhibit Hall I #370 | |
|
Integrating Visual Interpretation and Linguistic Reasoning for Geometric Problem Solving
Poster Session 1 & Exhibit Hall
Zixian Guo ⋅ Ming Liu ⋅ Qilong Wang ⋅ Zhilong Ji ⋅ Jinfeng Bai ⋅ Lei Zhang ⋅ Wangmeng Zuo
|
Exhibit Hall I #371 | |
|
SAFER: Sharpness Aware layer-selective Finetuning for Enhanced Robustness in vision transformers
Poster Session 1 & Exhibit Hall
Bhavna Gopal ⋅ Huanrui Yang ⋅ Mark Horton ⋅ Yiran Chen
|
Exhibit Hall I #372 | |
|
To Label or Not to Label: PALM – A Predictive Model for Evaluating Sample Efficiency in Active Learning Models
Poster Session 1 & Exhibit Hall
Julia Machnio ⋅ Mads Nielsen ⋅ Mostafa Mehdipour Ghazi
|
Exhibit Hall I #376 | |
|
Uncalibrated Structure from Motion on a Sphere
Poster Session 1 & Exhibit Hall
Jonathan Ventura ⋅ Viktor Larsson ⋅ Fredrik Kahl
|
Exhibit Hall I #303 | |
|
Prototype-based Contrastive Learning with Stage-wise Progressive Augmentation for Self-Supervised Fine-Grained Learning
Poster Session 1 & Exhibit Hall
BaoFeng Tan ⋅ Xiu-Shen Wei ⋅ Lin Zhao
|
Exhibit Hall I #386 | |
|
Radiant Foam: Real-Time Differentiable Ray Tracing
Shrisudhan Govindarajan ⋅ Daniel Rebain ⋅ Kwang Moo Yi ⋅ Andrea Tagliasacchi
|
Exhibit Hall I #387 | |
|
COSTARR: Consolidated Open Set Technique with Attenuation for Robust Recognition
Poster Session 1 & Exhibit Hall
Ryan Rabinowitz ⋅ Steve Cruz ⋅ Walter Scheirer ⋅ Terrance Boult
|
Exhibit Hall I #388 | |
|
Information Density Principle for MLLM Benchmarks
Poster Session 1 & Exhibit Hall
Chunyi Li ⋅ Xiaozhe Li ⋅ Zicheng Zhang ⋅ Yuan Tian ⋅ Ziheng Jia ⋅ Xiaohong Liu ⋅ Xiongkuo Min ⋅ Jia Wang ⋅ Haodong Duan ⋅ Kai Chen ⋅ Guangtao Zhai
|
Exhibit Hall I #390 | |
|
ReTracker: Exploring Image Matching for Robust Online Any Point Tracking
Dongli Tan ⋅ Xingyi He ⋅ Sida Peng ⋅ Yiqing Gong ⋅ Xing Zhu ⋅ Jiaming Sun ⋅ Ruizhen Hu ⋅ Yujun Shen ⋅ Hujun Bao ⋅ Xiaowei Zhou
|
Exhibit Hall I #404 | |
|
Perspective-Aware Teaching: Adapting Knowledge for Heterogeneous Distillation
Poster Session 1 & Exhibit Hall
Jhe-Hao Lin ⋅ Yi Yao ⋅ Chan-Feng Hsu ⋅ Hongxia Xie ⋅ Hong-Han Shuai ⋅ Wen-Huang Cheng
|
Exhibit Hall I #391 | |
|
Is Meta-Learning Out? Rethinking Unsupervised Few-Shot Classification with Limited Entropy
Poster Session 1 & Exhibit Hall
Yunchuan Guan ⋅ Yu Liu ⋅ Ke Zhou ⋅ Zhiqi Shen ⋅ Jenq-Newng Hwang ⋅ Serge Belongie ⋅ Lei Li
|
Exhibit Hall I #392 | |
|
Learning to Unlearn while Retaining: Combating Gradient Conflicts in Machine Unlearning
Poster Session 1 & Exhibit Hall
Gaurav Patel ⋅ Qiang Qiu
|
Exhibit Hall I #394 | |
|
Robust Multi-View Learning via Representation Fusion of Sample-Level Attention and Alignment of Simulated Perturbation
Poster Session 1 & Exhibit Hall
Jie Xu ⋅ Na Zhao ⋅ Gang Niu ⋅ Masashi Sugiyama ⋅ Xiaofeng Zhu
|
Exhibit Hall I #396 | |
|
Beyond the Frame: Generating 360° Panoramic Videos from Perspective Videos
Poster Session 3 & Exhibit Hall
Rundong Luo ⋅ Matthew Wallingford ⋅ Ali Farhadi ⋅ Noah Snavely ⋅ Wei-Chiu Ma
|
Exhibit Hall I #408 | |
|
A Linear N-Point Solver for Structure and Motion from Asynchronous Tracks
Hang Su ⋅ Yunlong Feng ⋅ Daniel Gehrig ⋅ Panfeng Jiang ⋅ Ling Gao ⋅ Xavier Lagorce ⋅ Laurent Kneip
|
Exhibit Hall I #407 | |
|
Differentiable Room Acoustic Rendering with Multi-View Vision Priors
Poster Session 1 & Exhibit Hall
Derong Jin ⋅ Ruohan Gao
|
Exhibit Hall I #381 | |
|
Long-LRM: Long-sequence Large Reconstruction Model for Wide-coverage Gaussian Splats
Chen Ziwen ⋅ Hao Tan ⋅ Kai Zhang ⋅ Sai Bi ⋅ Fujun Luan ⋅ Yicong Hong ⋅ Li Fuxin ⋅ Zexiang Xu
|
Exhibit Hall I #408 | |
|
SplatTalk: 3D VQA with Gaussian Splatting
Poster Session 1 & Exhibit Hall
Anh Thai ⋅ Kyle Genova ⋅ Songyou Peng ⋅ Leonidas Guibas ⋅ Thomas Funkhouser
|
Exhibit Hall I #442 | |
|
Joint Diffusion Models in Continual Learning
Poster Session 1 & Exhibit Hall
Paweł Skierś ⋅ Kamil Deja
|
Exhibit Hall I #411 | |
|
GT-Loc: Unifying When and Where in Images through a Joint Embedding Space
Poster Session 1 & Exhibit Hall
David G. Shatwell ⋅ Ishan Rajendrakumar Dave ⋅ Swetha Sirnam ⋅ Mubarak Shah
|
Exhibit Hall I #153 | |
|
TurboTrain: Towards Efficient and Balanced Multi-Task Learning for Multi-Agent Perception and Prediction
Poster Session 1 & Exhibit Hall
Zewei Zhou ⋅ Zhihao Zhao ⋅ Tianhui Cai ⋅ Zhiyu Huang ⋅ Bolei Zhou ⋅ Jiaqi Ma
|
Exhibit Hall I #412 | |
|
InfiniDreamer: Arbitrarily Long Human Motion Generation via Segment Score Distillation
Poster Session 3 & Exhibit Hall
Wenjie Zhuo ⋅ Fan Ma ⋅ Hehe Fan
|
Exhibit Hall I #441 | |
|
Multimodal Large Language Model-Guided ISP Hyperparameter Optimization with Dynamic Preference Learning
Poster Session 1 & Exhibit Hall
Xinyu Sun ⋅ Zhikun Zhao ⋅ congyan lang ⋅ Bing Li ⋅ Juan Wang
|
Exhibit Hall I #31 | |
|
VITAL: More Understandable Feature Visualization through Distribution Alignment and Relevant Information Flow
Poster Session 1 & Exhibit Hall
Ada Görgün ⋅ Bernt Schiele ⋅ Jonas Fischer
|
Exhibit Hall I #413 | |
|
ToolVQA: A Dataset for Multi-step Reasoning VQA with External Tools
Poster Session 1 & Exhibit Hall
Shaofeng Yin ⋅ Ting Lei ⋅ Yang Liu
|
Exhibit Hall I #415 | |
|
MMCR: Benchmarking Cross-Source Reasoning in Scientific Papers
Poster Session 1 & Exhibit Hall
Yang Tian ⋅ Zheng Lu ⋅ Mingqi Gao ⋅ Zheng Liu ⋅ Bo Zhao
|
Exhibit Hall I #36 | |
|
MosaicDiff: Training-free Structural Pruning for Diffusion Model Acceleration Reflecting Pretraining Dynamics
Poster Session 1 & Exhibit Hall
Bowei Guo ⋅ Shengkun Tang ⋅ Cong Zeng ⋅ Zhiqiang Shen
|
Exhibit Hall I #147 | |
|
FEVER-OOD: Free Energy Vulnerability Elimination for Robust Out-of-Distribution Detection
Poster Session 1 & Exhibit Hall
Brian Isaac-Medina ⋅ Mauricio Che ⋅ Yona Falinie A. Gaus ⋅ Samet Akcay ⋅ Toby Breckon
|
Exhibit Hall I #425 | |
|
CAVIS: Context-Aware Video Instance Segmentation
Poster Session 1 & Exhibit Hall
Seunghun Lee ⋅ Jiwan Seo ⋅ Kiljoon Han ⋅ Minwoo Choi ⋅ Sunghoon Im
|
Exhibit Hall I #423 | |
|
Adversarial Purification via Super-Resolution and Diffusion
Poster Session 1 & Exhibit Hall
Mincheol Park ⋅ Cheonjun Park ⋅ Seungseop Lim ⋅ Mijin Koo ⋅ Hyunwuk Lee ⋅ Won Woo Ro ⋅ Suhyun Kim
|
Exhibit Hall I #432 | |
|
FedWSQ: Efficient Federated Learning with Weight Standardization and Distribution-Aware Non-Uniform Quantization
Poster Session 1 & Exhibit Hall
Seung-Wook Kim ⋅ Seongyeol Kim ⋅ Jiah Kim ⋅ Seowon Ji ⋅ Se-Ho Lee
|
Exhibit Hall I #433 | |
|
CMAD: Correlation-Aware and Modalities-Aware Distillation for Multimodal Sentiment Analysis with Missing Modalities
Poster Session 1 & Exhibit Hall
Yan Zhuang ⋅ Minhao Liu ⋅ Wei Bai ⋅ Yanru Zhang ⋅ Xiaoyue Zhang ⋅ Jiawen Deng ⋅ Fuji Ren
|
Exhibit Hall I #434 | |
|
SimpleVQA: Multimodal Factuality Evaluation for Multimodal Large Language Models
Poster Session 1 & Exhibit Hall
Xianfu Cheng ⋅ Wei Zhang ⋅ Shiwei Zhang ⋅ Jian Yang ⋅ Xiangyuan Guan ⋅ Xianjie Wu ⋅ Xiang Li ⋅ Ge Zhang ⋅ Jiaheng Liu ⋅ Yuying Mai ⋅ Yutao Zeng ⋅ Zhoufutu Wen ⋅ JinKe JinKe ⋅ Baorui Wang ⋅ Weixiao Zhou ⋅ Lu Yunhong ⋅ Hangyuan Ji ⋅ Tongliang Li ⋅ Wenhao Huang ⋅ Zhoujun Li
|
Exhibit Hall I #435 | |
|
2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining
Wenqi Zhang ⋅ Hang Zhang ⋅ Xin Li ⋅ Jiashuo Sun ⋅ Yongliang Shen ⋅ Weiming Lu ⋅ Deli Zhao ⋅ Yueting Zhuang ⋅ Lidong Bing
|
Exhibit Hall I #436 | |
|
Revelio: Interpreting and leveraging semantic information in diffusion models
Poster Session 1 & Exhibit Hall
Dahye Kim ⋅ Xavier Thomas ⋅ Deepti Ghadiyaram
|
Exhibit Hall I #437 | |
|
CLIP-GS: Unifying Vision-Language Representation with 3D Gaussian Splatting
Poster Session 1 & Exhibit Hall
Siyu Jiao ⋅ Haoye Dong ⋅ Yuyang Yin ⋅ ZEQUN JIE ⋅ Yinlong Qian ⋅ Yao Zhao ⋅ Humphrey Shi ⋅ Yunchao Wei
|
Exhibit Hall I #438 | |
|
ProJudge: A Multi-Modal Multi-Discipline Benchmark and Instruction-Tuning Dataset for MLLM-based Process Judges
Poster Session 1 & Exhibit Hall
Jiaxin Ai ⋅ Pengfei Zhou ⋅ xu Pan ⋅ Ming Li ⋅ Fanrui Zhang ⋅ Zizhen Li ⋅ Jianwen Sun ⋅ Yukang Feng ⋅ Baojin Huang ⋅ Zhongyuan Wang ⋅ Kaipeng Zhang
|
Exhibit Hall I #439 | |
|
Failure Cases Are Better Learned But Boundary Says Sorry: Facilitating Smooth Perception Change for Accuracy-Robustness Trade-Off in Adversarial Training
Poster Session 1 & Exhibit Hall
Yanyun Wang ⋅ Li Liu
|
Exhibit Hall I #440 | |
|
Taming the Untamed: Graph-Based Knowledge Retrieval and Reasoning for MLLMs to Conquer the Unknown
Poster Session 1 & Exhibit Hall
Bowen Wang ⋅ Zhouqiang Jiang ⋅ Yasuaki Susumu ⋅ Shotaro Miwa ⋅ Tianwei Chen ⋅ Yuta Nakashima
|
Exhibit Hall I #444 | |
|
Causality-guided Prompt Learning for Vision-language Models via Visual Granulation
Poster Session 1 & Exhibit Hall
Mengyu Gao ⋅ Qiulei Dong
|
Exhibit Hall I #99 | |
|
MUNBa: Machine Unlearning via Nash Bargaining
Poster Session 1 & Exhibit Hall
Jing Wu ⋅ Mehrtash Harandi
|
Exhibit Hall I #446 | |
|
Auxiliary Prompt Tuning of Vision-Language Models for Few-Shot Out-of-Distribution Detection
Poster Session 1 & Exhibit Hall
Wenjun Miao ⋅ Guansong Pang ⋅ Zihan Wang ⋅ Jin Zheng ⋅ Xiao Bai
|
Exhibit Hall I #448 | |
|
Improved Noise Schedule for Diffusion Training
Poster Session 1 & Exhibit Hall
Tiankai Hang ⋅ Shuyang Gu ⋅ Jianmin Bao ⋅ Fangyun Wei ⋅ Dong Chen ⋅ Xin Geng ⋅ Baining Guo
|
Exhibit Hall I #450 | |
|
Secure On-Device Video OOD Detection Without Backpropagation
Poster Session 1 & Exhibit Hall
Li Li ⋅ Peilin Cai ⋅ Yuxiao Zhou ⋅ Zhiyu Ni ⋅ Renjie Liang ⋅ QIN YOU ⋅ Yi Nian ⋅ Zhengzhong Tu ⋅ Xiyang Hu ⋅ Yue Zhao
|
Exhibit Hall I #1 | |
|
Learning Counterfactually Decoupled Attention for Open-World Model Attribution
Poster Session 1 & Exhibit Hall
Yu Zheng ⋅ Boyang Gong ⋅ Fanye Kong ⋅ Yueqi Duan ⋅ Bingyao Yu ⋅ Wenzhao Zheng ⋅ Lei Chen ⋅ Jiwen Lu ⋅ Jie Zhou
|
Exhibit Hall I #2 | |
|
Latte: Collaborative Test-Time Adaptation of Vision-Language Models in Federated Learning
Poster Session 1 & Exhibit Hall
Wenxuan Bao ⋅ Ruxi Deng ⋅ Ruizhong Qiu ⋅ Tianxin Wei ⋅ Hanghang Tong ⋅ Jingrui He
|
Exhibit Hall I #3 | |
|
Is Less More? Exploring Token Condensation as Training-free Test-time Adaptation
Poster Session 1 & Exhibit Hall
Zixin Wang ⋅ Dong Gong ⋅ Sen Wang ⋅ Zi Huang ⋅ Yadan Luo
|
Exhibit Hall I #4 | |
|
WIPES: Wavelet-based Visual Primitives
Poster Session 6 & Exhibit Hall with Coffee Break
Wenhao Zhang ⋅ Hao Zhu ⋅ Delong Wu ⋅ Di Kang ⋅ Linchao Bao ⋅ Xun Cao ⋅ Zhan Ma
|
Exhibit Hall I #253 | |
|
Mastering Collaborative Multi-modal Data Selection: A Focus on Informativeness, Uniqueness, and Representativeness
Qifan Yu ⋅ Zhebei Shen ⋅ Zhongqi Yue ⋅ Yang Wu ⋅ Bosheng Qin ⋅ Wenqiao Zhang ⋅ Yunfei Li ⋅ Juncheng Li ⋅ Siliang Tang ⋅ Yueting Zhuang
|
Exhibit Hall I #5 | |
|
SMoLoRA: Exploring and Defying Dual Catastrophic Forgetting in Continual Visual Instruction Tuning
Poster Session 1 & Exhibit Hall
Ziqi Wang ⋅ Chang Che ⋅ Qi Wang ⋅ Yangyang Li ⋅ Zenglin Shi ⋅ Meng Wang
|
Exhibit Hall I #7 | |
|
Generalized Tensor-based Parameter-Efficient Fine-Tuning via Lie Group Transformations
Poster Session 1 & Exhibit Hall
Chongjie Si ⋅ Zhiyi Shi ⋅ Xuehui Wang ⋅ Yichen Xiao ⋅ Xiaokang Yang ⋅ Wei Shen
|
Exhibit Hall I #9 | |
|
Bootstrapping Grounded Chain-of-Thought in Multimodal LLMs for Data-Efficient Model Adaptation
Jiaer Xia ⋅ Bingkui Tong ⋅ Yuhang Zang ⋅ Rui Shao ⋅ Kaiyang Zhou
|
Exhibit Hall I #10 | |
|
One Encoder to Rule them All: Representation Learning for Model-free Visual Reinforcement Learning using Fourier Neural Operators
Poster Session 1 & Exhibit Hall
Parag Dutta ⋅ Mohd Ayyoob ⋅ Shalabh Bhatnagar ⋅ Ambedkar Dukkipati
|
Exhibit Hall I #452 | |
|
Deciphering Cross-Modal Alignment in Large Vision-Language Models via Modality Integration Rate
Poster Session 1 & Exhibit Hall
Qidong Huang ⋅ Xiaoyi Dong ⋅ Pan Zhang ⋅ Yuhang Zang ⋅ Yuhang Cao ⋅ Jiaqi Wang ⋅ Weiming Zhang ⋅ Nenghai Yu
|
Exhibit Hall I #11 | |
|
X-Fusion: Introducing New Modality to Frozen Large Language Models
Poster Session 1 & Exhibit Hall
Sicheng Mo ⋅ Thao Nguyen ⋅ Xun Huang ⋅ Siddharth Iyer ⋅ Yijun Li ⋅ Yuchen Liu ⋅ Abhishek Tandon ⋅ Eli Shechtman ⋅ Krishna Kumar Singh ⋅ Yong Jae Lee ⋅ Bolei Zhou ⋅ Yuheng Li
|
Exhibit Hall I #12 | |
|
LLaVA-KD: A Framework of Distilling Multimodal Large Language Models
Poster Session 1 & Exhibit Hall
Yuxuan Cai ⋅ Jiangning Zhang ⋅ Haoyang He ⋅ Xinwei He ⋅ Ao Tong ⋅ Zhenye Gan ⋅ Chengjie Wang ⋅ Zhucun Xue ⋅ Yong Liu ⋅ Xiang Bai
|
Exhibit Hall I #13 | |
|
Doodle Your Keypoints: Sketch-Based Few-Shot Keypoint Detection
Poster Session 1 & Exhibit Hall
Subhajit Maity ⋅ Ayan Bhunia ⋅ Subhadeep Koley ⋅ Pinaki Chowdhury ⋅ Aneeshan Sain ⋅ Yi-Zhe Song
|
Exhibit Hall I #17 | |
|
Dissecting Generalized Category Discovery: Multiplex Consensus under Self-Deconstruction
Luyao Tang ⋅ Kunze Huang ⋅ Yuxuan Yuan ⋅ Chenxin Li ⋅ Xiaotong Tu ⋅ Xinghao Ding ⋅ Chaoqi Chen ⋅ Yue Huang
|
Exhibit Hall I #18 | |
|
Partial Forward Blocking: A Novel Data Pruning Paradigm for Lossless Training Acceleration
Poster Session 1 & Exhibit Hall
Dongyue Wu ⋅ Zilin Guo ⋅ Jialong Zuo ⋅ Nong Sang ⋅ Changxin Gao
|
Exhibit Hall I #20 | |
|
LIFT: Latent Implicit Functions for Task- and Data-Agnostic Encoding
Poster Session 1 & Exhibit Hall
Amirhossein Kazerouni ⋅ Soroush Mehraban ⋅ Michael Brudno ⋅ Babak Taati
|
Exhibit Hall I #453 | |
|
ChartPoint: Guiding MLLMs with Grounding Reflection for Chart Reasoning
Poster Session 1 & Exhibit Hall
Zhengzhuo Xu ⋅ Sinan Du ⋅ Yiyan Qi ⋅ Siwen Lu ⋅ Chengjin Xu ⋅ Chun Yuan ⋅ Jian Guo
|
Exhibit Hall I #30 | |
|
ShortV: Efficient Multimodal Large Language Models by Freezing Visual Tokens in Ineffective Layers
Poster Session 1 & Exhibit Hall
Qianhao Yuan ⋅ Qingyu Zhang ⋅ yanjiang liu ⋅ Jiawei Chen ⋅ Yaojie Lu ⋅ Hongyu Lin ⋅ Jia Zheng ⋅ Xianpei Han ⋅ Le Sun
|
Exhibit Hall I #21 | |
|
Achieving More with Less: Additive Prompt Tuning for Rehearsal-Free Class-Incremental Learning
Poster Session 1 & Exhibit Hall
Haoran Chen ⋅ Ping Wang ⋅ Zihan Zhou ⋅ Xu Zhang ⋅ Zuxuan Wu ⋅ Yu-Gang Jiang
|
Exhibit Hall I #22 | |
|
CIARD: Cyclic Iterative Adversarial Robustness Distillation
Poster Session 1 & Exhibit Hall
Liming Lu ⋅ Shuchao Pang ⋅ Xu Zheng ⋅ Xiang GU ⋅ Anan Du ⋅ Yunhuai Liu ⋅ Yongbin Zhou
|
Exhibit Hall I #23 | |
|
MOBIUS: Big-to-Mobile Universal Instance Segmentation via Multi-modal Bottleneck Fusion and Calibrated Decoder Pruning
Poster Session 5 & Exhibit Hall
Mattia Segu ⋅ Marta Tintore Gazulla ⋅ Yongqin Xian ⋅ Luc Gool ⋅ Federico Tombari
|
Exhibit Hall I #88 | |
|
MambaML: Exploring State Space Models for Multi-Label Image Classification
Poster Session 1 & Exhibit Hall
Xuelin Zhu ⋅ Jian liu ⋅ Jiuxin Cao ⋅ Bing WANG
|
Exhibit Hall I #445 | |
|
Moderating the Generalization of Score-based Generative Model
Poster Session 1 & Exhibit Hall
Wan Jiang ⋅ He Wang ⋅ Xin Zhang ⋅ Dan Guo ⋅ Zhaoxin Fan ⋅ Yunfeng Diao ⋅ Richang Hong
|
Exhibit Hall I #24 | |
|
Scaling Language-Free Visual Representation Learning
David Fan ⋅ Shengbang Tong ⋅ Jiachen Zhu ⋅ Koustuv Sinha ⋅ Zhuang Liu ⋅ Xinlei Chen ⋅ Michael Rabbat ⋅ Nicolas Ballas ⋅ Yann LeCun ⋅ Amir Bar ⋅ Saining Xie
|
Exhibit Hall I #25 | |
|
Improving Noise Efficiency in Privacy-preserving Dataset Distillation
Poster Session 1 & Exhibit Hall
Runkai Zheng ⋅ Vishnu Dasu ⋅ Yinong Wang ⋅ Haohan Wang ⋅ Fernando De la Torre
|
Exhibit Hall I #454 | |
|
LLM-assisted Entropy-based Adaptive Distillation for Unsupervised Fine-grained Visual Representation Learning
Poster Session 1 & Exhibit Hall
Jianfeng Dong ⋅ Danfeng Luo ⋅ Daizong Liu ⋅ Jie Sun ⋅ Xiaoye Qu ⋅ Xun Yang ⋅ Dongsheng Liu ⋅ Xun Wang
|
Exhibit Hall I #26 | |
|
DiffRefine: Diffusion-based Proposal Specific Point Cloud Densification for Cross-Domain Object Detection
Sangyun Shin ⋅ Yuhang He ⋅ Xinyu Hou ⋅ Samuel Hodgson ⋅ Andrew Markham ⋅ Niki Trigoni
|
Exhibit Hall I #459 | |
|
On the Robustness Tradeoff in Fine-Tuning
Poster Session 1 & Exhibit Hall
Kunyang Li ⋅ Jean-Charles Noirot Ferrand ⋅ Ryan Sheatsley ⋅ Blaine Hoak ⋅ Yohan Beugin ⋅ Eric Pauley ⋅ Patrick McDaniel
|
Exhibit Hall I #460 | |
|
Gradient Short-Circuit: Efficient Out-of-Distribution Detection via Feature Intervention
Poster Session 1 & Exhibit Hall
Jiawei Gu ⋅ Ziyue Qiao ⋅ Zechao Li
|
Exhibit Hall I #33 | |
|
Boundary Probing for Input Privacy Protection When Using LMM Services
Poster Session 1 & Exhibit Hall
Xiaofei Hui ⋅ Haoxuan Qu ⋅ Ping Hu ⋅ Hossein Rahmani ⋅ Jun Liu
|
Exhibit Hall I #34 | |
|
Intrepretable Zero-Shot Learning with Locally-Aligned Vision-Language Model
Poster Session 1 & Exhibit Hall
Shiming Chen ⋅ Bowen Duan ⋅ Salman Khan ⋅ Fahad Khan
|
Exhibit Hall I #35 | |
|
UPRE: Zero-Shot Domain Adaptation for Object Detection via Unified Prompt and Representation Enhancement
Poster Session 1 & Exhibit Hall
Xiao Zhang ⋅ Fei Wei ⋅ Yong Wang ⋅ Wenda Zhao ⋅ Feiyi Li ⋅ Xiangxiang Chu
|
Exhibit Hall I #38 | |
|
HAMSt3R: Human-Aware Multi-view Stereo 3D Reconstruction
Poster Session 2 & Exhibit Hall with Coffee Break
Sara Rojas Martinez ⋅ Matthieu Armando ⋅ Bernard Ghanem ⋅ Philippe Weinzaepfel ⋅ Vincent Leroy ⋅ Grégory Rogez
|
Exhibit Hall I #1 | |
|
Dataset Distillation as Data Compression: A Rate-Utility Perspective
Poster Session 1 & Exhibit Hall
Youneng Bao ⋅ Yiping Liu ⋅ Zhuo Chen ⋅ Yongsheng Liang ⋅ Mu Li ⋅ Kede Ma
|
Exhibit Hall I #39 | |
|
Boosting Generative Adversarial Transferability with Self-supervised Vision Transformer Features
Poster Session 1 & Exhibit Hall
Shangbo Wu ⋅ Yu-an Tan ⋅ Ruinan Ma ⋅ Wencong Ma ⋅ Dehua Zhu ⋅ Yuanzhang Li
|
Exhibit Hall I #40 | |
|
TinyViM: Frequency Decoupling for Tiny Hybrid Vision Mamba
Poster Session 5 & Exhibit Hall
Xiaowen Ma ⋅ Zhen-Liang Ni ⋅ Xinghao Chen
|
Exhibit Hall I #350 | |
|
Open-set Cross Modal Generalization via Multimodal Unified Representation
Poster Session 1 & Exhibit Hall
Hai Huang ⋅ Yan Xia ⋅ Shulei Wang ⋅ Hanting Wang ⋅ Minghui Fang ⋅ Shengpeng Ji ⋅ Sashuai Zhou ⋅ Tao Jin ⋅ Zhou Zhao
|
Exhibit Hall I #41 | |
|
Adversarial Data Augmentation for Single Domain Generalization via Lyapunov Exponent-Guided Optimization
Poster Session 1 & Exhibit Hall
ZUYU ZHANG ⋅ Ning Chen ⋅ Yongshan Liu ⋅ Qinghua Zhang ⋅ Xu Zhang
|
Exhibit Hall I #42 | |
|
Adversarial Robust Memory-Based Continual Learner
Poster Session 1 & Exhibit Hall
Xiaoyue Mi ⋅ Fan Tang ⋅ Zonghan Yang ⋅ Danding Wang ⋅ Juan Cao ⋅ Peng Li ⋅ Yang Liu
|
Exhibit Hall I #43 | |
|
NegRefine: Refining Negative Label-Based Zero-Shot OOD Detection
Poster Session 1 & Exhibit Hall
Amirhossein Ansari ⋅ Ke Wang ⋅ Pulei Xiong
|
Exhibit Hall I #44 | |
|
Divide-and-Conquer for Enhancing Unlabeled Learning, Stability, and Plasticity in Semi-supervised Continual Learning
Poster Session 1 & Exhibit Hall
Yue Duan ⋅ Taicai Chen ⋅ Lei Qi ⋅ Yinghuan Shi
|
Exhibit Hall I #45 | |
|
A Unified Framework to BRIDGE Complete and Incomplete Deep Multi-View Clustering under Non-IID Missing Patterns
Poster Session 1 & Exhibit Hall
Xiaorui Jiang ⋅ Buyun He ⋅ Peng Yuan Zhou ⋅ Xinyue Chen ⋅ Jingcai Guo ⋅ Jie Xu ⋅ Yong Liao
|
Exhibit Hall I #46 | |
|
HumorDB: Can AI understand graphical humor?
Poster Session 1 & Exhibit Hall
Vedaant V Jain ⋅ Gabriel Kreiman ⋅ Felipe Feitosa
|
Exhibit Hall I #47 | |
|
GCAV: A Global Concept Activation Vector Framework for Cross-Layer Consistency in Interpretability
Poster Session 1 & Exhibit Hall
Zhenghao He ⋅ Sanchit Sinha ⋅ Guangzhi Xiong ⋅ Aidong Zhang
|
Exhibit Hall I #48 | |
|
Ensemble Foreground Management for Unsupervised Object Discovery
Ziling Wu ⋅ Armaghan Moemeni ⋅ Praminda Caleb-Solly
|
Exhibit Hall I #44 | |
|
Detect Anything 3D in the Wild
Poster Session 2 & Exhibit Hall with Coffee Break
Hanxue Zhang ⋅ Haoran Jiang ⋅ Qingsong Yao ⋅ Yanan SUN ⋅ Renrui Zhang ⋅ Hao Zhao ⋅ Hongyang Li ⋅ Hongzi Zhu ⋅ Zetong Yang
|
Exhibit Hall I #3 | |
|
Confound from All Sides, Distill with Resilience: Multi-Objective Adversarial Paths to Zero-Shot Robustness
Junhao Dong ⋅ Jiao Liu ⋅ Xinghua Qu ⋅ YEW-SOON ONG
|
Exhibit Hall I #49 | |
|
VolumetricSMPL: A Neural Volumetric Body Model for Efficient Interactions, Contacts, and Collisions
Marko Mihajlovic ⋅ Siwei Zhang ⋅ Gen Li ⋅ KAIFENG ZHAO ⋅ Lea Müller ⋅ Siyu Tang
|
Exhibit Hall I #4 | |
|
Mitigating Object Hallucinations via Sentence-Level Early Intervention
Poster Session 1 & Exhibit Hall
Shangpin Peng ⋅ Senqiao Yang ⋅ Li Jiang ⋅ Zhuotao Tian
|
Exhibit Hall I #50 | |
|
Active Membership Inference Test (aMINT): Enhancing Model Auditability with Multi-Task Learning.
Poster Session 1 & Exhibit Hall
Daniel DeAlcala ⋅ Aythami Morales ⋅ Julian Fierrez ⋅ Gonzalo Mancera ⋅ Ruben Tolosana ⋅ Javier Ortega-Garcia
|
Exhibit Hall I #51 | |
|
One-Shot Knowledge Transfer for Scalable Person Re-Identification
Poster Session 1 & Exhibit Hall
Longhua Li ⋅ Lei Qi ⋅ Xin Geng
|
Exhibit Hall I #53 | |
|
ShortFT: Diffusion Model Alignment via Shortcut-based Fine-Tuning
Poster Session 1 & Exhibit Hall
Xiefan Guo ⋅ Miaomiao Cui ⋅ Liefeng Bo ⋅ Di Huang
|
Exhibit Hall I #54 | |
|
PRISM: Reducing Spurious Implicit Biases in Vision-Language Models with LLM-Guided Embedding Projection
Poster Session 1 & Exhibit Hall
Mahdiyar Molahasani ⋅ Azadeh Motamedi ⋅ Michael Greenspan ⋅ Il-Min Kim ⋅ Ali Etemad
|
Exhibit Hall I #55 | |
|
Open-Unfairness Adversarial Mitigation for Generalized Deepfake Detection
Poster Session 1 & Exhibit Hall
Zhaoyang Li ⋅ Zhu Teng ⋅ Baopeng Zhang ⋅ Jianping Fan
|
Exhibit Hall I #56 | |
|
EA-KD: Entropy-based Adaptive Knowledge Distillation
Poster Session 1 & Exhibit Hall
Chi-Ping Su ⋅ Ching-Hsun Tseng ⋅ Bin Pu ⋅ Lei Zhao ⋅ Jiewen Yang ⋅ Zhuangzhuang Chen ⋅ Shin-Jye Lee
|
Exhibit Hall I #59 | |
|
Structured Policy Optimization: Enhance Large Vision-Language Model via Self-referenced Dialogue
Poster Session 1 & Exhibit Hall
Guohao Sun ⋅ Can Qin ⋅ Yihao Feng ⋅ Zeyuan Chen ⋅ Ran Xu ⋅ Sohail Dianat ⋅ MAJID RABBANI ⋅ Raghuveer Rao ⋅ Zhiqiang Tao
|
Exhibit Hall I #60 | |
|
Seal Your Backdoor with Variational Defense
Poster Session 1 & Exhibit Hall
Ivan Sabolic ⋅ Matej Grcic ⋅ Siniša Šegvić
|
Exhibit Hall I #61 | |
|
Semi-ViM: Bidirectional State Space Model for Mitigating Label Imbalance in Semi-Supervised Learning
Poster Session 1 & Exhibit Hall
Hongyang He ⋅ Hongyang Xie ⋅ Haochen You ⋅ Victor Sanchez
|
Exhibit Hall I #62 | |
|
Integrating Task-Specific and Universal Adapters for Pre-Trained Model-based Class-Incremental Learning
Poster Session 1 & Exhibit Hall
yan wang ⋅ Da-Wei Zhou ⋅ Han-Jia Ye
|
Exhibit Hall I #66 | |
|
Contact-Aware Refinement of Human Pose Pseudo-Ground Truth via Bioimpedance Sensing
Poster Session 2 & Exhibit Hall with Coffee Break
Maria-Paola Forte ⋅ Nikos Athanasiou ⋅ Giulia Ballardini ⋅ Jan Bartels ⋅ Katherine J. Kuchenbecker ⋅ Michael Black
|
Exhibit Hall I #5 | |
|
CODE-CL: Conceptor-Based Gradient Projection for Deep Continual Learning
Poster Session 1 & Exhibit Hall
Marco P. Apolinario ⋅ Sakshi Choudhary ⋅ Kaushik Roy
|
Exhibit Hall I #63 | |
|
SAMO: A Lightweight Sharpness-Aware Approach for Multi-Task Optimization with Joint Global-Local Perturbation
Poster Session 1 & Exhibit Hall
Hao Ban ⋅ Gokul Ram Subramani ⋅ Kaiyi Ji
|
Exhibit Hall I #64 | |
|
Beyond the Limits: Overcoming Negative Correlation of Activation-Based Training-Free NAS
Poster Session 1 & Exhibit Hall
Haidong Kang ⋅ Lianbo Ma ⋅ Pengjun Chen ⋅ Guo Yu ⋅ Xingwei Wang ⋅ Min Huang
|
Exhibit Hall I #65 | |
|
Diffusion Guided Adaptive Augmentation for Generalization in Visual Reinforcement Learning
Poster Session 1 & Exhibit Hall
Jeong Woon Lee ⋅ Hyoseok Hwang
|
Exhibit Hall I #73 | |
|
I Am Big, You Are Little; I Am Right, You Are Wrong
Poster Session 1 & Exhibit Hall
David A Kelly ⋅ Akchunya Chanchal ⋅ Nathan Blake
|
Exhibit Hall I #67 | |
|
Semi-supervised Deep Transfer for Regression without Domain Alignment
Poster Session 1 & Exhibit Hall
Mainak Biswas ⋅ Ambedkar Dukkipati ⋅ Devarajan Sridharan
|
Exhibit Hall I #68 | |
|
DocThinker: Explainable Multimodal Large Language Models with Rule-based Reinforcement Learning for Document Understanding
Poster Session 1 & Exhibit Hall
Wenwen Yu ⋅ Zhibo Yang ⋅ Yuliang Liu ⋅ Xiang Bai
|
Exhibit Hall I #69 | |
|
From Easy to Hard: The MIR Benchmark for Progressive Interleaved Multi-Image Reasoning
Poster Session 1 & Exhibit Hall
Hang Du ⋅ Jiayang Zhang ⋅ Guoshun Nan ⋅ Wendi Deng ⋅ Zhenyan Chen ⋅ Chenyang Zhang ⋅ Wang Xiao ⋅ Shan Huang ⋅ Yuqi Pan ⋅ Tao Qi ⋅ Sicong Leng
|
Exhibit Hall I #71 | |
|
Fast Globally Optimal and Geometrically Consistent 3D Shape Matching
Paul Roetzer ⋅ Florian Bernard
|
Exhibit Hall I #78 | |
|
A Framework for Double-Blind Federated Adaptation of Foundation Models
Poster Session 1 & Exhibit Hall
Nurbek Tastan ⋅ Karthik Nandakumar
|
Exhibit Hall I #79 | |
|
VGGSounder: Audio-Visual Evaluations for Foundation Models
Poster Session 1 & Exhibit Hall
Daniil Zverev ⋅ Thaddäus Wiedemer ⋅ Ameya Prabhu ⋅ Matthias Bethge ⋅ Wieland Brendel ⋅ A. Sophia Koepke
|
Exhibit Hall I #88 | |
|
EA-Vit: Efficient Adaptation for Elastic Vision Transformer
Poster Session 1 & Exhibit Hall
Chen Zhu ⋅ Wangbo Zhao ⋅ Huiwen Zhang ⋅ Yuhao Zhou ⋅ Weidong Tang ⋅ Shuo Wang ⋅ Zhihang Yuan ⋅ Yuzhang Shang ⋅ Xiaojiang Peng ⋅ Kai Wang ⋅ Dawei Yang
|
Exhibit Hall I #89 | |
|
Web Artifact Attacks Disrupt Vision Language Models
Poster Session 1 & Exhibit Hall
Maan Qraitem ⋅ Piotr Teterwak ⋅ Kate Saenko ⋅ Bryan Plummer
|
Exhibit Hall I #90 | |
|
Feature Coding in the Era of Large Models: Dataset, Test Conditions, and Benchmark
Poster Session 1 & Exhibit Hall
Changsheng Gao ⋅ Yifan Ma ⋅ Qiaoxi Chen ⋅ Xu yenan ⋅ Dong Liu ⋅ Weisi Lin
|
Exhibit Hall I #92 | |
|
Generate, Refine, and Encode: Leveraging Synthesized Novel Samples for On-the-Fly Fine-Grained Category Discovery
Poster Session 1 & Exhibit Hall
Xiao Liu ⋅ Nan Pu ⋅ Haiyang Zheng ⋅ Wenjing Li ⋅ Nicu Sebe ⋅ Zhun Zhong
|
Exhibit Hall I #93 | |
|
MMOne: Representing Multiple Modalities in One Scene
Poster Session 1 & Exhibit Hall
Zhifeng Gu ⋅ Bing WANG
|
Exhibit Hall I #94 | |
|
MM-IFEngine: Towards Multimodal Instruction Following
Poster Session 1 & Exhibit Hall
Shengyuan Ding ⋅ Wu Shenxi ⋅ Xiangyu Zhao ⋅ Yuhang Zang ⋅ Haodong Duan ⋅ Xiaoyi Dong ⋅ Pan Zhang ⋅ Yuhang Cao ⋅ Dahua Lin ⋅ Jiaqi Wang
|
Exhibit Hall I #95 | |
|
RainbowPrompt: Diversity-Enhanced Prompt-Evolving for Continual Learning
Poster Session 1 & Exhibit Hall
Kiseong Hong ⋅ Gyeong-Hyeon Kim ⋅ Eunwoo Kim
|
Exhibit Hall I #98 | |
|
VisionMath: Vision-Form Mathematical Problem-Solving
Poster Session 1 & Exhibit Hall
Zongyang Ma ⋅ Yuxin Chen ⋅ Ziqi Zhang ⋅ Zhongang Qi ⋅ Chunfeng Yuan ⋅ Shaojie Zhu ⋅ Chengxiang Zhuo ⋅ Bing Li ⋅ Ye Liu ⋅ Zang Li ⋅ Ying Shan ⋅ Weiming Hu
|
Exhibit Hall I #101 | |
|
Dataset Distillation via the Wasserstein Metric
Poster Session 1 & Exhibit Hall
Haoyang Liu ⋅ Peiran Wang ⋅ Yijiang Li ⋅ Tiancheng Xing ⋅ Vibhu Dalal ⋅ Luwei LI ⋅ Jingrui He ⋅ Haohan Wang
|
Exhibit Hall I #105 | |
|
A Good Teacher Adapts Their Knowledge for Distillation
Poster Session 1 & Exhibit Hall
Chengyao Qian ⋅ Trung Le ⋅ Mehrtash Harandi
|
Exhibit Hall I #108 | |
|
Quanta Neural Networks: From Photons to Perception
Poster Session 2 & Exhibit Hall with Coffee Break
Varun Sundar ⋅ Tianyi Zhang ⋅ Sacha Jungerman ⋅ Mohit Gupta
|
Exhibit Hall I #7 | |
|
AdaDrive: Self-Adaptive Slow-Fast System for Language-Grounded Autonomous Driving
Poster Session 2 & Exhibit Hall with Coffee Break
Ruifei Zhang ⋅ Junlin Xie ⋅ Wei Zhang ⋅ Weikai Chen ⋅ Xiao Tan ⋅ Xiang Wan ⋅ Guanbin Li
|
Exhibit Hall I #9 | |
|
Consistent Time-of-Flight Depth Denoising via Graph-Informed Geometric Attention
Poster Session 2 & Exhibit Hall with Coffee Break
Weida Wang ⋅ Changyong He ⋅ Jin Zeng ⋅ Di Qiu
|
Exhibit Hall I #16 | |
|
SIGMAN: Scaling 3D Human Gaussian Generation with Millions of Assets
Poster Session 2 & Exhibit Hall with Coffee Break
Yuhang Yang ⋅ Fengqi Liu ⋅ Yixing Lu ⋅ Qin Zhao ⋅ Pingyu Wu ⋅ Wei Zhai ⋅ Ran Yi ⋅ Yang Cao ⋅ Lizhuang Ma ⋅ Zheng-Jun Zha ⋅ Junting Dong
|
Exhibit Hall I #10 | |
|
Depth Any Event Stream: Enhancing Event-based Monocular Depth Estimation via Dense-to-Sparse Distillation
Poster Session 2 & Exhibit Hall with Coffee Break
Jinjing Zhu ⋅ Tianbo Pan ⋅ Zidong Cao ⋅ Yexin Liu ⋅ James Kwok ⋅ Hui Xiong
|
Exhibit Hall I #12 | |
|
Evading Data Provenance in Deep Neural Networks
Hongyu Zhu ⋅ Sichu Liang ⋅ Wenwen Wang ⋅ Zhuomeng Zhang ⋅ Fangqi Li ⋅ Shi-Lin Wang
|
Exhibit Hall I #109 | |
|
WildSeg3D: Segment Any 3D Objects in the Wild from 2D Images
Poster Session 2 & Exhibit Hall with Coffee Break
Yansong Guo ⋅ Jie Hu ⋅ Yansong Qu ⋅ Liujuan Cao
|
Exhibit Hall I #14 | |
|
AllTracker: Efficient Dense Point Tracking at High Resolution
Poster Session 2 & Exhibit Hall with Coffee Break
Adam Harley ⋅ Yang You ⋅ Yang Zheng ⋅ Xinglong Sun ⋅ Nikhil Raghuraman ⋅ Sheldon Liang ⋅ Yunqi Gu ⋅ Wen-Hsuan Chu ⋅ Suya You ⋅ Achal Dave ⋅ Rares Ambrus ⋅ Katerina Fragkiadaki ⋅ Leonidas Guibas
|
Exhibit Hall I #22 | |
|
Extending Foundational Monocular Depth Estimators to Fisheye Cameras with Calibration Tokens
Poster Session 2 & Exhibit Hall with Coffee Break
Suchisrit Gangopadhyay ⋅ Jung Hee Kim ⋅ Xien Chen ⋅ Patrick Rim ⋅ Hyoungseob Park ⋅ Alex Wong
|
Exhibit Hall I #17 | |
|
MPBR: Multimodal Progressive Bidirectional Reasoning for Open-Set Fine-Grained Recognition
Poster Session 1 & Exhibit Hall
Junfu Tan ⋅ Peiguang Jing ⋅ Yu Zhu ⋅ YU LIU
|
Exhibit Hall I #112 | |
|
MAVias: Mitigate any Visual Bias
Poster Session 1 & Exhibit Hall
Ioannis Sarridis ⋅ Christos Koutlis ⋅ Symeon Papadopoulos ⋅ Christos Diou
|
Exhibit Hall I #111 | |
|
OpenSubstance: A High-quality Measured Dataset of Multi-View and -Lighting Images and Shapes
Poster Session 2 & Exhibit Hall with Coffee Break
Fan Pei ⋅ jinchen bai ⋅ Xiang Feng ⋅ Zoubin Bi ⋅ Kun Zhou ⋅ Hongzhi Wu
|
Exhibit Hall I #19 | |
|
DIP: Unsupervised Dense In-Context Post-training of Visual Representations
Poster Session 1 & Exhibit Hall
Sophia Sirko-Galouchenko ⋅ Spyros Gidaris ⋅ Antonin Vobecky ⋅ Andrei Bursuc ⋅ Nicolas THOME
|
Exhibit Hall I #399 | |
|
Towards Higher Effective Rank in Parameter-Efficient Fine-tuning using Khatri-Rao Product
Poster Session 1 & Exhibit Hall
Paul Albert ⋅ Frederic Zhang ⋅ Hemanth Saratchandran ⋅ Anton Hengel ⋅ Ehsan Abbasnejad
|
Exhibit Hall I #113 | |
|
PseudoMapTrainer: Learning Online Mapping without HD Maps
Poster Session 2 & Exhibit Hall with Coffee Break
Christian Löwens ⋅ Thorben Funke ⋅ Jingchao Xie ⋅ Alexandru Condurache
|
Exhibit Hall I #23 | |
|
LONG3R: Long Sequence Streaming 3D Reconstruction
Poster Session 2 & Exhibit Hall with Coffee Break
Zhuoguang Chen ⋅ Minghui Qin ⋅ Tianyuan Yuan ⋅ Zhe Liu ⋅ Hang Zhao
|
Exhibit Hall I #24 | |
|
VGMamba: Attribute-to-Location Clue Reasoning for Quantity-Agnostic 3D Visual Grounding
Poster Session 2 & Exhibit Hall with Coffee Break
Zhu Yihang ⋅ Jinhao Zhang ⋅ Yuxuan Wang ⋅ Aming WU ⋅ Cheng Deng
|
Exhibit Hall I #26 | |
|
AnnofreeOD: Detecting All Classes at Low Frame Rates Without Human Annotations
Poster Session 2 & Exhibit Hall with Coffee Break
Boyi Sun ⋅ Yuhang Liu ⋅ Houxin He ⋅ Yonglin Tian ⋅ Fei-Yue Wang
|
Exhibit Hall I #28 | |
|
Federated Continual Instruction Tuning
Poster Session 1 & Exhibit Hall
Haiyang Guo ⋅ Fanhu Zeng ⋅ Fei Zhu ⋅ Wenzhuo Liu ⋅ Da-Han Wang ⋅ Jian Xu ⋅ Xu-Yao Zhang ⋅ Cheng-Lin Liu
|
Exhibit Hall I #116 | |
|
TWIST & SCOUT: Grounding Multimodal LLM-Experts by Forget-Free Tuning
Poster Session 1 & Exhibit Hall
Aritra Bhowmik ⋅ Mohammad Mahdi Derakhshani ⋅ Dennis Koelma ⋅ Yuki Asano ⋅ Martin R. Oswald ⋅ Cees Snoek
|
Exhibit Hall I #119 | |
|
Generate, Transduct, Adapt: Iterative Transduction with VLMs
Poster Session 1 & Exhibit Hall
Oindrila Saha ⋅ Logan Lawrence ⋅ Grant Horn ⋅ Subhransu Maji
|
Exhibit Hall I #120 | |
|
BabyVLM: Data-Efficient Pretraining of VLMs Inspired by Infant Learning
Poster Session 1 & Exhibit Hall
Shengao Wang ⋅ Arjun Chandra ⋅ Aoming Liu ⋅ Boqing Gong ⋅ Venkatesh Saligrama
|
Exhibit Hall I #121 | |
|
Controlling Multimodal LLMs via Reward-guided Decoding
Poster Session 1 & Exhibit Hall
Oscar Mañas ⋅ Pierluca D'Oro ⋅ Koustuv Sinha ⋅ Adriana Romero-Soriano ⋅ Michal Drozdzal ⋅ Aishwarya Agrawal
|
Exhibit Hall I #122 | |
|
Improving Large Vision and Language Models by Learning from a Panel of Peers
Poster Session 1 & Exhibit Hall
Jefferson Hernandez ⋅ Jing Shi ⋅ Simon Jenni ⋅ Vicente Ordonez ⋅ Kushal Kafle
|
Exhibit Hall I #123 | |
|
CE-FAM: Concept-Based Explanation via Fusion of Activation Maps
Poster Session 1 & Exhibit Hall
Michihiro Kuroki ⋅ Toshihiko Yamasaki
|
Exhibit Hall I #124 | |
|
PEFTDiff: Diffusion-Guided Transferability Estimation for Parameter-Efficient Fine-Tuning
Poster Session 1 & Exhibit Hall
PRAFFUL KHOBA ⋅ Zijian Wang ⋅ Chetan Arora ⋅ Mahsa Baktashmotlagh
|
Exhibit Hall I #128 | |
|
Towards Privacy-preserved Pre-training of Remote Sensing Foundation Models with Federated Mutual-guidance Learning
Poster Session 1 & Exhibit Hall
Jieyi Tan ⋅ Chengwei Zhang ⋅ Bo Dang ⋅ Yansheng Li
|
Exhibit Hall I #163 | |
|
AVTrustBench: Assessing and Enhancing Reliability and Robustness in Audio-Visual LLMs
Poster Session 1 & Exhibit Hall
Sanjoy Chowdhury ⋅ Sayan Nag ⋅ Subhrajyoti Dasgupta ⋅ Yaoting Wang ⋅ Mohamed Elhoseiny ⋅ Ruohan Gao ⋅ Dinesh Manocha
|
Exhibit Hall I #141 | |
|
Verbalized Representation Learning for Interpretable Few-Shot Generalization
Poster Session 1 & Exhibit Hall
Cheng-Fu Yang ⋅ Da Yin ⋅ Wenbo Hu ⋅ Heng Ji ⋅ Nanyun Peng ⋅ Bolei Zhou ⋅ Kai-Wei Chang
|
Exhibit Hall I #142 | |
|
RMultiplex200K: Toward Reliable Multimodal Process Supervision for Visual Language Models on Telecommunications
Poster Session 1 & Exhibit Hall
Sijia Chen ⋅ Bin Song
|
Exhibit Hall I #150 | |
|
Equipping Vision Foundation Model with Mixture of Experts for Out-of-Distribution Detection
Poster Session 1 & Exhibit Hall
Shizhen Zhao ⋅ Jiahui Liu ⋅ Xin Wen ⋅ Haoru Tan ⋅ Xiaojuan Qi
|
Exhibit Hall I #158 | |
|
Class-Wise Federated Averaging for Efficient Personalization
Poster Session 1 & Exhibit Hall
Gyuejeong Lee ⋅ Daeyoung Choi
|
Exhibit Hall I #160 | |
|
Multi-view Gaze Target Estimation
Poster Session 2 & Exhibit Hall with Coffee Break
Qiaomu Miao ⋅ Vivek Golani ⋅ Jingyi Xu ⋅ Progga Paromita Dutta ⋅ Minh Hoai ⋅ Dimitris Samaras
|
Exhibit Hall I #33 | |
|
EFTViT: Efficient Federated Training of Vision Transformers with Masked Images on Resource-Constrained Clients
Poster Session 1 & Exhibit Hall
meihan wu ⋅ Tao Chang ⋅ Cui Miao ⋅ Jie Zhou ⋅ Chun Li ⋅ Xiangyu Xu ⋅ Ming Li ⋅ Xiaodong Wang
|
Exhibit Hall I #164 | |
|
ODP-Bench: Benchmarking Out-of-Distribution Performance Prediction
Poster Session 1 & Exhibit Hall
Han Yu ⋅ Kehan Li ⋅ Dongbai Li ⋅ Yue He ⋅ Xingxuan Zhang ⋅ Peng Cui
|
Exhibit Hall I #167 | |
|
R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization
Poster Session 1 & Exhibit Hall
Jingyi Zhang ⋅ Jiaxing Huang ⋅ Huanjin Yao ⋅ Shunyu Liu ⋅ Xikun ZHANG ⋅ Shijian Lu ⋅ Dacheng Tao
|
Exhibit Hall I #168 | |
|
Human-Object Interaction from Human-Level Instructions
Poster Session 3 & Exhibit Hall
Zhen Wu ⋅ Jiaman Li ⋅ Pei Xu ⋅ Karen Liu
|
Exhibit Hall I #110 | |
|
FG-OrIU: Towards Better Forgetting via Feature-Gradient Orthogonality for Incremental Unlearning
Poster Session 1 & Exhibit Hall
qian feng ⋅ Jiahang Tu ⋅ Mintong Kang ⋅ Hanbin Zhao ⋅ Chao Zhang ⋅ Hui Qian
|
Exhibit Hall I #177 | |
|
ViT-EnsembleAttack: Augmenting Ensemble Models for Stronger Adversarial Transferability in Vision Transformers
Poster Session 1 & Exhibit Hall
Hanwen Cao ⋅ Haobo Lu ⋅ Xiaosen Wang ⋅ Kun He
|
Exhibit Hall I #181 | |
|
Instruction-Oriented Preference Alignment for Enhancing Multi-Modal Comprehension Capability of MLLMs
Poster Session 1 & Exhibit Hall
Zitian Wang ⋅ Yue Liao ⋅ RONG KANG ⋅ Fengyun Rao ⋅ Yibo Yang ⋅ Si Liu
|
Exhibit Hall I #182 | |
|
Visual-RFT: Visual Reinforcement Fine-Tuning
Poster Session 1 & Exhibit Hall
Ziyu Liu ⋅ Zeyi Sun ⋅ Yuhang Zang ⋅ Xiaoyi Dong ⋅ Yuhang Cao ⋅ Haodong Duan ⋅ Dahua Lin ⋅ Jiaqi Wang
|
Exhibit Hall I #184 | |
|
Enhancing Transformers Through Conditioned Embedded Tokens
Poster Session 1 & Exhibit Hall
Hemanth Saratchandran ⋅ Simon Lucey
|
Exhibit Hall I #449 | |
|
Jailbreaking Multimodal Large Language Models via Shuffle Inconsistency
Poster Session 1 & Exhibit Hall
Shiji Zhao ⋅ Ranjie Duan ⋅ Fengxiang Wang ⋅ Chi Chen ⋅ Caixin KANG ⋅ Shouwei Ruan ⋅ Jialing Tao ⋅ YueFeng Chen ⋅ Hui Xue ⋅ Xingxing Wei
|
Exhibit Hall I #185 | |
|
Large Learning Rates Simultaneously Achieve Robustness to Spurious Correlations and Compressibility
Poster Session 1 & Exhibit Hall
Melih Barsbey ⋅ Lucas Prieto ⋅ Stefanos Zafeiriou ⋅ Tolga Birdal
|
Exhibit Hall I #186 | |
|
Dynamic Multi-Layer Null Space Projection for Vision-Language Continual Learning
Poster Session 1 & Exhibit Hall
Borui Kang ⋅ Lei Wang ⋅ Zhiping Wu ⋅ Tao Feng ⋅ Yawen Li ⋅ Yang Gao ⋅ Wenbin Li
|
Exhibit Hall I #188 | |
|
LLaVA-CoT: Let Vision Language Models Reason Step-by-Step
Poster Session 1 & Exhibit Hall
Guowei Xu ⋅ Peng Jin ⋅ ZiangWu ZiangWu ⋅ Li Hao ⋅ Yibing Song ⋅ Lichao Sun ⋅ Li Yuan
|
Exhibit Hall I #189 | |
|
Visual Modality Prompt for Adapting Vision-Language Object Detectors
Poster Session 1 & Exhibit Hall
Heitor Rapela Medeiros ⋅ Atif Belal ⋅ Srikanth Muralidharan ⋅ Eric Granger ⋅ Marco Pedersoli
|
Exhibit Hall I #197 | |
|
What's in a Latent? Leveraging Diffusion Latent Space for Domain Generalization
Poster Session 1 & Exhibit Hall
Xavier Thomas ⋅ Deepti Ghadiyaram
|
Exhibit Hall I #198 | |
|
Prototype Guided Backdoor Defense via Activation Space Manipulation
Poster Session 1 & Exhibit Hall
Venkat Adithya Amula ⋅ Sunayana Samavedam ⋅ Saurabh Saini ⋅ Avani Gupta ⋅ P J Narayanan
|
Exhibit Hall I #199 | |
|
RIPE: Reinforcement Learning on Unlabeled Image Pairs for Robust Keypoint Extraction
Poster Session 1 & Exhibit Hall
Johannes Künzel ⋅ Anna Hilsmann ⋅ Peter Eisert
|
Exhibit Hall I #457 | |
|
Analyzing Finetuning Representation Shift for Multimodal LLMs Steering
Poster Session 1 & Exhibit Hall
Pegah KHAYATAN ⋅ Mustafa Shukor ⋅ Jayneel Parekh ⋅ Arnaud Dapogny ⋅ Matthieu Cord
|
Exhibit Hall I #200 | |
|
Efficient Unsupervised Shortcut Learning Detection and Mitigation in Transformers
Poster Session 1 & Exhibit Hall
Lukas Kuhn ⋅ sari sadiya ⋅ Jörg Schlötterer ⋅ Florian Buettner ⋅ Christin Seifert ⋅ Gemma Roig
|
Exhibit Hall I #201 | |
|
VSP: Diagnosing the Dual Challenges of Perception and Reasoning in Spatial Planning Tasks for MLLMs
Poster Session 1 & Exhibit Hall
Qiucheng Wu ⋅ Handong Zhao ⋅ Michael Saxon ⋅ Trung Bui ⋅ William Yang Wang ⋅ Yang Zhang ⋅ Shiyu Chang
|
Exhibit Hall I #206 | |
|
Multi-Cache Enhanced Prototype Learning for Test-Time Generalization of Vision-Language Models
Poster Session 1 & Exhibit Hall
Xinyu Chen ⋅ Haotian Zhai ⋅ Can Zhang ⋅ XIUPENG SHI ⋅ Ruirui Li
|
Exhibit Hall I #207 | |
|
AVAM: a Universal Training-free Adaptive Visual Anchoring Embedded into Multimodal Large Language Model for Multi-image Question Answering
Poster Session 1 & Exhibit Hall
Kang Zeng ⋅ Guojin Zhong ⋅ Jintao Cheng ⋅ Jin Yuan ⋅ Zhiyong Li
|
Exhibit Hall I #208 | |
|
R1-Onevision: Advancing Generalized Multimodal Reasoning through Cross-Modal Formalization
Poster Session 1 & Exhibit Hall
yi yang ⋅ Xiaoxuan He ⋅ Hongkun Pan ⋅ Xiyan Jiang ⋅ Yan Deng ⋅ Xingtao Yang ⋅ Haoyu Lu ⋅ Dacheng Yin ⋅ Fengyun Rao ⋅ Minfeng Zhu ⋅ Bo Zhang ⋅ Wei Chen
|
Exhibit Hall I #216 | |
|
The Inter-Intra Modal Measure: A Predictive Lens on Fine-Tuning Outcomes in Vision-Language Models
Poster Session 1 & Exhibit Hall
Laura Niss ⋅ Kevin Vogt-Lowell ⋅ Theodoros Tsiligkaridis
|
Exhibit Hall I #218 | |
|
What to Distill? Fast Knowledge Distillation with Adaptive Sampling
Byungchul Chae ⋅ Seonyeong Heo
|
Exhibit Hall I #219 | |
|
Flexi-FSCIL: Adaptive Knowledge Retention for Breaking the Stability-Plasticity Dilemma in Few-Shot Class-Incremental Learning
Poster Session 1 & Exhibit Hall
Wufei Xie ⋅ Yalin Wang ⋅ Chenliang Liu ⋅ Zhaohui Jiang ⋅ Xue Yang
|
Exhibit Hall I #223 | |
|
Multispectral Demosaicing via Dual Cameras
SaiKiran Tedla ⋅ Junyong Lee ⋅ Beixuan Yang ⋅ Mahmoud Afifi ⋅ Michael Brown
|
Exhibit Hall I #36 | |
|
Generative Modeling of Shape-Dependent Self-Contact Human Poses
Poster Session 2 & Exhibit Hall with Coffee Break
Takehiko Ohkawa ⋅ Jihyun Lee ⋅ Shunsuke Saito ⋅ Jason Saragih ⋅ Fabian Prada ⋅ Yichen Xu ⋅ Shoou-I Yu ⋅ Ryosuke Furuta ⋅ Yoichi Sato ⋅ Takaaki Shiratori
|
Exhibit Hall I #38 | |
|
Met2Net: A Decoupled Two-Stage Spatio-Temporal Forecasting Model for Complex Meteorological Systems
Poster Session 2 & Exhibit Hall with Coffee Break
Shaohan Li ⋅ Hao Yang ⋅ Min Chen ⋅ Xiaolin Qin
|
Exhibit Hall I #41 | |
|
TriDi: Trilateral Diffusion of 3D Humans, Objects, and Interactions
Poster Session 2 & Exhibit Hall with Coffee Break
Ilya A. Petrov ⋅ Riccardo Marin ⋅ Julian Chibane ⋅ Gerard Pons-Moll
|
Exhibit Hall I #47 | |
|
Beyond RGB: Adaptive Parallel Processing for RAW Object Detection
Poster Session 2 & Exhibit Hall with Coffee Break
Shani Gamrian ⋅ Hila Barel ⋅ Feiran Li ⋅ Masakazu Yoshimura ⋅ Daisuke Iso
|
Exhibit Hall I #49 | |
|
egoPPG: Heart Rate Estimation from Eye-Tracking Cameras in Egocentric Systems to Benefit Downstream Vision Tasks
Poster Session 2 & Exhibit Hall with Coffee Break
Björn Braun ⋅ Rayan Armani ⋅ Manuel Meier ⋅ Max Moebus ⋅ Christian Holz
|
Exhibit Hall I #52 | |
|
PoseSyn: Synthesizing Diverse 3D Pose Data from In-the-Wild 2D Data
Poster Session 2 & Exhibit Hall with Coffee Break
CHANGHEE YANG ⋅ Hyeonseop Song ⋅ Seokhun Choi ⋅ Seungwoo Lee ⋅ Jaechul Kim ⋅ Hoseok Do
|
Exhibit Hall I #55 | |
|
Diffusion-Based Extreme High-speed Scenes Reconstruction with the Complementary Vision Sensor
Poster Session 2 & Exhibit Hall with Coffee Break
Yapeng Meng ⋅ Yihan Lin ⋅ Taoyi Wang ⋅ Yuguo Chen ⋅ Lijian Wang ⋅ Rong Zhao
|
Exhibit Hall I #63 | |
|
TorchAdapt: Towards Light-Agnostic Real-Time Visual Perception
Poster Session 2 & Exhibit Hall with Coffee Break
Khurram Azeem Hashmi ⋅ Karthik Suresh ⋅ Didier Stricker ⋅ Muhammad Zeshan Afzal
|
Exhibit Hall I #58 | |
|
Human-in-the-Loop Local Corrections of 3D Scene Layouts via Infilling
Christopher Xie ⋅ Armen Avetisyan ⋅ Henry Howard-Jenkins ⋅ Yawar Siddiqui ⋅ Julian Straub ⋅ Richard Newcombe ⋅ Vasileios Balntas ⋅ Jakob Engel
|
Exhibit Hall I #59 | |
|
POMATO: Marrying Pointmap Matching with Temporal Motions for Dynamic 3D Reconstruction
Poster Session 2 & Exhibit Hall with Coffee Break
Songyan Zhang ⋅ Yongtao Ge ⋅ Jinyuan Tian ⋅ Guangkai Xu ⋅ Hao Chen ⋅ Chen Lv ⋅ Chunhua Shen
|
Exhibit Hall I #61 | |
|
Boosting Class Representation via Semantically Related Instances for Robust Long-Tailed Learning with Noisy Labels
Poster Session 1 & Exhibit Hall
Yuhang Li ⋅ Zhuying Li ⋅ Yuheng Jia
|
Exhibit Hall I #134 | |
|
CAT: A Unified Click-and-Track Framework for Realistic Tracking
Poster Session 2 & Exhibit Hall with Coffee Break
Yongsheng Yuan ⋅ Jie Zhao ⋅ Dong Wang ⋅ Huchuan Lu
|
Exhibit Hall I #62 | |
|
DepR: Depth Guided Single-view Scene Reconstruction with Instance-level Diffusion
Poster Session 2 & Exhibit Hall with Coffee Break
Qingcheng Zhao ⋅ Xiang Zhang ⋅ Haiyang Xu ⋅ Zeyuan Chen ⋅ Jianwen Xie ⋅ Yuan Gao ⋅ Zhuowen Tu
|
Exhibit Hall I #65 | |
|
Invisible Watermarks, Visible Gains: Steering Machine Unlearning with Bi-Level Watermarking Design
Poster Session 1 & Exhibit Hall
Yuhao Sun ⋅ Yihua Zhang ⋅ Gaowen Liu ⋅ Hongtao Xie ⋅ Sijia Liu
|
Exhibit Hall I #220 | |
|
DiffuMatch: Category-Agnostic Spectral Diffusion Priors for Robust Non-rigid Shape Matching
Poster Session 2 & Exhibit Hall with Coffee Break
Emery Pierson ⋅ Lei Li ⋅ Angela Dai ⋅ Maks Ovsjanikov
|
Exhibit Hall I #67 | |
|
SAC-GNC: SAmple Consensus for adaptive Graduated Non-Convexity
Valter Piedade ⋅ Chitturi Sidhartha ⋅ José Gaspar ⋅ Venu Madhav Govindu ⋅ Pedro Miraldo
|
Exhibit Hall I #70 | |
|
AstroLoc: Robust Space to Ground Image Localizer
Poster Session 2 & Exhibit Hall with Coffee Break
Gabriele Berton ⋅ Alex Stoken ⋅ Carlo Masone
|
Exhibit Hall I #73 | |
|
Do It Yourself: Learning Semantic Correspondence from Pseudo-Labels
Poster Session 2 & Exhibit Hall with Coffee Break
Olaf Dünkel ⋅ Thomas Wimmer ⋅ Christian Theobalt ⋅ Christian Rupprecht ⋅ Adam Kortylewski
|
Exhibit Hall I #77 | |
|
Stochastic Interpolants for Revealing Stylistic Flows across the History of Art
Poster Session 2 & Exhibit Hall with Coffee Break
Pingchuan Ma ⋅ Ming Gui ⋅ Johannes Schusterbauer ⋅ Xiaopei Yang ⋅ Olga Grebenkova ⋅ Vincent Tao Hu ⋅ Björn Ommer
|
Exhibit Hall I #80 | |
|
Is Tracking really more challenging in First Person Egocentric Vision?
Matteo Dunnhofer ⋅ Zaira Manigrasso ⋅ Christian Micheloni
|
Exhibit Hall I #81 | |
|
VLDrive: Vision-Augmented Lightweight MLLMs for Efficient Language-grounded Autonomous Driving
Poster Session 2 & Exhibit Hall with Coffee Break
Ruifei Zhang ⋅ Wei Zhang ⋅ Xiao Tan ⋅ Sibei Yang ⋅ Xiang Wan ⋅ Xiaonan Luo ⋅ Guanbin Li
|
Exhibit Hall I #85 | |
|
Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformers
Poster Session 2 & Exhibit Hall with Coffee Break
Zhengyao Lyu ⋅ Tianlin Pan ⋅ Chenyang Si ⋅ Zhaoxi Chen ⋅ Wangmeng Zuo ⋅ Ziwei Liu ⋅ Kwan-Yee K. Wong
|
Exhibit Hall I #86 | |
|
Toward Material-Agnostic System Identification from Videos
Poster Session 2 & Exhibit Hall with Coffee Break
Yizhou Zhao ⋅ Haoyu Chen ⋅ Chunjiang Liu ⋅ Zhenyang Li ⋅ Charles Herrmann ⋅ Junhwa Hur ⋅ Yinxiao Li ⋅ Ming-Hsuan Yang ⋅ Bhiksha Raj ⋅ Min Xu
|
Exhibit Hall I #87 | |
|
MagicHOI: Leveraging 3D Priors for Accurate Hand-object Reconstruction from Short Monocular Video Clips
Poster Session 2 & Exhibit Hall with Coffee Break
SHIBO WANG ⋅ Haonan He ⋅ Maria Parelli ⋅ Christoph Gebhardt ⋅ Zicong Fan ⋅ Jie Song
|
Exhibit Hall I #88 | |
|
Boosting Multi-View Indoor 3D Object Detection via Adaptive 3D Volume Construction
Poster Session 2 & Exhibit Hall with Coffee Break
Runmin Zhang ⋅ Zhu Yu ⋅ Si-Yuan Cao ⋅ Lingyu Zhu ⋅ Guangyi Zhang ⋅ Xiaokai Bai ⋅ Hui-liang Shen
|
Exhibit Hall I #90 | |
|
ETA: Energy-based Test-time Adaptation for Depth Completion
Poster Session 2 & Exhibit Hall with Coffee Break
Younjoon Chung ⋅ Hyoungseob Park ⋅ Patrick Rim ⋅ Xiaoran Zhang ⋅ Jihe He ⋅ Ziyao Zeng ⋅ Safa Cicek ⋅ Byung-Woo Hong ⋅ James Duncan ⋅ Alex Wong
|
Exhibit Hall I #92 | |
|
CoTracker3: Simpler and Better Point Tracking by Pseudo-Labelling Real Videos
Nikita Karaev ⋅ Iurii Makarov ⋅ Jianyuan Wang ⋅ Natalia Neverova ⋅ Andrea Vedaldi ⋅ Christian Rupprecht
|
Exhibit Hall I #93 | |
|
SceneMI: Motion In-betweening for Modeling Human-Scene Interaction
Inwoo Hwang ⋅ Bing Zhou ⋅ Young Min Kim ⋅ Jian Wang ⋅ chuan guo
|
Exhibit Hall I #95 | |
|
DEPTHOR: Depth Enhancement from a Practical Light-Weight dToF Sensor and RGB Image
Poster Session 2 & Exhibit Hall with Coffee Break
Jijun Xiang ⋅ Xuan Zhu ⋅ Xianqi Wang ⋅ Yu Wang ⋅ Hong Zhang ⋅ Fei Guo ⋅ Xin Yang
|
Exhibit Hall I #101 | |
|
GeoExplorer: Active Geo-localization with Curiosity-Driven Exploration
Poster Session 2 & Exhibit Hall with Coffee Break
Li Mi ⋅ Manon Béchaz ⋅ Zeming Chen ⋅ Antoine Bosselut ⋅ Devis Tuia
|
Exhibit Hall I #103 | |
|
ROADWork: A Dataset and Benchmark for Learning to Recognize, Observe, Analyze and Drive Through Work Zones
Poster Session 2 & Exhibit Hall with Coffee Break
Anurag Ghosh ⋅ Shen Zheng ⋅ Robert Tamburo ⋅ Khiem Vuong ⋅ Juan Alvarez-Padilla ⋅ Hailiang Zhu ⋅ Nicholas Dunn ⋅ Michael Cardei ⋅ Christoph Mertz ⋅ Srinivasa Narasimhan
|
Exhibit Hall I #104 | |
|
RoMo: Robust Motion Segmentation Improves Structure from Motion
Poster Session 2 & Exhibit Hall with Coffee Break
Lily Goli ⋅ Sara Sabour ⋅ Mark Matthews ⋅ Marcus Brubaker ⋅ Dmitry Lagun ⋅ Alec Jacobson ⋅ David Fleet ⋅ Saurabh Saxena ⋅ Andrea Tagliasacchi
|
Exhibit Hall I #106 | |
|
Hints of Prompt: Enhancing Visual Representation for Multimodal LLMs in Autonomous Driving
Poster Session 2 & Exhibit Hall with Coffee Break
Hao Zhou ⋅ Zhanning Gao ⋅ Zhili Chen ⋅ Maosheng Ye ⋅ Qifeng Chen ⋅ Tongyi Cao ⋅ Honggang Qi
|
Exhibit Hall I #107 | |
|
Learning Large Motion Estimation from Intermediate Representations with a High-Resolution Optical Flow Dataset Featuring Long-Range Dynamic Motion
Hoonhee Cho ⋅ Yuhwan Jeong ⋅ Kuk-Jin Yoon
|
Exhibit Hall I #108 | |
|
Robust Low-light Scene Restoration via Illumination Transition
Poster Session 2 & Exhibit Hall with Coffee Break
Ze Li ⋅ Feng Zhang ⋅ Xiatian Zhu ⋅ Zhang Meng ⋅ Yanghong Zhou ⋅ P.Y. Mok
|
Exhibit Hall I #109 | |
|
Towards Real Unsupervised Anomaly Detection Via Confident Meta-Learning
Poster Session 1 & Exhibit Hall
Muhammad Aqeel ⋅ Shakiba Sharifi ⋅ Marco Cristani ⋅ Francesco Setti
|
Exhibit Hall I #456 | |
|
CCMNet: Leveraging Calibrated Color Correction Matrices for Cross-Camera Color Constancy
Poster Session 2 & Exhibit Hall with Coffee Break
Dongyoung Kim ⋅ Mahmoud Afifi ⋅ Dongyun Kim ⋅ Michael Brown ⋅ Seon Joo Kim
|
Exhibit Hall I #110 | |
|
MGSfM: Multi-Camera Geometry Driven Global Structure-from-Motion
Poster Session 2 & Exhibit Hall with Coffee Break
peilin Tao ⋅ Hainan Cui ⋅ Diantao Tu ⋅ Shuhan Shen
|
Exhibit Hall I #20 | |
|
UrbanLLaVA: A Multi-modal Large Language Model for Urban Intelligence
Poster Session 2 & Exhibit Hall with Coffee Break
Jie Feng ⋅ Shengyuan Wang ⋅ Tianhui Liu ⋅ Yanxin Xi ⋅ Yong Li
|
Exhibit Hall I #111 | |
|
Zero-shot Inexact CAD Model Alignment from a Single Image
Poster Session 2 & Exhibit Hall with Coffee Break
Pattaramanee Arsomngern ⋅ Sasikarn Khwanmuang ⋅ Matthias Nießner ⋅ Supasorn Suwajanakorn
|
Exhibit Hall I #113 | |
|
HazeFlow: Revisit Haze Physical Model as ODE and Non-Homogeneous Haze Generation for Real-World Dehazing
Poster Session 2 & Exhibit Hall with Coffee Break
Junseong Shin ⋅ Seungwoo Chung ⋅ Yunjeong Yang ⋅ Tae Hyun Kim
|
Exhibit Hall I #116 | |
|
Motal: Unsupervised 3D Object Detection by Modality and Task-specific Knowledge Transfer
Poster Session 2 & Exhibit Hall with Coffee Break
Hai Wu ⋅ Hongwei Lin ⋅ Xusheng Guo ⋅ Xin Li ⋅ Mingming Wang ⋅ Cheng Wang ⋅ Chenglu Wen
|
Exhibit Hall I #118 | |
|
Dual-Rate Dynamic Teacher for Source-Free Domain Adaptive Object Detection
Poster Session 1 & Exhibit Hall
Qi He ⋅ Xiao Wu ⋅ Jun-Yan He ⋅ Shuai Li
|
Exhibit Hall I #187 | |
|
DeGauss: Dynamic-Static Decomposition with Gaussian Splatting for Distractor-free 3D Reconstruction
Poster Session 2 & Exhibit Hall with Coffee Break
Rui Wang ⋅ Quentin Lohmeyer ⋅ Mirko Meboldt ⋅ Siyu Tang
|
Exhibit Hall I #119 | |
|
Scaling Inference-Time Search with Vision Value Model for Improved Visual Comprehension
Poster Session 1 & Exhibit Hall
Xiyao Wang ⋅ Zhengyuan Yang ⋅ Linjie Li ⋅ Hongjin Lu ⋅ Yuancheng Xu ⋅ Chung-Ching Lin ⋅ Kevin Lin ⋅ Furong Huang ⋅ Lijuan Wang
|
Exhibit Hall I #102 | |
|
Manual-PA: Learning 3D Part Assembly from Instruction Diagrams
Poster Session 2 & Exhibit Hall with Coffee Break
Jiahao Zhang ⋅ Anoop Cherian ⋅ Cristian Rodriguez-Opazo ⋅ Weijian Deng ⋅ Stephen Gould
|
Exhibit Hall I #120 | |
|
MoMa-Kitchen: A 100K+ Benchmark for Affordance-Grounded Last-Mile Navigation in Mobile Manipulation
Poster Session 2 & Exhibit Hall with Coffee Break
Pingrui Zhang ⋅ Xianqiang Gao ⋅ Yuhan Wu ⋅ Kehui Liu ⋅ Dong Wang ⋅ Zhigang Wang ⋅ Bin Zhao ⋅ Yan Ding ⋅ Xuelong Li
|
Exhibit Hall I #121 | |
|
NavQ: Learning a Q-Model for Foresighted Vision-and-Language Navigation
Poster Session 2 & Exhibit Hall with Coffee Break
Peiran Xu ⋅ Xicheng Gong ⋅ Yadong Mu
|
Exhibit Hall I #122 | |
|
Embodied VideoAgent: Persistent Memory from Egocentric Videos and Embodied Sensors Enables Dynamic Scene Understanding
Yue Fan ⋅ Xiaojian Ma ⋅ Rongpeng Su ⋅ Jun Guo ⋅ Rujie Wu ⋅ Xi Chen ⋅ Qing Li
|
Exhibit Hall I #123 | |
|
LightsOut: Diffusion-based Outpainting for Enhanced Lens Flare Removal
Poster Session 2 & Exhibit Hall with Coffee Break
Shr-Ruei Tsai ⋅ Wei-Cheng Chang ⋅ Jie-Ying Lee ⋅ Chih-Hai Su ⋅ Yu-Lun Liu
|
Exhibit Hall I #124 | |
|
Rethinking Multi-modal Object Detection from the Perspective of Mono-Modality Feature Learning
Poster Session 2 & Exhibit Hall with Coffee Break
Tianyi Zhao ⋅ Boyang Liu ⋅ Yanglei Gao ⋅ Yiming Sun ⋅ Maoxun Yuan ⋅ Xingxing Wei
|
Exhibit Hall I #125 | |
|
GeoDiffusion: A Training-Free Framework for Accurate 3D Geometric Conditioning in Image Generation
Poster Session 2 & Exhibit Hall with Coffee Break
Phillip Mueller ⋅ Talip Ünlü ⋅ Sebastian Schmidt ⋅ Marcel Kollovieh ⋅ Jiajie Fan ⋅ Stephan Günnemann ⋅ Lars Mikelsons
|
Exhibit Hall I #126 | |
|
OVA-Fields: Weakly Supervised Open-Vocabulary Affordance Fields for Robot Operational Part Detection
Poster Session 2 & Exhibit Hall with Coffee Break
Heng Su ⋅ Mengying Xie ⋅ Nieqing Cao ⋅ Yan Ding ⋅ Beichen Shao ⋅ Xianlei Long ⋅ Fuqiang Gu ⋅ Chao Chen
|
Exhibit Hall I #127 | |
|
Arti-PG: A Toolbox for Procedurally Synthesizing Large-Scale and Diverse Articulated Objects with Rich Annotations
Poster Session 2 & Exhibit Hall with Coffee Break
Jianhua Sun ⋅ Yuxuan Li ⋅ Jiude Wei ⋅ Xu Longfei ⋅ Wang Nange ⋅ Yining Zhang ⋅ Cewu Lu
|
Exhibit Hall I #128 | |
|
Scaling 3D Compositional Models for Robust Classification and Pose Estimation
Poster Session 2 & Exhibit Hall with Coffee Break
Xiaoding Yuan ⋅ Prakhar Kaushik ⋅ Guofeng Zhang ⋅ Artur Jesslen ⋅ Adam Kortylewski ⋅ Alan Yuille
|
Exhibit Hall I #129 | |
|
RoboTron-Nav: A Unified Framework for Embodied Navigation Integrating Perception, Planning, and Prediction
Poster Session 2 & Exhibit Hall with Coffee Break
Yufeng Zhong ⋅ Chengjian Feng ⋅ Feng yan ⋅ Fanfan Liu ⋅ Liming Zheng ⋅ Lin Ma
|
Exhibit Hall I #130 | |
|
Corvid: Improving Multimodal Large Language Models Towards Chain-of-Thought Reasoning
Jingjing Jiang ⋅ Chao Ma ⋅ Xurui Song ⋅ Hanwang Zhang ⋅ Jun Luo
|
Exhibit Hall I #280 | |
|
DAMap: Distance-aware MapNet for High Quality HD Map Construction
Poster Session 2 & Exhibit Hall with Coffee Break
JINPENG DONG ⋅ Chen Li ⋅ Yutong Lin ⋅ Jingwen Fu ⋅ Sanping Zhou ⋅ Nanning Zheng
|
Exhibit Hall I #25 | |
|
X-Capture: An Open-Source Portable Device for Multi-Sensory Learning
Poster Session 2 & Exhibit Hall with Coffee Break
Samuel Clarke ⋅ Suzannah Wistreich ⋅ Yanjie Ze ⋅ Jiajun Wu
|
Exhibit Hall I #132 | |
|
DRaM-LHM: A Quaternion Framework for Iterative Camera Pose Estimation
Poster Session 2 & Exhibit Hall with Coffee Break
Chen Lin ⋅ Weizhi Du ⋅ Zhixiang Min ⋅ Baochen She ⋅ Enrique Dunn ⋅ Sonya Hanson
|
Exhibit Hall I #133 | |
|
Focal Plane Visual Feature Generation and Matching on a Pixel Processor Array
Poster Session 6 & Exhibit Hall with Coffee Break
Hongyi Zhang ⋅ Laurie Bose ⋅ Jianing Chen ⋅ Piotr Dudek ⋅ Walterio Mayol-Cuevas
|
Exhibit Hall I #415 | |
|
VoteSplat: Hough Voting Gaussian Splatting for 3D Scene Understanding
Poster Session 2 & Exhibit Hall with Coffee Break
Minchao Jiang ⋅ Shunyu Jia ⋅ Jiaming Gu ⋅ Xiaoyuan Lu ⋅ Guangming Zhu ⋅ Anqi Dong ⋅ zhang liang
|
Exhibit Hall I #134 | |
|
Prior-aware Dynamic Temporal Modeling Framework for Sequential 3D Hand Pose Estimation
Poster Session 2 & Exhibit Hall with Coffee Break
Pengfei Ren ⋅ Jingyu Wang ⋅ Haifeng Sun ⋅ Qi Qi ⋅ Xingyu Liu ⋅ Menghao Zhang ⋅ Lei Zhang ⋅ Jing Wang ⋅ Jianxin Liao
|
Exhibit Hall I #136 | |
|
Epipolar Consistent Attention Aggregation Network for Unsupervised Light Field Disparity Estimation
Poster Session 2 & Exhibit Hall with Coffee Break
Chen Gao ⋅ Shuo Zhang ⋅ Youfang Lin
|
Exhibit Hall I #137 | |
|
ATLAS: Decoupling Skeletal and Shape Parameters for Expressive Parametric Human Modeling
Poster Session 2 & Exhibit Hall with Coffee Break
Jinhyung Park ⋅ Javier Romero ⋅ Shunsuke Saito ⋅ Fabian Prada ⋅ Takaaki Shiratori ⋅ Yichen Xu ⋅ Federica Bogo ⋅ Shoou-I Yu ⋅ Kris Kitani ⋅ Rawal Khirodkar
|
Exhibit Hall I #139 | |
|
On the Generalization of Representation Uncertainty in Earth Observation
Poster Session 2 & Exhibit Hall with Coffee Break
Spyros Kondylatos ⋅ Nikolaos Ioannis Bountos ⋅ Dimitrios Michail ⋅ Xiao Xiang Zhu ⋅ Gustau Camps-Valls ⋅ Ioannis Papoutsis
|
Exhibit Hall I #143 | |
|
Predict-Optimize-Distill: A Self-Improving Cycle for 4D Object Understanding
Poster Session 2 & Exhibit Hall with Coffee Break
Mingxuan Wu ⋅ Huang Huang ⋅ Justin Kerr ⋅ Chung Min Kim ⋅ Anthony Zhang ⋅ Brent Yi ⋅ Angjoo Kanazawa
|
Exhibit Hall I #145 | |
|
Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data and Metric Perspectives
Poster Session 2 & Exhibit Hall with Coffee Break
Shaoyuan Xie ⋅ Lingdong Kong ⋅ Yuhao Dong ⋅ Chonghao Sima ⋅ Wenwei Zhang ⋅ Qi Alfred Chen ⋅ Ziwei Liu ⋅ Liang Pan
|
Exhibit Hall I #148 | |
|
Humans as a Calibration Pattern: Dynamic 3D Scene Reconstruction from Unsynchronized and Uncalibrated Videos
Poster Session 2 & Exhibit Hall with Coffee Break
Changwoon Choi ⋅ Jeongjun Kim ⋅ Geonho Cha ⋅ Minkwan Kim ⋅ Dongyoon Wee ⋅ Young Min Kim
|
Exhibit Hall I #149 | |
|
GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors
Poster Session 2 & Exhibit Hall with Coffee Break
Tian-Xing Xu ⋅ Xiangjun Gao ⋅ Wenbo Hu ⋅ Xiaoyu Li ⋅ Song-Hai Zhang ⋅ Ying Shan
|
Exhibit Hall I #153 | |
|
Hybrid-grained Feature Aggregation with Coare-to-fine Language Guidance for Self-supervised Monocular Depth Estimation
Poster Session 2 & Exhibit Hall with Coffee Break
Wenyao Zhang ⋅ Hongsi Liu ⋅ Bohan Li ⋅ Jiawei He ⋅ Zekun Qi ⋅ Yunnan Wang ⋅ Eastern Institute of Technology Shengyang ⋅ Ningbo Institute Of Digital Twin XinQiang ⋅ Galbot Wenjun ⋅ Eastern Institute for Advanced Study Xin
|
Exhibit Hall I #157 | |
|
Zero-AVSR: Zero-Shot Audio-Visual Speech Recognition with LLMs by Learning Language-Agnostic Speech Representations
Poster Session 2 & Exhibit Hall with Coffee Break
Jeong Hun Yeo ⋅ Minsu Kim ⋅ Chae Won Kim ⋅ Stavros Petridis ⋅ Yong Man Ro
|
Exhibit Hall I #158 | |
|
Jigsaw++: Imagining Complete Shape Priors for Object Reassembly
Poster Session 2 & Exhibit Hall with Coffee Break
Jiaxin Lu ⋅ Gang Hua ⋅ Qixing Huang
|
Exhibit Hall I #159 | |
|
Seeing and Seeing Through the Glass: Real and Synthetic Data for Multi-Layer Depth Estimation
Poster Session 2 & Exhibit Hall with Coffee Break
Hongyu Wen ⋅ Yiming Zuo ⋅ Venkat Subramanian ⋅ Patrick Chen ⋅ Jia Deng
|
Exhibit Hall I #160 | |
|
SpatialTrackerV2: Advancing 3D Point Tracking with Explicit Camera Motion
Poster Session 2 & Exhibit Hall with Coffee Break
Yuxi Xiao ⋅ Jianyuan Wang ⋅ Nan Xue ⋅ Nikita Karaev ⋅ Iurii Makarov ⋅ Bingyi Kang ⋅ Xing Zhu ⋅ Hujun Bao ⋅ Yujun Shen ⋅ Xiaowei Zhou
|
Exhibit Hall I #161 | |
|
A Simple yet Mighty Hartley Diffusion Versatilist for Generalizable Dense Vision Tasks
Poster Session 2 & Exhibit Hall with Coffee Break
Qi Bi ⋅ Jingjun Yi ⋅ Huimin Huang ⋅ Hao Zheng ⋅ Haolan Zhan ⋅ Wei Ji ⋅ Yawen Huang ⋅ Yuexiang Li ⋅ Yefeng Zheng
|
Exhibit Hall I #163 | |
|
IGL-Nav: Incremental 3D Gaussian Localization for Image-goal Navigation
Poster Session 2 & Exhibit Hall with Coffee Break
Wenxuan Guo ⋅ Xiuwei Xu ⋅ Hang Yin ⋅ Ziwei Wang ⋅ Jianjiang Feng ⋅ Jie Zhou ⋅ Jiwen Lu
|
Exhibit Hall I #168 | |
|
AR-VRM: Imitating Human Motions for Visual Robot Manipulation with Analogical Reasoning
Poster Session 2 & Exhibit Hall with Coffee Break
Dejie Yang ⋅ Zijing Zhao ⋅ Yang Liu
|
Exhibit Hall I #169 | |
|
Unleashing the Temporal Potential of Stereo Event Cameras for Continuous-Time 3D Object Detection
Poster Session 2 & Exhibit Hall with Coffee Break
Jae Young Kang ⋅ Hoonhee Cho ⋅ Kuk-Jin Yoon
|
Exhibit Hall I #174 | |
|
PlaneRAS: Learning Planar Primitives for 3D Plane Recovery
Poster Session 2 & Exhibit Hall with Coffee Break
Fang Zhang ⋅ Wenzhao Zheng ⋅ Linqing Zhao ⋅ Zelan Zhu ⋅ Jiwen Lu ⋅ Xiuzhuang Zhou
|
Exhibit Hall I #175 | |
|
FedVLA: Federated Vision-Language-Action Learning with Dual Gating Mixture-of-Experts for Robotic Manipulation
Poster Session 2 & Exhibit Hall with Coffee Break
Cui Miao ⋅ Tao Chang ⋅ meihan wu ⋅ Hongbin Xu ⋅ Chun Li ⋅ Ming Li ⋅ Xiaodong Wang
|
Exhibit Hall I #177 | |
|
3DSRBench: A Comprehensive 3D Spatial Reasoning Benchmark
Poster Session 2 & Exhibit Hall with Coffee Break
Wufei Ma ⋅ Haoyu Chen ⋅ Guofeng Zhang ⋅ Yu-Cheng Chou ⋅ Celso de Melo ⋅ Alan Yuille ⋅ Jieneng Chen
|
Exhibit Hall I #179 | |
|
3DGraphLLM: Combining Semantic Graphs and Large Language Models for 3D Scene Understanding
Poster Session 2 & Exhibit Hall with Coffee Break
Tatiana Zemskova ⋅ Dmitry Yudin
|
Exhibit Hall I #363 | |
|
TAR3D: Creating High-Quality 3D Assets via Next-Part Prediction
Poster Session 2 & Exhibit Hall with Coffee Break
Xuying Zhang ⋅ Yutong Liu ⋅ Yangguang Li ⋅ Renrui Zhang ⋅ Yufei Liu ⋅ Kai Wang ⋅ Wanli Ouyang ⋅ Zhiwei Xiong ⋅ Peng Gao ⋅ Qibin Hou ⋅ Ming-Ming Cheng
|
Exhibit Hall I #11 | |
|
Exploring the Adversarial Vulnerabilities of Vision-Language-Action Models in Robotics
Poster Session 2 & Exhibit Hall with Coffee Break
Taowen Wang ⋅ Cheng Han ⋅ James Liang ⋅ Wenhao Yang ⋅ Dongfang Liu ⋅ Luna Zhang ⋅ Qifan Wang ⋅ Jiebo Luo ⋅ Ruixiang Tang
|
Exhibit Hall I #181 | |
|
Simultaneous Motion And Noise Estimation with Event Cameras
Poster Session 2 & Exhibit Hall with Coffee Break
Shintaro Shiba ⋅ Yoshimitsu Aoki ⋅ Guillermo Gallego
|
Exhibit Hall I #182 | |
|
Layer-wise Vision Injection with Disentangled Attention for Efficient LVLMs
Poster Session 2 & Exhibit Hall with Coffee Break
Xuange Zhang ⋅ Dengjie Li ⋅ Bo Liu ⋅ Zenghao Bao ⋅ Yao Zhou ⋅ Baisong Yang ⋅ liuzhongying liuzhongying ⋅ Yujie Zhong ⋅ Tongtong Yuan
|
Exhibit Hall I #186 | |
|
CMT: A Cascade MAR with Topology Predictor for Multimodal Conditional CAD Generation
Poster Session 2 & Exhibit Hall with Coffee Break
Jianyu Wu ⋅ Yizhou Wang ⋅ Xiangyu Yue ⋅ Xinzhu Ma ⋅ Jinyang Guo ⋅ Dongzhan Zhou ⋅ Wanli Ouyang ⋅ SHIXIANG TANG
|
Exhibit Hall I #187 | |
|
StableDepth: Scene-Consistent and Scale-Invariant Monocular Depth
Zheng Zhang ⋅ Lihe Yang ⋅ Tianyu Yang ⋅ Chaohui Yu ⋅ Xiaoyang Guo ⋅ Yixing Lao ⋅ Hengshuang Zhao
|
Exhibit Hall I #192 | |
|
4DSegStreamer: Streaming 4D Panoptic Segmentation via Dual Threads
Poster Session 2 & Exhibit Hall with Coffee Break
Ling Liu ⋅ Jun Tian ⋅ Li Yi
|
Exhibit Hall I #194 | |
|
Color Matching Using Hypernetwork-Based Kolmogorov-Arnold Networks
Poster Session 2 & Exhibit Hall with Coffee Break
Artem Nikonorov ⋅ Georgy Perevozchikov ⋅ Andrei Korepanov ⋅ Nancy Mehta ⋅ Mahmoud Afifi ⋅ Egor Ershov ⋅ Radu Timofte
|
Exhibit Hall I #195 | |
|
HccePose (BF): Predicting Front & Back Surfaces to Construct Ultra-Dense 2D-3D Correspondences for Pose Estimation
Yulin Wang ⋅ Mengting Hu ⋅ Hongli Li ⋅ Chen LUO
|
Exhibit Hall I #201 | |
|
GaussianVideo: Efficient Video Representation via Hierarchical Gaussian Splatting
Poster Session 2 & Exhibit Hall with Coffee Break
Andrew Bond ⋅ Jui-Hsien Wang ⋅ Long Mai ⋅ Erkut Erdem ⋅ Aykut Erdem
|
Exhibit Hall I #203 | |
|
PhysTwin: Physics-Informed Reconstruction and Simulation of Deformable Objects from Videos
Poster Session 2 & Exhibit Hall with Coffee Break
Hanxiao Jiang ⋅ Hao-Yu Hsu ⋅ Kaifeng Zhang ⋅ Hsin-Ni Yu ⋅ Shenlong Wang ⋅ Yunzhu Li
|
Exhibit Hall I #206 | |
|
GaussianProperty: Integrating Physical Properties to 3D Gaussians with LMMs
Poster Session 2 & Exhibit Hall with Coffee Break
Xinli Xu ⋅ Wenhang Ge ⋅ Dicong Qiu ⋅ ZhiFei Chen ⋅ Dongyu Yan ⋅ Zhuoyun LIU ⋅ Haoyu Zhao ⋅ hanfeng Zhao ⋅ Shunsi Zhang ⋅ Junwei Liang ⋅ Ying-Cong Chen
|
Exhibit Hall I #207 | |
|
Enhancing Image Restoration Transformer via Adaptive Translation Equivariance
Poster Session 4 & Exhibit Hall with Coffee Break
JiaKui Hu ⋅ Zhengjian Yao ⋅ Lujia Jin ⋅ Hangzhou He ⋅ Yanye Lu
|
Exhibit Hall I #110 | |
|
Frequency-Aligned Knowledge Distillation for Lightweight Spatiotemporal Forecasting
Poster Session 2 & Exhibit Hall with Coffee Break
Yuqi Li ⋅ Chuanguang Yang ⋅ Hansheng Zeng ⋅ Zeyu Dong ⋅ Zhulin An ⋅ Yongjun Xu ⋅ Yingli Tian ⋅ Hao Wu
|
Exhibit Hall I #210 | |
|
CAD-Assistant: Tool-Augmented VLLMs as Generic CAD Task Solvers
Poster Session 2 & Exhibit Hall with Coffee Break
Dimitrios Mallis ⋅ Ahmet Karadeniz ⋅ Sebastian Cavada ⋅ Danila Rukhovich ⋅ Niki Foteinopoulou ⋅ Kseniya Cherenkova ⋅ Anis Kacem ⋅ Djamila Aouada
|
Exhibit Hall I #212 | |
|
Dynamic Point Maps: A Versatile Representation for Dynamic 3D Reconstruction
Edgar Sucar ⋅ Zihang Lai ⋅ Eldar Insafutdinov ⋅ Andrea Vedaldi
|
Exhibit Hall I #213 | |
|
Physics Context Builders: A Modular Framework for Physical Reasoning in Vision-Language Models
Poster Session 2 & Exhibit Hall with Coffee Break
Vahid Balazadeh ⋅ Mohammadmehdi Ataei ⋅ Hyunmin Cheong ⋅ Amir Khasahmadi ⋅ Rahul Krishnan
|
Exhibit Hall I #215 | |
|
VOccl3D: A Video Benchmark Dataset for 3D Human Pose and Shape Estimation under real Occlusions
Poster Session 2 & Exhibit Hall with Coffee Break
Yash Garg ⋅ Saketh Bachu ⋅ Arindam Dutta ⋅ Rohit Lal ⋅ Sarosij Bose ⋅ Calvin-Khang Ta ⋅ M. Salman Asif ⋅ Amit Roy-Chowdhury
|
Exhibit Hall I #218 | |
|
Tracking Tiny Drones against Clutter: Large-Scale Infrared Benchmark with Motion-Centric Adaptive Algorithm
Poster Session 2 & Exhibit Hall with Coffee Break
Jiahao Zhang ⋅ Zongli Jiang ⋅ Gang Wang ⋅ Jinli Zhang ⋅ Yixin Wei ⋅ Liang Li ⋅ Yizheng Wang
|
Exhibit Hall I #219 | |
|
MM-Spatial: Exploring 3D Spatial Understanding in Multimodal LLMs
Poster Session 2 & Exhibit Hall with Coffee Break
Erik Daxberger ⋅ Nina Wenzel ⋅ David Griffiths ⋅ Haiming Gang ⋅ Justin Lazarow ⋅ Gefen Kohavi ⋅ Kai Kang ⋅ Marcin Eichner ⋅ Yinfei Yang ⋅ Afshin Dehghan ⋅ Peter Grasch
|
Exhibit Hall I #222 | |
|
AutoComPose: Automatic Generation of Pose Transition Descriptions for Composed Pose Retrieval Using Multimodal LLMs
Poster Session 2 & Exhibit Hall with Coffee Break
Yi-Ting Shen ⋅ Sungmin Eum ⋅ Doheon Lee ⋅ Rohit Shete ⋅ Chiao-Yi Wang ⋅ Heesung Kwon ⋅ Shuvra Bhattacharyya
|
Exhibit Hall I #223 | |
|
Understanding Flatness in Generative Models: Its Role and Benefits
Poster Session 1 & Exhibit Hall
Taehwan Lee ⋅ Kyeongkook Seo ⋅ Jaejun Yoo ⋅ Sung Whan Yoon
|
Exhibit Hall I #461 | |
|
Image-Guided Shape-from-Template Using Mesh Inextensibility Constraints
Poster Session 2 & Exhibit Hall with Coffee Break
Dinh-Vinh-Thuy Tran ⋅ Ruochen Chen ⋅ Shaifali Parashar
|
Exhibit Hall I #224 | |
|
3D-MOOD: Lifting 2D to 3D for Monocular Open-Set Object Detection
Poster Session 2 & Exhibit Hall with Coffee Break
Yung-Hsu Yang ⋅ Luigi Piccinelli ⋅ Mattia Segu ⋅ Siyuan Li ⋅ Rui Huang ⋅ Yuqian Fu ⋅ Marc Pollefeys ⋅ Hermann Blum ⋅ Zuria Bauer
|
Exhibit Hall I #227 | |
|
LUDVIG: Learning-Free Uplifting of 2D Visual Features to Gaussian Splatting Scenes
Poster Session 2 & Exhibit Hall with Coffee Break
Juliette Marrie ⋅ Romain Menegaux ⋅ Michael Arbel ⋅ Diane Larlus ⋅ Julien Mairal
|
Exhibit Hall I #228 | |
|
VOVTrack: Exploring the Potentiality in Raw Videos for Open-Vocabulary Multi-Object Tracking
Poster Session 2 & Exhibit Hall with Coffee Break
Zekun Qian ⋅ Ruize Han ⋅ Junhui Hou ⋅ Linqi Song ⋅ Wei Feng
|
Exhibit Hall I #231 | |
|
PHD: Personalized 3D Human Body Fitting with Point Diffusion
Poster Session 2 & Exhibit Hall with Coffee Break
Hsuan-I Ho ⋅ Chen Guo ⋅ Po-Chen Wu ⋅ Ivan Shugurov ⋅ Chengcheng Tang ⋅ Abhay Mittal ⋅ Sizhe An ⋅ Manuel Kaufmann ⋅ Linguang Zhang
|
Exhibit Hall I #236 | |
|
Frequency Domain-Based Diffusion Model for Unpaired Image Dehazing
Poster Session 2 & Exhibit Hall with Coffee Break
Chengxu Liu ⋅ Lu Qi ⋅ Jinshan Pan ⋅ Xueming Qian ⋅ Ming-Hsuan Yang
|
Exhibit Hall I #237 | |
|
Language Driven Occupancy Prediction
Poster Session 2 & Exhibit Hall with Coffee Break
Zhu Yu ⋅ Bowen Pang ⋅ Lizhe Liu ⋅ Runmin Zhang ⋅ Qiang Li ⋅ Si-Yuan Cao ⋅ Maochun Luo ⋅ Mingxia Chen ⋅ Sheng Yang ⋅ Hui-liang Shen
|
Exhibit Hall I #238 | |
|
C4D: 4D Made from 3D through Dual Correspondences
Poster Session 2 & Exhibit Hall with Coffee Break
Shizun Wang ⋅ Zhenxiang Jiang ⋅ Xingyi Yang ⋅ Xinchao Wang
|
Exhibit Hall I #240 | |
|
ScoreHOI: Physically Plausible Reconstruction of Human-Object Interaction via Score-Guided Diffusion
Poster Session 2 & Exhibit Hall with Coffee Break
AO LI ⋅ Jinpeng Liu ⋅ Yixuan Zhu ⋅ Yansong Tang
|
Exhibit Hall I #242 | |
|
Estimating 2D Camera Motion with Hybrid Motion Basis
Poster Session 2 & Exhibit Hall with Coffee Break
Haipeng Li ⋅ Tianhao Zhou ⋅ Zhanglei Yang ⋅ WuYi WuYi ⋅ Chen Yan ⋅ Zijing Mao ⋅ Shen Cheng ⋅ Bing Zeng ⋅ Shuaicheng Liu
|
Exhibit Hall I #245 | |
|
AgroBench: Vision-Language Model Benchmark in Agriculture
Poster Session 2 & Exhibit Hall with Coffee Break
Risa Shinoda ⋅ Nakamasa Inoue ⋅ Hirokatsu Kataoka ⋅ Masaki Onishi ⋅ Yoshitaka Ushiku
|
Exhibit Hall I #246 | |
|
Princeton365: A Diverse Dataset with Accurate Camera Pose
Poster Session 2 & Exhibit Hall with Coffee Break
Karhan Kayan ⋅ Stamatis Alexandropoulos ⋅ Rishabh Jain ⋅ Yiming Zuo ⋅ Erich Liang ⋅ Jia Deng
|
Exhibit Hall I #247 | |
|
H3R: Hybrid Multi-view Correspondence for Generalizable 3D Reconstruction
Poster Session 2 & Exhibit Hall with Coffee Break
Heng Jia ⋅ Na Zhao ⋅ Linchao Zhu
|
Exhibit Hall I #248 | |
|
After the Party: Navigating the Mapping From Color to Ambient Lighting
Poster Session 2 & Exhibit Hall with Coffee Break
Florin-Alexandru Vasluianu ⋅ Tim Seizinger ⋅ Zongwei Wu ⋅ Radu Timofte
|
Exhibit Hall I #395 | |
|
From Abyssal Darkness to Blinding Glare: A Benchmark on Extreme Exposure Correction in Real World
Poster Session 2 & Exhibit Hall with Coffee Break
Bo Wang ⋅ Huiyuan Fu ⋅ Zhiye Huang ⋅ Siru Zhang ⋅ Xin Wang ⋅ Huadong Ma
|
Exhibit Hall I #249 | |
|
Dita: Scaling Diffusion Transformer for Generalist Vision-Language-Action Policy
Poster Session 2 & Exhibit Hall with Coffee Break
Zhi Hou ⋅ Tianyi Zhang ⋅ Yuwen Xiong ⋅ Haonan Duan ⋅ Hengjun Pu ⋅ Ronglei Tong ⋅ Chengyang Zhao ⋅ Xizhou Zhu ⋅ Yu Qiao ⋅ Jifeng Dai ⋅ Yuntao Chen
|
Exhibit Hall I #251 | |
|
Voyaging into Perpetual Dynamic Scenes from a Single View
Poster Session 2 & Exhibit Hall with Coffee Break
Fengrui Tian ⋅ Tianjiao Ding ⋅ Jinqi Luo ⋅ Hancheng Min ⋅ Rene Vidal
|
Exhibit Hall I #252 | |
|
Learnable Feature Patches and Vectors for Boosting Low-light Image Enhancement without External Knowledge
Poster Session 2 & Exhibit Hall with Coffee Break
Xiaogang Xu ⋅ Jiafei Wu ⋅ Qingsen Yan ⋅ Jiequan Cui ⋅ Richang Hong ⋅ Bei Yu
|
Exhibit Hall I #258 | |
|
TESPEC: Temporally-Enhanced Self-Supervised Pretraining for Event Cameras
Poster Session 2 & Exhibit Hall with Coffee Break
Mohammad Mohammadi ⋅ Ziyi Wu ⋅ Igor Gilitschenski
|
Exhibit Hall I #260 | |
|
CL-Splats: Continual Learning of Gaussian Splatting with Local Optimization
Poster Session 2 & Exhibit Hall with Coffee Break
Jan Ackermann ⋅ Jonas Kulhanek ⋅ Shengqu Cai ⋅ Haofei Xu ⋅ Marc Pollefeys ⋅ Gordon Wetzstein ⋅ Leonidas Guibas ⋅ Songyou Peng
|
Exhibit Hall I #262 | |
|
Find Any Part in 3D
Ziqi Ma ⋅ Yisong Yue ⋅ Georgia Gkioxari
|
Exhibit Hall I #263 | |
|
Learning 3D Scene Analogies with Neural Contextual Scene Maps
Poster Session 2 & Exhibit Hall with Coffee Break
Junho Kim ⋅ Gwangtak Bae ⋅ Eun Sun Lee ⋅ Young Min Kim
|
Exhibit Hall I #264 | |
|
GausSim: Foreseeing Reality by Gaussian Simulator for Elastic Objects
Poster Session 2 & Exhibit Hall with Coffee Break
Yidi Shao ⋅ Mu Huang ⋅ Chen Change Loy ⋅ Bo Dai
|
Exhibit Hall I #265 | |
|
Global Motion Corresponder for 3D Point-Based Scene Interpolation under Large Motion
Poster Session 2 & Exhibit Hall with Coffee Break
Junru Lin ⋅ Chirag Vashist ⋅ Mikaela Uy ⋅ Colton Stearns ⋅ Xuan Luo ⋅ Leonidas Guibas ⋅ Ke Li
|
Exhibit Hall I #269 | |
|
AdvDreamer Unveils: Are Vision-Language Models Truly Ready for Real-World 3D Variations?
Shouwei Ruan ⋅ Hanqing Liu ⋅ Yao Huang ⋅ XIaoqi Wang ⋅ Caixin KANG ⋅ Hang Su ⋅ Yinpeng Dong ⋅ Xingxing Wei
|
Exhibit Hall I #270 | |
|
SpikeDiff: Zero-shot High-Quality Video Reconstruction from Chromatic Spike Camera and Sub-millisecond Spike Streams
Poster Session 2 & Exhibit Hall with Coffee Break
Siqi Yang ⋅ Jinxiu Liang ⋅ Zhaojun Huang ⋅ Yeliduosi Xiaokaiti ⋅ Yakun Chang ⋅ Zhaofei Yu ⋅ Boxin Shi
|
Exhibit Hall I #271 | |
|
VA-MoE: Variables-Adaptive Mixture of Experts for Incremental Weather Forecasting
Poster Session 2 & Exhibit Hall with Coffee Break
Hao Chen ⋅ Tao Han ⋅ Song Guo ⋅ Jie ZHANG ⋅ Yonghan Dong ⋅ Yunlong Yu ⋅ LEI BAI
|
Exhibit Hall I #272 | |
|
AJAHR: Amputated Joint Aware 3D Human Mesh Recovery
Poster Session 2 & Exhibit Hall with Coffee Break
hyunjin cho ⋅ Giyun choi ⋅ Jongwon Choi
|
Exhibit Hall I #273 | |
|
EquiCaps: Predictor-Free Pose-Aware Pre-Trained Capsule Networks
Poster Session 2 & Exhibit Hall with Coffee Break
Athinoulla Konstantinou ⋅ Georgios Leontidis ⋅ Mamatha Thota ⋅ Aiden Durrant
|
Exhibit Hall I #275 | |
|
A Structure-aware and Motion-adaptive Framework for 3D Human Pose Estimation with Mamba
Poster Session 2 & Exhibit Hall with Coffee Break
Ye Lu ⋅ Jie Wang ⋅ Jianjun Gao ⋅ Rui Gong ⋅ Chen Cai ⋅ Kim-Hui Yap
|
Exhibit Hall I #276 | |
|
Unsupervised Joint Learning of Optical Flow and Intensity with Event Cameras
Shuang Guo ⋅ Friedhelm Hamann ⋅ Guillermo Gallego
|
Exhibit Hall I #278 | |
|
CAPTURE: Evaluating Spatial Reasoning in Vision Language Models via Occluded Object Counting
Poster Session 2 & Exhibit Hall with Coffee Break
Atin Pothiraj ⋅ Jaemin Cho ⋅ Elias Stengel-Eskin ⋅ Mohit Bansal
|
Exhibit Hall I #280 | |
|
RoboTron-Drive: All-in-One Large Multimodal Model for Autonomous Driving
Poster Session 2 & Exhibit Hall with Coffee Break
Zhijian Huang ⋅ Chengjian Feng ⋅ Baihui Xiao ⋅ Feng yan ⋅ ZEQUN JIE ⋅ Yujie Zhong ⋅ Xiaodan Liang ⋅ Lin Ma
|
Exhibit Hall I #281 | |
|
6DOPE-GS: Online 6D Object Pose Estimation using Gaussian Splatting
Poster Session 2 & Exhibit Hall with Coffee Break
Yufeng Jin ⋅ Vignesh Prasad ⋅ Snehal Jauhri ⋅ Mathias Franzius ⋅ Georgia Chalvatzaki
|
Exhibit Hall I #283 | |
|
AnyCalib: On-Manifold Learning for Model-Agnostic Single-View Camera Calibration
Poster Session 2 & Exhibit Hall with Coffee Break
Javier Tirado-Garín ⋅ Javier Civera
|
Exhibit Hall I #284 | |
|
Background Invariance Testing According to Semantic Proximity
Poster Session 2 & Exhibit Hall with Coffee Break
Zukang Liao ⋅ Min Chen
|
Exhibit Hall I #285 | |
|
NuPlanQA: A Large-Scale Dataset and Benchmark for Multi-View Driving Scene Understanding in Multi-Modal Large Language Models
Poster Session 2 & Exhibit Hall with Coffee Break
Sung-Yeon Park ⋅ Can Cui ⋅ Yunsheng Ma ⋅ Ahmadreza Moradipari ⋅ Rohit Gupta ⋅ Kyungtae Han ⋅ Ziran Wang
|
Exhibit Hall I #286 | |
|
One Look is Enough: Seamless Patchwise Refinement for Zero-Shot Monocular Depth Estimation on High-Resolution Images
Poster Session 2 & Exhibit Hall with Coffee Break
Byeongjun Kwon ⋅ Munchurl Kim
|
Exhibit Hall I #287 | |
|
Adapting Vehicle Detectors for Aerial Imagery to Unseen Domains with Weak Supervision
Poster Session 2 & Exhibit Hall with Coffee Break
Xiao Fang ⋅ Minhyek Jeon ⋅ Zheyang Qin ⋅ Stanislav Panev ⋅ Celso de Melo ⋅ Shuowen Hu ⋅ Shayok Chakraborty ⋅ Fernando De la Torre
|
Exhibit Hall I #288 | |
|
RegGS: Unposed Sparse Views Gaussian Splatting with 3DGS Registration
Poster Session 2 & Exhibit Hall with Coffee Break
Chong Cheng ⋅ Yu Hu ⋅ Sicheng Yu ⋅ Beizhen ZHAO ⋅ Zijian Wang ⋅ Hao Wang
|
Exhibit Hall I #289 | |
|
PersPose: 3D Human Pose Estimation with Perspective Encoding and Perspective Rotation
Poster Session 2 & Exhibit Hall with Coffee Break
Xiaoyang Hao ⋅ Han Li
|
Exhibit Hall I #290 | |
|
Training-Free Generation of Temporally Consistent Rewards from VLMs
Poster Session 2 & Exhibit Hall with Coffee Break
Yinuo Zhao ⋅ Jiale Yuan ⋅ Zhiyuan Xu ⋅ Xiaoshuai Hao ⋅ Xinyi Zhang ⋅ Kun Wu ⋅ Zhengping Che ⋅ Chi Liu ⋅ Jian Tang
|
Exhibit Hall I #292 | |
|
MEMFOF: High-Resolution Training for Memory-Efficient Multi-Frame Optical Flow Estimation
Vladislav Bargatin ⋅ Egor Chistov ⋅ Alexander Yakovenko ⋅ Dmitriy Vatolin
|
Exhibit Hall I #297 | |
|
Breaking Rectangular Shackles: Cross-View Object Segmentation for Fine-Grained Object Geo-Localization
Poster Session 2 & Exhibit Hall with Coffee Break
Qingwang Zhang ⋅ Yingying Zhu
|
Exhibit Hall I #298 | |
|
TopicGeo: An Efficient Unified Framework for Geolocation
Poster Session 2 & Exhibit Hall with Coffee Break
Xin Wang ⋅ Xinlin Wang ⋅ Shuiping Gou
|
Exhibit Hall I #302 | |
|
ETCH: Generalizing Body Fitting to Clothed Humans via Equivariant Tightness
Boqian Li ⋅ Zeyu Cai ⋅ Michael Black ⋅ Haiwen Feng ⋅ Yuliang Xiu
|
Exhibit Hall I #306 | |
|
Revisiting Image Fusion for Multi-Illuminant White-Balance Correction
Poster Session 2 & Exhibit Hall with Coffee Break
David Serrano ⋅ Aditya Arora ⋅ Luis Herranz ⋅ Kosta Derpanis ⋅ Michael Brown ⋅ Javier Vazquez-Corral
|
Exhibit Hall I #307 | |
|
Partially Matching Submap Helps: Uncetainty Modeling and Propagation for Text to Point Cloud Localization
Poster Session 2 & Exhibit Hall with Coffee Break
Mingtao Feng ⋅ Longlong Mei ⋅ Zijie Wu ⋅ Jianqiao Luo ⋅ Fenghao Tian ⋅ Jie Feng ⋅ Weisheng Dong ⋅ Yaonan Wang
|
Exhibit Hall I #309 | |
|
Medical World Model
Poster Session 2 & Exhibit Hall with Coffee Break
Yijun Yang ⋅ Zhao-Yang Wang ⋅ Qiuping Liu ⋅ Shu Wen Sun ⋅ Kang Wang ⋅ Rama Chellappa ⋅ Zongwei Zhou ⋅ Alan Yuille ⋅ Lei Zhu ⋅ Yu-Dong Zhang ⋅ Jieneng Chen
|
Exhibit Hall I #311 | |
|
NormalCrafter: Learning Temporally Consistent Normals from Video Diffusion Priors
Poster Session 2 & Exhibit Hall with Coffee Break
Yanrui Bin ⋅ Wenbo Hu ⋅ Haoyuan Wang ⋅ Xinya Chen ⋅ Bing WANG
|
Exhibit Hall I #312 | |
|
DuCos: Duality Constrained Depth Super-Resolution via Foundation Model
Poster Session 2 & Exhibit Hall with Coffee Break
Zhiqiang Yan ⋅ Zhengxue Wang ⋅ Haoye Dong ⋅ Jun Li ⋅ Jian Yang ⋅ Gim Hee Lee
|
Exhibit Hall I #315 | |
|
MaskHand: Generative Masked Modeling for Robust Hand Mesh Reconstruction in the Wild
Poster Session 2 & Exhibit Hall with Coffee Break
Muhammad Usama Saleem ⋅ Ekkasit Pinyoanuntapong ⋅ Mayur Patel ⋅ Hongfei Xue ⋅ Ahmed Helmy ⋅ Srijan Das ⋅ Pu Wang
|
Exhibit Hall I #316 | |
|
Passing the Driving Knowledge Test
Poster Session 2 & Exhibit Hall with Coffee Break
Maolin Wei ⋅ Wanzhou Liu ⋅ Eshed Ohn-Bar
|
Exhibit Hall I #318 | |
|
Uncertainty-Aware Gradient Stabilization for Small Object Detection
Poster Session 2 & Exhibit Hall with Coffee Break
Huixin Sun ⋅ Yanjing Li ⋅ Linlin Yang ⋅ Xianbin Cao ⋅ Baochang Zhang
|
Exhibit Hall I #319 | |
|
Can Generative Geospatial Diffusion Models Excel as Discriminative Geospatial Foundation Models?
Poster Session 2 & Exhibit Hall with Coffee Break
Yuru Jia ⋅ Valerio Marsocci ⋅ Ziyang Gong ⋅ Xue Yang ⋅ Maarten Vergauwen ⋅ Andrea Nascetti
|
Exhibit Hall I #321 | |
|
4D Visual Pre-training for Robot Learning
Poster Session 2 & Exhibit Hall with Coffee Break
Chengkai Hou ⋅ Yanjie Ze ⋅ Yankai Fu ⋅ Zeyu Gao ⋅ Songbo Hu ⋅ Yue Yu ⋅ Shanghang Zhang ⋅ Huazhe Xu
|
Exhibit Hall I #323 | |
|
Fish2Mesh Transformer: 3D Human Mesh Recovery from Egocentric Vision
Poster Session 2 & Exhibit Hall with Coffee Break
Tianma Shen ⋅ Aditya Shrish Puranik ⋅ James Vong ⋅ Vrushabh Deogirikar ⋅ Ryan Fell ⋅ Julianna Dietrich ⋅ Maria Kyrarini ⋅ Christopher Kitts ⋅ David Jeong
|
Exhibit Hall I #138 | |
|
CryoFastAR: Fast Cryo-EM Ab initio Reconstruction Made Easy
Poster Session 2 & Exhibit Hall with Coffee Break
Jiakai Zhang ⋅ Shouchen Zhou ⋅ Haizhao Dai ⋅ Xinhang Liu ⋅ Peihao Wang ⋅ Zhiwen Fan ⋅ Yuan Pei ⋅ Jingyi Yu
|
Exhibit Hall I #324 | |
|
Beyond Pixel Uncertainty: Bounding the OoD Objects in Road Scenes
Poster Session 2 & Exhibit Hall with Coffee Break
Huachao Zhu ⋅ Zelong Liu ⋅ Zhichao Sun ⋅ Yuda Zou ⋅ Gui-Song Xia ⋅ Yongchao Xu
|
Exhibit Hall I #325 | |
|
HoliTracer: Holistic Vectorization of Geographic Objects from Large-Size Remote Sensing Imagery
Poster Session 2 & Exhibit Hall with Coffee Break
Yu Wang ⋅ Bo Dang ⋅ Wanchun Li ⋅ Wei Chen ⋅ Yansheng Li
|
Exhibit Hall I #326 | |
|
DialNav: Multi-turn Dialog Navigation with a Remote Guide
Poster Session 2 & Exhibit Hall with Coffee Break
Leekyeung Han ⋅ Hyunji Min ⋅ Gyeom Hwangbo ⋅ Jonghyun Choi ⋅ Paul Hongsuck Seo
|
Exhibit Hall I #329 | |
|
TaxaDiffusion: Progressively Trained Diffusion Model for Fine-Grained Species Generation
Poster Session 2 & Exhibit Hall with Coffee Break
Amin Karimi Monsefi ⋅ Mridul Khurana ⋅ Rajiv Ramnath ⋅ Anuj Karpatne ⋅ Wei-Lun (Harry) Chao ⋅ Cheng Zhang
|
Exhibit Hall I #335 | |
|
VLM4D: Towards Spatiotemporal Awareness in Vision Language Models
Poster Session 2 & Exhibit Hall with Coffee Break
Shijie Zhou ⋅ Alexander Vilesov ⋅ Xuehai He ⋅ Ziyu Wan ⋅ Shuwang Zhang ⋅ Aditya Nagachandra ⋅ Di Chang ⋅ Dongdong Chen ⋅ Xin Wang ⋅ Achuta Kadambi
|
Exhibit Hall I #337 | |
|
Spatial Alignment and Temporal Matching Adapter for Video-Radar Remote Physiological Measurement
Poster Session 2 & Exhibit Hall with Coffee Break
Qian Liang ⋅ Ruixu Geng ⋅ Jinbo Chen ⋅ Haoyu Wang ⋅ Yan Chen ⋅ Yang Hu
|
Exhibit Hall I #339 | |
|
Bias in Gender Bias Benchmarks: How Spurious Features Distort Evaluation
Poster Session 2 & Exhibit Hall with Coffee Break
Yusuke Hirota ⋅ Ryo Hachiuma ⋅ Boyi Li ⋅ Ximing Lu ⋅ Michael Boone ⋅ Boris Ivanovic ⋅ Yejin Choi ⋅ Marco Pavone ⋅ Yu-Chiang Frank Wang ⋅ Noa Garcia ⋅ Yuta Nakashima ⋅ Chao-Han Yang
|
Exhibit Hall I #340 | |
|
AGO: Adaptive Grounding for Open World 3D Occupancy Prediction
Poster Session 2 & Exhibit Hall with Coffee Break
Peizheng Li ⋅ Shuxiao Ding ⋅ You Zhou ⋅ Qingwen Zhang ⋅ Onat Inak ⋅ Larissa Triess ⋅ Niklas Hanselmann ⋅ Marius Cordts ⋅ Andreas Zell
|
Exhibit Hall I #341 | |
|
Environment-Agnostic Pose: Generating Environment-independent Object Representations for 6D Pose Estimation
Poster Session 2 & Exhibit Hall with Coffee Break
Shaobo Zhang ⋅ Yuhang Huang ⋅ Wanqing Zhao ⋅ Wei Zhao ⋅ Ziyu Guan ⋅ Jinye Peng
|
Exhibit Hall I #344 | |
|
OpenM3D: Open Vocabulary Multi-view Indoor 3D Object Detection without Human Annotations
Poster Session 2 & Exhibit Hall with Coffee Break
Peng-Hao Hsu ⋅ Ke Zhang ⋅ Fu-En Wang ⋅ Tao Tu ⋅ Ming-Feng Li ⋅ Yu-Lun Liu ⋅ Albert Y. C. Chen ⋅ Min Sun ⋅ Cheng-Hao Kuo
|
Exhibit Hall I #345 | |
|
Online Dense Point Tracking with Streaming Memory
Poster Session 2 & Exhibit Hall with Coffee Break
Qiaole Dong ⋅ Yanwei Fu
|
Exhibit Hall I #347 | |
|
MaGS: Reconstructing and Simulating Dynamic 3D Objects with Mesh-adsorbed Gaussian Splatting
Shaojie Ma ⋅ Yawei Luo ⋅ Wei Yang ⋅ Yi Yang
|
Exhibit Hall I #350 | |
|
GM-MoE: Low-Light Enhancement with Gated-Mechanism Mixture-of-Experts
Poster Session 2 & Exhibit Hall with Coffee Break
Minwen Liao ⋅ Hao Dong ⋅ Xinyi Wang ⋅ Kurban Ubul ⋅ Ziyang Yan ⋅ Yihua Shao
|
Exhibit Hall I #352 | |
|
CHARM3R: Towards Unseen Camera Height Robust Monocular 3D Detector
Poster Session 2 & Exhibit Hall with Coffee Break
Abhinav Kumar ⋅ Yuliang Guo ⋅ Zhihao Zhang ⋅ Xinyu Huang ⋅ Liu Ren ⋅ Xiaoming Liu
|
Exhibit Hall I #353 | |
|
Test-Time Retrieval-Augmented Adaptation for Vision-Language Models
Poster Session 2 & Exhibit Hall with Coffee Break
Xinqi Fan ⋅ Xueli CHEN ⋅ Luoxiao Yang ⋅ Chuin Hong Yap ⋅ Rizwan Qureshi ⋅ Qi Dou ⋅ Moi Hoon Yap ⋅ Mubarak Shah
|
Exhibit Hall I #356 | |
|
RnGCam: High-speed video from rolling & global shutter measurements
Poster Session 2 & Exhibit Hall with Coffee Break
Kevin Tandi ⋅ Xiang Dai ⋅ Chinmay Talegaonkar ⋅ Gal Mishne ⋅ Nicholas Antipa
|
Exhibit Hall I #358 | |
|
Self-Supervised Monocular 4D Scene Reconstruction for Egocentric Videos
Poster Session 2 & Exhibit Hall with Coffee Break
Chengbo Yuan ⋅ Geng Chen ⋅ Li Yi ⋅ Yang Gao
|
Exhibit Hall I #361 | |
|
Diorama: Unleashing Zero-shot Single-view 3D Indoor Scene Modeling
Qirui Wu ⋅ Denys Iliash ⋅ Daniel Ritchie ⋅ Manolis Savva ⋅ Angel Chang
|
Exhibit Hall I #364 | |
|
Bokehlicious: Photorealistic Bokeh Rendering with Controllable Apertures
Tim Seizinger ⋅ Florin-Alexandru Vasluianu ⋅ Marcos Conde ⋅ Zongwei Wu ⋅ Radu Timofte
|
Exhibit Hall I #365 | |
|
Hybrid-TTA: Continual Test-time Adaptation via Dynamic Domain Shift Detection
Poster Session 1 & Exhibit Hall
Hyewon Park ⋅ Hyejin Park ⋅ Jueun Ko ⋅ Dongbo Min
|
Exhibit Hall I #265 | |
|
Learning on the Go: A Meta-learning Object Navigation Model
Poster Session 2 & Exhibit Hall with Coffee Break
Xiaorong Qin ⋅ Xinhang Song ⋅ Sixian Zhang ⋅ Xinyao Yu ⋅ Xinmiao Zhang ⋅ Shuqiang Jiang
|
Exhibit Hall I #368 | |
|
Unlocking Constraints: Source-Free Occlusion-Aware Seamless Segmentation
Poster Session 2 & Exhibit Hall with Coffee Break
Yihong Cao ⋅ Jiaming Zhang ⋅ Xu Zheng ⋅ Hao Shi ⋅ Kunyu Peng ⋅ Hang Liu ⋅ Kailun Yang ⋅ Hui Zhang
|
Exhibit Hall I #370 | |
|
3D Gaussian Map with Open-Set Semantic Grouping for Vision-Language Navigation
Poster Session 2 & Exhibit Hall with Coffee Break
Jianzhe Gao ⋅ Rui Liu ⋅ Wenguan Wang
|
Exhibit Hall I #398 | |
|
ProGait: A Multi-Purpose Video Dataset and Benchmark for Transfemoral Prosthesis Users
Xiangyu Yin ⋅ Boyuan Yang ⋅ Weichen Liu ⋅ Qiyao Xue ⋅ Abrar Alamri ⋅ Goeran Fiedler ⋅ Wei Gao
|
Exhibit Hall I #372 | |
|
MixRI: Mixing Features of Reference Images for Novel Object Pose Estimation
Poster Session 2 & Exhibit Hall with Coffee Break
Xinhang Liu ⋅ Jiawei Shi ⋅ Zheng Dang ⋅ Yuchao Dai
|
Exhibit Hall I #376 | |
|
ReassembleNet: Learnable Keypoints and Diffusion for 2D Fresco Reconstruction
Poster Session 2 & Exhibit Hall with Coffee Break
ADEELA ISLAM ⋅ Stefano Fiorini ⋅ Stuart James ⋅ Pietro Morerio ⋅ ALESSIO DEL BUE
|
Exhibit Hall I #378 | |
|
WonderPlay: Dynamic 3D Scene Generation from a Single Image and Actions
Zizhang Li ⋅ Hong-Xing Yu ⋅ Wei Liu ⋅ Yin Yang ⋅ Charles Herrmann ⋅ Gordon Wetzstein ⋅ Jiajun Wu
|
Exhibit Hall I #383 | |
|
OMNI-DC: Highly Robust Depth Completion with Multiresolution Depth Integration
Poster Session 2 & Exhibit Hall with Coffee Break
Yiming Zuo ⋅ Willow Yang ⋅ Zeyu Ma ⋅ Jia Deng
|
Exhibit Hall I #401 | |
|
Beyond the Destination: A Novel Benchmark for Exploration-Aware Embodied Question Answering
Poster Session 2 & Exhibit Hall with Coffee Break
Kaixuan Jiang ⋅ Yang Liu ⋅ Weixing Chen ⋅ Jingzhou Luo ⋅ Ziliang Chen ⋅ Ling Pan ⋅ Guanbin Li ⋅ Liang Lin
|
Exhibit Hall I #384 | |
|
Not all Views are Created Equal: Analyzing Viewpoint Instabilities in Vision Foundation Models
Poster Session 2 & Exhibit Hall with Coffee Break
Mateusz Michalkiewicz ⋅ Xinyue Bai ⋅ Mahsa Baktashmotlagh ⋅ Varun Jampani ⋅ Guha Balakrishnan
|
Exhibit Hall I #386 | |
|
CHROME: Clothed Human Reconstruction with Occlusion-Resilience and Multiview-Consistency from a Single Image
Arindam Dutta ⋅ Meng Zheng ⋅ Zhongpai Gao ⋅ Benjamin Planche ⋅ Anwesa Choudhuri ⋅ Terrence Chen ⋅ Amit Roy-Chowdhury ⋅ Ziyan Wu
|
Exhibit Hall I #387 | |
|
ReCoT: Reflective Self-Correction Training for Mitigating Confirmation Bias in Large Vision-Language Models
Poster Session 2 & Exhibit Hall with Coffee Break
Mengxue Qu ⋅ Yibo Hu ⋅ Kunyang Han ⋅ Yunchao Wei ⋅ Yao Zhao
|
Exhibit Hall I #389 | |
|
Easi3R: Estimating Disentangled Motion from DUSt3R Without Training
Poster Session 2 & Exhibit Hall with Coffee Break
Xingyu Chen ⋅ Yue Chen ⋅ Yuliang Xiu ⋅ Andreas Geiger ⋅ Anpei Chen
|
Exhibit Hall I #390 | |
|
PRE-Mamba: A 4D State Space Model for Ultra-High-Frequent Event Camera Deraining
Poster Session 2 & Exhibit Hall with Coffee Break
Ciyu Ruan ⋅ Ruishan Guo ⋅ Zihang GONG ⋅ Jingao Xu ⋅ Wenhan Yang ⋅ Xinlei Chen
|
Exhibit Hall I #391 | |
|
GenHaze: Pioneering Controllable One-Step Realistic Haze Generation for Real-World Dehazing
Poster Session 2 & Exhibit Hall with Coffee Break
Sixiang Chen ⋅ Tian Ye ⋅ Yunlong Lin ⋅ Yeying Jin ⋅ Yijun Yang ⋅ Haoyu Chen ⋅ Jianyu Lai ⋅ Song Fei ⋅ Zhaohu Xing ⋅ Fugee Tsung ⋅ Lei Zhu
|
Exhibit Hall I #393 | |
|
When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruning
Poster Session 2 & Exhibit Hall with Coffee Break
Junwei Luo ⋅ Yingying Zhang ⋅ Xue Yang ⋅ Kang Wu ⋅ Qi Zhu ⋅ Lei Liang ⋅ Jingdong Chen ⋅ Yansheng Li
|
Exhibit Hall I #394 | |
|
Can3Tok: Canonical 3D Tokenization and Latent Modeling of Scene-Level 3D Gaussians
Poster Session 2 & Exhibit Hall with Coffee Break
Quankai Gao ⋅ Iliyan Georgiev ⋅ Tuanfeng Wang ⋅ Krishna Kumar Singh ⋅ Ulrich Neumann ⋅ Jae Shin Yoon
|
Exhibit Hall I #404 | |
|
GEOPARD: Geometric Pretraining for Articulation Prediction in 3D Shapes
Poster Session 2 & Exhibit Hall with Coffee Break
Pradyumn Goyal ⋅ Dmitrii Petrov ⋅ Sheldon Andrews ⋅ Yizhak Ben-Shabat ⋅ Hsueh-Ti Derek Liu ⋅ Evangelos Kalogerakis
|
Exhibit Hall I #405 | |
|
Dream-to-Recon: Monocular 3D Reconstruction with Diffusion-Depth Distillation from Single Images
Poster Session 2 & Exhibit Hall with Coffee Break
Philipp Wulff ⋅ Felix Wimbauer ⋅ Dominik Muhle ⋅ Daniel Cremers
|
Exhibit Hall I #407 | |
|
LocalDyGS: Multi-view Global Dynamic Scene Modeling via Adaptive Local Implicit Feature Decoupling
Poster Session 2 & Exhibit Hall with Coffee Break
Jiahao Wu ⋅ Rui Peng ⋅ Jianbo Jiao ⋅ Jiayu Yang ⋅ Luyang Tang ⋅ Kaiqiang Xiong ⋅ Jie Liang ⋅ Jinbo Yan ⋅ runling liu ⋅ Ronggang Wang
|
Exhibit Hall I #422 | |
|
Combinative Matching for Geometric Shape Assembly
Nahyuk Lee ⋅ Juhong Min ⋅ Junhong Lee ⋅ Chunghyun Park ⋅ Minsu Cho
|
Exhibit Hall I #424 | |
|
CogNav: Cognitive Process Modeling for Object Goal Navigation with LLMs
Poster Session 2 & Exhibit Hall with Coffee Break
Yihan Cao ⋅ Jiazhao Zhang ⋅ Zhinan Yu ⋅ Shuzhen Liu ⋅ Zheng Qin ⋅ Qin Zou ⋅ Bo Du ⋅ Kai Xu
|
Exhibit Hall I #425 | |
|
LLaVA-3D: A Simple yet Effective Pathway to Empowering LMMs with 3D Capabilities
Poster Session 1 & Exhibit Hall
CHENMING ZHU ⋅ Tai Wang ⋅ Wenwei Zhang ⋅ Jiangmiao Pang ⋅ Xihui Liu
|
Exhibit Hall I #402 | |
|
DyGS-SLAM: Real-Time Accurate Localization and Gaussian Reconstruction for Dynamic Scenes
Poster Session 2 & Exhibit Hall with Coffee Break
Xinggang Hu ⋅ Chenyangguang Zhang ⋅ Mingyuan Zhao ⋅ Yuanze Gui ⋅ Xiangkui Zhang ⋅ Xiangyang Ji
|
Exhibit Hall I #426 | |
|
CAD-Recode: Reverse Engineering CAD Code from Point Clouds
Poster Session 2 & Exhibit Hall with Coffee Break
Danila Rukhovich ⋅ Elona Dupont ⋅ Dimitrios Mallis ⋅ Kseniya Cherenkova ⋅ Anis Kacem ⋅ Djamila Aouada
|
Exhibit Hall I #448 | |
|
Teaching VLMs to Localize Specific Objects from In-context Examples
Poster Session 2 & Exhibit Hall with Coffee Break
Sivan Doveh ⋅ Nimrod Shabtay ⋅ Eli Schwartz ⋅ Leonid Karlinsky ⋅ Raja Giryes ⋅ Hilde Kuehne ⋅ Rogerio Feris ⋅ James Glass ⋅ Assaf Arbelle ⋅ Shimon Ullman ⋅ Muhammad Jehanzeb Mirza
|
Exhibit Hall I #427 | |
|
SDFit: 3D Object Pose and Shape by Fitting a Morphable SDF to a Single Image
Poster Session 2 & Exhibit Hall with Coffee Break
Dimitrije Antić ⋅ Georgios Paschalidis ⋅ Shashank Tripathi ⋅ Theo Gevers ⋅ Sai Kumar Dwivedi ⋅ Dimitrios Tzionas
|
Exhibit Hall I #431 | |
|
Details Matter for Indoor Open-vocabulary 3D Instance Segmentation
Poster Session 2 & Exhibit Hall with Coffee Break
Sanghun Jung ⋅ Jingjing Zheng ⋅ Ke Zhang ⋅ Nan Qiao ⋅ Albert Y. C. Chen ⋅ Lu Xia ⋅ Chi Liu ⋅ Yuyin Sun ⋅ Xiao Zeng ⋅ Hsiang-Wei Huang ⋅ Byron Boots ⋅ Min Sun ⋅ Cheng-Hao Kuo
|
Exhibit Hall I #432 | |
|
FlashDepth: Real-time Streaming Video Depth Estimation at 2K Resolution
Gene Chou ⋅ Wenqi Xian ⋅ Guandao Yang ⋅ Mohamed Abdelfattah ⋅ Bharath Hariharan ⋅ Noah Snavely ⋅ Ning Yu ⋅ Paul Debevec
|
Exhibit Hall I #433 | |
|
Self-supervised Learning of Hybrid Part-aware 3D Representations of 2D Gaussians and Superquadrics
Poster Session 2 & Exhibit Hall with Coffee Break
Zhirui Gao ⋅ Renjiao Yi ⋅ Yuhang Huang ⋅ Wei Chen ⋅ Chenyang Zhu ⋅ Kai Xu
|
Exhibit Hall I #434 | |
|
Training-Free Personalization via Retrieval and Reasoning on Fingerprints
Poster Session 2 & Exhibit Hall with Coffee Break
Deepayan Das ⋅ Davide Talon ⋅ Yiming Wang ⋅ Massimiliano Mancini ⋅ Elisa Ricci
|
Exhibit Hall I #437 | |
|
TAPNext: Tracking Any Point (TAP) as Next Token Prediction
Poster Session 2 & Exhibit Hall with Coffee Break
Artem Zholus ⋅ Carl Doersch ⋅ Yi Yang ⋅ Skanda Koppula ⋅ Viorica Patraucean ⋅ Xu He ⋅ Ignacio Rocco ⋅ Mehdi S. M. Sajjadi ⋅ Sarath Chandar ⋅ Ross Goroshin
|
Exhibit Hall I #438 | |
|
PartField: Learning 3D Feature Fields for Part Segmentation and Beyond
Poster Session 2 & Exhibit Hall with Coffee Break
Minghua Liu ⋅ Mikaela Uy ⋅ Donglai Xiang ⋅ Hao Su ⋅ Sanja Fidler ⋅ Nicholas Sharp ⋅ Jun Gao
|
Exhibit Hall I #439 | |
|
MoMaps: Semantics-Aware Scene Motion Generation with Motion Maps
Poster Session 3 & Exhibit Hall
Jiahui Lei ⋅ Kyle Genova ⋅ George Kopanas ⋅ Noah Snavely ⋅ Leonidas Guibas
|
Exhibit Hall I #2 | |
|
Bridging the Sky and Ground: Towards View-Invariant Feature Learning for Aerial-Ground Person Re-Identification
Poster Session 2 & Exhibit Hall with Coffee Break
Wajahat Khalid ⋅ Bin Liu ⋅ Xulin Li ⋅ MUHAMMAD WAQAS ⋅ MUHAMMAD SHER AFGAN
|
Exhibit Hall I #443 | |
|
PASD: A Pixel-Adaptive Swarm Dynamics Approach for Unsupervised Low-Light Image Enhancement
Poster Session 2 & Exhibit Hall with Coffee Break
Shuai Jin ⋅ Yuhua Qian ⋅ Feijiang Li ⋅ Guoqing Liu ⋅ Xinyan Liang
|
Exhibit Hall I #380 | |
|
CoA-VLA: Improving Vision-Language-Action Models via Visual-Text Chain-of-Affordance
Poster Session 2 & Exhibit Hall with Coffee Break
Jinming Li ⋅ Yichen Zhu ⋅ Zhibin Tang ⋅ Junjie Wen ⋅ Minjie Zhu ⋅ Xiaoyu Liu ⋅ Chengmeng Li ⋅ Ran Cheng ⋅ Yaxin Peng ⋅ Yan Peng ⋅ Feifei Feng
|
Exhibit Hall I #444 | |
|
Proactive Scene Decomposition and Reconstruction
Poster Session 2 & Exhibit Hall with Coffee Break
Baicheng Li ⋅ Zike Yan ⋅ Dong Wu ⋅ Hongbin Zha
|
Exhibit Hall I #446 | |
|
Unified Category-Level Object Detection and Pose Estimation from RGB Images using 3D Prototypes
Poster Session 2 & Exhibit Hall with Coffee Break
Tom Fischer ⋅ Xiaojie Zhang ⋅ Eddy Ilg
|
Exhibit Hall I #447 | |
|
EvRT-DETR: Latent Space Adaptation of Image Detectors for Event-based Vision
Poster Session 2 & Exhibit Hall with Coffee Break
Dmitrii Torbunov ⋅ Yihui Ren ⋅ Animesh Ghose ⋅ Odera Dim ⋅ Yonggang Cui
|
Exhibit Hall I #449 | |
|
A Hyperdimensional One Place Signature to Represent Them All: Stackable Descriptors For Visual Place Recognition
Poster Session 2 & Exhibit Hall with Coffee Break
Connor Malone ⋅ Somayeh Hussaini ⋅ Tobias Fischer ⋅ Michael Milford
|
Exhibit Hall I #450 | |
|
IRASim: A Fine-Grained World Model for Robot Manipulation
Poster Session 2 & Exhibit Hall with Coffee Break
Fangqi Zhu ⋅ Hongtao Wu ⋅ Song Guo ⋅ Yuxiao Liu ⋅ Chilam Cheang ⋅ Tao Kong
|
Exhibit Hall I #451 | |
|
WalkVLM: Aid Visually Impaired People Walking by Vision Language Model
Poster Session 2 & Exhibit Hall with Coffee Break
Zhiqiang Yuan ⋅ Ting Zhang ⋅ Yeshuang Zhu ⋅ Jiapei Zhang ⋅ Ying Deng ⋅ Zexi Jia ⋅ Peixiang Luo ⋅ Xiaoyue Duan ⋅ Jie Zhou ⋅ Jinchao Zhang
|
Exhibit Hall I #452 | |
|
Error Recognition in Procedural Videos using Generalized Task Graph
Poster Session 3 & Exhibit Hall
Shih-Po Lee ⋅ Ehsan Elhamifar
|
Exhibit Hall I #1 | |
|
VIGFace: Virtual Identity Generation for Privacy-Free Face Recognition Dataset
Poster Session 3 & Exhibit Hall
Minsoo Kim ⋅ Min-Cheol Sagong ⋅ Gi Pyo Nam ⋅ Junghyun Cho ⋅ Ig-Jae Kim
|
Exhibit Hall I #4 | |
|
RoboFactory: Exploring Embodied Agent Collaboration with Compositional Constraints
Poster Session 3 & Exhibit Hall
Yiran Qin ⋅ Li Kang ⋅ Xiufeng Song ⋅ Zhenfei Yin ⋅ Xiaohong Liu ⋅ Xihui Liu ⋅ Ruimao Zhang ⋅ LEI BAI
|
Exhibit Hall I #7 | |
|
MotionStreamer: Streaming Motion Generation via Diffusion-based Autoregressive Model in Causal Latent Space
Poster Session 3 & Exhibit Hall
Lixing Xiao ⋅ Shunlin Lu ⋅ Huaijin Pi ⋅ Ke Fan ⋅ Liang Pan ⋅ Yueer Zhou ⋅ Ziyong Feng ⋅ Xiaowei Zhou ⋅ Sida Peng ⋅ Jingbo Wang
|
Exhibit Hall I #8 | |
|
RapVerse: Coherent Vocals and Whole-Body Motion Generation from Text
Poster Session 3 & Exhibit Hall
Jiaben Chen ⋅ Xin Yan ⋅ Yihang Chen ⋅ Siyuan Cen ⋅ Zixin Wang ⋅ Qinwei Ma ⋅ Haoyu Zhen ⋅ Kaizhi Qian ⋅ Lie Lu ⋅ Chuang Gan
|
Exhibit Hall I #9 | |
|
RoboPearls: Editable Video Simulation for Robot Manipulation
Poster Session 3 & Exhibit Hall
Tao Tang ⋅ Likui Zhang ⋅ Youpeng Wen ⋅ Kaidong Zhang ⋅ Jia-Wang Bian ⋅ xia zhou ⋅ Tianyi Yan ⋅ Kun Zhan ⋅ Peng Jia ⋅ Hefeng Wu ⋅ Liang Lin ⋅ Xiaodan Liang
|
Exhibit Hall I #11 | |
|
GraspCoT: Integrating Physical Property Reasoning for 6-DoF Grasping under Flexible Language Instructions
Poster Session 3 & Exhibit Hall
Xiaomeng Chu ⋅ Jiajun Deng ⋅ Guoliang You ⋅ Wei Liu ⋅ Xingchen Li ⋅ Jianmin Ji ⋅ Yanyong Zhang
|
Exhibit Hall I #12 | |
|
Towards Immersive Human-X Interaction: A Real-Time Framework for Physically Plausible Motion Synthesis
Kaiyang Ji ⋅ Ye Shi ⋅ Zichen Jin ⋅ Kangyi Chen ⋅ Lan Xu ⋅ Yuexin Ma ⋅ Jingyi Yu ⋅ Jingya Wang
|
Exhibit Hall I #16 | |
|
SynFER: Towards Boosting Facial Expression Recognition with Synthetic Data
Poster Session 3 & Exhibit Hall
Xilin He ⋅ Cheng Luo ⋅ Xiaole Xian ⋅ Bing Li ⋅ Siyang Song ⋅ Muhammad Haris Khan ⋅ Weicheng Xie ⋅ Linlin Shen ⋅ Zongyuan Ge ⋅ Bernard Ghanem ⋅ Xiangyu Yue
|
Exhibit Hall I #17 | |
|
Multi-modal Multi-platform Person Re-Identification: Benchmark and Method
Poster Session 3 & Exhibit Hall
Ruiyang Ha ⋅ Songyi Jiang ⋅ Bin Li ⋅ Bikang Pan ⋅ Yihang Zhu ⋅ Junjie Zhang ⋅ Xiatian Zhu ⋅ Shaogang Gong ⋅ Jingya Wang
|
Exhibit Hall I #23 | |
|
Mixture of Experts Guided by Gaussian Splatters Matters: A new Approach to Weakly-Supervised Video Anomaly Detection
Giacomo D'Amicantonio ⋅ Snehashis Majhi ⋅ Quan Kong ⋅ Lorenzo Garattoni ⋅ Gianpiero Francesca ⋅ Egor Bondarev ⋅ Francois Bremond
|
Exhibit Hall I #25 | |
|
What If: Understanding Motion Through Sparse Interactions
Poster Session 3 & Exhibit Hall
Stefan A. Baumann ⋅ Nick Stracke ⋅ Timy Phan ⋅ Björn Ommer
|
Exhibit Hall I #26 | |
|
PROGRESSOR: A Perceptually Guided Reward Estimator with Self-Supervised Online Refinement
Poster Session 3 & Exhibit Hall
Tewodros W. Ayalew ⋅ Xiao Zhang ⋅ Kevin Y Wu ⋅ Tianchong Jiang ⋅ Michael Maire ⋅ Matthew Walter
|
Exhibit Hall I #27 | |
|
UniEgoMotion: A Unified Model for Egocentric Motion Reconstruction, Forecasting, and Generation
Poster Session 3 & Exhibit Hall
Chaitanya Patel ⋅ Hiroki Nakamura ⋅ Yuta Kyuragi ⋅ Kazuki Kozuka ⋅ Juan Carlos Niebles ⋅ Ehsan Adeli
|
Exhibit Hall I #29 | |
|
DAViD: Modeling Dynamic Affordance of 3D Objects Using Pre-trained Video Diffusion Models
Poster Session 3 & Exhibit Hall
Hyeonwoo Kim ⋅ Sangwon Baik ⋅ Hanbyul Joo
|
Exhibit Hall I #30 | |
|
RoboAnnotatorX: A Comprehensive and Universal Annotation Framework for Accurate Understanding of Long-horizon Robot Demonstration
Poster Session 3 & Exhibit Hall
Longxin Kou ⋅ Fei Ni ⋅ Jianye HAO ⋅ Han Peilong ⋅ Jinyi Liu ⋅ Haiqin Cui ⋅ Rui Liu ⋅ YAN ZHENG
|
Exhibit Hall I #32 | |
|
FaceShield: Defending Facial Image against Deepfake Threats
Poster Session 3 & Exhibit Hall
Jaehwan Jeong ⋅ Sumin In ⋅ Sieun Kim ⋅ Shin yi ⋅ Jongheon Jeong ⋅ Sang Yoon ⋅ Jaewook Chung ⋅ Sangpil Kim
|
Exhibit Hall I #33 | |
|
Task-Oriented Human Grasp Synthesis via Context- and Task-Aware Diffusers
Poster Session 3 & Exhibit Hall
An Lun Liu ⋅ Yu-Wei Chao ⋅ Yi-Ting Chen
|
Exhibit Hall I #34 | |
|
Ouroboros: Single-step Diffusion Models for Cycle-consistent Forward and Inverse Rendering
Poster Session 3 & Exhibit Hall
shanlin sun ⋅ Yifan Wang ⋅ Hanwen Zhang ⋅ Yifeng Xiong ⋅ Qin Ren ⋅ Ruogu Fang ⋅ Xiaohui Xie ⋅ Chenyu You
|
Exhibit Hall I #35 | |
|
Expressive Talking Human from Single-Image with Imperfect Priors
Poster Session 3 & Exhibit Hall
Jun Xiang ⋅ Yudong Guo ⋅ Leipeng Hu ⋅ Boyang Guo ⋅ Yancheng Yuan ⋅ Juyong Zhang
|
Exhibit Hall I #36 | |
|
Beyond Label Semantics: Language-Guided Action Anatomy for Few-shot Action Recognition
Poster Session 3 & Exhibit Hall
Zefeng Qian ⋅ Xincheng Yao ⋅ Yifei Huang ⋅ Chong-Yang Zhang ⋅ Jiangyong Ying ⋅ Hong Sun
|
Exhibit Hall I #38 | |
|
Few-Shot Image Quality Assessment via Adaptation of Vision-Language Models
Poster Session 3 & Exhibit Hall
Xudong Li ⋅ Zihao Huang ⋅ Yan Zhang ⋅ Yunhang Shen ⋅ Ke Li ⋅ Xiawu Zheng ⋅ Liujuan Cao ⋅ Rongrong Ji
|
Exhibit Hall I #40 | |
|
Unleashing High-Quality Image Generation in Diffusion Sampling Using Second-Order Levenberg-Marquardt-Langevin
Poster Session 3 & Exhibit Hall
Fangyikang Wang ⋅ Hubery Yin ⋅ Lei Qian ⋅ Yinan Li ⋅ SHAOBIN ZHUANG ⋅ Huminhao Zhu ⋅ Yilin Zhang ⋅ Yanlong Tang ⋅ Chao Zhang ⋅ Hanbin Zhao ⋅ Hui Qian ⋅ Chen Li
|
Exhibit Hall I #41 | |
|
Reverse Convolution and Its Applications to Image Restoration
Poster Session 3 & Exhibit Hall
Xuhong Huang ⋅ Shiqi Liu ⋅ Kai Zhang ⋅ Ying Tai ⋅ Jian Yang ⋅ Hui Zeng ⋅ Lei Zhang
|
Exhibit Hall I #46 | |
|
MamTiff-CAD: Multi-Scale Latent Diffusion with Mamba+ for Complex Parametric Sequence
Poster Session 3 & Exhibit Hall
Liyuan Deng ⋅ Yunpeng Bai ⋅ Yongkang Dai ⋅ Xiaoshui Huang ⋅ Hongping Gan ⋅ Dongshuo Huang ⋅ Hao jiacheng ⋅ Yilei Shi
|
Exhibit Hall I #47 | |
|
Local Scale Equivariance with Latent Deep Equilibrium Canonicalizer
Poster Session 3 & Exhibit Hall
Md Ashiqur Rahman ⋅ Chiao-An Yang ⋅ Michael N Cheng ⋅ Lim Hao ⋅ Jeremiah Jiang ⋅ Teck-Yian Lim ⋅ Raymond A. Yeh
|
Exhibit Hall I #48 | |
|
DynamicID: Zero-Shot Multi-ID Image Personalization with Flexible Facial Editability
Poster Session 3 & Exhibit Hall
Xirui Hu ⋅ Jiahao Wang ⋅ Hao chen ⋅ Weizhan Zhang ⋅ Benqi Wang ⋅ yikun Li ⋅ Haishun Nan
|
Exhibit Hall I #50 | |
|
EfficientMT: Efficient Temporal Adaptation for Motion Transfer in Text-to-Video Diffusion Models
Poster Session 3 & Exhibit Hall
Yufei Cai ⋅ Hu Han ⋅ Yuxiang Wei ⋅ Shiguang Shan ⋅ Xilin Chen
|
Exhibit Hall I #54 | |
|
InteractAvatar: Modeling Hand-Face Interaction in Photorealistic Avatars with Deformable Gaussians
Poster Session 3 & Exhibit Hall
Kefan Chen ⋅ Sergiu Oprea ⋅ Justin Theiss ⋅ Sreyas Mohan ⋅ Srinath Sridhar ⋅ Aayush Prakash
|
Exhibit Hall I #37 | |
|
X-Dancer: Expressive Music to Human Dance Video Generation
Zeyuan Chen ⋅ Hongyi Xu ⋅ Guoxian Song ⋅ You Xie ⋅ Chenxu Zhang ⋅ Xin Chen ⋅ Chao Wang ⋅ Di Chang ⋅ Linjie Luo
|
Exhibit Hall I #55 | |
|
DeepMesh: Auto-Regressive Artist-mesh Creation with Reinforcement Learning
Poster Session 3 & Exhibit Hall
Ruowen Zhao ⋅ James Jun Liang Chen Ye ⋅ Zhengyi Wang ⋅ Guangce Liu ⋅ Yiwen Chen ⋅ Yikai Wang ⋅ Jun Zhu
|
Exhibit Hall I #56 | |
|
Im2Haircut: Single-view Strand-based Hair Reconstruction for Human Avatars
Poster Session 3 & Exhibit Hall
Vanessa Sklyarova ⋅ Egor Zakharov ⋅ Malte Prinzler ⋅ Giorgio Becherini ⋅ Michael Black ⋅ Justus Thies
|
Exhibit Hall I #60 | |
|
AFUNet: Cross-Iterative Alignment-Fusion Synergy for HDR Reconstruction via Deep Unfolding Paradigm
Poster Session 3 & Exhibit Hall
Xinyue Li ⋅ Zhangkai Ni ⋅ Wenhan Yang
|
Exhibit Hall I #61 | |
|
PINO: Person-Interaction Noise Optimization for Long-Duration and Customizable Motion Generation of Arbitrary-Sized Groups
Poster Session 3 & Exhibit Hall
Sakuya Ota ⋅ Qing Yu ⋅ Kent Fujiwara ⋅ Satoshi Ikehata ⋅ Ikuro Sato
|
Exhibit Hall I #62 | |
|
TeRA: Rethinking Text-guided Realistic 3D Avatar Generation
Poster Session 3 & Exhibit Hall
Yanwen Wang ⋅ Yiyu Zhuang ⋅ Jiawei Zhang ⋅ Li Wang ⋅ Yifei Zeng ⋅ Xun Cao ⋅ Xinxin Zuo ⋅ Hao Zhu
|
Exhibit Hall I #63 | |
|
A Unified Framework for Motion Reasoning and Generation in Human Interaction
Poster Session 3 & Exhibit Hall
Jeongeun Park ⋅ Sungjoon Choi ⋅ Sangdoo Yun
|
Exhibit Hall I #64 | |
|
Open-World Skill Discovery from Unsegmented Demonstration Videos
Poster Session 3 & Exhibit Hall
Jingwen Deng ⋅ Zihao Wang ⋅ Shaofei Cai ⋅ Anji Liu ⋅ Yitao Liang
|
Exhibit Hall I #65 | |
|
Deep Adaptive Unfolded Network via Spatial Morphology Stripping and Spectral Filtration for Pan-sharpening
Poster Session 3 & Exhibit Hall
Hebaixu Wang ⋅ Jiayi Ma
|
Exhibit Hall I #67 | |
|
EgoAdapt: Adaptive Multisensory Distillation and Policy Learning for Efficient Egocentric Perception
Poster Session 3 & Exhibit Hall
Sanjoy Chowdhury ⋅ Subrata Biswas ⋅ Sayan Nag ⋅ Tushar Nagarajan ⋅ Calvin Murdock ⋅ Ishwarya Ananthabhotla ⋅ Yijun Qian ⋅ Vamsi Ithapu ⋅ Dinesh Manocha ⋅ Ruohan Gao
|
Exhibit Hall I #68 | |
|
Reference-based Super-Resolution via Image-based Retrieval-Augmented Generation Diffusion
Poster Session 3 & Exhibit Hall
Byeonghun Lee ⋅ Hyunmin Cho ⋅ Honggyu Choi ⋅ Soo Min Kang ⋅ ILJUN AHN ⋅ Kyong Hwan Jin
|
Exhibit Hall I #70 | |
|
Vulnerability-Aware Spatio-Temporal Learning for Generalizable Deepfake Video Detection
Poster Session 3 & Exhibit Hall
Dat NGUYEN ⋅ Marcella Astrid ⋅ Anis Kacem ⋅ Enjie Ghorbel ⋅ Djamila Aouada
|
Exhibit Hall I #72 | |
|
EgoM2P: Egocentric Multimodal Multitask Pretraining
Poster Session 3 & Exhibit Hall
Gen Li ⋅ Yutong Chen ⋅ Yiqian Wu ⋅ KAIFENG ZHAO ⋅ Marc Pollefeys ⋅ Siyu Tang
|
Exhibit Hall I #78 | |
|
E-NeMF: Event-based Neural Motion Field for Novel Space-time View Synthesis of Dynamic Scenes
Poster Session 3 & Exhibit Hall
Yan Liu ⋅ Zehao Chen ⋅ Haojie Yan ⋅ De Ma ⋅ Huajin Tang ⋅ Qian Zheng ⋅ Gang Pan
|
Exhibit Hall I #80 | |
|
HUMOTO: A 4D Dataset of Mocap Human Object Interactions
Poster Session 3 & Exhibit Hall
Jiaxin Lu ⋅ Chun-Hao Huang ⋅ Uttaran Bhattacharya ⋅ Qixing Huang ⋅ Yi Zhou
|
Exhibit Hall I #83 | |
|
CombatVLA: An Efficient Vision-Language-Action Model for Combat Tasks in 3D Action Role-Playing Games
Poster Session 3 & Exhibit Hall
Peng Chen ⋅ Pi Bu ⋅ Yingyao Wang ⋅ Xinyi Wang ⋅ Ziming Wang ⋅ Jie Guo ⋅ Yingxiu Zhao ⋅ Qi Zhu ⋅ Jun Song ⋅ Siran Yang ⋅ Jiamang Wang ⋅ Bo Zheng
|
Exhibit Hall I #86 | |
|
CharaConsist: Fine-Grained Consistent Character Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Mengyu Wang ⋅ Henghui Ding ⋅ Jianing Peng ⋅ Yao Zhao ⋅ Yunpeng Chen ⋅ Yunchao Wei
|
Exhibit Hall I #111 | |
|
MonSTeR: a Unified Model for Motion, Scene, Text Retrieval
Poster Session 3 & Exhibit Hall
Luca Collorone ⋅ Matteo Gioia ⋅ Massimiliano Pappa ⋅ Paolo Leoni ⋅ Giovanni Ficarra ⋅ Or Litany ⋅ Indro Spinelli ⋅ Fabio Galasso
|
Exhibit Hall I #88 | |
|
Nautilus: Locality-aware Autoencoder for Scalable Mesh Generation
Poster Session 3 & Exhibit Hall
Yuxuan Wang ⋅ Xuanyu Yi ⋅ Haohan Weng ⋅ Qingshan Xu ⋅ xiaokang wei ⋅ Xianghui Yang ⋅ Chunchao Guo ⋅ Long Chen ⋅ Hanwang Zhang
|
Exhibit Hall I #90 | |
|
F-Bench: Rethinking Human Preference Evaluation Metrics for Benchmarking Face Generation, Customization, and Restoration
Lu Liu ⋅ Huiyu Duan ⋅ Qiang Hu ⋅ Liu Yang ⋅ Chunlei Cai ⋅ Tianxiao Ye ⋅ Huayu Liu ⋅ Xiaoyun Zhang ⋅ Guangtao Zhai
|
Exhibit Hall I #92 | |
|
Decouple and Track: Benchmarking and Improving Video Diffusion Transformers For Motion Transfer
Poster Session 3 & Exhibit Hall
Qingyu Shi ⋅ Jianzong Wu ⋅ Jinbin Bai ⋅ Lu Qi ⋅ Jiangning Zhang ⋅ Yunhai Tong ⋅ Xiangtai Li
|
Exhibit Hall I #93 | |
|
Latent Swap Joint Diffusion for 2D Long-Form Latent Generation
Poster Session 3 & Exhibit Hall
Yusheng Dai ⋅ Chenxi Wang ⋅ Chang Li ⋅ Chen Wang ⋅ Kewei Li ⋅ Jun Du ⋅ Lei Sun ⋅ Jianqing Gao ⋅ Ruoyu Wang ⋅ Jiefeng Ma
|
Exhibit Hall I #94 | |
|
Blind Noisy Image Deblurring Using Residual Guidance Strategy
Poster Session 3 & Exhibit Hall
Heyan Liu ⋅ Jianing Sun ⋅ Jun Liu ⋅ Xi-Le Zhao ⋅ Tingting WU ⋅ Tieyong Zeng
|
Exhibit Hall I #95 | |
|
Drawing Developmental Trajectory from Cortical Surface Reconstruction
Poster Session 3 & Exhibit Hall
WENXUAN WU ⋅ ruowen qu ⋅ Zhongliang Liu ⋅ Zhuoyan Dai ⋅ Dongzi Shi ⋅ Sijin Yu ⋅ Tong Xiong ⋅ Shiping Liu ⋅ Xiangmin Xu ⋅ Xiaofen Xing ⋅ Xin Zhang
|
Exhibit Hall I #96 | |
|
Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models
Poster Session 3 & Exhibit Hall
Yudong Jin ⋅ Sida Peng ⋅ Xuan Wang ⋅ Tao Xie ⋅ Zhen Xu ⋅ Yifan Yang ⋅ Yujun Shen ⋅ Hujun Bao ⋅ Xiaowei Zhou
|
Exhibit Hall I #98 | |
|
Less is More: Improving Motion Diffusion Models with Sparse Keyframes
Poster Session 3 & Exhibit Hall
Jinseok Bae ⋅ Inwoo Hwang ⋅ Young-Yoon Lee ⋅ Ziyu Guo ⋅ Joseph Liu ⋅ Yizhak Ben-Shabat ⋅ Young Min Kim ⋅ Mubbasir Kapadia
|
Exhibit Hall I #100 | |
|
DGTalker: Disentangled Generative Latent Space Learning for Audio-Driven Gaussian Talking Heads
Poster Session 3 & Exhibit Hall
Xiaoxi Liang ⋅ Yanbo Fan ⋅ Qiya Yang ⋅ Xuan Wang ⋅ Wei Gao ⋅ Ge Li
|
Exhibit Hall I #101 | |
|
VQ-VLA: Improving Vision-Language-Action Models via Scaling Vector-Quantized Action Tokenizers
Poster Session 3 & Exhibit Hall
Yating Wang ⋅ Haoyi Zhu ⋅ Mingyu Liu ⋅ Jiange Yang ⋅ Hao-Shu Fang ⋅ Tong He
|
Exhibit Hall I #102 | |
|
Augmented and Softened Matching for Unsupervised Visible-Infrared Person Re-Identification
Poster Session 3 & Exhibit Hall
Zhiqi Pang ⋅ Chunyu Wang ⋅ Lingling Zhao ⋅ Junjie Wang
|
Exhibit Hall I #103 | |
|
Temporal Unlearnable Examples: Preventing Personal Video Data from Unauthorized Exploitation by Object Tracking
Poster Session 3 & Exhibit Hall
Qiangqiang Wu ⋅ Yi Yu ⋅ Chenqi Kong ⋅ Ziquan Liu ⋅ Jia Wan ⋅ Haoliang Li ⋅ Alex Kot ⋅ Antoni Chan
|
Exhibit Hall I #104 | |
|
VLABench: A Large-Scale Benchmark for Language-Conditioned Robotics Manipulation with Long-Horizon Reasoning Tasks
Poster Session 3 & Exhibit Hall
shiduo zhang ⋅ Zhe Xu ⋅ Peiju Liu ⋅ Xiaopeng Yu ⋅ Qinghui Gao ⋅ Yuan Li ⋅ Zhaoye Fei ⋅ Zhangyue Yin ⋅ Zuxuan Wu ⋅ Yu-Gang Jiang ⋅ Xipeng Qiu
|
Exhibit Hall I #107 | |
|
TrackVerse: A Large-Scale Object-Centric Video Dataset for Image-Level Representation Learning
Poster Session 3 & Exhibit Hall
Yibing Wei ⋅ Samuel Church ⋅ Victor Suciu ⋅ Jinhong Lin ⋅ Cheng-En Wu ⋅ Pedro Morgado
|
Exhibit Hall I #108 | |
|
Reangle-A-Video: 4D Video Generation as Video-to-Video Translation
Poster Session 3 & Exhibit Hall
Hyeonho Jeong ⋅ Suhyeon Lee ⋅ Jong Ye
|
Exhibit Hall I #109 | |
|
Beyond Spatial Frequency: Pixel-wise Temporal Frequency-based Deepfake Video Detection
Taehoon Kim ⋅ Jongwook Choi ⋅ Yonghyun Jeong ⋅ Haeun Noh ⋅ Jaejun Yoo ⋅ Seungryul Baek ⋅ Jongwon Choi
|
Exhibit Hall I #112 | |
|
Causal-Entity Reflected Egocentric Traffic Accident Video Synthesis
Poster Session 3 & Exhibit Hall
Lei-lei Li ⋅ Jianwu Fang ⋅ Junbin Xiao ⋅ Shanmin Pang ⋅ Hongkai Yu ⋅ Chen Lv ⋅ Jianru Xue ⋅ Tat-Seng Chua
|
Exhibit Hall I #113 | |
|
Robust Test-Time Adaptation for Single Image Denoising Using Deep Gaussian Prior
Poster Session 3 & Exhibit Hall
Qing Ma ⋅ Pengwei Liang ⋅ Xiong Zhou ⋅ Jiayi Ma ⋅ Junjun Jiang ⋅ Zhe Peng
|
Exhibit Hall I #115 | |
|
Hierarchical-aware Orthogonal Disentanglement Framework for Fine-grained Skeleton-based Action Recognition
Poster Session 3 & Exhibit Hall
Haochen Chang ⋅ Pengfei Ren ⋅ Haoyang Zhang ⋅ Liang Xie ⋅ Hongbo Chen ⋅ Erwei Yin
|
Exhibit Hall I #117 | |
|
MBTI: Masked Blending Transformers with Implicit Positional Encoding for Frame-rate Agnostic Motion Estimation
Poster Session 3 & Exhibit Hall
Jungwoo Huh ⋅ Yeseung Park ⋅ Seongjean Kim ⋅ Jungsu Kim ⋅ Sanghoon Lee
|
Exhibit Hall I #146 | |
|
Balancing Task-invariant Interaction and Task-specific Adaptation for Unified Image Fusion
Poster Session 3 & Exhibit Hall
Xingyu Hu ⋅ Junjun Jiang ⋅ Chenyang Wang ⋅ Kui Jiang ⋅ Xianming Liu ⋅ Jiayi Ma
|
Exhibit Hall I #118 | |
|
PatchScaler: An Efficient Patch-Independent Diffusion Model for Image Super-Resolution
Poster Session 3 & Exhibit Hall
Yong Liu ⋅ Hang Dong ⋅ Jinshan Pan ⋅ Qingji dong ⋅ Kai Chen ⋅ Rongxiang Zhang ⋅ Lean Fu ⋅ Fei Wang
|
Exhibit Hall I #120 | |
|
Disentangled Clothed Avatar Generation with Layered Representation
Weitian Zhang ⋅ Yichao Yan ⋅ Sijing Wu ⋅ Manwen Liao ⋅ Xiaokang Yang
|
Exhibit Hall I #124 | |
|
Augmented Mass-Spring Model for Real-Time Dense Hair Simulation
Poster Session 3 & Exhibit Hall
Jorge Herrera ⋅ Yi Zhou ⋅ Xin Sun ⋅ Zhixin Shu ⋅ Chengan He ⋅ Soren Pirk ⋅ Dominik Michels
|
Exhibit Hall I #125 | |
|
Punching Bag vs. Punching Person: Motion Transferability in Videos
Poster Session 3 & Exhibit Hall
Raiyaan Abdullah ⋅ Jared Claypoole ⋅ Michael Cogswell ⋅ Ajay Divakaran ⋅ Yogesh Rawat
|
Exhibit Hall I #126 | |
|
G-DexGrasp: Generalizable Dexterous Grasping Synthesis Via Part-Aware Prior Retrieval and Prior-Assisted Generation
Poster Session 3 & Exhibit Hall
Juntao Jian ⋅ Xiuping Liu ⋅ Zixuanchen Zixuanchen ⋅ Manyi Li ⋅ Jian Liu ⋅ Ruizhen Hu
|
Exhibit Hall I #135 | |
|
WarpHE4D: Dense 4D Head Map toward Full Head Reconstruction
Poster Session 3 & Exhibit Hall
Jongseob Yun ⋅ Yong-Hoon Kwon ⋅ Min-Gyu Park ⋅ Ju-Mi Kang ⋅ Min-Ho Lee ⋅ Inho Chang ⋅ Ju Yoon ⋅ Kuk-Jin Yoon
|
Exhibit Hall I #138 | |
|
PrimHOI: Compositional Human-Object Interaction via Reusable Primitives
Poster Session 3 & Exhibit Hall
Kai Jia ⋅ Tengyu Liu ⋅ Mingtao Pei ⋅ Yixin Zhu ⋅ Siyuan Huang
|
Exhibit Hall I #139 | |
|
Continuous-Time Human Motion Field from Event Cameras
Poster Session 3 & Exhibit Hall
Ziyun (Claude) Wang ⋅ Ruijun Zhang ⋅ Zi-Yan Liu ⋅ Yufu Wang ⋅ Kostas Daniilidis
|
Exhibit Hall I #140 | |
|
GENMO: A GENeralist Model for Human MOtion
Jiefeng Li ⋅ Jinkun Cao ⋅ Haotian Zhang ⋅ Davis Rempe ⋅ Jan Kautz ⋅ Umar Iqbal ⋅ Ye Yuan
|
Exhibit Hall I #166 | |
|
Efficient Track Anything
Poster Session 3 & Exhibit Hall
Yunyang Xiong ⋅ Chong Zhou ⋅ Xiaoyu Xiang ⋅ Lemeng Wu ⋅ Chenchen Zhu ⋅ Zechun Liu ⋅ Saksham Suri ⋅ Balakrishnan Varadarajan ⋅ Ramya Akula ⋅ Forrest Iandola ⋅ Raghuraman Krishnamoorthi ⋅ Bilge Soran ⋅ Vikas Chandra
|
Exhibit Hall I #141 | |
|
HAMoBE: Hierarchical and Adaptive Mixture of Biometric Experts for Video-based Person ReID
Poster Session 3 & Exhibit Hall
Yiyang Su ⋅ Yunping Shi ⋅ Feng Liu ⋅ Xiaoming Liu
|
Exhibit Hall I #142 | |
|
Multi-Object Sketch Animation by Scene Decomposition and Motion Planning
Poster Session 3 & Exhibit Hall
Jingyu Liu ⋅ Zijie Xin ⋅ Yuhan Fu ⋅ Ruixiang Zhao ⋅ Bangxiang Lan ⋅ Xirong Li
|
Exhibit Hall I #143 | |
|
ISP2HRNet: Learning to Reconstruct High Resolution Image from Irregularly Sampled Pixels via Hierarchical Gradient Learning
Yuanlin Wang ⋅ Ruiqin Xiong ⋅ Rui Zhao ⋅ Jin Wang ⋅ Xiaopeng Fan ⋅ Tiejun Huang
|
Exhibit Hall I #144 | |
|
Sequential keypoint density estimator: an overlooked baseline of skeleton-based video anomaly detection
Anja Delić ⋅ Matej Grcic ⋅ Siniša Šegvić
|
Exhibit Hall I #147 | |
|
GameFactory: Creating New Games with Generative Interactive Videos
Jiwen Yu ⋅ Yiran Qin ⋅ Xintao Wang ⋅ Pengfei Wan ⋅ Di ZHANG ⋅ Xihui Liu
|
Exhibit Hall I #150 | |
|
FaceCraft4D: Animated 3D Facial Avatar Generation from a Single Image
Poster Session 3 & Exhibit Hall
Fei Yin ⋅ Mallikarjun Reddy ⋅ Chun-Han Yao ⋅ Rafal Mantiuk ⋅ Varun Jampani
|
Exhibit Hall I #152 | |
|
Event-Driven Storytelling with Multiple Lifelike Humans in a 3D Scene
Poster Session 3 & Exhibit Hall
Donggeun Lim ⋅ Jinseok Bae ⋅ Inwoo Hwang ⋅ Seungmin Lee ⋅ Hwanhee Lee ⋅ Young Min Kim
|
Exhibit Hall I #156 | |
|
EvolvingGrasp: Evolutionary Grasp Generation via Efficient Preference Alignment
Poster Session 3 & Exhibit Hall
Yufei Zhu ⋅ Yiming Zhong ⋅ Zemin Yang ⋅ Peishan Cong ⋅ Jingyi Yu ⋅ Xinge Zhu ⋅ Yuexin Ma
|
Exhibit Hall I #157 | |
|
Efficient Autoregressive Shape Generation via Octree-Based Adaptive Tokenization
Poster Session 3 & Exhibit Hall
Kangle Deng ⋅ Hsueh-Ti Derek Liu ⋅ Yiheng Zhu ⋅ Xiaoxia Sun ⋅ Chong Shang ⋅ Kiran Bhat ⋅ Deva Ramanan ⋅ Jun-Yan Zhu ⋅ Maneesh Agrawala ⋅ Tinghui Zhou
|
Exhibit Hall I #159 | |
|
EAMamba: Efficient All-Around Vision State Space Model for Image Restoration
Poster Session 3 & Exhibit Hall
Yu-Cheng Lin ⋅ Yu-Syuan Xu ⋅ Hao-Wei Chen ⋅ Hsien-Kai Kuo ⋅ Chun-Yi Lee
|
Exhibit Hall I #161 | |
|
SyncDiff: Synchronized Motion Diffusion for Multi-Body Human-Object Interaction Synthesis
Poster Session 3 & Exhibit Hall
Wenkun He ⋅ Yun Liu ⋅ Ruitao Liu ⋅ Li Yi
|
Exhibit Hall I #163 | |
|
Fast Image Super-Resolution via Consistency Rectified Flow
Poster Session 3 & Exhibit Hall
Jiaqi Xu ⋅ Wenbo Li ⋅ Haoze Sun ⋅ Fan Li ⋅ Zhixin Wang ⋅ Long Peng ⋅ Jingjing Ren ⋅ HAORAN YANG ⋅ Xiaowei Hu ⋅ Renjing Pei ⋅ Pheng-Ann Heng
|
Exhibit Hall I #165 | |
|
Event-guided HDR Reconstruction with Diffusion Priors
Poster Session 3 & Exhibit Hall
Yixin Yang ⋅ jiawei zhang ⋅ Yang Zhang ⋅ Yunxuan Wei ⋅ Dongqing Zou ⋅ Jimmy Ren ⋅ Boxin Shi
|
Exhibit Hall I #168 | |
|
Learning Efficient and Generalizable Human Representation with Human Gaussian Model
Poster Session 3 & Exhibit Hall
Yifan Liu ⋅ Shengjun Zhang ⋅ Chensheng Dai ⋅ Yang Chen ⋅ Hao Liu ⋅ Chen Li ⋅ Yueqi Duan
|
Exhibit Hall I #169 | |
|
SMGDiff: Soccer Motion Generation using Diffusion Probabilistic Models
Poster Session 3 & Exhibit Hall
Hongdi Yang ⋅ Chengyang Li ⋅ Zhenxuan Wu ⋅ Gaozheng Li ⋅ Jingya Wang ⋅ Jingyi Yu ⋅ Zhuo Su ⋅ Lan Xu
|
Exhibit Hall I #170 | |
|
AffordDexGrasp: Open-set Language-guided Dexterous Grasp with Generalizable-Instructive Affordance
Poster Session 3 & Exhibit Hall
Yilin Wei ⋅ Mu Lin ⋅ Yuhao Lin ⋅ Jian-Jian Jiang ⋅ Xiao-Ming Wu ⋅ Ling-An Zeng ⋅ Wei-Shi Zheng
|
Exhibit Hall I #171 | |
|
Robust Adverse Weather Removal via Spectral-based Spatial Grouping
Poster Session 3 & Exhibit Hall
Yuhwan Jeong ⋅ Yunseo Yang ⋅ Youngho Yoon ⋅ Kuk-Jin Yoon
|
Exhibit Hall I #176 | |
|
Switch-a-View: View Selection Learned from Unlabeled In-the-wild Videos
Poster Session 3 & Exhibit Hall
Sagnik Majumder ⋅ Tushar Nagarajan ⋅ Ziad Al-Halah ⋅ Kristen Grauman
|
Exhibit Hall I #185 | |
|
DiTaiListener: Controllable High Fidelity Listener Video Generation with Diffusion
Poster Session 3 & Exhibit Hall
Maksim Siniukov ⋅ Di Chang ⋅ Minh Tran ⋅ Hongkun Gong ⋅ Ashutosh Chaubey ⋅ Mohammad Soleymani
|
Exhibit Hall I #187 | |
|
Hipandas: Hyperspectral Image Joint Denoising and Super-Resolution by Image Fusion with the Panchromatic Image
Poster Session 3 & Exhibit Hall
Shuang Xu ⋅ Zixiang Zhao ⋅ Haowen Bai ⋅ Chang Yu ⋅ Jiangjun Peng ⋅ Xiangyong Cao ⋅ Deyu Meng
|
Exhibit Hall I #188 | |
|
Ponimator: Unfolding Interactive Pose for Versatile Human-human Interaction Animation
Poster Session 3 & Exhibit Hall
Shaowei Liu ⋅ chuan guo ⋅ Bing Zhou ⋅ Jian Wang
|
Exhibit Hall I #194 | |
|
Scaling Action Detection: AdaTAD++ with Transformer-Enhanced Temporal-Spatial Adaptation
Poster Session 3 & Exhibit Hall
Tanay Agrawal ⋅ Abid Ali ⋅ Antitza Dantcheva ⋅ Francois Bremond
|
Exhibit Hall I #208 | |
|
Avat3r: Large Animatable Gaussian Reconstruction Model for High-fidelity 3D Head Avatars
Poster Session 3 & Exhibit Hall
Tobias Kirschstein ⋅ Javier Romero ⋅ Artem Sevastopolsky ⋅ Matthias Nießner ⋅ Shunsuke Saito
|
Exhibit Hall I #196 | |
|
Skeleton Motion Words for Unsupervised Skeleton-based Temporal Action Segmentation
Poster Session 3 & Exhibit Hall
Uzay Gökay ⋅ Federico Spurio ⋅ Dominik Bach ⋅ Juergen Gall
|
Exhibit Hall I #197 | |
|
DH-FaceVid-1K: A Large-Scale High-Quality Dataset for Face Video Generation
Poster Session 3 & Exhibit Hall
Donglin Di ⋅ He Feng ⋅ Wenzhang SUN ⋅ Yongjia Ma ⋅ Hao Li ⋅ Chen Wei ⋅ Lei Fan ⋅ Tonghua Su ⋅ Xun Yang
|
Exhibit Hall I #199 | |
|
Synthetic Video Enhances Physical Fidelity in Video Synthesis
Poster Session 3 & Exhibit Hall
Qi Zhao ⋅ Xingyu Ni ⋅ Ziyu Wang ⋅ Feng Cheng ⋅ Ziyan Yang ⋅ Lu Jiang ⋅ Bohan Wang
|
Exhibit Hall I #200 | |
|
TimeBooth: Disentangled Facial Invariant Representation for Diverse and Personalized Face Aging
Poster Session 3 & Exhibit Hall
Zepeng Su ⋅ zhulin liu ⋅ Zongyan Zhang ⋅ Tong Zhang ⋅ C.L.Philip Chen
|
Exhibit Hall I #201 | |
|
Identity Preserving 3D Head Stylization with Multiview Score Distillation
Poster Session 3 & Exhibit Hall
Bahri Batuhan Bilecen ⋅ Ahmet Berke Gokmen ⋅ Furkan Güzelant ⋅ Aysegul Dundar
|
Exhibit Hall I #203 | |
|
IDF: Iterative Dynamic Filtering Networks for Generalizable Image Denoising
Poster Session 3 & Exhibit Hall
Dongjin Kim ⋅ Jaekyun Ko ⋅ Muhammad Kashif Ali ⋅ Tae Hyun Kim
|
Exhibit Hall I #204 | |
|
Who is a Better Talker: Subjective and Objective Quality Assessment for AI-Generated Talking Heads
Poster Session 3 & Exhibit Hall
Yingjie Zhou ⋅ Jiezhang Cao ⋅ Zicheng Zhang ⋅ Farong Wen ⋅ Jiang Yanwei ⋅ Jun Jia ⋅ Xiaohong Liu ⋅ Xiongkuo Min ⋅ Guangtao Zhai
|
Exhibit Hall I #206 | |
|
Towards Efficient General Feature Prediction in Masked Skeleton Modeling
Poster Session 3 & Exhibit Hall
Shengkai Sun ⋅ Zefan Zhang ⋅ Jianfeng Dong ⋅ Zhiyong Cheng ⋅ Xiaojun Chang ⋅ Meng Wang
|
Exhibit Hall I #207 | |
|
How Would It Sound? Material-Controlled Multimodal Acoustic Profile Generation for Indoor Scenes
Poster Session 3 & Exhibit Hall
Mahnoor Saad ⋅ Ziad Al-Halah
|
Exhibit Hall I #209 | |
|
VideoSetDiff: Identifying and Reasoning Similarities and Differences in Similar Videos
Poster Session 3 & Exhibit Hall
YUE QIU ⋅ Yanjun Sun ⋅ Takuma Yagi ⋅ Shusaku Egami ⋅ Natsuki Miyata ⋅ Ken Fukuda ⋅ Kensho Hara ⋅ Ryusuke Sagawa
|
Exhibit Hall I #210 | |
|
Occlusion-robust Stylization for Drawing-based 3D Animation
Poster Session 3 & Exhibit Hall
Sunjae Yoon ⋅ Gwanhyeong Koo ⋅ Younghwan Lee ⋅ Ji Woo Hong ⋅ Chang Yoo
|
Exhibit Hall I #212 | |
|
Video Individual Counting for Moving Drones
Yaowu Fan ⋅ Jia Wan ⋅ Tao Han ⋅ Antoni Chan ⋅ Jinhua Ma
|
Exhibit Hall I #214 | |
|
NAPPure: Adversarial Purification for Robust Image Classification under Non-Additive Perturbations
Poster Session 1 & Exhibit Hall
Junjie Nan ⋅ Jianing Li ⋅ Wei Chen ⋅ Mingkun Zhang ⋅ Xueqi Cheng
|
Exhibit Hall I #205 | |
|
FaceLift: Learning Generalizable Single Image 3D Face Reconstruction from Synthetic Heads
Poster Session 3 & Exhibit Hall
Weijie Lyu ⋅ Yi Zhou ⋅ Ming-Hsuan Yang ⋅ Zhixin Shu
|
Exhibit Hall I #253 | |
|
What Changed and What Could Have Changed? State-Change Counterfactuals for Procedure-Aware Video Representation Learning
Poster Session 3 & Exhibit Hall
Chi-Hsi Kung ⋅ Frangil Ramirez ⋅ Juhyung Ha ⋅ Yi-Hsuan Tsai ⋅ Yi-Ting Chen ⋅ David Crandall
|
Exhibit Hall I #215 | |
|
HADES: Human Avatar with Dynamic Explicit Hair Strands
Poster Session 3 & Exhibit Hall
Zhanfeng Liao ⋅ Hanzhang Tu ⋅ Cheng Peng ⋅ Hongwen Zhang ⋅ Boyao Zhou ⋅ Yebin Liu
|
Exhibit Hall I #217 | |
|
FlowDPS : Flow-Driven Posterior Sampling for Inverse Problems
Poster Session 3 & Exhibit Hall
Jeongsol Kim ⋅ Bryan Sangwoo Kim ⋅ Jong Ye
|
Exhibit Hall I #218 | |
|
ZFusion: Efficient Deep Compositional Zero-shot Learning for Blind Image Super-Resolution with Generative Diffusion Prior
Poster Session 3 & Exhibit Hall
Alireza Esmaeilzehi ⋅ Hossein Zaredar ⋅ Yapeng Tian ⋅ Laleh Seyyed-Kalantari
|
Exhibit Hall I #219 | |
|
Stable Virtual Camera: Generative View Synthesis with Diffusion Models
Poster Session 3 & Exhibit Hall
Jensen Zhou ⋅ Hang Gao ⋅ Vikram Voleti ⋅ Aaryaman Vasishta ⋅ Chun-Han Yao ⋅ Mark Boss ⋅ Philip Torr ⋅ Christian Rupprecht ⋅ Varun Jampani
|
Exhibit Hall I #227 | |
|
VLIPP: Towards Physically Plausible Video Generation with Vision and Language Informed Physical Prior
Poster Session 3 & Exhibit Hall
Xindi Yang ⋅ Baolu Li ⋅ Yiming Zhang ⋅ Zhenfei Yin ⋅ LEI BAI ⋅ Liqian Ma ⋅ Zhiyong Wang ⋅ Jianfei Cai ⋅ Tien-Tsin Wong ⋅ Huchuan Lu ⋅ Xu Jia
|
Exhibit Hall I #221 | |
|
StreamDiffusion: A Pipeline-level Solution for Real-Time Interactive Generation
Poster Session 3 & Exhibit Hall
Akio Kodaira ⋅ Chenfeng Xu ⋅ Toshiki Hazama ⋅ Takanori Yoshimoto ⋅ Kohei Ohno ⋅ Shogo Mitsuhori ⋅ Soichi Sugano ⋅ Hanying Cho ⋅ Zhijian Liu ⋅ Masayoshi Tomizuka ⋅ Kurt Keutzer
|
Exhibit Hall I #222 | |
|
DreamRelation: Relation-Centric Video Customization
Poster Session 3 & Exhibit Hall
Yujie Wei ⋅ Shiwei Zhang ⋅ Hangjie Yuan ⋅ Biao Gong ⋅ Longxiang Tang ⋅ Xiang Wang ⋅ Haonan Qiu ⋅ Hengjia Li ⋅ Shuai Tan ⋅ Yingya Zhang ⋅ Hongming Shan
|
Exhibit Hall I #225 | |
|
ModSkill: Physical Character Skill Modularization
Poster Session 3 & Exhibit Hall
Yiming Huang ⋅ Zhiyang Dou ⋅ Lingjie Liu
|
Exhibit Hall I #226 | |
|
Rethinking Bimanual Robotic Manipulation: Learning with Decoupled Interaction Framework
Poster Session 3 & Exhibit Hall
Jian-Jian Jiang ⋅ Xiao-Ming Wu ⋅ Yi-Xiang He ⋅ Ling-An Zeng ⋅ Yilin Wei ⋅ Dandan Zhang ⋅ Wei-Shi Zheng
|
Exhibit Hall I #229 | |
|
Free-Form Motion Control: Controlling the 6D Poses of Camera and Objects in Video Generation
Poster Session 3 & Exhibit Hall
Xincheng Shuai ⋅ Henghui Ding ⋅ Zhenyuan Qin ⋅ Hao Luo ⋅ Xingjun Ma ⋅ Dacheng Tao
|
Exhibit Hall I #231 | |
|
Learning A Unified Template for Gait Recognition
Poster Session 3 & Exhibit Hall
Panjian Huang ⋅ Saihui Hou ⋅ Junzhou Huang ⋅ Yongzhen Huang
|
Exhibit Hall I #232 | |
|
Synchronization of Multiple Videos
Poster Session 3 & Exhibit Hall
Avihai Naaman ⋅ Ron Shapira Weber ⋅ Oren Freifeld
|
Exhibit Hall I #237 | |
|
DeepShield: Fortifying Deepfake Video Detection with Local and Global Forgery Analysis
Poster Session 3 & Exhibit Hall
Yinqi Cai ⋅ Jichang Li ⋅ Zhaolun Li ⋅ Weikai Chen ⋅ Rushi Lan ⋅ Xi Xie ⋅ Xiaonan Luo ⋅ Guanbin Li
|
Exhibit Hall I #238 | |
|
VertexRegen: Mesh Generation with Continuous Level of Detail
Poster Session 3 & Exhibit Hall
Xiang Zhang ⋅ Yawar Siddiqui ⋅ Armen Avetisyan ⋅ Christopher Xie ⋅ Jakob Engel ⋅ Henry Howard-Jenkins
|
Exhibit Hall I #242 | |
|
GestureHYDRA: Semantic Co-speech Gesture Synthesis via Hybrid Modality Diffusion Transformer and Cascaded-Synchronized Retrieval-Augmented Generation
Poster Session 3 & Exhibit Hall
Quanwei Yang ⋅ Luying Huang ⋅ Kaisiyuan Wang ⋅ Jiazhi Guan ⋅ Shengyi He ⋅ Fengguo Li ⋅ Hang Zhou ⋅ Lingyun Yu ⋅ Yingying Li ⋅ Haocheng Feng ⋅ Hongtao Xie
|
Exhibit Hall I #246 | |
|
FoundIR: Unleashing Million-scale Training Data to Advance Foundation Models for Image Restoration
Poster Session 3 & Exhibit Hall
Hao Li ⋅ Xiang Chen ⋅ Jiangxin Dong ⋅ Jinhui Tang ⋅ Jinshan Pan
|
Exhibit Hall I #247 | |
|
Highlight What You Want: Weakly-Supervised Instance-Level Controllable Infrared-Visible Image Fusion
Poster Session 3 & Exhibit Hall
Zeyu Wang ⋅ Jizheng Zhang ⋅ Haiyu Song ⋅ Mingyu Ge ⋅ Jiayu Wang ⋅ Haoran Duan
|
Exhibit Hall I #248 | |
|
Adaptive Hyper-Graph Convolution Network for Skeleton-based Human Action Recognition with Virtual Connections
Poster Session 3 & Exhibit Hall
Youwei Zhou ⋅ Tianyang Xu ⋅ Cong Wu ⋅ Xiaojun Wu ⋅ Josef Kittler
|
Exhibit Hall I #249 | |
|
Precise Action-to-Video Generation Through Visual Action Prompts
Poster Session 3 & Exhibit Hall
Yuang Wang ⋅ Chao Wen ⋅ Haoyu Guo ⋅ Sida Peng ⋅ Minghan Qin ⋅ Hujun Bao ⋅ Ruizhen Hu ⋅ Xiaowei Zhou
|
Exhibit Hall I #255 | |
|
PRIMAL: Physically Reactive and Interactive Motor Model for Avatar Learning
Poster Session 3 & Exhibit Hall
Yan Zhang ⋅ Yao Feng ⋅ Alpár Cseke ⋅ Nitin Saini ⋅ Nathan Bajandas ⋅ Nicolas Heron ⋅ Michael Black
|
Exhibit Hall I #256 | |
|
Bridging the Skeleton-Text Modality Gap: Diffusion-Powered Modality Alignment for Zero-shot Skeleton-based Action Recognition
Poster Session 3 & Exhibit Hall
Jeonghyeok Do ⋅ Munchurl Kim
|
Exhibit Hall I #259 | |
|
Visual Chronicles: Using Multimodal LLMs to Analyze Massive Collections of Images
Boyang Deng ⋅ Kyle Genova ⋅ Songyou Peng ⋅ Gordon Wetzstein ⋅ Noah Snavely ⋅ Leonidas Guibas ⋅ Thomas Funkhouser
|
Exhibit Hall I #260 | |
|
Latent-Reframe: Enabling Camera Control for Video Diffusion Models without Training
Poster Session 3 & Exhibit Hall
Zhenghong Zhou ⋅ Jie An ⋅ Jiebo Luo
|
Exhibit Hall I #261 | |
|
GeoAvatar: Adaptive Geometrical Gaussian Splatting for 3D Head Avatar
Poster Session 3 & Exhibit Hall
SeungJun Moon ⋅ Hah Min Lew ⋅ Seungeun Lee ⋅ Ji-Su Kang ⋅ Gyeong-Moon Park
|
Exhibit Hall I #264 | |
|
Image Intrinsic Scale Assessment: Bridging the Gap Between Quality and Resolution
Poster Session 3 & Exhibit Hall
Vlad Hosu ⋅ Lorenzo Agnolucci ⋅ Daisuke Iso ⋅ Dietmar Saupe
|
Exhibit Hall I #269 | |
|
Frequency-Guided Posterior Sampling for Diffusion-Based Image Restoration
Poster Session 3 & Exhibit Hall
Darshan Thaker ⋅ Abhishek Goyal ⋅ Rene Vidal
|
Exhibit Hall I #270 | |
|
GAS: Generative Avatar Synthesis from a Single Image
Poster Session 3 & Exhibit Hall
Yixing Lu ⋅ Junting Dong ⋅ YoungJoong Kwon ⋅ Qin Zhao ⋅ Bo Dai ⋅ Fernando De la Torre
|
Exhibit Hall I #271 | |
|
Less Static, More Private: Towards Transferable Privacy-Preserving Action Recognition by Generative Decoupled Learning
Poster Session 3 & Exhibit Hall
Zhi-Wei Xia ⋅ Kun-Yu Lin ⋅ Yuan-Ming Li ⋅ Wei-Jin Huang ⋅ Xian-Tuo Tan ⋅ Wei-Shi Zheng
|
Exhibit Hall I #272 | |
|
Bitrate-Controlled Diffusion for Disentangling Motion and Content in Video
Poster Session 3 & Exhibit Hall
Xiao Li ⋅ Qi Chen ⋅ Xiulian Peng ⋅ Kai Yu ⋅ Xie Chen ⋅ Yan Lu
|
Exhibit Hall I #273 | |
|
Blind2Sound: Self-Supervised Image Denoising without Residual Noise
Poster Session 3 & Exhibit Hall
Jiazheng Liu ⋅ Zejin Wang ⋅ Bohao Chen ⋅ Hua Han
|
Exhibit Hall I #276 | |
|
Unified Multimodal Understanding via Byte-Pair Visual Encoding
Wanpeng Zhang ⋅ Yicheng Feng ⋅ Hao Luo ⋅ Yijiang Li ⋅ Zihao Yue ⋅ Sipeng Zheng ⋅ Zongqing Lu
|
Exhibit Hall I #280 | |
|
IMoRe: Implicit Program-Guided Reasoning for Human Motion Q&A
Poster Session 3 & Exhibit Hall
Chen Li ⋅ Chinthani Sugandhika ⋅ Ee Yeo Keat ⋅ Eric Peh ⋅ Hao Zhang ⋅ HONG YANG ⋅ Deepu Rajan ⋅ Basura Fernando
|
Exhibit Hall I #281 | |
|
Privacy-centric Deep Motion Retargeting for Anonymization of Skeleton-Based Motion Visualization
Poster Session 3 & Exhibit Hall
Thomas Carr ⋅ Depeng Xu ⋅ Shuhan Yuan ⋅ Aidong Lu
|
Exhibit Hall I #297 | |
|
AdaDCP: Learning an Adapter with Discrete Cosine Prior for Clear-to-Adverse Domain Generalization
Poster Session 3 & Exhibit Hall
Qi Bi ⋅ Yixian Shen ⋅ Jingjun Yi ⋅ Gui-Song Xia
|
Exhibit Hall I #282 | |
|
MorphoGen: Efficient Unconditional Generation of Long-Range Projection Neuronal Morphology via a Global-to-Local Framework
Poster Session 3 & Exhibit Hall
Tianfang Zhu ⋅ Hongyang Zhou ⋅ Anan LI
|
Exhibit Hall I #284 | |
|
GaussianSpeech: Audio-Driven Personalized 3D Gaussian Avatars
Poster Session 3 & Exhibit Hall
Shivangi Aneja ⋅ Artem Sevastopolsky ⋅ Tobias Kirschstein ⋅ Justus Thies ⋅ Angela Dai ⋅ Matthias Nießner
|
Exhibit Hall I #288 | |
|
A Quality-Guided Mixture of Score-Fusion Experts Framework for Human Recognition
Poster Session 3 & Exhibit Hall
Jie Zhu ⋅ Yiyang Su ⋅ Minchul Kim ⋅ Anil Jain ⋅ Xiaoming Liu
|
Exhibit Hall I #289 | |
|
Capturing head avatar with hand contacts from a monocular video
Poster Session 3 & Exhibit Hall
Haonan He ⋅ Yufeng Zheng ⋅ Jie Song
|
Exhibit Hall I #291 | |
|
Tiling artifacts and trade-offs of feature normalization in the segmentation of large biological images
Elena Buglakova ⋅ Anwai Archit ⋅ Edoardo D'Imprima ⋅ Julia Mahamid ⋅ Constantin Pape ⋅ Anna Kreshuk
|
Exhibit Hall I #292 | |
|
GenM3: Generative Pretrained Multi-path Motion Model for Text Conditional Human Motion Generation
Poster Session 3 & Exhibit Hall
Junyu Shi ⋅ Lijiang LIU ⋅ Yong Sun ⋅ Zhiyuan Zhang ⋅ JINNI ZHOU ⋅ Qiang Nie
|
Exhibit Hall I #294 | |
|
Saliency-Aware Quantized Imitation Learning for Efficient Robotic Control
Poster Session 3 & Exhibit Hall
Seongmin Park ⋅ Hyungmin Kim ⋅ Sangwoo kim ⋅ Wonseok Jeon ⋅ Juyoung Yang ⋅ Byeongwook Jeon ⋅ Yoonseon Oh ⋅ Jungwook Choi
|
Exhibit Hall I #295 | |
|
MAVFlow: Preserving Paralinguistic Elements with Conditional Flow Matching for Zero-Shot AV2AV Multilingual Translation
Poster Session 3 & Exhibit Hall
Sungwoo Cho ⋅ Jeongsoo Choi ⋅ Sungnyun Kim ⋅ Se-Young Yun
|
Exhibit Hall I #296 | |
|
UniPhys: Unified Planner and Controller with Diffusion for Flexible Physics-Based Character Control
Yan Wu ⋅ Korrawe Karunratanakul ⋅ Zhengyi Luo ⋅ Siyu Tang
|
Exhibit Hall I #304 | |
|
UniRes: Universal Image Restoration for Complex Degradations
Poster Session 3 & Exhibit Hall
Mo Zhou ⋅ Keren Ye ⋅ Mauricio Delbracio ⋅ Peyman Milanfar ⋅ Vishal Patel ⋅ Hossein Talebi
|
Exhibit Hall I #306 | |
|
SV4D 2.0: Enhancing Spatio-Temporal Consistency in Multi-View Video Diffusion for High-Quality 4D Generation
Poster Session 3 & Exhibit Hall
Chun-Han Yao ⋅ Yiming Xie ⋅ Vikram Voleti ⋅ Huaizu Jiang ⋅ Varun Jampani
|
Exhibit Hall I #307 | |
|
Light-A-Video: Training-free Video Relighting via Progressive Light Fusion
Poster Session 3 & Exhibit Hall
Yujie Zhou ⋅ Jiazi Bu ⋅ Pengyang Ling ⋅ Pan Zhang ⋅ Tong Wu ⋅ Qidong Huang ⋅ Jinsong Li ⋅ Xiaoyi Dong ⋅ Yuhang Zang ⋅ Yuhang Cao ⋅ Anyi Rao ⋅ Jiaqi Wang ⋅ Li Niu
|
Exhibit Hall I #313 | |
|
Group-wise Scaling and Orthogonal Decomposition for Domain-Invariant Feature Extraction in Face Anti-Spoofing
Poster Session 3 & Exhibit Hall
Seungjin Jung ⋅ Kanghee Lee ⋅ Yonghyun Jeong ⋅ Haeun Noh ⋅ Jungmin Lee ⋅ Jongwon Choi
|
Exhibit Hall I #318 | |
|
SVG-Head: Hybrid Surface-Volumetric Gaussians for High-Fidelity Head Reconstruction and Real-Time Editing
Poster Session 3 & Exhibit Hall
Heyi Sun ⋅ Cong Wang ⋅ Tian-Xing Xu ⋅ Jingwei Huang ⋅ Di Kang ⋅ Chunchao Guo ⋅ Song-Hai Zhang
|
Exhibit Hall I #314 | |
|
Go to Zero: Towards Zero-shot Motion Generation with Million-scale Data
Ke Fan ⋅ Shunlin Lu ⋅ Minyue Dai ⋅ Runyi Yu ⋅ Lixing Xiao ⋅ Zhiyang Dou ⋅ Junting Dong ⋅ Lizhuang Ma ⋅ Jingbo Wang
|
Exhibit Hall I #315 | |
|
StyleMotif: Multi-Modal Motion Stylization using Style-Content Cross Fusion
Poster Session 3 & Exhibit Hall
Ziyu Guo ⋅ Young-Yoon Lee ⋅ Joseph Liu ⋅ Yizhak Ben-Shabat ⋅ Victor Zordan ⋅ Mubbasir Kapadia
|
Exhibit Hall I #316 | |
|
I2V3D: Controllable Image-to-video Generation with 3D Guidance
Poster Session 3 & Exhibit Hall
Zhiyuan Zhang ⋅ Dongdong Chen ⋅ Jing Liao
|
Exhibit Hall I #317 | |
|
FakeRadar: Probing Forgery Outliers to Detect Unknown Deepfake Videos
Poster Session 3 & Exhibit Hall
Zhaolun Li ⋅ Jichang Li ⋅ Yinqi Cai ⋅ Junye Chen ⋅ Xiaonan Luo ⋅ Guanbin Li ⋅ Rushi Lan
|
Exhibit Hall I #319 | |
|
CameraCtrl II: Dynamic Scene Exploration via Camera-controlled Video Diffusion Models
Poster Session 3 & Exhibit Hall
Hao He ⋅ Ceyuan Yang ⋅ Shanchuan Lin ⋅ Yinghao Xu ⋅ Meng Wei ⋅ Liangke Gui ⋅ Qi Zhao ⋅ Gordon Wetzstein ⋅ Lu Jiang ⋅ Hongsheng Li
|
Exhibit Hall I #322 | |
|
DynamicFace: High-Quality and Consistent Face Swapping for Image and Video using Composable 3D Facial Priors
Poster Session 3 & Exhibit Hall
Runqi Wang ⋅ Yang Chen ⋅ Sijie Xu ⋅ Tianyao He ⋅ Wei Zhu ⋅ Dejia Song ⋅ Nemo Chen ⋅ Xu Tang ⋅ Yao Hu
|
Exhibit Hall I #324 | |
|
CARP: Visuomotor Policy Learning via Coarse-to-Fine Autoregressive Prediction
Poster Session 3 & Exhibit Hall
Zhefei Gong ⋅ Pengxiang Ding ⋅ Shangke Lyu ⋅ Siteng Huang ⋅ Mingyang Sun ⋅ Wei Zhao ⋅ Zhaoxin Fan ⋅ Donglin Wang
|
Exhibit Hall I #326 | |
|
AdaHuman: Animatable Detailed 3D Human Generation with Compositional Multiview Diffusion
Poster Session 3 & Exhibit Hall
Yangyi Huang ⋅ Ye Yuan ⋅ Xueting Li ⋅ Jan Kautz ⋅ Umar Iqbal
|
Exhibit Hall I #333 | |
|
Trokens: Semantic-Aware Relational Trajectory Tokens for Few-Shot Action Recognition
Poster Session 3 & Exhibit Hall
Pulkit Kumar ⋅ Shuaiyi Huang ⋅ Matthew Walmer ⋅ Sai Saketh Rambhatla ⋅ Abhinav Shrivastava
|
Exhibit Hall I #334 | |
|
Controllable Weather Synthesis and Removal with Video Diffusion Models
Poster Session 3 & Exhibit Hall
Chih-Hao Lin ⋅ Zian Wang ⋅ Ruofan Liang ⋅ Yuxuan Zhang ⋅ Sanja Fidler ⋅ Shenlong Wang ⋅ Zan Gojcic
|
Exhibit Hall I #337 | |
|
Sequential Gaussian Avatars with Hierarchical Motion Context
Poster Session 3 & Exhibit Hall
Wangze Xu ⋅ Yifan Zhan ⋅ Zhihang Zhong ⋅ Xiao Sun
|
Exhibit Hall I #338 | |
|
TokenUnify: Scaling Up Autoregressive Pretraining for Neuron Segmentation
Poster Session 3 & Exhibit Hall
Yinda Chen ⋅ Haoyuan Shi ⋅ Xiaoyu Liu ⋅ Te Shi ⋅ Ruobing Zhang ⋅ Dong Liu ⋅ Zhiwei Xiong ⋅ Feng Wu
|
Exhibit Hall I #339 | |
|
SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree
Poster Session 3 & Exhibit Hall
Shuangrui Ding ⋅ Rui Qian ⋅ Xiaoyi Dong ⋅ Pan Zhang ⋅ Yuhang Zang ⋅ Yuhang Cao ⋅ Yuwei Guo ⋅ Dahua Lin ⋅ Jiaqi Wang
|
Exhibit Hall I #340 | |
|
T2Bs: Text-to-Character Blendshapes via Video Generation
Poster Session 3 & Exhibit Hall
Jiahao Luo ⋅ Chaoyang Wang ⋅ Michael Vasilkovsky ⋅ Vladislav Shakhrai ⋅ Di Liu ⋅ Peiye Zhuang ⋅ Sergey Tulyakov ⋅ Peter Wonka ⋅ Hsin-Ying Lee ⋅ James Davis ⋅ Jian Wang
|
Exhibit Hall I #341 | |
|
Unfolding-Associative Encoder-Decoder Network with Progressive Alignment for Pansharpening
Poster Session 3 & Exhibit Hall
Shijie Fang ⋅ Hongping Gan
|
Exhibit Hall I #343 | |
|
MOERL: When Mixture-of-Experts Meet Reinforcement Learning for Adverse Weather Image Restoration
Poster Session 3 & Exhibit Hall
Tao Wang ⋅ Peiwen Xia ⋅ Bo Li ⋅ Peng-Tao Jiang ⋅ Zhe Kong ⋅ Kaihao Zhang ⋅ Tong Lu ⋅ Wenhan Luo
|
Exhibit Hall I #345 | |
|
DimensionX: Create Any 3D and 4D Scenes from a Single Image with Decoupled Video Diffusion
Poster Session 3 & Exhibit Hall
Wenqiang Sun ⋅ Shuo Chen ⋅ Fangfu Liu ⋅ Zilong Chen ⋅ Yueqi Duan ⋅ Jun Zhu ⋅ Jun Zhang ⋅ Yikai Wang
|
Exhibit Hall I #347 | |
|
LOMM: Latest Object Memory Management for Temporally Consistent Video Instance Segmentation
Poster Session 3 & Exhibit Hall
Seunghun Lee ⋅ Jiwan Seo ⋅ Minwoo Choi ⋅ Kiljoon Han ⋅ Jaehoon Jeong ⋅ Zane Durante ⋅ Ehsan Adeli ⋅ Sang Hyun Park ⋅ Sunghoon Im
|
Exhibit Hall I #349 | |
|
VoluMe – Authentic 3D Video Calls from Live Gaussian Splat Prediction
Poster Session 3 & Exhibit Hall
Martin de La Gorce ⋅ Charlie Hewitt ⋅ Tibor Takács ⋅ Robert Gerdisch ⋅ Zafiirah Hosenie ⋅ Givi Meishvili ⋅ Marek Kowalski ⋅ Thomas J. Cashman ⋅ Antonio Criminisi
|
Exhibit Hall I #355 | |
|
EVDM: Event-based Real-world Video Deblurring with Mamba
Poster Session 3 & Exhibit Hall
Zhijing Sun ⋅ Senyan Xu ⋅ Kean Liu ⋅ Runze Tian ⋅ Xueyang Fu ⋅ Zheng-Jun Zha
|
Exhibit Hall I #356 | |
|
iManip: Skill-Incremental Learning for Robotic Manipulation
Poster Session 3 & Exhibit Hall
Zexin Zheng ⋅ Jia-Feng Cai ⋅ Xiao-Ming Wu ⋅ Yilin Wei ⋅ Yu-Ming Tang ⋅ Wei-Shi Zheng ⋅ Ancong Wu
|
Exhibit Hall I #365 | |
|
Q-Norm: Robust Representation Learning via Quality-Adaptive Normalization
Poster Session 3 & Exhibit Hall
Lanning Zhang ⋅ Ying Zhou ⋅ Fei Gao ⋅ Ziyun Li ⋅ Maoying Qiao ⋅ Jinlan Xu ⋅ Nannan Wang
|
Exhibit Hall I #366 | |
|
Proxy-Bridged Game Transformer for Interactive Extreme Motion Prediction
Poster Session 3 & Exhibit Hall
Yanwen Fang ⋅ Wenqi Jia ⋅ Xu Cao ⋅ Peng-Tao Jiang ⋅ Guodong Li ⋅ Jintai CHEN
|
Exhibit Hall I #367 | |
|
MeshAnything V2: Artist-Created Mesh Generation with Adjacent Mesh Tokenization
Poster Session 3 & Exhibit Hall
Yiwen Chen ⋅ Yikai Wang ⋅ Yihao Luo ⋅ Zhengyi Wang ⋅ Zilong Chen ⋅ Jun Zhu ⋅ Chi Zhang ⋅ Guosheng Lin
|
Exhibit Hall I #368 | |
|
π-AVAS: Can Physics-Integrated Audio-Visual Modeling Boost Neural Acoustic Synthesis?
Poster Session 3 & Exhibit Hall
Susan Liang ⋅ Chao Huang ⋅ Yolo Yunlong Tang ⋅ Zeliang Zhang ⋅ Chenliang Xu
|
Exhibit Hall I #370 | |
|
SemGes: Semantics-aware Co-Speech Gesture Generation using Semantic Coherence and Relevance Learning
Poster Session 3 & Exhibit Hall
Lanmiao Liu ⋅ Esam Ghaleb ⋅ asli ozyurek ⋅ Zerrin Yumak
|
Exhibit Hall I #372 | |
|
Metric Convolutions: A Unifying Theory to Adaptive Image Convolutions
Poster Session 3 & Exhibit Hall
Thomas Dagès ⋅ Michael Lindenbaum ⋅ Alfred Bruckstein
|
Exhibit Hall I #373 | |
|
RobAVA: A Large-scale Dataset and Baseline Towards Video based Robotic Arm Action Understanding
Poster Session 3 & Exhibit Hall
Baoli Sun ⋅ Ning Wang ⋅ Xinzhu Ma ⋅ Anqi Zou ⋅ Lu Yihang ⋅ Chuixuan Fan ⋅ Zhihui Wang ⋅ Kun Lu ⋅ Zhiyong Wang
|
Exhibit Hall I #374 | |
|
IDFace: Face Template Protection for Efficient and Secure Identification
Poster Session 3 & Exhibit Hall
Sunpill Kim ⋅ Seunghun Paik ⋅ Chanwoo Hwang ⋅ Dongsoo Kim ⋅ Junbum Shin ⋅ Jae Hong Seo
|
Exhibit Hall I #375 | |
|
Not All Degradations Are Equal: A Targeted Feature Denoising Framework for Generalizable Image Super-Resolution
Poster Session 3 & Exhibit Hall
hongjun wang ⋅ Jiyuan Chen ⋅ Zhengwei Yin ⋅ Xuan Song ⋅ Yinqiang Zheng
|
Exhibit Hall I #391 | |
|
I2VControl: Disentangled and Unified Video Motion Synthesis Control
Poster Session 3 & Exhibit Hall
Wanquan Feng ⋅ Tianhao Qi ⋅ Jiawei Liu ⋅ Mingzhen Sun ⋅ Pengqi Tu ⋅ Tianxiang Ma ⋅ Fei Dai ⋅ Songtao Zhao ⋅ SiYu Zhou ⋅ Qian HE
|
Exhibit Hall I #382 | |
|
MeshLLM: Empowering Large Language Models to Progressively Understand and Generate 3D Mesh
Shuangkang Fang ⋅ I-Chao Shen ⋅ Yufeng Wang ⋅ Yi-Hsuan Tsai ⋅ Yi Yang ⋅ Shuchang Zhou ⋅ Wenrui Ding ⋅ Takeo Igarashi ⋅ Ming-Hsuan Yang
|
Exhibit Hall I #383 | |
|
On-Device Diffusion Transformer Policy for Efficient Robot Manipulation
Poster Session 3 & Exhibit Hall
Yiming Wu ⋅ Huan Wang ⋅ Zhenghao Chen ⋅ Jianxin Pang ⋅ Dong Xu
|
Exhibit Hall I #384 | |
|
Generic Event Boundary Detection via Denoising Diffusion
Poster Session 3 & Exhibit Hall
Jaejun Hwang ⋅ Dayoung Gong ⋅ Manjin Kim ⋅ Minsu Cho
|
Exhibit Hall I #385 | |
|
LiT: Delving into a Simple Linear Diffusion Transformer for Image Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Jiahao Wang ⋅ Ning Kang ⋅ Lewei Yao ⋅ Mengzhao Chen ⋅ Chengyue Wu ⋅ Songyang Zhang ⋅ Shuchen Xue ⋅ Yong Liu ⋅ Taiqiang Wu ⋅ Xihui Liu ⋅ Kaipeng Zhang ⋅ Shifeng Zhang ⋅ Wenqi Shao ⋅ Zhenguo Li ⋅ Ping Luo
|
Exhibit Hall I #112 | |
|
SHeaP: Self-supervised Head Geometry Predictor Learned via 2D Gaussians
Poster Session 3 & Exhibit Hall
Liam Schoneveld ⋅ Zhe Chen ⋅ Davide Davoli ⋅ Jiapeng Tang ⋅ Saimon Terazawa ⋅ Ko Nishino ⋅ Matthias Nießner
|
Exhibit Hall I #392 | |
|
TARO: Timestep-Adaptive Representation Alignment with Onset-Aware Conditioning for Synchronized Video-to-Audio Synthesis
Poster Session 3 & Exhibit Hall
Tri Ton ⋅ Ji Woo Hong ⋅ Chang Yoo
|
Exhibit Hall I #398 | |
|
DexVLG: Dexterous Vision-Language-Grasp Model at Scale
Jiawei He ⋅ Danshi Li ⋅ Xinqiang Yu ⋅ Zekun Qi ⋅ Wenyao Zhang ⋅ Jiayi Chen ⋅ Zhaoxiang Zhang ⋅ Zhizheng Zhang ⋅ Li Yi ⋅ He Wang
|
Exhibit Hall I #400 | |
|
Towards Explicit Exoskeleton for the Reconstruction of Complicated 3D Human Avatars
Poster Session 3 & Exhibit Hall
Yifan Zhan ⋅ Qingtian Zhu ⋅ Muyao Niu ⋅ Mingze Ma ⋅ Jiancheng Zhao ⋅ Zhihang Zhong ⋅ Xiao Sun ⋅ Yu Qiao ⋅ Yinqiang Zheng
|
Exhibit Hall I #401 | |
|
Fine-Grained 3D Gaussian Head Avatars Modeling from Static Captures via Joint Reconstruction and Registration
Poster Session 3 & Exhibit Hall
Yuan Sun ⋅ Xuan Wang ⋅ Cong Wang ⋅ WeiLi Zhang ⋅ Yanbo Fan ⋅ Yu Guo ⋅ Fei Wang
|
Exhibit Hall I #404 | |
|
IM-LUT: Interpolation Mixing Look-Up Tables for Image Super-Resolution
Poster Session 3 & Exhibit Hall
Sejin Park ⋅ Sangmin Lee ⋅ Kyong Hwan Jin ⋅ Seung-Won Jung
|
Exhibit Hall I #406 | |
|
Attention to Trajectory: Trajectory-Aware Open-Vocabulary Tracking
Poster Session 3 & Exhibit Hall
Yunhao Li ⋅ Yifan Jiao ⋅ Dan Meng ⋅ Heng Fan ⋅ Libo Zhang
|
Exhibit Hall I #413 | |
|
UniPortrait: A Unified Framework for Identity-Preserving Single- and Multi-Human Image Personalization
Poster Session 3 & Exhibit Hall
Junjie He ⋅ Yifeng Geng ⋅ Liefeng Bo
|
Exhibit Hall I #414 | |
|
Music-Aligned Holistic 3D Dance Generation via Hierarchical Motion Modeling
Poster Session 3 & Exhibit Hall
LI XIAOJIE ⋅ Ronghui Li ⋅ Shukai Fang ⋅ Shuzhao Xie ⋅ Xiaoyang Guo ⋅ Jiaqing Zhou ⋅ Junkun Peng ⋅ Zhi Wang
|
Exhibit Hall I #416 | |
|
NoiseController: Towards Consistent Multi-view Video Generation via Noise Decomposition and Collaboration
Poster Session 3 & Exhibit Hall
Haotian Dong ⋅ Xin WANG ⋅ Di Lin ⋅ Yipeng Wu ⋅ Qin Chen ⋅ Ruonan Liu ⋅ Kairui Yang ⋅ Ping Li ⋅ Qing Guo
|
Exhibit Hall I #418 | |
|
FED-PsyAU: Privacy-Preserving Micro-Expression Recognition via Psychological AU Coordination and Dynamic Facial Motion Modeling
Poster Session 3 & Exhibit Hall
Jingting Li ⋅ Yu Qian ⋅ Lin Zhao ⋅ Su-Jing Wang
|
Exhibit Hall I #419 | |
|
PUMPS: Skeleton-Agnostic Point-based Universal Motion Pre-Training for Synthesis in Human Motion Tasks
Poster Session 3 & Exhibit Hall
Clinton A Mo ⋅ Kun Hu ⋅ Chengjiang Long ⋅ Dong Yuan ⋅ Wan-Chi Siu ⋅ Zhiyong Wang
|
Exhibit Hall I #423 | |
|
MistSense: Versatile Online Detection of Procedural and Execution Mistakes
Poster Session 3 & Exhibit Hall
Constantin Patsch ⋅ Yuankai Wu ⋅ Marsil Zakour ⋅ Driton Salihu ⋅ Eckehard Steinbach
|
Exhibit Hall I #426 | |
|
SEREP: Semantic Facial Expression Representation for Robust In-the-Wild Capture and Retargeting
Poster Session 3 & Exhibit Hall
Arthur Josi ⋅ Luiz Gustavo Hafemann ⋅ Abdallah Dib ⋅ Emeline Got ⋅ Rafael M. O. Cruz ⋅ Marc-André Carbonneau
|
Exhibit Hall I #427 | |
|
LUT-Fuse: Towards Extremely Fast Infrared and Visible Image Fusion via Distillation to Learnable Look-Up Tables
Poster Session 3 & Exhibit Hall
Xunpeng Yi ⋅ yibing zhang ⋅ Xinyu Xiang ⋅ Qinglong Yan ⋅ Han Xu ⋅ Jiayi Ma
|
Exhibit Hall I #429 | |
|
Morph: A Motion-free Physics Optimization Framework for Human Motion Generation
Poster Session 3 & Exhibit Hall
Zhuo Li ⋅ Mingshuang Luo ⋅ RuiBing Hou ⋅ XIN ZHAO ⋅ Hao Liu ⋅ Hong Chang ⋅ Zimo Liu ⋅ Chen Li
|
Exhibit Hall I #431 | |
|
DisCoRD: Discrete Tokens to Continuous Motion via Rectified Flow Decoding
Jungbin Cho ⋅ Junwan Kim ⋅ Jisoo Kim ⋅ Minseo Kim ⋅ Mingu Kang ⋅ Sungeun Hong ⋅ Tae-Hyun Oh ⋅ Youngjae Yu
|
Exhibit Hall I #433 | |
|
MixANT: Observation-dependent Memory Propagation for Stochastic Dense Action Anticipation
Poster Session 3 & Exhibit Hall
Syed Talal Wasim ⋅ Hamid Suleman ⋅ Olga Zatsarynna ⋅ Muzammal Naseer ⋅ Juergen Gall
|
Exhibit Hall I #434 | |
|
VoiceCraft-Dub: Automated Video Dubbing with Neural Codec Language Models
Poster Session 3 & Exhibit Hall
Kim Sung-Bin ⋅ Jeongsoo Choi ⋅ Puyuan Peng ⋅ Joon Chung Chung ⋅ Tae-Hyun Oh ⋅ David Harwath
|
Exhibit Hall I #435 | |
|
DeSPITE: Exploring Contrastive Deep Skeleton-Pointcloud-IMU-Text Embeddings for Advanced Point Cloud Human Activity Understanding
Poster Session 3 & Exhibit Hall
Thomas Kreutz ⋅ Max Mühlhäuser ⋅ Alejandro Sanchez Guinea
|
Exhibit Hall I #436 | |
|
DISTA-Net: Dynamic Closely-Spaced Infrared Small Target Unmixing
Poster Session 3 & Exhibit Hall
Shengdong Han ⋅ Shangdong Yang ⋅ Yuxuan Li ⋅ Xin Zhang ⋅ Xiang Li ⋅ jian Yang ⋅ Ming-Ming Cheng ⋅ Yimian Dai
|
Exhibit Hall I #438 | |
|
FLOAT: Generative Motion Latent Flow Matching for Audio-driven Talking Portrait
Poster Session 3 & Exhibit Hall
Taekyung Ki ⋅ Dongchan Min ⋅ Gyeongsu Chae
|
Exhibit Hall I #442 | |
|
VSRM: A Robust Mamba-Based Framework for Video Super-Resolution
Poster Session 3 & Exhibit Hall
Phu Tran Dinh ⋅ Hung Dao ⋅ Daeyoung Kim
|
Exhibit Hall I #443 | |
|
2HandedAfforder: Learning Precise Actionable Bimanual Affordances from Human Videos
Poster Session 3 & Exhibit Hall
Marvin Heidinger ⋅ Snehal Jauhri ⋅ Vignesh Prasad ⋅ Georgia Chalvatzaki
|
Exhibit Hall I #446 | |
|
AnimalClue: Recognizing Animals by their Traces
Risa Shinoda ⋅ Nakamasa Inoue ⋅ Iro Laina ⋅ Christian Rupprecht ⋅ Hirokatsu Kataoka
|
Exhibit Hall I #451 | |
|
TIP-I2V: A Million-Scale Real Text and Image Prompt Dataset for Image-to-Video Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Wenhao Wang ⋅ Yi Yang
|
Exhibit Hall I #1 | |
|
SCFlow: Implicitly Learning Style and Content Disentanglement with Flow Models
Poster Session 4 & Exhibit Hall with Coffee Break
Pingchuan Ma ⋅ Xiaopei Yang ⋅ Ming Gui ⋅ Yusong Li ⋅ Felix Krause ⋅ Johannes Schusterbauer ⋅ Björn Ommer
|
Exhibit Hall I #3 | |
|
OminiControl: Minimal and Universal Control for Diffusion Transformer
Zhenxiong Tan ⋅ Songhua Liu ⋅ Xingyi Yang ⋅ Qiaochu Xue ⋅ Xinchao Wang
|
Exhibit Hall I #5 | |
|
Penalizing Boundary Activation for Object Completeness in Diffusion Models
Poster Session 4 & Exhibit Hall with Coffee Break
Haoyang Xu ⋅ Tianhao Zhao ⋅ Sibei Yang ⋅ Yutian Lin
|
Exhibit Hall I #7 | |
|
RayZer: A Self-supervised Large View Synthesis Model
Poster Session 2 & Exhibit Hall with Coffee Break
Hanwen Jiang ⋅ Hao Tan ⋅ Peng Wang ⋅ Haian Jin ⋅ Yue Zhao ⋅ Sai Bi ⋅ Kai Zhang ⋅ Fujun Luan ⋅ Kalyan Sunkavalli ⋅ Qixing Huang ⋅ Georgios Pavlakos
|
Exhibit Hall I #74 | |
|
MatchDiffusion: Training-free Generation of Match-Cuts
Poster Session 4 & Exhibit Hall with Coffee Break
Alejandro Pardo ⋅ Fabio Pizzati ⋅ Tong Zhang ⋅ Alexander Pondaven ⋅ Philip Torr ⋅ Juan Perez ⋅ Bernard Ghanem
|
Exhibit Hall I #8 | |
|
Dual-Expert Consistency Model for Efficient and High-Quality Video Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Zhengyao Lyu ⋅ Chenyang Si ⋅ Tianlin Pan ⋅ Zhaoxi Chen ⋅ Kwan-Yee K. Wong ⋅ Yu Qiao ⋅ Ziwei Liu
|
Exhibit Hall I #9 | |
|
Straighten Viscous Rectified Flow via Noise Optimization
Jimin Dai ⋅ Jiexi Yan ⋅ Jian Yang ⋅ lei luo
|
Exhibit Hall I #11 | |
|
Scalable Dual Fingerprinting for Hierarchical Attribution of Text-to-Image Models
Jianwei Fei ⋅ Yunshu Dai ⋅ Peipeng Yu ⋅ Zhe Kong ⋅ Jiantao Zhou ⋅ Zhihua Xia
|
Exhibit Hall I #13 | |
|
QuantCache: Adaptive Importance-Guided Quantization with Hierarchical Latent and Layer Caching for Video Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Junyi Wu ⋅ Zhiteng Li ⋅ Zheng Hui ⋅ YULUN ZHANG ⋅ Linghe Kong ⋅ Xiaokang Yang
|
Exhibit Hall I #14 | |
|
CRAM: Large Scale Video Continual Learning with Bootstrapped Compression
Poster Session 4 & Exhibit Hall with Coffee Break
Shivani Mall ⋅ Joao F. Henriques
|
Exhibit Hall I #15 | |
|
Tree-NeRV: Efficient Non-Uniform Sampling for Neural Video Representation via Tree-Structured Feature Grids
Poster Session 4 & Exhibit Hall with Coffee Break
Jiancheng Zhao ⋅ Yifan Zhan ⋅ Qingtian Zhu ⋅ Mingze Ma ⋅ Muyao Niu ⋅ Zunian Wan ⋅ Xiang Ji ⋅ Yinqiang Zheng
|
Exhibit Hall I #18 | |
|
MaTe: Images Are All You Need for Material Transfer via Diffusion Transformer
Poster Session 4 & Exhibit Hall with Coffee Break
Nisha Huang ⋅ Henglin Liu ⋅ Yizhou Lin ⋅ Kaer Huang ⋅ Chubin Chen ⋅ Jie Guo ⋅ Tong-Yee Lee ⋅ Xiu Li
|
Exhibit Hall I #22 | |
|
ForCenNet: Foreground-Centric Network for Document Image Rectification
Poster Session 4 & Exhibit Hall with Coffee Break
Peng Cai ⋅ liqiang liqiang ⋅ Kaicheng Yang ⋅ guodong guodong ⋅ lijia lijia ⋅ zhounan zhounan ⋅ Xiang An ⋅ Ninghua Yang ⋅ Jiankang Deng
|
Exhibit Hall I #24 | |
|
VEGGIE: Instructional Editing and Reasoning Video Concepts with Grounded Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Shoubin Yu ⋅ Difan Liu ⋅ Ziqiao Ma ⋅ Yicong Hong ⋅ Yang Zhou ⋅ Hao Tan ⋅ Joyce Chai ⋅ Mohit Bansal
|
Exhibit Hall I #25 | |
|
Scale Your Instructions: Enhance the Instruction-Following Fidelity of Unified Image Generation Model by Self-Adaptive Attention Scaling
Poster Session 4 & Exhibit Hall with Coffee Break
Chao Zhou ⋅ Tianyi Wei ⋅ Nenghai Yu
|
Exhibit Hall I #27 | |
|
CycleVAR: Repurposing Autoregressive Model for Unsupervised One-Step Image Translation
Poster Session 4 & Exhibit Hall with Coffee Break
Yi Liu ⋅ Shengqian Li ⋅ Zuzeng Lin ⋅ Feng Wang ⋅ Si Liu
|
Exhibit Hall I #29 | |
|
SDMatte: Grafting Diffusion Models for Interactive Matting
Poster Session 4 & Exhibit Hall with Coffee Break
Longfei Huang ⋅ Yu Liang ⋅ Hao Zhang ⋅ Jinwei Chen ⋅ Wei Dong ⋅ Lunde Chen ⋅ Wanyu Liu ⋅ Bo Li ⋅ Peng-Tao Jiang
|
Exhibit Hall I #32 | |
|
Adaptive Caching for Faster Video Generation with Diffusion Transformers
Poster Session 4 & Exhibit Hall with Coffee Break
Kumara Kahatapitiya ⋅ Haozhe Liu ⋅ Sen He ⋅ Ding Liu ⋅ Menglin Jia ⋅ Chenyang Zhang ⋅ Michael Ryoo ⋅ Tian Xie
|
Exhibit Hall I #33 | |
|
CoMPaSS: Enhancing Spatial Understanding in Text-to-Image Diffusion Models
Poster Session 4 & Exhibit Hall with Coffee Break
Gaoyang Zhang ⋅ Bingtao Fu ⋅ Qingnan Fan ⋅ Qi Zhang ⋅ Runxing Liu ⋅ Hong Gu ⋅ Huaqi Zhang ⋅ Xinguo Liu
|
Exhibit Hall I #34 | |
|
Edicho: Consistent Image Editing in the Wild
Poster Session 4 & Exhibit Hall with Coffee Break
Qingyan Bai ⋅ Hao Ouyang ⋅ Yinghao Xu ⋅ Qiuyu Wang ⋅ Ceyuan Yang ⋅ Ka Leong Cheng ⋅ Yujun Shen ⋅ Qifeng Chen
|
Exhibit Hall I #36 | |
|
LUSD: Localized Update Score Distillation for Text-Guided Image Editing
Poster Session 4 & Exhibit Hall with Coffee Break
Worameth Chinchuthakun ⋅ Tossaporn Saengja ⋅ Nontawat Tritrong ⋅ Pitchaporn Rewatbowornwong ⋅ Pramook Khungurn ⋅ Supasorn Suwajanakorn
|
Exhibit Hall I #38 | |
|
FlowChef: Steering of Rectified Flow Models for Controlled Generations
Poster Session 4 & Exhibit Hall with Coffee Break
Maitreya Patel ⋅ Song Wen ⋅ Dimitris Metaxas ⋅ Yezhou Yang
|
Exhibit Hall I #39 | |
|
From Reflection to Perfection: Scaling Inference-Time Optimization for Text-to-Image Diffusion Models via Reflection Tuning
Poster Session 4 & Exhibit Hall with Coffee Break
Le Zhuo ⋅ Liangbing Zhao ⋅ Sayak Paul ⋅ Yue Liao ⋅ Renrui Zhang ⋅ Yi Xin ⋅ Peng Gao ⋅ Mohamed Elhoseiny ⋅ Hongsheng Li
|
Exhibit Hall I #41 | |
|
Translation of Text Embedding via Delta Vector to Suppress Strongly Entangled Content in Text-to-Image Diffusion Models
Poster Session 4 & Exhibit Hall with Coffee Break
Eunseo Koh ⋅ SeungHoo Hong ⋅ Tae-Young Kim ⋅ Jae-Pil Heo ⋅ Simon Woo
|
Exhibit Hall I #44 | |
|
Grouped Speculative Decoding for Autoregressive Image Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Junhyuk So ⋅ Juncheol Shin ⋅ Hyunho Kook ⋅ Eunhyeok Park
|
Exhibit Hall I #45 | |
|
Generating, Fast and Slow: Scalable Parallel Video Generation with Video Interface Networks
Poster Session 4 & Exhibit Hall with Coffee Break
Bhishma Dedhia ⋅ David Bourgin ⋅ Krishna Kumar Singh ⋅ Yuheng Li ⋅ Yan Kang ⋅ Zhan Xu ⋅ Niraj Jha ⋅ Yuchen Liu
|
Exhibit Hall I #46 | |
|
SynTag: Enhancing the Geometric Robustness of Inversion-based Generative Image Watermarking
Poster Session 4 & Exhibit Hall with Coffee Break
Han Fang ⋅ Kejiang Chen ⋅ Zehua Ma ⋅ Jiajun Deng ⋅ Yicong Li ⋅ Weiming Zhang ⋅ Ee-Chien Chang
|
Exhibit Hall I #49 | |
|
Draw Your Mind: Personalized Generation via Condition-Level Modeling in Text-to-Image Diffusion Models
Poster Session 4 & Exhibit Hall with Coffee Break
Hyungjin Kim ⋅ Seokho Ahn ⋅ Young-Duk Seo
|
Exhibit Hall I #218 | |
|
Text Embedding Knows How to Quantize Text-Guided Diffusion Models
Poster Session 4 & Exhibit Hall with Coffee Break
Hongjae Lee ⋅ Myungjun Son ⋅ Dongjea Kang ⋅ Seung-Won Jung
|
Exhibit Hall I #50 | |
|
PUMA: Empowering Unified MLLM with Multi-granular Visual Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Rongyao Fang ⋅ Chengqi Duan ⋅ Kun Wang ⋅ Hao Li ⋅ Linjiang Huang ⋅ Hao Tian ⋅ Xingyu Zeng ⋅ Rui Zhao ⋅ Jifeng Dai ⋅ Hongsheng Li ⋅ Xihui Liu
|
Exhibit Hall I #52 | |
|
NeuralSVG: An Implicit Representation for Text-to-Vector Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Sagi Polaczek ⋅ Yuval Alaluf ⋅ Elad Richardson ⋅ Yael Vinker ⋅ Daniel Cohen-Or
|
Exhibit Hall I #53 | |
|
IQA-Adapter: Exploring Knowledge Transfer from Image Quality Assessment to Diffusion-based Generative Models
Khaled Abud ⋅ Sergey Lavrushkin ⋅ Alexey Kirillov ⋅ Dmitriy Vatolin
|
Exhibit Hall I #54 | |
|
Global and Local Entailment Learning for Natural World Imagery
Poster Session 4 & Exhibit Hall with Coffee Break
Srikumar Sastry ⋅ Aayush Dhakal ⋅ Eric Xing ⋅ Subash Khanal ⋅ Nathan Jacobs
|
Exhibit Hall I #84 | |
|
Dual Recursive Feedback on Generation and Appearance Latents for Pose-Robust Text-to-Image Diffusion
Poster Session 4 & Exhibit Hall with Coffee Break
Jiwon Kim ⋅ Pureum Kim ⋅ SeonHwa Kim ⋅ Soobin Park ⋅ Eunju Cha ⋅ Kyong Hwan Jin
|
Exhibit Hall I #56 | |
|
Anti-Tamper Protection for Unauthorized Individual Image Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Zelin Li ⋅ Ruohan Zong ⋅ Yifan Liu ⋅ Ruichen Yao ⋅ Yaokun Liu ⋅ Yang Zhang ⋅ Dong Wang
|
Exhibit Hall I #57 | |
|
Continual Personalization for Diffusion Models
Poster Session 4 & Exhibit Hall with Coffee Break
Yu-Chien Liao ⋅ Jr-Jen Chen ⋅ Chi-Pin Huang ⋅ Ci-Siang Lin ⋅ Meng-Lin Wu ⋅ Yu-Chiang Frank Wang
|
Exhibit Hall I #58 | |
|
WikiAutoGen: Towards Multi-Modal Wikipedia-Style Article Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Zhongyu Yang ⋅ Jun Chen ⋅ Dannong Xu ⋅ Junjie Fei ⋅ Xiaoqian Shen ⋅ Liangbing Zhao ⋅ Chun-Mei Feng ⋅ Mohamed Elhoseiny
|
Exhibit Hall I #60 | |
|
Spectral Image Tokenizer
Poster Session 4 & Exhibit Hall with Coffee Break
Carlos Esteves ⋅ Mohammed Suhail ⋅ Ameesh Makadia
|
Exhibit Hall I #219 | |
|
QuEST: Low-bit Diffusion Model Quantization via Efficient Selective Finetuning
Poster Session 4 & Exhibit Hall with Coffee Break
Haoxuan Wang ⋅ Yuzhang Shang ⋅ Zhihang Yuan ⋅ Junyi Wu ⋅ Junchi Yan ⋅ Yan Yan
|
Exhibit Hall I #61 | |
|
SPADE: Spatial-Aware Denoising Network for Open-vocabulary Panoptic Scene Graph Generation with Long- and Local-range Context Reasoning
Poster Session 4 & Exhibit Hall with Coffee Break
XIN Hu ⋅ Ke Qin ⋅ Guiduo Duan ⋅ Ming Li ⋅ Yuan-Fang Li ⋅ Tao He
|
Exhibit Hall I #63 | |
|
DropletVideo: A Dataset and Approach to Explore Integral Spatio-Temporal Consistent Video Generation
Runze Zhang ⋅ Guoguang Du ⋅ Xiaochuan Li ⋅ Qi Jia ⋅ Liang Jin ⋅ Lu Liu ⋅ Jingjing Wang ⋅ Cong Xu ⋅ Zhenhua Guo ⋅ Yaqian Zhao ⋅ Xiaoli Gong ⋅ Rengang Li ⋅ Baoyu Fan
|
Exhibit Hall I #65 | |
|
Split-and-Combine: Enhancing Style Augmentation for Single Domain Generalization
Poster Session 4 & Exhibit Hall with Coffee Break
Zhen Zhang ⋅ Zhen Zhang ⋅ Qianlong Dang ⋅ Zhize Wu ⋅ LiChuan Gu
|
Exhibit Hall I #68 | |
|
RefEdit: A Benchmark and Method for Improving Instruction-based Image Editing Model on Referring Expressions
Poster Session 4 & Exhibit Hall with Coffee Break
Bimsara Pathiraja ⋅ Maitreya Patel ⋅ Shivam Singh ⋅ Yezhou Yang ⋅ Chitta Baral
|
Exhibit Hall I #71 | |
|
CuRe: Cultural Gaps in the Long Tail of Text-to-Image Systems
Poster Session 4 & Exhibit Hall with Coffee Break
Aniket Rege ⋅ Zinnia Nie ⋅ Unmesh Raskar ⋅ Mahesh Ramesh ⋅ Zhuoran Yu ⋅ Aditya Kusupati ⋅ Yong Jae Lee ⋅ Ramya Vinayak
|
Exhibit Hall I #76 | |
|
TREAD: Token Routing for Efficient Architecture-agnostic Diffusion Training
Poster Session 4 & Exhibit Hall with Coffee Break
Felix Krause ⋅ Timy Phan ⋅ Ming Gui ⋅ Stefan A. Baumann ⋅ Vincent Tao Hu ⋅ Björn Ommer
|
Exhibit Hall I #78 | |
|
Bootstrap3D: Improving Multi-view Diffusion Model with Synthetic Data
Poster Session 4 & Exhibit Hall with Coffee Break
Zeyi Sun ⋅ Tong Wu ⋅ Pan Zhang ⋅ Yuhang Zang ⋅ Xiaoyi Dong ⋅ Yuanjun Xiong ⋅ Dahua Lin ⋅ Jiaqi Wang
|
Exhibit Hall I #79 | |
|
Zero-Shot Depth Aware Image Editing with Diffusion Models
Poster Session 4 & Exhibit Hall with Coffee Break
Rishubh Parihar ⋅ Sachidanand VS ⋅ Venkatesh Babu Radhakrishnan
|
Exhibit Hall I #82 | |
|
StyleKeeper: Prevent Content Leakage using Negative Visual Query Guidance
Poster Session 4 & Exhibit Hall with Coffee Break
Jaeseok Jeong ⋅ Junho Kim ⋅ Youngjung Uh ⋅ Gayoung Lee ⋅ Yunjey Choi
|
Exhibit Hall I #83 | |
|
TRKT: Weakly Supervised Dynamic Scene Graph Generation with Temporal-enhanced Relation-aware Knowledge Transferring
Poster Session 4 & Exhibit Hall with Coffee Break
Zhu Xu ⋅ Ting Lei ⋅ Zhimin Li ⋅ Guan Wang ⋅ Qingchao Chen ⋅ Yuxin Peng ⋅ Yang Liu
|
Exhibit Hall I #88 | |
|
Pose-Star: Anatomy-Aware Editing for Open-World Fashion Images
Poster Session 4 & Exhibit Hall with Coffee Break
Yuran Dong ⋅ Mang Ye
|
Exhibit Hall I #89 | |
|
Who Controls the Authorization? Invertible Networks for Copyright Protection in Text-to-Image Synthesis
Poster Session 4 & Exhibit Hall with Coffee Break
Baoyue Hu ⋅ Yang Wei ⋅ Junhao Xiao ⋅ Wendong Huang ⋅ Xiuli Bi ⋅ Bin Xiao
|
Exhibit Hall I #90 | |
|
SegmentDreamer: Towards High-fidelity Text-to-3D Synthesis with Segmented Consistency Trajectory Distillation
Poster Session 4 & Exhibit Hall with Coffee Break
Jiahao Zhu ⋅ Zixuan Chen ⋅ Guangcong Wang ⋅ Xiaohua Xie ⋅ Yi Zhou
|
Exhibit Hall I #93 | |
|
MUSE: Multi-Subject Unified Synthesis via Explicit Layout Semantic Expansion
Poster Session 4 & Exhibit Hall with Coffee Break
Fei Peng ⋅ Junqiang Wu ⋅ Yan Li ⋅ Tingting Gao ⋅ Di ZHANG ⋅ Huiyuan Fu
|
Exhibit Hall I #95 | |
|
Magic Insert: Style-Aware Drag-and-Drop
Nataniel Ruiz ⋅ Yuanzhen Li ⋅ Neal Wadhwa ⋅ Yael Pritch ⋅ Michael Rubinstein ⋅ David Jacobs ⋅ Shlomi Fruchter
|
Exhibit Hall I #103 | |
|
DIVE: Taming DINO for Subject-Driven Video Editing
Poster Session 4 & Exhibit Hall with Coffee Break
Yi Huang ⋅ Wei Xiong ⋅ He Zhang ⋅ Chaoqi Chen ⋅ Jianzhuang Liu ⋅ Mingfu Yan ⋅ Shifeng Chen
|
Exhibit Hall I #106 | |
|
FontAnimate: High Quality Few-shot Font Generation via Animating Font Transfer Process
Poster Session 4 & Exhibit Hall with Coffee Break
Bin Fu ⋅ Zixuan Wang ⋅ Kainan Yan ⋅ Shitian Zhao ⋅ Qi Qin ⋅ Jie Wen ⋅ Junjun He ⋅ Peng Gao
|
Exhibit Hall I #107 | |
|
PromptDresser: Improving the Quality and Controllability of Virtual Try-On via Generative Textual Prompt and Prompt-aware Mask
Poster Session 4 & Exhibit Hall with Coffee Break
Jeongho Kim ⋅ Hoiyeong Jin ⋅ Sunghyun Park ⋅ Jaegul Choo
|
Exhibit Hall I #108 | |
|
IMG: Calibrating Diffusion Models via Implicit Multimodal Guidance
Poster Session 4 & Exhibit Hall with Coffee Break
Jiayi Guo ⋅ Chuanhao Yan ⋅ Xingqian Xu ⋅ Yulin Wang ⋅ Kai Wang ⋅ Gao Huang ⋅ Humphrey Shi
|
Exhibit Hall I #113 | |
|
SAGI: Semantically Aligned and Uncertainty Guided AI Image Inpainting
Poster Session 4 & Exhibit Hall with Coffee Break
Paschalis Giakoumoglou ⋅ Dimitrios Karageorgiou ⋅ Symeon Papadopoulos ⋅ Panagiotis Petrantonakis
|
Exhibit Hall I #114 | |
|
TextMaster: A Unified Framework for Realistic Text Editing via Glyph-Style Dual-Control
Poster Session 4 & Exhibit Hall with Coffee Break
Zhenyu Yan ⋅ Jian Wang ⋅ Aoqiang Wang ⋅ Yuhan Li ⋅ Wenxiang Shang ⋅ Zhu Hangcheng
|
Exhibit Hall I #116 | |
|
LoRA.rar: Learning to Merge LoRAs via Hypernetworks for Subject-Style Conditioned Image Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Donald Shenaj ⋅ Ondrej Bohdal ⋅ Mete Ozay ⋅ Pietro Zanuttigh ⋅ Umberto Michieli
|
Exhibit Hall I #118 | |
|
Beyond Perspective: Neural 360-Degree Video Compression
Poster Session 4 & Exhibit Hall with Coffee Break
Andy Regensky ⋅ Marc Windsheimer ⋅ Fabian Brand ⋅ Andre Kaup
|
Exhibit Hall I #119 | |
|
MCID: Multi-aspect Copyright Infringement Detection for Generated Images
Poster Session 4 & Exhibit Hall with Coffee Break
Chuanwei Huang ⋅ Zexi Jia ⋅ Hongyan Fei ⋅ Yeshuang Zhu ⋅ Zhiqiang Yuan ⋅ Ying Deng ⋅ Jiapei Zhang ⋅ Xiaoyue Duan ⋅ Jinchao Zhang ⋅ Jie Zhou
|
Exhibit Hall I #120 | |
|
Text2Outfit: Controllable Outfit Generation with Multimodal Language Models
Poster Session 4 & Exhibit Hall with Coffee Break
Yuanhao Zhai ⋅ Yen-Liang Lin ⋅ Minxu Peng ⋅ Larry Davis ⋅ Ashwin Chandramouli ⋅ Junsong Yuan ⋅ David Doermann
|
Exhibit Hall I #121 | |
|
Outlier-Aware Post-Training Quantization for Image Super-Resolution
Hailing Wang ⋅ Jianglin Lu ⋅ Yitian Zhang ⋅ Yun Fu
|
Exhibit Hall I #122 | |
|
DualReal: Adaptive Joint Training for Lossless Identity-Motion Fusion in Video Customization
Poster Session 4 & Exhibit Hall with Coffee Break
Wenchuan Wang ⋅ Mengqi Huang ⋅ Yijing Tu ⋅ Zhendong Mao
|
Exhibit Hall I #161 | |
|
Modeling Human Gaze Behavior with Diffusion Models for Unified Scanpath Prediction
Poster Session 4 & Exhibit Hall with Coffee Break
Giuseppe Cartella ⋅ Vittorio Cuculo ⋅ Alessandro D'Amelio ⋅ Marcella Cornia ⋅ Giuseppe Boccignone ⋅ Rita Cucchiara
|
Exhibit Hall I #125 | |
|
What Changed? Detecting and Evaluating Instruction-Guided Image Edits with Multimodal Large Language Models
Poster Session 4 & Exhibit Hall with Coffee Break
Lorenzo Baraldi ⋅ Davide Bucciarelli ⋅ Federico Betti ⋅ Marcella Cornia ⋅ Lorenzo Baraldi ⋅ Nicu Sebe ⋅ Rita Cucchiara
|
Exhibit Hall I #126 | |
|
MeshPad: Interactive Sketch-Conditioned Artist-Reminiscent Mesh Generation and Editing
Poster Session 4 & Exhibit Hall with Coffee Break
Haoxuan Li ⋅ Ziya Erkoç ⋅ Lei Li ⋅ Daniele Sirigatti ⋅ Vladislav Rosov ⋅ Angela Dai ⋅ Matthias Nießner
|
Exhibit Hall I #127 | |
|
PLADIS: Pushing the Limits of Attention in Diffusion Models at Inference Time by Leveraging Sparsity
Poster Session 4 & Exhibit Hall with Coffee Break
Kwanyoung Kim ⋅ Byeongsu Sim
|
Exhibit Hall I #128 | |
|
STIV: Scalable Text and Image Conditioned Video Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Zongyu Lin ⋅ Wei Liu ⋅ Chen Chen ⋅ Jiasen Lu ⋅ Wenze Hu ⋅ Tsu-Jui Fu ⋅ Jesse Allardice ⋅ Zhengfeng Lai ⋅ Liangchen Song ⋅ Bowen Zhang ⋅ cha chen ⋅ Yiran Fei ⋅ Lezhi Li ⋅ Yizhou Sun ⋅ Kai-Wei Chang ⋅ Yinfei Yang
|
Exhibit Hall I #129 | |
|
D3QE: Learning Discrete Distribution Discrepancy-aware Quantization Error for Autoregressive-Generated Image Detection
Poster Session 4 & Exhibit Hall with Coffee Break
Yanran Zhang ⋅ Bingyao Yu ⋅ Yu Zheng ⋅ Wenzhao Zheng ⋅ Yueqi Duan ⋅ Lei Chen ⋅ Jie Zhou ⋅ Jiwen Lu
|
Exhibit Hall I #133 | |
|
OmniCache: A Trajectory-Oriented Global Perspective on Training-Free Cache Reuse for Diffusion Transformer Models
Poster Session 4 & Exhibit Hall with Coffee Break
Huanpeng Chu ⋅ Wei Wu ⋅ Guanyu Feng ⋅ Yutao Zhang
|
Exhibit Hall I #134 | |
|
One-Step Specular Highlight Removal with Adapted Diffusion Models
Poster Session 4 & Exhibit Hall with Coffee Break
Mahir Atmis ⋅ LEVENT KARACAN ⋅ Mehmet SARIGÜL
|
Exhibit Hall I #135 | |
|
DiGA3D: Coarse-to-Fine Diffusional Propagation of Geometry and Appearance for Versatile 3D Inpainting
Poster Session 4 & Exhibit Hall with Coffee Break
Jingyi Pan ⋅ Dan Xu ⋅ Qiong Luo
|
Exhibit Hall I #138 | |
|
Holistic Unlearning Benchmark: A Multi-Faceted Evaluation for Text-to-Image Diffusion Model Unlearning
Poster Session 4 & Exhibit Hall with Coffee Break
Saemi Moon ⋅ Minjong Lee ⋅ Sangdon Park ⋅ Dongwoo Kim
|
Exhibit Hall I #139 | |
|
MV-Adapter: Multi-View Consistent Image Generation Made Easy
Poster Session 4 & Exhibit Hall with Coffee Break
Zehuan Huang ⋅ Yuan-Chen Guo ⋅ Haoran Wang ⋅ Ran Yi ⋅ Lizhuang Ma ⋅ Yanpei Cao ⋅ Lu Sheng
|
Exhibit Hall I #141 | |
|
On Large Multimodal Models as Open-World Image Classifiers
Poster Session 4 & Exhibit Hall with Coffee Break
Alessandro Conti ⋅ Massimiliano Mancini ⋅ Enrico Fini ⋅ Yiming Wang ⋅ Paolo Rota ⋅ Elisa Ricci
|
Exhibit Hall I #142 | |
|
VACE: All-in-One Video Creation and Editing
Poster Session 4 & Exhibit Hall with Coffee Break
Zeyinzi Jiang ⋅ Zhen Han ⋅ Chaojie Mao ⋅ Jingfeng Zhang ⋅ Yulin Pan ⋅ Yu Liu
|
Exhibit Hall I #220 | |
|
DiTFastAttnV2: Head-wise Attention Compression for Multi-Modality Diffusion Transformers
Poster Session 4 & Exhibit Hall with Coffee Break
Hanling Zhang ⋅ Rundong Su ⋅ Zhihang Yuan ⋅ Pengtao Chen ⋅ Mingzhu Shen ⋅ Yibo Fan ⋅ Shengen Yan ⋅ Guohao Dai ⋅ Yu Wang
|
Exhibit Hall I #143 | |
|
DIMCIM: A Quantitative Evaluation Framework for Default-mode Diversity and Generalization in Text-to-Image Generative Models
Poster Session 4 & Exhibit Hall with Coffee Break
Revant Teotia ⋅ Candace Ross ⋅ Karen Ullrich ⋅ Sumit Chopra ⋅ Adriana Romero-Soriano ⋅ Melissa Hall ⋅ Matthew Muckley
|
Exhibit Hall I #146 | |
|
From Linearity to Non-Linearity: How Masked Autoencoders Capture Spatial Correlations
Poster Session 4 & Exhibit Hall with Coffee Break
Anthony Bisulco ⋅ Rahul Ramesh ⋅ Randall Balestriero ⋅ Pratik Chaudhari
|
Exhibit Hall I #147 | |
|
Reusing Computation in Text-to-Image Diffusion for Efficient Generation of Image Sets
Poster Session 4 & Exhibit Hall with Coffee Break
Dale Decatur ⋅ Thibault Groueix ⋅ Wang Yifan ⋅ Rana Hanocka ⋅ Vladimir Kim ⋅ Matheus Gadelha
|
Exhibit Hall I #153 | |
|
Repurposing 2D Diffusion Models with Gaussian Atlas for 3D Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Tiange Xiang ⋅ Kai Li ⋅ Chengjiang Long ⋅ Christian Häne ⋅ Peihong Guo ⋅ Scott Delp ⋅ Ehsan Adeli ⋅ Li Fei-Fei
|
Exhibit Hall I #154 | |
|
Cross-Granularity Online Optimization with Masked Compensated Information for Learned Image Compression
Poster Session 4 & Exhibit Hall with Coffee Break
Haowei Kuang ⋅ Wenhan Yang ⋅ Zongming Guo ⋅ Jiaying Liu
|
Exhibit Hall I #156 | |
|
Generating Multi-Image Synthetic Data for Text-to-Image Customization
Poster Session 4 & Exhibit Hall with Coffee Break
Nupur Kumari ⋅ Xi Yin ⋅ Jun-Yan Zhu ⋅ Ishan Misra ⋅ Samaneh Azadi
|
Exhibit Hall I #157 | |
|
Deeply Supervised Flow-Based Generative Models
Poster Session 4 & Exhibit Hall with Coffee Break
Inkyu Shin ⋅ Chenglin Yang ⋅ Liang-Chieh (Jay) Chen
|
Exhibit Hall I #158 | |
|
Stroke2Sketch: Harnessing Stroke Attributes for Training-Free Sketch Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Rui Yang ⋅ Huining Li ⋅ Yiyi Long ⋅ Xiaojun Wu ⋅ Shengfeng He
|
Exhibit Hall I #159 | |
|
ICE-Bench: A Unified and Comprehensive Benchmark for Image Creating and Editing
Poster Session 4 & Exhibit Hall with Coffee Break
Yulin Pan ⋅ Xiangteng He ⋅ Chaojie Mao ⋅ Zhen Han ⋅ Zeyinzi Jiang ⋅ Jingfeng Zhang ⋅ Yu Liu
|
Exhibit Hall I #163 | |
|
Edit360: 2D Image Edits to 3D Assets from Any Angle
Junchao Huang ⋅ Xinting Hu ⋅ Shaoshuai Shi ⋅ Zhuotao Tian ⋅ Li Jiang
|
Exhibit Hall I #166 | |
|
FlowTok: Flowing Seamlessly Across Text and Image Tokens
Poster Session 4 & Exhibit Hall with Coffee Break
Ju He ⋅ Qihang Yu ⋅ Qihao Liu ⋅ Liang-Chieh (Jay) Chen
|
Exhibit Hall I #167 | |
|
TeEFusion: Blending Text Embeddings to Distill Classifier-Free Guidance
Poster Session 4 & Exhibit Hall with Coffee Break
Minghao Fu ⋅ Guo-Hua Wang ⋅ Xiaohao Chen ⋅ Qing-Guo Chen ⋅ Zhao Xu ⋅ Weihua Luo ⋅ Kaifu Zhang
|
Exhibit Hall I #169 | |
|
YOLO-Count: Differentiable Object Counting for Text-to-Image Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Guanning Zeng ⋅ Xiang Zhang ⋅ Zirui Wang ⋅ Haiyang Xu ⋅ Zeyuan Chen ⋅ Bingnan Li ⋅ Zhuowen Tu
|
Exhibit Hall I #180 | |
|
TITAN-Guide: Taming Inference-Time Alignment for Guided Text-to-Video Diffusion Models
Poster Session 4 & Exhibit Hall with Coffee Break
Christian Simon ⋅ Masato Ishii ⋅ Akio Hayakawa ⋅ Zhi Zhong ⋅ Shusuke Takahashi ⋅ Takashi Shibuya ⋅ Yuki Mitsufuji
|
Exhibit Hall I #170 | |
|
FiVE-Bench: A Fine-grained Video Editing Benchmark for Evaluating Emerging Diffusion and Rectified Flow Models
Poster Session 4 & Exhibit Hall with Coffee Break
Minghan LI ⋅ Chenxi Xie ⋅ Yichen Wu ⋅ Lei Zhang ⋅ Mengyu Wang
|
Exhibit Hall I #171 | |
|
CompSlider: Compositional Slider for Disentangled Multiple-Attribute Image Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Zixin Zhu ⋅ Kevin Duarte ⋅ Mamshad Nayeem Rizve ⋅ Chengyuan Xu ⋅ Ratheesh Kalarot ⋅ Junsong Yuan
|
Exhibit Hall I #172 | |
|
DreamRenderer: Taming Multi-Instance Attribute Control in Large-Scale Text-to-Image Models
Poster Session 4 & Exhibit Hall with Coffee Break
Dewei Zhou ⋅ Mingwei Li ⋅ Zongxin Yang ⋅ Yi Yang
|
Exhibit Hall I #175 | |
|
DiffSim: Taming Diffusion Models for Evaluating Visual Similarity
Poster Session 4 & Exhibit Hall with Coffee Break
Yiren Song ⋅ Xiaokang Liu ⋅ Mike Zheng Shou
|
Exhibit Hall I #193 | |
|
Adversarial Distribution Matching for Diffusion Distillation Towards Efficient Image and Video Synthesis
Yanzuo Lu ⋅ Yuxi Ren ⋅ Xin Xia ⋅ Shanchuan Lin ⋅ XING WANG ⋅ Xuefeng Xiao ⋅ Jinhua Ma ⋅ Xiaohua Xie ⋅ Jianhuang Lai
|
Exhibit Hall I #185 | |
|
Co-Painter: Fine-Grained Controllable Image Stylization via Implicit Decoupling and Adaptive Injection
Poster Session 4 & Exhibit Hall with Coffee Break
Bowen Fu ⋅ Wei Wei ⋅ Jiaqi Tang ⋅ Jiangtao Nie ⋅ Yanyu Ye ⋅ Xiaogang Xu ⋅ Ying-Cong Chen ⋅ Lei Zhang
|
Exhibit Hall I #186 | |
|
PLA: Prompt Learning Attack against Text-to-Image Generative Models
Poster Session 4 & Exhibit Hall with Coffee Break
XINQI LYU ⋅ Yihao LIU ⋅ Yanjie Li ⋅ Bin Xiao
|
Exhibit Hall I #188 | |
|
FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion
Poster Session 4 & Exhibit Hall with Coffee Break
Haonan Qiu ⋅ Shiwei Zhang ⋅ Yujie Wei ⋅ Ruihang Chu ⋅ Hangjie Yuan ⋅ Xiang Wang ⋅ Yingya Zhang ⋅ Ziwei Liu
|
Exhibit Hall I #192 | |
|
Holistic Tokenizer for Autoregressive Image Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Anlin Zheng ⋅ Haochen Wang ⋅ Yucheng Zhao ⋅ Weipeng DENG ⋅ Tiancai Wang ⋅ Xiangyu Zhang ⋅ Xiaojuan Qi
|
Exhibit Hall I #194 | |
|
Toward Better Out-painting: Improving the Image Composition with Initialization Policy Model
Poster Session 4 & Exhibit Hall with Coffee Break
Xuan Han ⋅ Yihao Zhao ⋅ Yanhao Ge ⋅ Mingyu You
|
Exhibit Hall I #196 | |
|
From Image to Video: An Empirical Study of Diffusion Representations
Pedro Vélez ⋅ Luisa Polania Cabrera ⋅ Yi Yang ⋅ Chuhan Zhang ⋅ Rishabh Kabra ⋅ Anurag Arnab ⋅ Mehdi S. M. Sajjadi
|
Exhibit Hall I #197 | |
|
Versatile Transition Generation with Image-to-Video Diffusion
Poster Session 4 & Exhibit Hall with Coffee Break
Zuhao Yang ⋅ Jiahui Zhang ⋅ Yingchen Yu ⋅ Shijian Lu ⋅ Song Bai
|
Exhibit Hall I #200 | |
|
MetaMorph: Multimodal Understanding and Generation via Instruction Tuning
Poster Session 4 & Exhibit Hall with Coffee Break
Shengbang Tong ⋅ David Fan ⋅ Jiachen Zhu ⋅ Yunyang Xiong ⋅ Xinlei Chen ⋅ Koustuv Sinha ⋅ Michael Rabbat ⋅ Yann LeCun ⋅ Saining Xie ⋅ Zhuang Liu
|
Exhibit Hall I #202 | |
|
SafetyDPO: Scalable Safety Alignment for Text-to-Image Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Runtao Liu ⋅ I Chen ⋅ Jindong Gu ⋅ Jipeng Zhang ⋅ Renjie Pi ⋅ Qifeng Chen ⋅ Philip Torr ⋅ Ashkan Khakzar ⋅ Fabio Pizzati
|
Exhibit Hall I #204 | |
|
DiffIP: Representation Fingerprints for Robust IP Protection of Diffusion Models
Poster Session 4 & Exhibit Hall with Coffee Break
Zhuoling Li ⋅ Haoxuan Qu ⋅ Jason Kuen ⋅ Jiuxiang Gu ⋅ Qiuhong Ke ⋅ Jun Liu ⋅ Hossein Rahmani
|
Exhibit Hall I #205 | |
|
FairHuman: Boosting Hand and Face Quality in Human Image Generation with Minimum Potential Delay Fairness in Diffusion Models
Poster Session 4 & Exhibit Hall with Coffee Break
Yuxuan Wang ⋅ Tianwei Cao ⋅ Huayu Zhang ⋅ Zhongjiang He ⋅ Kongming Liang ⋅ Zhanyu Ma
|
Exhibit Hall I #206 | |
|
Processing and acquisition traces in visual encoders: What does CLIP know about your camera?
Ryan Ramos ⋅ Vladan Stojnić ⋅ Giorgos Kordopatis-Zilos ⋅ Yuta Nakashima ⋅ Giorgos Tolias ⋅ Noa Garcia
|
Exhibit Hall I #207 | |
|
AM-Adapter: Appearance Matching Adapter for Exemplar-based Semantic Image Synthesis in-the-Wild
Poster Session 4 & Exhibit Hall with Coffee Break
Siyoon Jin ⋅ Jisu Nam ⋅ Jiyoung Kim ⋅ Dahyun Chung ⋅ Yeong-Seok Kim ⋅ Joonhyung Park ⋅ HeonJeong Chu ⋅ Seungryong Kim
|
Exhibit Hall I #209 | |
|
Diffusion Epistemic Uncertainty with Asymmetric Learning for Diffusion-Generated Image Detection
Poster Session 4 & Exhibit Hall with Coffee Break
Yingsong Huang ⋅ Hui Guo ⋅ Jing Huang ⋅ Bing Bai ⋅ Qi Xiong
|
Exhibit Hall I #211 | |
|
HypDAE: Hyperbolic Diffusion Autoencoders for Hierarchical Few-shot Image Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Lingxiao Li ⋅ Kaixuan Fan ⋅ Boqing Gong ⋅ Xiangyu Yue
|
Exhibit Hall I #212 | |
|
Model Reveals What to Cache: Profiling-Based Feature Reuse for Video Diffusion Models
Poster Session 4 & Exhibit Hall with Coffee Break
Xuran Ma ⋅ Yexin Liu ⋅ Yaofu LIU ⋅ Xianfeng Wu ⋅ Mingzhe Zheng ⋅ Zihao Wang ⋅ Ser-Nam Lim ⋅ Harry Yang
|
Exhibit Hall I #216 | |
|
Rectifying Magnitude Neglect in Linear Attention
Qihang Fan ⋅ Huaibo Huang ⋅ Yuang Ai ⋅ Ran He
|
Exhibit Hall I #160 | |
|
RomanTex: Decoupling 3D-aware Rotary Positional Embedded Multi-Attention Network for Texture Synthesis
Poster Session 4 & Exhibit Hall with Coffee Break
yifei feng ⋅ Mx Yang ⋅ Shuhui Yang ⋅ Sheng Zhang ⋅ Jiaao Yu ⋅ Zibo Zhao ⋅ Lliu Yuhong ⋅ Jie Jiang ⋅ Chunchao Guo
|
Exhibit Hall I #221 | |
|
Calibrating MLLM-as-a-judge via Multimodal Bayesian Prompt Ensembles
Poster Session 4 & Exhibit Hall with Coffee Break
Eric Slyman ⋅ Mehrab Tanjim ⋅ Kushal Kafle ⋅ Stefan Lee
|
Exhibit Hall I #223 | |
|
V.I.P. : Iterative Online Preference Distillation for Efficient Video Diffusion Models
Poster Session 4 & Exhibit Hall with Coffee Break
Jisoo Kim ⋅ Wooseok Seo ⋅ Junwan Kim ⋅ Seungho Park ⋅ Sooyeon Park ⋅ Youngjae Yu
|
Exhibit Hall I #224 | |
|
LOTA: Bit-Planes Guided AI-Generated Image Detection
Poster Session 4 & Exhibit Hall with Coffee Break
Renxi Cheng ⋅ Hongsong Wang ⋅ Yang Zhang ⋅ Chaolei Han ⋅ Jie Gui
|
Exhibit Hall I #225 | |
|
Balanced Image Stylization with Style Matching Score
Poster Session 4 & Exhibit Hall with Coffee Break
Yuxin Jiang ⋅ Liming Jiang ⋅ Shuai Yang ⋅ Jia-Wei Liu ⋅ Ivor Tsang ⋅ Mike Zheng Shou
|
Exhibit Hall I #236 | |
|
Trade-offs in Image Generation: How Do Different Dimensions Interact?
Poster Session 4 & Exhibit Hall with Coffee Break
Sicheng Zhang ⋅ Binzhu Xie ⋅ Zhonghao Yan ⋅ Yuli Zhang ⋅ Donghao Zhou ⋅ Xiaofei Chen ⋅ Shi Qiu ⋅ Jiaqi Liu ⋅ Guoyang Xie ⋅ Zhichao Lu
|
Exhibit Hall I #228 | |
|
X-Prompt: Generalizable Auto-Regressive Visual Learning with In-Context Prompting
Poster Session 4 & Exhibit Hall with Coffee Break
Zeyi Sun ⋅ Ziyang Chu ⋅ Pan Zhang ⋅ Tong Wu ⋅ Xiaoyi Dong ⋅ Yuhang Zang ⋅ Yuanjun Xiong ⋅ Dahua Lin ⋅ Jiaqi Wang
|
Exhibit Hall I #229 | |
|
Long Context Tuning for Video Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Yuwei Guo ⋅ Ceyuan Yang ⋅ Ziyan Yang ⋅ Zhibei Ma ⋅ Zhijie Lin ⋅ Zhenheng Yang ⋅ Dahua Lin ⋅ Lu Jiang
|
Exhibit Hall I #230 | |
|
DreamFuse: Adaptive Image Fusion with Diffusion Transformer
Poster Session 4 & Exhibit Hall with Coffee Break
Junjia Huang ⋅ Pengxiang Yan ⋅ Jiyang Liu ⋅ Jie Wu ⋅ Zhao Wang ⋅ Yitong Wang ⋅ Liang Lin ⋅ Guanbin Li
|
Exhibit Hall I #231 | |
|
AnyI2V: Animating Any Conditional Image with Motion Control
Poster Session 4 & Exhibit Hall with Coffee Break
Ziye Li ⋅ Xincheng Shuai ⋅ Hao Luo ⋅ Henghui Ding
|
Exhibit Hall I #232 | |
|
EEdit : Rethinking the Spatial and Temporal Redundancy for Efficient Image Editing
Poster Session 4 & Exhibit Hall with Coffee Break
Zexuan Yan ⋅ Yue Ma ⋅ Chang Zou ⋅ Wenteng Chen ⋅ Qifeng Chen ⋅ Linfeng Zhang
|
Exhibit Hall I #248 | |
|
RAGDiffusion: Faithful Cloth Generation via External Knowledge Assimilation
Yuhan Li ⋅ Xianfeng Tan ⋅ Wenxiang Shang ⋅ Yubo Wu ⋅ Jian Wang ⋅ Xuanhong Chen ⋅ Yi Zhang ⋅ Zhu Hangcheng ⋅ Bingbing Ni
|
Exhibit Hall I #249 | |
|
Instruction-based Image Editing with Planning, Reasoning, and Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Liya Ji ⋅ Chenyang Qi ⋅ Qifeng Chen
|
Exhibit Hall I #251 | |
|
HDR Image Generation via Gain Map Decomposed Diffusion
Poster Session 4 & Exhibit Hall with Coffee Break
Yuanshen Guan ⋅ Ruikang Xu ⋅ Yinuo Liao ⋅ Mingde Yao ⋅ Lizhi Wang ⋅ Zhiwei Xiong
|
Exhibit Hall I #254 | |
|
ESSENTIAL: Episodic and Semantic Memory Integration for Video Class-Incremental Learning
Jongseo Lee ⋅ Kyungho Bae ⋅ Kyle Min ⋅ Gyeong-Moon Park ⋅ Jinwoo Choi
|
Exhibit Hall I #255 | |
|
Fair Generation without Unfair Distortions: Debiasing Text-to-Image Generation with Entanglement-Free Attention
Poster Session 4 & Exhibit Hall with Coffee Break
Jeonghoon Park ⋅ Juyoung Lee ⋅ Chaeyeon Chung ⋅ Jaeseong Lee ⋅ Jaegul Choo ⋅ Jindong Gu
|
Exhibit Hall I #257 | |
|
Training-Free Text-Guided Image Editing with Visual Autoregressive Model
Poster Session 4 & Exhibit Hall with Coffee Break
Yufei Wang ⋅ Lanqing Guo ⋅ Zhihao Li ⋅ Jiaxing Huang ⋅ Pichao WANG ⋅ Bihan Wen ⋅ Jian Wang
|
Exhibit Hall I #258 | |
|
Accelerating Diffusion Transformer via Gradient-Optimized Cache
Poster Session 4 & Exhibit Hall with Coffee Break
Junxiang Qiu ⋅ Lin Liu ⋅ Shuo Wang ⋅ Jinda Lu ⋅ Kezhou Chen ⋅ Yanbin Hao
|
Exhibit Hall I #261 | |
|
The Silent Assistant: NoiseQuery as Implicit Guidance for Goal-Driven Image Generation
Ruoyu Wang ⋅ Huayang Huang ⋅ Ye Zhu ⋅ Olga Russakovsky ⋅ Yu Wu
|
Exhibit Hall I #262 | |
|
Progressive Growing of Video Tokenizers for Temporally Compact Latent Spaces
Poster Session 4 & Exhibit Hall with Coffee Break
Aniruddha Mahapatra ⋅ Long Mai ⋅ David Bourgin ⋅ Yitian Zhang ⋅ Feng Liu
|
Exhibit Hall I #263 | |
|
ArtEditor: Learning Customized Instructional Image Editor from Few-Shot Examples
Poster Session 4 & Exhibit Hall with Coffee Break
Shijie Huang ⋅ Yiren Song ⋅ Yuxuan Zhang ⋅ Hailong Guo ⋅ Xueyin Wang ⋅ Jiaming Liu
|
Exhibit Hall I #265 | |
|
MC-Bench: A Benchmark for Multi-Context Visual Grounding in the Era of MLLMs
Poster Session 4 & Exhibit Hall with Coffee Break
Yunqiu Xu ⋅ Linchao Zhu ⋅ Yi Yang
|
Exhibit Hall I #267 | |
|
Disrupting Model Merging: A Parameter-Level Defense Without Sacrificing Accuracy
Poster Session 4 & Exhibit Hall with Coffee Break
JUNHAO WEI ⋅ YU ZHE ⋅ Jun Sakuma
|
Exhibit Hall I #269 | |
|
Harmonizing Visual Representations for Unified Multimodal Understanding and Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Size Wu ⋅ Wenwei Zhang ⋅ Lumin Xu ⋅ Sheng Jin ⋅ Zhonghua Wu ⋅ Qingyi Tao ⋅ Wentao Liu ⋅ Wei Li ⋅ Chen Change Loy
|
Exhibit Hall I #273 | |
|
A3GS: Arbitrary Artistic Style into Arbitrary 3D Gaussian Splatting
Poster Session 4 & Exhibit Hall with Coffee Break
Zhiyuan Fang ⋅ Rengan Xie ⋅ Xuancheng Jin ⋅ Qi Ye ⋅ Wei Chen ⋅ Wenting Zheng ⋅ Rui Wang ⋅ Yuchi Huo
|
Exhibit Hall I #274 | |
|
LayerD: Decomposing Raster Graphic Designs into Layers
Poster Session 4 & Exhibit Hall with Coffee Break
Tomoyuki Suzuki ⋅ Kang-Jun Liu ⋅ Naoto Inoue ⋅ Kota Yamaguchi
|
Exhibit Hall I #277 | |
|
ViLU: Learning Vision-Language Uncertainties for Failure Prediction
Poster Session 4 & Exhibit Hall with Coffee Break
Marc Lafon ⋅ Yannis Karmim ⋅ Julio Silva-Rodríguez ⋅ Paul Couairon ⋅ Clément Rambour ⋅ Raphael Fournier-Sniehotta ⋅ Ismail Ayed ⋅ Jose Dolz ⋅ Nicolas THOME
|
Exhibit Hall I #279 | |
|
Subjective Camera 1.0: Bridging Human Cognition and Visual Reconstruction through Sequence-Aware Sketch-Guided Diffusion
Poster Session 4 & Exhibit Hall with Coffee Break
Haoyang Chen ⋅ Dongfang Sun ⋅ Caoyuan Ma ⋅ Shiqin Wang ⋅ Kewei Zhang ⋅ Zheng Wang ⋅ Zhixiang Wang
|
Exhibit Hall I #282 | |
|
GlassWizard: Harvesting Diffusion Priors for Glass Surface Detection
Poster Session 4 & Exhibit Hall with Coffee Break
Wenxue Li ⋅ Tian Ye ⋅ Xinyu Xiong ⋅ Jinbin Bai ⋅ feilong tang ⋅ Wenxuan Song ⋅ Zhaohu Xing ⋅ Lie Ju ⋅ Guanbin Li ⋅ Lei Zhu
|
Exhibit Hall I #283 | |
|
Zero-Shot Compositional Video Learning with Coding Rate Reduction
Poster Session 5 & Exhibit Hall
Heeseok Jung ⋅ Jun-Hyeon Bak ⋅ Yujin Jeong ⋅ Gyugeun Lee ⋅ Jinwoo Ahn ⋅ Eun-Sol Kim
|
Exhibit Hall I #66 | |
|
FedMVP: Federated Multimodal Visual Prompt Tuning for Vision-Language Models
Poster Session 4 & Exhibit Hall with Coffee Break
Mainak Singha ⋅ Subhankar Roy ⋅ Sarthak Mehrotra ⋅ Ankit Jha ⋅ Moloud Abdar ⋅ Biplab Banerjee ⋅ Elisa Ricci
|
Exhibit Hall I #285 | |
|
HyTIP: Hybrid Temporal Information Propagation for Masked Conditional Residual Video Coding
Poster Session 4 & Exhibit Hall with Coffee Break
Yi-Hsin Chen ⋅ Yi-Chen Yao ⋅ Kuan-Wei Ho ⋅ Chun-Hung Wu ⋅ Huu-Tai Phung ⋅ Martin Benjak ⋅ Jörn Ostermann ⋅ Wen-Hsiao Peng
|
Exhibit Hall I #287 | |
|
DACoN: DINO for Anime Paint Bucket Colorization with Any Number of Reference Images
Poster Session 4 & Exhibit Hall with Coffee Break
Kazuma Nagata ⋅ Naoshi Kaneko
|
Exhibit Hall I #288 | |
|
Lay-Your-Scene: Natural Scene Layout Generation with Diffusion Transformers
Poster Session 4 & Exhibit Hall with Coffee Break
Divyansh Srivastava ⋅ Xiang Zhang ⋅ He Wen ⋅ Chenru Wen ⋅ Zhuowen Tu
|
Exhibit Hall I #289 | |
|
Free2Guide: Training-Free Text-to-Video Alignment using Image LVLM
Poster Session 4 & Exhibit Hall with Coffee Break
Jaemin Kim ⋅ Bryan Sangwoo Kim ⋅ Jong Ye
|
Exhibit Hall I #290 | |
|
InfGen: A Resolution-Agnostic Paradigm for Scalable Image Synthesis
Poster Session 4 & Exhibit Hall with Coffee Break
Tao Han ⋅ Wanghan Xu ⋅ Junchao Gong ⋅ Xiaoyu Yue ⋅ Song Guo ⋅ Luping Zhou ⋅ LEI BAI
|
Exhibit Hall I #292 | |
|
VideoVAE+: Large Motion Video Autoencoding with Cross-modal Video VAE
Poster Session 4 & Exhibit Hall with Coffee Break
Yazhou Xing ⋅ Yang Fei ⋅ Yingqing He ⋅ Jingye Chen ⋅ Pengjun Fang ⋅ Xiaowei Chi ⋅ Qifeng Chen
|
Exhibit Hall I #293 | |
|
SpecGuard: Spectral Projection-based Advanced Invisible Watermarking
Poster Session 4 & Exhibit Hall with Coffee Break
Inzamamul Alam ⋅ Md Islam ⋅ Simon Woo ⋅ Khan Muhammad
|
Exhibit Hall I #296 | |
|
DIA: The Adversarial Exposure of Deterministic Inversion in Diffusion Models
Poster Session 4 & Exhibit Hall with Coffee Break
SeungHoo Hong ⋅ GeonHo Son ⋅ Juhun Lee ⋅ Simon Woo
|
Exhibit Hall I #297 | |
|
Supercharged One-step Text-to-Image Diffusion Models with Negative Prompts
Poster Session 4 & Exhibit Hall with Coffee Break
Viet Nguyen ⋅ Anh Nguyen ⋅ Trung Dao ⋅ Khoi Nguyen ⋅ Cuong Pham ⋅ Toan Tran ⋅ Anh Tran
|
Exhibit Hall I #298 | |
|
GFPack++: Attention-Driven Gradient Fields for Optimizing 2D Irregular Packing
Tianyang Xue ⋅ Lin Lu ⋅ Yang Liu ⋅ Mingdong Wu ⋅ Hao Dong ⋅ Yanbin Zhang ⋅ Renmin Han ⋅ Baoquan Chen
|
Exhibit Hall I #299 | |
|
Denoising Token Prediction in Masked Autoregressive Models
Poster Session 4 & Exhibit Hall with Coffee Break
Ting Yao ⋅ Yehao Li ⋅ Yingwei Pan ⋅ Zhaofan Qiu ⋅ Tao Mei
|
Exhibit Hall I #300 | |
|
LACONIC: A 3D Layout Adapter for Controllable Image Creation
Poster Session 4 & Exhibit Hall with Coffee Break
Léopold Maillard ⋅ Tom Durand ⋅ Adrien RAMANANA RAHARY ⋅ Maks Ovsjanikov
|
Exhibit Hall I #302 | |
|
Preserve Anything: Controllable Image Synthesis with Object Preservation
Poster Session 4 & Exhibit Hall with Coffee Break
Prasen Kumar Sharma ⋅ Neeraj Matiyali ⋅ Siddharth Srivastava ⋅ Gaurav Sharma
|
Exhibit Hall I #303 | |
|
Contrastive Test-Time Composition of Multiple LoRA Models for Image Generation
Tuna Meral ⋅ Enis Simsar ⋅ Federico Tombari ⋅ Pinar Yanardag
|
Exhibit Hall I #308 | |
|
FramePainter: Endowing Interactive Image Editing with Video Diffusion Priors
Poster Session 4 & Exhibit Hall with Coffee Break
Yabo Zhang ⋅ xinpeng zhou ⋅ Yihan Zeng ⋅ Hang Xu ⋅ Hui Li ⋅ Wangmeng Zuo
|
Exhibit Hall I #311 | |
|
PlanGen: Towards Unified Layout Planning and Image Generation in Auto-Regressive Vision Language Models
Poster Session 4 & Exhibit Hall with Coffee Break
Runze He ⋅ bo cheng ⋅ Yuhang Ma ⋅ QingxiangJia QingxiangJia ⋅ Shanyuan Liu ⋅ Ao Ma ⋅ Xiaoyu Wu ⋅ Liebucha Wu ⋅ Dawei Leng ⋅ Yuhui Yin
|
Exhibit Hall I #313 | |
|
Turbo2K: Towards Ultra-Efficient and High-Quality 2K Video Synthesis
Poster Session 4 & Exhibit Hall with Coffee Break
Jingjing Ren ⋅ Wenbo Li ⋅ Zhongdao Wang ⋅ Haoze Sun ⋅ Bangzhen Liu ⋅ Haoyu Chen ⋅ Jiaqi Xu ⋅ Aoxue Li ⋅ Shifeng Zhang ⋅ Bin Shao ⋅ Yong Guo ⋅ Lei Zhu
|
Exhibit Hall I #314 | |
|
Anchor Token Matching: Implicit Structure Locking for Training-free AR Image Editing
Poster Session 4 & Exhibit Hall with Coffee Break
Taihang Hu ⋅ Linxuan Li ⋅ Kai Wang ⋅ Yaxing Wang ⋅ jian Yang ⋅ Ming-Ming Cheng
|
Exhibit Hall I #315 | |
|
Parametric Shadow Control for Portrait Generation in Text-to-Image Diffusion Models
Poster Session 4 & Exhibit Hall with Coffee Break
Haoming Cai ⋅ Tsung-Wei Huang ⋅ Shiv Gehlot ⋅ Brandon Feng ⋅ Sachin Shah ⋅ Guan-Ming Su ⋅ Christopher Metzler
|
Exhibit Hall I #319 | |
|
GenDoP: Auto-regressive Camera Trajectory Generation as a Director of Photography
Poster Session 4 & Exhibit Hall with Coffee Break
Mengchen Zhang ⋅ Tong Wu ⋅ Jing Tan ⋅ Ziwei Liu ⋅ Gordon Wetzstein ⋅ Dahua Lin
|
Exhibit Hall I #321 | |
|
CompleteMe: Reference-based Human Image Completion
Poster Session 4 & Exhibit Hall with Coffee Break
Yu-Ju Tsai ⋅ Brian Price ⋅ Qing Liu ⋅ Luis Figueroa ⋅ Daniil Pakhomov ⋅ Zhihong Ding ⋅ Scott Cohen ⋅ Ming-Hsuan Yang
|
Exhibit Hall I #323 | |
|
REPA-E: Unlocking VAE for End-to-End Tuning of Latent Diffusion Transformers
Poster Session 4 & Exhibit Hall with Coffee Break
Xingjian Leng ⋅ Jaskirat Singh ⋅ Yunzhong Hou ⋅ Zhenchang Xing ⋅ Saining Xie ⋅ Liang Zheng
|
Exhibit Hall I #324 | |
|
EEGMirror: Leveraging EEG data in the wild via Montage-Agnostic Self-Supervision for EEG to Video Decoding
Poster Session 4 & Exhibit Hall with Coffee Break
Xuan-Hao Liu ⋅ Bao-liang Lu ⋅ Wei-Long Zheng
|
Exhibit Hall I #325 | |
|
Accelerating Diffusion Sampling via Exploiting Local Transition Coherence
Poster Session 4 & Exhibit Hall with Coffee Break
shangwen zhu ⋅ Han Zhang ⋅ Zhantao Yang ⋅ Qianyu Peng ⋅ Zhao Pu ⋅ Huangji Wang ⋅ Fan Cheng
|
Exhibit Hall I #326 | |
|
SA-LUT: Spatial Adaptive 4D Look-Up Table for Photorealistic Style Transfer
Poster Session 4 & Exhibit Hall with Coffee Break
Zerui Gong ⋅ Zhonghua Wu ⋅ Qingyi Tao ⋅ Qinyue Li ⋅ Chen Change Loy
|
Exhibit Hall I #327 | |
|
UniversalBooth: Model-Agnostic Personalized Text-to-Image Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Songhua Liu ⋅ Ruonan Yu ⋅ Xinchao Wang
|
Exhibit Hall I #329 | |
|
UniGlyph: Unified Segmentation-Conditioned Diffusion for Precise Visual Text Synthesis
Poster Session 4 & Exhibit Hall with Coffee Break
Yuanrui Wang ⋅ Cong Han ⋅ Yafei Li ⋅ Zhipeng Jin ⋅ Xiawei Li ⋅ Sinan Du ⋅ Wen Tao ⋅ Yi Yang ⋅ shuanglong li ⋅ Chun Yuan ⋅ LIU LIN
|
Exhibit Hall I #331 | |
|
ADIEE: Automatic Dataset Creation and Scorer for Instruction-Guided Image Editing Evaluation
Poster Session 4 & Exhibit Hall with Coffee Break
Sherry Chen ⋅ Yi Wei ⋅ Luowei Zhou ⋅ Suren Kumar
|
Exhibit Hall I #332 | |
|
Semantic Discrepancy-aware Detector for Image Forgery Identification
Poster Session 4 & Exhibit Hall with Coffee Break
Wang Ziye ⋅ Minghang Yu ⋅ Chunyan Xu ⋅ Zhen Cui
|
Exhibit Hall I #336 | |
|
Scalable Ranked Preference Optimization for Text-to-Image Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Shyamgopal Karthik ⋅ Huseyin Coskun ⋅ Zeynep Akata ⋅ Sergey Tulyakov ⋅ Jian Ren ⋅ Anil Kag
|
Exhibit Hall I #337 | |
|
FairGen: Enhancing Fairness in Text-to-Image Diffusion Models via Self-Discovering Latent Directions
Poster Session 4 & Exhibit Hall with Coffee Break
Yilei Jiang ⋅ Wei-Hong Li ⋅ Yiyuan Zhang ⋅ Minghong Cai ⋅ Xiangyu Yue
|
Exhibit Hall I #338 | |
|
Benchmarking and Learning Multi-Dimensional Quality Evaluator for Text-to-3D Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Yujie Zhang ⋅ Bingyang Cui ⋅ Qi Yang ⋅ Zhu Li ⋅ Yiling Xu
|
Exhibit Hall I #352 | |
|
REGEN: Learning Compact Video Embedding with (Re-)Generative Decoder
Poster Session 4 & Exhibit Hall with Coffee Break
Yitian Zhang ⋅ Long Mai ⋅ Aniruddha Mahapatra ⋅ David Bourgin ⋅ Yicong Hong ⋅ Jonah Casebeer ⋅ Feng Liu ⋅ Yun Fu
|
Exhibit Hall I #342 | |
|
FonTS: Text Rendering With Typography and Style Controls
Poster Session 4 & Exhibit Hall with Coffee Break
Wenda SHI ⋅ Yiren Song ⋅ Dengming Zhang ⋅ Jiaming Liu ⋅ XINGXING ZOU
|
Exhibit Hall I #343 | |
|
CreatiLayout: Siamese Multimodal Diffusion Transformer for Creative Layout-to-Image Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Hui Zhang ⋅ Dexiang Hong ⋅ Yitong Wang ⋅ Jie Shao ⋅ Xinglong Wu ⋅ Zuxuan Wu ⋅ Yu-Gang Jiang
|
Exhibit Hall I #345 | |
|
CoMatch: Dynamic Covisibility-Aware Transformer for Bilateral Subpixel-Level Semi-Dense Image Matching
Zizhuo Li ⋅ Yifan Lu ⋅ Linfeng Tang ⋅ Shihua Zhang ⋅ Jiayi Ma
|
Exhibit Hall I #348 | |
|
G2SF: Geometry-Guided Score Fusion for Multimodal Industrial Anomaly Detection
Poster Session 5 & Exhibit Hall
Chengyu Tao ⋅ Xuanming Cao ⋅ Juan Du
|
Exhibit Hall I #70 | |
|
PASTA: Part-Aware Sketch-to-3D Shape Generation with Text-Aligned Prior
Poster Session 4 & Exhibit Hall with Coffee Break
Seunggwan Lee ⋅ Hwanhee Jung ⋅ ByoungSoo Koh ⋅ Qixing Huang ⋅ Sang Yoon ⋅ Sangpil Kim
|
Exhibit Hall I #354 | |
|
Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Yuqing Wang ⋅ Zhijie Lin ⋅ Yao Teng ⋅ Yuanzhi Zhu ⋅ Shuhuai Ren ⋅ Jiashi Feng ⋅ Xihui Liu
|
Exhibit Hall I #355 | |
|
Gain-MLP: Improving HDR Gain Map Encoding via a Lightweight MLP
Poster Session 4 & Exhibit Hall with Coffee Break
Trevor Canham ⋅ SaiKiran Tedla ⋅ Michael Murdoch ⋅ Michael Brown
|
Exhibit Hall I #357 | |
|
From Prompt to Progression: Taming Video Diffusion Models for Seamless Attribute Transition
Poster Session 4 & Exhibit Hall with Coffee Break
Ling Lo ⋅ Kelvin Chan ⋅ Wen-Huang Cheng ⋅ Ming-Hsuan Yang
|
Exhibit Hall I #360 | |
|
Dense2MoE: Restructuring Diffusion Transformer to MoE for Efficient Text-to-Image Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Youwei Zheng ⋅ Yuxi Ren ⋅ Xin Xia ⋅ Xuefeng Xiao ⋅ Xiaohua Xie
|
Exhibit Hall I #361 | |
|
Less-to-More Generalization: Unlocking More Controllability by In-Context Generation
Poster Session 4 & Exhibit Hall with Coffee Break
shaojin wu ⋅ Mengqi Huang ⋅ wenxu wu ⋅ Yufeng Cheng ⋅ Fei Ding ⋅ Qian HE
|
Exhibit Hall I #363 | |
|
Sparse Fine-Tuning of Transformers for Generative Tasks
Poster Session 4 & Exhibit Hall with Coffee Break
Wei Chen ⋅ Jingxi Yu ⋅ Zichen Miao ⋅ Qiang Qiu
|
Exhibit Hall I #365 | |
|
FlexGen: Flexible Multi-View Generation from Text and Image Inputs
Poster Session 4 & Exhibit Hall with Coffee Break
Xinli Xu ⋅ Wenhang Ge ⋅ Jiantao Lin ⋅ Jiawei Feng ⋅ Lie XU ⋅ hanfeng Zhao ⋅ Shunsi Zhang ⋅ Ying-Cong Chen
|
Exhibit Hall I #366 | |
|
Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM
Poster Session 4 & Exhibit Hall with Coffee Break
Yatai Ji ⋅ Jiacheng Zhang ⋅ Jie Wu ⋅ Shilong Zhang ⋅ Shoufa Chen ⋅ Chongjian GE ⋅ Peize Sun ⋅ Weifeng Chen ⋅ Wenqi Shao ⋅ Xuefeng Xiao ⋅ Weilin Huang ⋅ Ping Luo
|
Exhibit Hall I #367 | |
|
Dynamic-VLM: Simple Dynamic Visual Token Compression for VideoLLM
Poster Session 5 & Exhibit Hall
Han Wang ⋅ Yuxiang Nie ⋅ Yongjie Ye ⋅ Yanjie Wang ⋅ SHUAI LI ⋅ Haiyang Yu ⋅ Jinghui Lu ⋅ Can Huang
|
Exhibit Hall I #96 | |
|
Learning Implicit Features with Flow-Infused Transformations for Realistic Virtual Try-On
Poster Session 4 & Exhibit Hall with Coffee Break
Delong Zhang ⋅ Qiwei Huang ⋅ Yang Sun ⋅ Yuanliu Liu ⋅ Wei-Shi Zheng ⋅ Pengfei Xiong ⋅ Wei Zhang
|
Exhibit Hall I #368 | |
|
AIGI-Holmes: Towards Explainable and Generalizable AI-Generated Image Detection via Multimodal Large Language Models
Poster Session 4 & Exhibit Hall with Coffee Break
Ziyin Zhou ⋅ Yunpeng Luo ⋅ Yuanchen Wu ⋅ Ke Sun ⋅ Jiayi Ji ⋅ Ke Yan ⋅ Shouhong Ding ⋅ Xiaoshuai Sun ⋅ Yunsheng Wu ⋅ Rongrong Ji
|
Exhibit Hall I #369 | |
|
GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Tianwei Xiong ⋅ Jun Hao Liew ⋅ Zilong Huang ⋅ Jiashi Feng ⋅ Xihui Liu
|
Exhibit Hall I #371 | |
|
Collaborative Instance Object Navigation: Leveraging Uncertainty-Awareness to Minimize Human-Agent Dialogues
Poster Session 4 & Exhibit Hall with Coffee Break
Francesco Taioli ⋅ Edoardo Zorzi ⋅ Gianni Franchi ⋅ Alberto Castellini ⋅ Alessandro Farinelli ⋅ Marco Cristani ⋅ Yiming Wang
|
Exhibit Hall I #372 | |
|
Puppet-Master: Scaling Interactive Video Generation as a Motion Prior for Part-Level Dynamics
Poster Session 3 & Exhibit Hall
Ruining Li ⋅ Chuanxia Zheng ⋅ Christian Rupprecht ⋅ Andrea Vedaldi
|
Exhibit Hall I #321 | |
|
ReasonVQA: A Multi-hop Reasoning Benchmark with Structural Knowledge for Visual Question Answering
Poster Session 4 & Exhibit Hall with Coffee Break
Duong T. Tran ⋅ Trung-Kien Tran ⋅ Manfred Hauswirth ⋅ Danh Le-Phuoc
|
Exhibit Hall I #373 | |
|
LOCATEdit: Graph Laplacian Optimized Cross Attention for Localized Text-Guided Image Editing
Poster Session 4 & Exhibit Hall with Coffee Break
Achint Soni ⋅ Meet Soni ⋅ Sirisha Rambhatla
|
Exhibit Hall I #374 | |
|
Learned Image Compression with Hierarchical Progressive Context Modeling
Poster Session 4 & Exhibit Hall with Coffee Break
Yuqi Li ⋅ Haotian Zhang ⋅ Li Li ⋅ Dong Liu
|
Exhibit Hall I #377 | |
|
Early Timestep Zero-Shot Candidate Selection for Instruction-Guided Image Editing
Poster Session 4 & Exhibit Hall with Coffee Break
Joowon Kim ⋅ Ziseok Lee ⋅ Donghyeon Cho ⋅ Sanghyun Jo ⋅ Yeonsung Jung ⋅ Kyungsu Kim ⋅ Eunho Yang
|
Exhibit Hall I #378 | |
|
Teleportraits: Training-Free People Insertion into Any Scene
Poster Session 4 & Exhibit Hall with Coffee Break
Jialu Gao ⋅ Joseph K J ⋅ Fernando De la Torre
|
Exhibit Hall I #380 | |
|
DCT-Shield: A Robust Frequency Domain Defense against Malicious Image Editing
Aniruddha Bala ⋅ Rohit Chowdhury ⋅ Rohan Jaiswal ⋅ Siddharth Roheda
|
Exhibit Hall I #381 | |
|
Context Guided Transformer Entropy Modeling for Video Compression
Poster Session 4 & Exhibit Hall with Coffee Break
Junlong Tong ⋅ Wei Zhang ⋅ Yaohui Jin ⋅ Xiaoyu Shen
|
Exhibit Hall I #382 | |
|
UIP2P: Unsupervised Instruction-based Image Editing via Edit Reversibility Constraint
Poster Session 4 & Exhibit Hall with Coffee Break
Enis Simsar ⋅ Alessio Tonioni ⋅ Yongqin Xian ⋅ Thomas Hofmann ⋅ Federico Tombari
|
Exhibit Hall I #385 | |
|
DiT4SR: Taming Diffusion Transformer for Real-World Image Super-Resolution
Poster Session 4 & Exhibit Hall with Coffee Break
Zheng-Peng Duan ⋅ jiawei zhang ⋅ Xin Jin ⋅ Ziheng Zhang ⋅ Zheng Xiong ⋅ Dongqing Zou ⋅ Jimmy Ren ⋅ Chun-Le Guo ⋅ Chongyi Li
|
Exhibit Hall I #390 | |
|
USP: Unified Self-Supervised Pretraining for Image Generation and Understanding
Poster Session 4 & Exhibit Hall with Coffee Break
Xiangxiang Chu ⋅ Renda Li ⋅ Yong Wang
|
Exhibit Hall I #344 | |
|
Bi-Level Optimization for Self-Supervised AI-Generated Face Detection
Mian Zou ⋅ Nan Zhong ⋅ Baosheng Yu ⋅ Yibing Zhan ⋅ Kede Ma
|
Exhibit Hall I #391 | |
|
VisualCloze: A Universal Image Generation Framework via Visual In-Context Learning
Poster Session 4 & Exhibit Hall with Coffee Break
Zhong-Yu Li ⋅ Ruoyi Du ⋅ Juncheng Yan ⋅ Le Zhuo ⋅ Zhen Li ⋅ Peng Gao ⋅ Zhanyu Ma ⋅ Ming-Ming Cheng
|
Exhibit Hall I #392 | |
|
Neighboring Autoregressive Modeling for Efficient Visual Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Yefei He ⋅ Yuanyu He ⋅ Shaoxuan He ⋅ Feng Chen ⋅ Hong Zhou ⋅ Kaipeng Zhang ⋅ Bohan Zhuang
|
Exhibit Hall I #395 | |
|
FastVAR: Linear Visual Autoregressive Modeling via Cached Token Pruning
Poster Session 4 & Exhibit Hall with Coffee Break
Hang Guo ⋅ Yawei Li ⋅ Taolin Zhang ⋅ Jiangshan Wang ⋅ Tao Dai ⋅ Shu-Tao Xia ⋅ Luca Benini
|
Exhibit Hall I #396 | |
|
Tune-Your-Style: Intensity-tunable 3D Style Transfer with Gaussian Splatting
Poster Session 4 & Exhibit Hall with Coffee Break
Yian Zhao ⋅ rushi ye ⋅ Ruochong Zheng ⋅ Zesen Cheng ⋅ Chaoran Feng ⋅ Jiashu Yang ⋅ Pengchong Qiao ⋅ Chang Liu ⋅ Jie Chen
|
Exhibit Hall I #398 | |
|
QK-Edit: Revisiting Attention-based Injection in MM-DiT for Image and Video Editing
Poster Session 4 & Exhibit Hall with Coffee Break
Tiancheng SHEN ⋅ Jun Hao Liew ⋅ Zilong Huang ⋅ Xiangtai Li ⋅ Zhijie Lin ⋅ Jiyang Liu ⋅ Yitong Wang ⋅ Jiashi Feng ⋅ Ming-Hsuan Yang
|
Exhibit Hall I #399 | |
|
Beyond Isolated Words: Diffusion Brush for Handwritten Text-Line Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Gang Dai ⋅ Yifan Zhang ⋅ Yutao Qin ⋅ Qiangya Guo ⋅ Shuangping Huang ⋅ Shuicheng YAN
|
Exhibit Hall I #400 | |
|
Always Skip Attention
Poster Session 5 & Exhibit Hall
Yiping Ji ⋅ Hemanth Saratchandran ⋅ Peyman Moghadam ⋅ Simon Lucey
|
Exhibit Hall I #313 | |
|
BadVideo: Stealthy Backdoor Attack against Text-to-Video Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Ruotong Wang ⋅ Mingli Zhu ⋅ Jiarong Ou ⋅ Rui Chen ⋅ Xin Tao ⋅ Pengfei Wan ⋅ Baoyuan Wu
|
Exhibit Hall I #402 | |
|
Any2AnyTryon: Leveraging Adaptive Position Embeddings for Versatile Virtual Clothing Tasks
Poster Session 4 & Exhibit Hall with Coffee Break
Hailong Guo ⋅ Bohan Zeng ⋅ Yiren Song ⋅ Wentao Zhang ⋅ Jiaming Liu ⋅ Chuang Zhang
|
Exhibit Hall I #403 | |
|
Blended Point Cloud Diffusion for Localized Text-guided Shape Editing
Etai Sella ⋅ Noam Atia ⋅ Ron Mokady ⋅ Hadar Averbuch-Elor
|
Exhibit Hall I #406 | |
|
VSC: Visual Search Compositional Text-to-Image Diffusion Model
Poster Session 4 & Exhibit Hall with Coffee Break
Do Dat ⋅ Nam Hyeon-Woo ⋅ Po-Yuan Mao ⋅ Tae-Hyun Oh
|
Exhibit Hall I #409 | |
|
Fine-Tuning Visual Autogressive Models for Subject-Driven Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Jiwoo Chung ⋅ Sangeek Hyun ⋅ Hyunjun Kim ⋅ Eunseo Koh ⋅ Minkyu Lee ⋅ Jae-Pil Heo
|
Exhibit Hall I #411 | |
|
Flow to the Mode: Mode-Seeking Diffusion Autoencoders for State-of-the-Art Image Tokenization
Poster Session 4 & Exhibit Hall with Coffee Break
Kyle Sargent ⋅ Kyle Hsu ⋅ Justin Johnson ⋅ Li Fei-Fei ⋅ Jiajun Wu
|
Exhibit Hall I #439 | |
|
Pretrained Reversible Generation as Unsupervised Visual Representation Learning
Poster Session 4 & Exhibit Hall with Coffee Break
Rongkun Xue ⋅ Jinouwen Zhang ⋅ Yazhe Niu ⋅ Dazhong Shen ⋅ Bingqi Ma ⋅ Yu Liu ⋅ Jing Yang
|
Exhibit Hall I #415 | |
|
DLF: Extreme Image Compression with Dual-generative Latent Fusion
Naifu Xue ⋅ Zhaoyang Jia ⋅ Jiahao Li ⋅ Bin Li ⋅ Yuan Zhang ⋅ Yan Lu
|
Exhibit Hall I #416 | |
|
Tracing Copied Pixels and Regularizing Patch Affinity in Copy Detection
Poster Session 4 & Exhibit Hall with Coffee Break
Yichen Lu ⋅ Siwei Nie ⋅ Minlong Lu ⋅ Xudong Yang ⋅ Xiaobo Zhang ⋅ Peng Zhang
|
Exhibit Hall I #418 | |
|
Beyond Brain Decoding: Visual-Semantic Reconstructions to Mental Creation Extension Based on fMRI
Poster Session 4 & Exhibit Hall with Coffee Break
Haodong Jing ⋅ Dongyao Jiang ⋅ Yongqiang Ma ⋅ Haibo Hua ⋅ Bo Huang ⋅ Nanning Zheng
|
Exhibit Hall I #419 | |
|
Exploiting Domain Properties in Language-Driven Domain Generalization for Semantic Segmentation
Poster Session 5 & Exhibit Hall
Seogkyu Jeon ⋅ Kibeom Hong ⋅ Hyeran Byun
|
Exhibit Hall I #94 | |
|
PixTalk: Controlling Photorealistic Image Processing and Editing with Language
Poster Session 4 & Exhibit Hall with Coffee Break
Marcos Conde ⋅ Zihao Lu ⋅ Radu Timofte
|
Exhibit Hall I #420 | |
|
ADCD-Net: Robust Document Image Forgery Localization via Adaptive DCT Feature and Hierarchical Content Disentanglement
Poster Session 4 & Exhibit Hall with Coffee Break
KA WONG ⋅ Jicheng Zhou ⋅ Haiwei Wu ⋅ Yain-Whar Si ⋅ Jiantao Zhou
|
Exhibit Hall I #421 | |
|
Towards Robust Defense against Customization via Protective Perturbation Resistant to Diffusion-based Purification
Wenkui Yang ⋅ Jie Cao ⋅ Junxian Duan ⋅ Ran He
|
Exhibit Hall I #422 | |
|
A Unified Framework for Industrial Cel-Animation Colorization with Temporal-Structural Awareness
Poster Session 4 & Exhibit Hall with Coffee Break
Xiaoyi Feng ⋅ Tao Huang ⋅ Peng Wang ⋅ Zizhou Huang ⋅ Haihang Zhang ⋅ Yuntao Zou ⋅ Dagang Li ⋅ Kaifeng Zou
|
Exhibit Hall I #423 | |
|
Generative Video Bi-flow
Poster Session 4 & Exhibit Hall with Coffee Break
Chen Liu ⋅ Tobias Ritschel
|
Exhibit Hall I #429 | |
|
AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Moayed Haji-Ali ⋅ Willi Menapace ⋅ Aliaksandr Siarohin ⋅ Ivan Skorokhodov ⋅ Alper Canberk ⋅ Kwot Sin Lee ⋅ Vicente Ordonez ⋅ Sergey Tulyakov
|
Exhibit Hall I #430 | |
|
T2I-Copilot: A Training-Free Multi-Agent Text-to-Image System for Enhanced Prompt Interpretation and Interactive Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Chieh-Yun Chen ⋅ Min Shi ⋅ Gong Zhang ⋅ Humphrey Shi
|
Exhibit Hall I #432 | |
|
LayerLock: Non-collapsing Representation Learning with Progressive Freezing
Poster Session 4 & Exhibit Hall with Coffee Break
Goker Erdogan ⋅ Nikhil Parthasarathy ⋅ Catalin Ionescu ⋅ Drew Hudson ⋅ Alexander Lerchner ⋅ Andrew Zisserman ⋅ Mehdi S. M. Sajjadi ⋅ Joao Carreira
|
Exhibit Hall I #438 | |
|
Adaptive Routing of Text-to-Image Generation Requests Between Large Cloud Model and Light-Weight Edge Model
Poster Session 4 & Exhibit Hall with Coffee Break
Zewei Xin ⋅ Qinya Li ⋅ Chaoyue Niu ⋅ Fan Wu ⋅ Guihai Chen
|
Exhibit Hall I #440 | |
|
Exploring Multimodal Diffusion Transformers for Enhanced Prompt-based Image Editing
Poster Session 4 & Exhibit Hall with Coffee Break
Joonghyuk Shin ⋅ Alchan Hwang ⋅ Yujin Kim ⋅ Daneul Kim ⋅ Jaesik Park
|
Exhibit Hall I #441 | |
|
JPEG Processing Neural Operator for Backward-Compatible Coding
Poster Session 4 & Exhibit Hall with Coffee Break
Woo Kyoung Han ⋅ Yongjun Lee ⋅ Byeonghun Lee ⋅ Sang Hyun Park ⋅ Sunghoon Im ⋅ Kyong Hwan Jin
|
Exhibit Hall I #442 | |
|
EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer
Poster Session 4 & Exhibit Hall with Coffee Break
Yuxuan Zhang ⋅ Yirui Yuan ⋅ Yiren Song ⋅ Haofan Wang ⋅ Jiaming Liu
|
Exhibit Hall I #443 | |
|
All Parts Matter: A Unified Mask-Free Virtual Try-On Framework
Poster Session 4 & Exhibit Hall with Coffee Break
Chenghu Du ⋅ Shengwu Xiong ⋅ Yi Rong
|
Exhibit Hall I #444 | |
|
Function-centric Bayesian Network for Zero-Shot Object Goal Navigation
Poster Session 4 & Exhibit Hall with Coffee Break
Sixian Zhang ⋅ Xinyao Yu ⋅ Xinhang Song ⋅ Yiyao Wang ⋅ Shuqiang Jiang
|
Exhibit Hall I #445 | |
|
Attention to Neural Plagiarism: Diffusion Models Can Plagiarize Your Copyrighted Images!
Poster Session 4 & Exhibit Hall with Coffee Break
zihang zou ⋅ Boqing Gong ⋅ Liqiang Wang
|
Exhibit Hall I #446 | |
|
Distilling Parallel Gradients for Fast ODE Solvers of Diffusion Models
Poster Session 4 & Exhibit Hall with Coffee Break
Beier Zhu ⋅ Ruoyu Wang ⋅ Tong Zhao ⋅ Hanwang Zhang ⋅ Chi Zhang
|
Exhibit Hall I #447 | |
|
LATINO-PRO: LAtent consisTency INverse sOlver with PRompt Optimization
Poster Session 4 & Exhibit Hall with Coffee Break
Alessio Spagnoletti ⋅ Jean Prost ⋅ Andres Almansa ⋅ Nicolas Papadakis ⋅ Marcelo Pereyra
|
Exhibit Hall I #451 | |
|
EDiT: Efficient Diffusion Transformers with Linear Compressed Attention
Poster Session 4 & Exhibit Hall with Coffee Break
Philipp Becker ⋅ Abhinav Mehrotra ⋅ Ruchika Chavhan ⋅ Malcolm Chadwick ⋅ Luca Morreale ⋅ Mehdi Noroozi ⋅ Alberto Gil Couto Pimentel Ramos ⋅ Sourav Bhattacharya
|
Exhibit Hall I #452 | |
|
Hate in Plain Sight: On the Risks of Moderating AI-Generated Hateful Illusions
Poster Session 4 & Exhibit Hall with Coffee Break
Yiting Qu ⋅ Ziqing Yang ⋅ Yihan Ma ⋅ Michael Backes ⋅ Savvas Zannettou ⋅ Yang Zhang
|
Exhibit Hall I #453 | |
|
DC-AE 1.5: Accelerating Diffusion Model Convergence with Structured Latent Space
Poster Session 4 & Exhibit Hall with Coffee Break
Junyu Chen ⋅ Dongyun Zou ⋅ Wenkun He ⋅ Junsong Chen ⋅ Enze Xie ⋅ Song Han ⋅ Han Cai
|
Exhibit Hall I #454 | |
|
MH-LVC: Multi-Hypothesis Temporal Prediction for Learned Conditional Residual Video Coding
Poster Session 4 & Exhibit Hall with Coffee Break
Gao Zong lin ⋅ Huu-Tai Phung ⋅ Yi-Chen Yao ⋅ Kuan-Wei Ho ⋅ Yi-Hsin Chen ⋅ Yu-Hsiang Lin ⋅ Alessandro Gnutti ⋅ Wen-Hsiao Peng
|
Exhibit Hall I #456 | |
|
Sculpting Memory: Multi-Concept Forgetting in Diffusion Models via Dynamic Mask and Concept-Aware Optimization
Poster Session 4 & Exhibit Hall with Coffee Break
Li ⋅ Yang Xiao ⋅ Jie Ji ⋅ Kaiyuan Deng ⋅ Bo Hui ⋅ Linke Guo ⋅ Xiaolong Ma
|
Exhibit Hall I #457 | |
|
On the Provable Importance of Gradients for Autonomous Language-Assisted Image Clustering
Bo Peng ⋅ Jie Lu ⋅ Guangquan Zhang ⋅ Zhen Fang
|
Exhibit Hall I #1 | |
|
Inter2Former: Dynamic Hybrid Attention for Efficient High-Precision Interactive Segmentation
Poster Session 5 & Exhibit Hall
You Huang ⋅ Lichao Chen ⋅ Jiayi Ji ⋅ Liujuan Cao ⋅ Shengchuan Zhang ⋅ Rongrong Ji
|
Exhibit Hall I #2 | |
|
OphCLIP: Hierarchical Retrieval-Augmented Learning for Ophthalmic Surgical Video-Language Pretraining
Poster Session 5 & Exhibit Hall
Ming Hu ⋅ Kun yuan ⋅ Yaling Shen ⋅ feilong tang ⋅ Xiaohao Xu ⋅ Lin Zhou ⋅ Wei Li ⋅ Ying Chen ⋅ Zhongxing Xu ⋅ Zelin Peng ⋅ Siyuan Yan ⋅ Vinkle Srivastav ⋅ Diping Song ⋅ Tianbin Li ⋅ Danli Shi ⋅ Jin Ye ⋅ Nicolas Padoy ⋅ Nassir Navab ⋅ Junjun He ⋅ Zongyuan Ge
|
Exhibit Hall I #4 | |
|
HiERO: Understanding the Hierarchy of Human Behavior Enhances Reasoning on Egocentric Videos
Poster Session 5 & Exhibit Hall
Simone Alberto Peirone ⋅ Francesca Pistilli ⋅ Giuseppe Averta
|
Exhibit Hall I #6 | |
|
CaptionSmiths: Flexibly Controlling Language Pattern in Image Captioning
Kuniaki Saito ⋅ Donghyun Kim ⋅ Kwanyong Park ⋅ Atsushi Hashimoto ⋅ Yoshitaka Ushiku
|
Exhibit Hall I #7 | |
|
An Efficient Hybrid Vision Transformer for TinyML Applications
Poster Session 5 & Exhibit Hall
Fanhong Zeng ⋅ Huanan LI ⋅ Juntao Guan ⋅ Rui Fan ⋅ Tong Wu ⋅ Xilong Wang ⋅ Lai Rui
|
Exhibit Hall I #11 | |
|
Graph Domain Adaptation with Dual-branch Encoder and Two-level Alignment for Whole Slide Image-based Survival Prediction
Poster Session 5 & Exhibit Hall
Yuntao Shou ⋅ Xiangyong Cao ⋅ PeiqiangYan PeiqiangYan ⋅ Qiaohui Qiaohui ⋅ Qian Zhao ⋅ Deyu Meng
|
Exhibit Hall I #12 | |
|
CNS-Bench: Benchmarking Image Classifier Robustness Under Continuous Nuisance Shifts
Poster Session 5 & Exhibit Hall
Olaf Dünkel ⋅ Artur Jesslen ⋅ Jiahao Xie ⋅ Christian Theobalt ⋅ Christian Rupprecht ⋅ Adam Kortylewski
|
Exhibit Hall I #17 | |
|
Visual Test-time Scaling for GUI Agent Grounding
Tiange Luo ⋅ Lajanugen Logeswaran ⋅ Justin Johnson ⋅ Honglak Lee
|
Exhibit Hall I #18 | |
|
Multi-Schema Proximity Network for Composed Image Retrieval
Poster Session 5 & Exhibit Hall
Jiangming Shi ⋅ Xiangbo Yin ⋅ yeyunchen yeyunchen ⋅ Yachao Zhang ⋅ zhizhong zhang ⋅ Yuan Xie ⋅ Yanyun Qu
|
Exhibit Hall I #19 | |
|
ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations
Poster Session 5 & Exhibit Hall
Tianming Liang ⋅ Kun-Yu Lin ⋅ Chaolei Tan ⋅ Jianguo Zhang ⋅ Wei-Shi Zheng ⋅ Jian-Fang Hu
|
Exhibit Hall I #20 | |
|
GECKO: Gigapixel Vision-Concept Contrastive Pretraining in Histopathology
Saarthak Kapse ⋅ Pushpak Pati ⋅ Srikar Yellapragada ⋅ Srijan Das ⋅ Rajarsi Gupta ⋅ Joel Saltz ⋅ Dimitris Samaras ⋅ Prateek Prasanna
|
Exhibit Hall I #21 | |
|
Lumina-Image 2.0: A Unified and Efficient Image Generative Framework
Poster Session 5 & Exhibit Hall
Qi Qin ⋅ Le Zhuo ⋅ Yi Xin ⋅ Ruoyi Du ⋅ Zhen Li ⋅ Bin Fu ⋅ Yiting Lu ⋅ Xinyue Li ⋅ Dongyang Liu ⋅ Xiangyang Zhu ⋅ Will Beddow ⋅ Erwann Millon ⋅ Victor Perez ⋅ Wenhai Wang ⋅ Yu Qiao ⋅ Bo Zhang ⋅ Xiaohong Liu ⋅ Hongsheng Li ⋅ Chang Xu ⋅ Peng Gao
|
Exhibit Hall I #22 | |
|
DiSCO-3D : Discovering and Segmenting Sub-Concepts from Open-vocabulary Queries in NeRF
Poster Session 5 & Exhibit Hall
Doriand Petit ⋅ Steve Bourgeois ⋅ Vincent Gay-Bellile ⋅ Florian Chabot ⋅ Loïc Barthe
|
Exhibit Hall I #23 | |
|
ESCNet:Edge-Semantic Collaborative Network for Camouflaged Object Detection
Poster Session 5 & Exhibit Hall
Sheng Ye ⋅ Xin Chen ⋅ Yan Zhang ⋅ Xianming Lin ⋅ Liujuan Cao
|
Exhibit Hall I #24 | |
|
Test-time Adaptation for Foundation Medical Segmentation Model Without Parametric Updates
Kecheng Chen ⋅ Xinyu Luo ⋅ Tiexin Qin ⋅ Jie Liu ⋅ Hui Liu ⋅ Victor Ho Fun Lee ⋅ Hong Yan ⋅ Haoliang Li
|
Exhibit Hall I #26 | |
|
ResQ: A Novel Framework to Implement Residual Neural Networks on Analog Rydberg Atom Quantum Computers
Poster Session 5 & Exhibit Hall
Nicholas DiBrita ⋅ Jason Han ⋅ Tirthak Patel
|
Exhibit Hall I #27 | |
|
M-Net: MRI Brain Tumor Sequential Segmentation Network via Mesh-Cast
Poster Session 5 & Exhibit Hall
Jiacheng Lu ⋅ Hui Ding ⋅ Shiyu Zhang ⋅ Guoping Huo
|
Exhibit Hall I #30 | |
|
Moment Quantization for Video Temporal Grounding
Poster Session 5 & Exhibit Hall
Xiaolong Sun ⋅ Le Wang ⋅ Sanping Zhou ⋅ Liushuai Shi ⋅ Kun Xia ⋅ Mengnan Liu ⋅ Yabing Wang ⋅ Gang Hua
|
Exhibit Hall I #32 | |
|
SVTRv2: CTC Beats Encoder-Decoder Models in Scene Text Recognition
Poster Session 5 & Exhibit Hall
Yongkun Du ⋅ Zhineng Chen ⋅ Hongtao Xie ⋅ Caiyan Jia ⋅ Yu-Gang Jiang
|
Exhibit Hall I #33 | |
|
ROVI: A VLM-LLM Re-Captioned Dataset for Open-Vocabulary Instance-Grounded Text-to-Image Generation
Poster Session 5 & Exhibit Hall
Cihang Peng ⋅ Qiming HOU ⋅ Zhong Ren ⋅ Kun Zhou
|
Exhibit Hall I #38 | |
|
S⁴M: Boosting Semi-Supervised Instance Segmentation with SAM
Poster Session 5 & Exhibit Hall
Heeji Yoon ⋅ Heeseong Shin ⋅ Eunbeen Hong ⋅ Hyunwook Choi ⋅ Hansang Cho ⋅ Daun Jeong ⋅ Seungryong Kim
|
Exhibit Hall I #40 | |
|
Structure-aware Semantic Discrepancy and Consistency for 3D Medical Image Self-supervised Learning
Poster Session 5 & Exhibit Hall
Tan Pan ⋅ Zhaorui Tan ⋅ Kaiyu Guo ⋅ Dongli Xu ⋅ Weidi Xu ⋅ Chen Jiang ⋅ Xin Guo ⋅ Yuan Qi ⋅ Yuan Cheng
|
Exhibit Hall I #43 | |
|
ARGUS: Hallucination and Omission Evaluation in Video-LLMs
Poster Session 5 & Exhibit Hall
Ruchit Rawal ⋅ Reza Shirkavand ⋅ Heng Huang ⋅ Gowthami Somepalli ⋅ Tom Goldstein
|
Exhibit Hall I #45 | |
|
Feature Purification Matters: Suppressing Outlier Propagation for Training-Free Open-Vocabulary Semantic Segmentation
Shuo Jin ⋅ Siyue Yu ⋅ Bingfeng Zhang ⋅ Mingjie Sun ⋅ Yi Dong ⋅ Jimin XIAO
|
Exhibit Hall I #46 | |
|
DiffPS: Leveraging Prior Knowledge of Diffusion Model for Person Search
Giyeol Kim ⋅ Sooyoung Yang ⋅ Jihyong Oh ⋅ Myungjoo Kang ⋅ Chanho Eom
|
Exhibit Hall I #47 | |
|
Mind the Gap: Aligning Vision Foundation Models to Image Feature Matching
Poster Session 5 & Exhibit Hall
Yuhan Liu ⋅ Jingwen Fu ⋅ Yang Wu ⋅ Kangyi Wu ⋅ Pengna Li ⋅ Jiayi Wu ⋅ Sanping Zhou ⋅ Jingmin Xin
|
Exhibit Hall I #48 | |
|
COIN: Confidence Score-Guided Distillation for Annotation-Free Cell Segmentation
Poster Session 5 & Exhibit Hall
Sanghyun Jo ⋅ Seo Lee ⋅ Seungwoo Lee ⋅ Seohyung Hong ⋅ Hyungseok Seo ⋅ Kyungsu Kim
|
Exhibit Hall I #49 | |
|
OVG-HQ: Online Video Grounding with Hybrid-modal Queries
Poster Session 5 & Exhibit Hall
Runhao Zeng ⋅ Jiaqi Mao ⋅ Minghao Lai ⋅ Vu Phan ⋅ Yanjie Dong ⋅ Wei Wang ⋅ Qi Chen ⋅ Xiping Hu
|
Exhibit Hall I #120 | |
|
Learn2Synth: Learning Optimal Data Synthesis Using Hypergradients for Brain Image Segmentation
Poster Session 5 & Exhibit Hall
Xiaoling Hu ⋅ Xiangrui Zeng ⋅ Oula Puonti ⋅ Juan Iglesias ⋅ Bruce Fischl ⋅ Yaël Balbastre
|
Exhibit Hall I #53 | |
|
Representation Shift: Unifying Token Compression with FlashAttention
Poster Session 5 & Exhibit Hall
Joonmyung Choi ⋅ Sanghyeok Lee ⋅ Byungoh Ko ⋅ Eunseo Kim ⋅ Jihyung Kil ⋅ Hyunwoo Kim
|
Exhibit Hall I #61 | |
|
ZipVL: Accelerating Vision-Language Models through Dynamic Token Sparsity
Poster Session 5 & Exhibit Hall
Yefei He ⋅ Feng Chen ⋅ Jing Liu ⋅ Wenqi Shao ⋅ Hong Zhou ⋅ Kaipeng Zhang ⋅ Bohan Zhuang
|
Exhibit Hall I #63 | |
|
ProSAM: Enhancing the Robustness of SAM-based Visual Reference Segmentation with Probabilistic Prompts
Poster Session 5 & Exhibit Hall
Xiaoqi Wang ⋅ Clint Sebastian ⋅ Wenbin He ⋅ Liu Ren
|
Exhibit Hall I #64 | |
|
LaCoOT: Layer Collapse through Optimal Transport
Poster Session 5 & Exhibit Hall
Victor Quétu ⋅ Zhu LIAO ⋅ Nour Hezbri ⋅ Fabio Pizzati ⋅ Enzo Tartaglione
|
Exhibit Hall I #65 | |
|
Fuzzy Contrastive Decoding to Alleviate Object Hallucination in Large Vision-Language Models
Poster Session 5 & Exhibit Hall
Jieun Kim ⋅ Jinmyeong Kim ⋅ Yoonji Kim ⋅ Sung-Bae Cho
|
Exhibit Hall I #72 | |
|
Semantic versus Identity: A Divide-and-Conquer Approach towards Adjustable Medical Image De-Identification
Poster Session 5 & Exhibit Hall
Yuan Tian ⋅ Shuo Wang ⋅ Rongzhao Zhang ⋅ Zijian Chen ⋅ Yankai Jiang ⋅ Chunyi Li ⋅ Xiangyang Zhu ⋅ Fang Yan ⋅ Qiang Hu ⋅ Xiaosong Wang ⋅ Guangtao Zhai
|
Exhibit Hall I #78 | |
|
Towards Video Thinking Test: A Holistic Benchmark for Advanced Video Reasoning and Understanding
Poster Session 5 & Exhibit Hall
Yuanhan Zhang ⋅ Yunice Chew ⋅ Yuhao Dong ⋅ Aria Leo ⋅ Bo Hu ⋅ Ziwei Liu
|
Exhibit Hall I #79 | |
|
Cross-View Isolated Sign Language Recognition via View Synthesis and Feature Disentanglement
Poster Session 5 & Exhibit Hall
Xin Shen ⋅ Xinyu Wang ⋅ Lei Shen ⋅ Kaihao Zhang ⋅ Xin Yu
|
Exhibit Hall I #81 | |
|
Geo4D: Leveraging Video Generators for Geometric 4D Scene Reconstruction
Zeren Jiang ⋅ Chuanxia Zheng ⋅ Iro Laina ⋅ Diane Larlus ⋅ Andrea Vedaldi
|
Exhibit Hall I #82 | |
|
Fix-CLIP: Dual-Branch Hierarchical Contrastive Learning via Synthetic Captions for Better Understanding of Long Text
Poster Session 5 & Exhibit Hall
Bingchao Wang ⋅ Zhiwei Ning ⋅ Jianyu Ding ⋅ Xuanang Gao ⋅ Yin Li ⋅ Dongsheng Jiang ⋅ JIE YANG ⋅ Wei Liu
|
Exhibit Hall I #85 | |
|
Superpowering Open-Vocabulary Object Detectors for X-ray Vision
Poster Session 5 & Exhibit Hall
Pablo Garcia-Fernandez ⋅ Lorenzo Vaquero ⋅ Mingxuan Liu ⋅ Feng Xue ⋅ Daniel Cores ⋅ Nicu Sebe ⋅ Manuel Mucientes ⋅ Elisa Ricci
|
Exhibit Hall I #92 | |
|
RhythmGuassian: Repurposing Generalizable Gaussian Model For Remote Physiological Measurement
Hao LU ⋅ Yuting Zhang ⋅ Jiaqi Tang ⋅ Bowen Fu ⋅ Wenhang Ge ⋅ Wei Wei ⋅ Kaishun Wu ⋅ Ying-Cong Chen
|
Exhibit Hall I #93 | |
|
Beyond Text-Visual Attention: Exploiting Visual Cues for Effective Token Pruning in VLMs
Poster Session 5 & Exhibit Hall
Qizhe Zhang ⋅ Aosong Cheng ⋅ Ming Lu ⋅ Renrui Zhang ⋅ Zhiyong Zhuo ⋅ Jiajun Cao ⋅ Shaobo Guo ⋅ Qi She ⋅ Shanghang Zhang
|
Exhibit Hall I #100 | |
|
MaTVLM: Hybrid Mamba-Transformer for Efficient Vision-Language Modeling
Poster Session 5 & Exhibit Hall
Yingyue Li ⋅ Bencheng Liao ⋅ Wenyu Liu ⋅ Xinggang Wang
|
Exhibit Hall I #102 | |
|
UniConvNet: Expanding Effective Receptive Field while Maintaining Asymptotically Gaussian Distribution for ConvNets of Any Scale
Poster Session 5 & Exhibit Hall
Yuhao Wang ⋅ Wei Xi
|
Exhibit Hall I #106 | |
|
On the Recovery of Cameras from Fundamental Matrices
Rakshith Madhavan ⋅ Federica Arrigoni
|
Exhibit Hall I #107 | |
|
Wavelet Policy: Lifting Scheme for Policy Learning in Long-Horizon Tasks
Poster Session 3 & Exhibit Hall
Hao Huang ⋅ Shuaihang Yuan ⋅ Geeta Chandra Raju Bethala ⋅ Congcong Wen ⋅ Anthony Tzes ⋅ Yi Fang
|
Exhibit Hall I #220 | |
|
MultiverSeg: Scalable Interactive Segmentation of Biomedical Imaging Datasets with In-Context Guidance
Poster Session 5 & Exhibit Hall
Hallee Wong ⋅ Jose Javier Gonzalez Ortiz ⋅ John Guttag ⋅ Adrian Dalca
|
Exhibit Hall I #110 | |
|
The Devil is in the Spurious Correlations: Boosting Moment Retrieval with Dynamic Learning
Poster Session 5 & Exhibit Hall
Xinyang Zhou ⋅ Fanyue Wei ⋅ Lixin Duan ⋅ Angela Yao ⋅ Wen Li
|
Exhibit Hall I #111 | |
|
CABLD: Contrast-Agnostic Brain Landmark Detection with Consistency-Based Regularization
Poster Session 5 & Exhibit Hall
Soorena Salari ⋅ Arash Harirpoush ⋅ Hassan Rivaz ⋅ Yiming Xiao
|
Exhibit Hall I #112 | |
|
EVEv2: Improved Baselines for Encoder-Free Vision-Language Models
Haiwen Diao ⋅ Xiaotong Li ⋅ Yufeng Cui ⋅ Yueze Wang ⋅ Haoge Deng ⋅ Ting Pan ⋅ Wenxuan Wang ⋅ Huchuan Lu ⋅ Xinlong Wang
|
Exhibit Hall I #114 | |
|
Describe, Adapt and Combine: Empowering CLIP Encoders for Open-set 3D Object Retrieval
Poster Session 5 & Exhibit Hall
Zhichuan Wang ⋅ Yang Zhou ⋅ Zhe Liu ⋅ Rui Yu ⋅ Song Bai ⋅ Yulong Wang ⋅ Xinwei He ⋅ Xiang Bai
|
Exhibit Hall I #115 | |
|
Robustifying Zero-Shot Vision Language Models by Subspaces Alignment
Poster Session 5 & Exhibit Hall
Junhao Dong ⋅ Piotr Koniusz ⋅ Liaoyuan Feng ⋅ Yifei Zhang ⋅ Hao Zhu ⋅ Weiming Liu ⋅ Xinghua Qu ⋅ YEW-SOON ONG
|
Exhibit Hall I #116 | |
|
V2PE: Improving Multimodal Long-Context Capability of Vision-Language Models with Variable Visual Position Encoding
Poster Session 5 & Exhibit Hall
Junqi Ge ⋅ Ziyi Chen ⋅ Jintao Lin ⋅ Jinguo Zhu ⋅ Xihui Liu ⋅ Jifeng Dai ⋅ Xizhou Zhu
|
Exhibit Hall I #119 | |
|
Enhancing Zero-shot Object Counting via Text-guided Local Ranking and Number-evoked Global Attention
Poster Session 5 & Exhibit Hall
Shiwei Zhang ⋅ Qi Zhou ⋅ Wei Ke
|
Exhibit Hall I #121 | |
|
SegAnyPET: Universal Promptable Segmentation from Positron Emission Tomography Images
Poster Session 5 & Exhibit Hall
Yichi Zhang ⋅ Le Xue ⋅ Wenbo zhang ⋅ Lanlan Li ⋅ Yuchen Liu ⋅ Chen Jiang ⋅ Yuan Cheng ⋅ Yuan Qi
|
Exhibit Hall I #122 | |
|
Multi-View Slot Attention Using Paraphrased Texts for Face Anti-Spoofing
Poster Session 5 & Exhibit Hall
Jeongmin Yu ⋅ Susang Kim ⋅ Kisu Lee ⋅ Taekyoung Kwon ⋅ Won-Yong Shin ⋅ Ha Young Kim
|
Exhibit Hall I #123 | |
|
4D-Bench: Benchmarking Multi-modal Large Language Models for 4D Object Understanding
Poster Session 5 & Exhibit Hall
Wenxuan Zhu ⋅ Bing Li ⋅ Cheng Zheng ⋅ Jinjie Mai ⋅ Jun Chen ⋅ Letian Jiang ⋅ Abdullah Hamdi ⋅ Sara Rojas Martinez ⋅ Chia-Wen Lin ⋅ Mohamed Elhoseiny ⋅ Bernard Ghanem
|
Exhibit Hall I #124 | |
|
Uncertainty-Driven Expert Control: Enhancing the Reliability of Medical Vision-Language Models
Poster Session 5 & Exhibit Hall
Xiao Liang ⋅ Di Wang ⋅ Zhicheng Jiao ⋅ Ronghan Li ⋅ Pengfei Yang ⋅ Quan Wang ⋅ Tat-Seng Chua
|
Exhibit Hall I #125 | |
|
OuroMamba: A Data-Free Quantization Framework for Vision Mamba
Poster Session 5 & Exhibit Hall
Akshat Ramachandran ⋅ Mingyu Lee ⋅ Huan Xu ⋅ Souvik Kundu ⋅ Tushar Krishna
|
Exhibit Hall I #128 | |
|
Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers
Poster Session 5 & Exhibit Hall
Weiming Ren ⋅ Wentao Ma ⋅ Huan Yang ⋅ Cong Wei ⋅ Ge Zhang ⋅ Wenhu Chen
|
Exhibit Hall I #130 | |
|
SAMora: Enhancing SAM through Hierarchical Self-Supervised Pre-Training for Medical Images
Poster Session 5 & Exhibit Hall
Shuhang Chen ⋅ Hangjie Yuan ⋅ Pengwei Liu ⋅ Hanxue Gu ⋅ Tao Feng ⋅ Dong Ni
|
Exhibit Hall I #131 | |
|
FE-CLIP: Frequency Enhanced CLIP Model for Zero-Shot Anomaly Detection and Segmentation
Poster Session 5 & Exhibit Hall
Tao Gong ⋅ Qi Chu ⋅ Bin Liu ⋅ Zhou Wei ⋅ Nenghai Yu
|
Exhibit Hall I #132 | |
|
Referring Expression Comprehension for Small Objects
Poster Session 5 & Exhibit Hall
Kanoko Goto ⋅ Takumi Hirose ⋅ Mahiro Ukai ⋅ Shuhei Kurita ⋅ Nakamasa Inoue
|
Exhibit Hall I #133 | |
|
AID: Adapting Image2Video Diffusion Models for Instruction-guided Video Prediction
Poster Session 5 & Exhibit Hall
Zhen Xing ⋅ Qi Dai ⋅ Zejia Weng ⋅ Zuxuan Wu ⋅ Yu-Gang Jiang
|
Exhibit Hall I #134 | |
|
CutS3D: Cutting Semantics in 3D for 2D Unsupervised Instance Segmentation
Poster Session 5 & Exhibit Hall
Leon Sick ⋅ Dominik Engel ⋅ Sebastian Hartwig ⋅ Pedro Hermosilla ⋅ Timo Ropinski
|
Exhibit Hall I #136 | |
|
Text-guided Visual Prompt DINO for Generic Segmentation
Poster Session 5 & Exhibit Hall
Yuchen Guan ⋅ Chong Sun ⋅ Canmiao Fu ⋅ Zhipeng Huang ⋅ Chun Yuan ⋅ Chen Li
|
Exhibit Hall I #138 | |
|
ETVA: Evaluation of Text-to-Video Alignment via Fine-grained Question Generation and Answering
Poster Session 5 & Exhibit Hall
Kaisi Guan ⋅ Zhengfeng Lai ⋅ Yuchong Sun ⋅ Peng Zhang ⋅ Wei Liu ⋅ Xiaojiang Liu ⋅ Meng Cao ⋅ Ruihua Song
|
Exhibit Hall I #139 | |
|
GEMeX: A Large-Scale, Groundable, and Explainable Medical VQA Benchmark for Chest X-ray Diagnosis
Poster Session 5 & Exhibit Hall
Bo Liu ⋅ Ke Zou ⋅ Li-Ming Zhan ⋅ ZEXIN LU ⋅ Xiaoyu DONG ⋅ Chengqiang Xie ⋅ Yidi Chen ⋅ Jiannong Cao ⋅ Xiao-Ming Wu ⋅ Huazhu Fu
|
Exhibit Hall I #140 | |
|
Bias-Resilient Weakly Supervised Semantic Segmentation Using Normalizing Flows
Poster Session 5 & Exhibit Hall
Xianglin Qiu ⋅ Xiaoyang Wang ⋅ Zhen Zhang ⋅ Jimin XIAO
|
Exhibit Hall I #141 | |
|
MedSegFactory: Text-Guided Generation of Medical Image-Mask Pairs
Poster Session 5 & Exhibit Hall
Jiawei Mao ⋅ Yuhan Wang ⋅ Yucheng Tang ⋅ Daguang Xu ⋅ Kang Wang ⋅ Yang Yang ⋅ Zongwei Zhou ⋅ Yuyin Zhou
|
Exhibit Hall I #162 | |
|
Cracking Instance Jigsaw Puzzles: A Superior Alternative to Multiple Instance Learning for Whole Slide Image Analysis
Poster Session 5 & Exhibit Hall
Xiwen Chen ⋅ Peijie Qiu ⋅ Wenhui Zhu ⋅ Hao Wang ⋅ Huayu Li ⋅ XUANZHAO DONG ⋅ Xiaotong Sun ⋅ Xiaobing Yu ⋅ Yalin Wang ⋅ Abolfazl Razi ⋅ Aristedis Sotiras
|
Exhibit Hall I #144 | |
|
STDDNet: Harnessing Mamba for Video Polyp Segmentation via Spatial-aligned Temporal Modeling and Discriminative Dynamic Representation Learning
Poster Session 5 & Exhibit Hall
Guilian Chen ⋅ Huisi Wu ⋅ Jing Qin
|
Exhibit Hall I #145 | |
|
FLOSS: Free Lunch in Open-vocabulary Semantic Segmentation
Poster Session 5 & Exhibit Hall
Yasser Benigmim ⋅ Mohammad Fahes ⋅ Tuan-Hung Vu ⋅ Andrei Bursuc ⋅ Raoul de Charette
|
Exhibit Hall I #157 | |
|
Sparse-Dense Side-Tuner for efficient Video Temporal Grounding
Poster Session 5 & Exhibit Hall
David Pujol-Perich ⋅ Sergio Escalera ⋅ Albert Clapés
|
Exhibit Hall I #161 | |
|
Towards a Universal 3D Medical Multi-modality Generalization via Learning Personalized Invariant Representation
Poster Session 5 & Exhibit Hall
Zhaorui Tan ⋅ Xi Yang ⋅ Tan Pan ⋅ TIANYI LIU ⋅ Chen Jiang ⋅ Xin Guo ⋅ Qiufeng Wang ⋅ Anh Nguyen ⋅ Yuan Qi ⋅ Kaizhu Huang ⋅ Yuan Cheng
|
Exhibit Hall I #196 | |
|
DecAD: Decoupling Anomalies in Latent Space for Multi-Class Unsupervised Anomaly Detection
Poster Session 5 & Exhibit Hall
Xiaolei Wang ⋅ Xiaoyang Wang ⋅ Huihui Bai ⋅ ENG Gee LIM ⋅ Jimin XIAO
|
Exhibit Hall I #166 | |
|
Few-Shot Pattern Detection via Template Matching and Regression
Eunchan Jo ⋅ Dahyun Kang ⋅ Sanghyun Kim ⋅ Yunseon Choi ⋅ Minsu Cho
|
Exhibit Hall I #167 | |
|
Hierarchical Event Memory for Accurate and Low-latency Online Video Temporal Grounding
Poster Session 5 & Exhibit Hall
Minghang Zheng ⋅ Yuxin Peng ⋅ Benyuan Sun ⋅ Yi Yang ⋅ Yang Liu
|
Exhibit Hall I #168 | |
|
Learning Yourself: Class-Incremental Semantic Segmentation with Language-Inspired Bootstrapped Disentanglement
Poster Session 5 & Exhibit Hall
Ruitao Wu ⋅ Yifan Zhao ⋅ Jia Li
|
Exhibit Hall I #171 | |
|
Aligning Information Capacity Between Vision and Language via Dense-to-Sparse Feature Distillation for Image-Text matching
Poster Session 5 & Exhibit Hall
Yang Liu ⋅ Wentao Feng ⋅ Zhuoyao Liu ⋅ Shudong Huang ⋅ Jiancheng Lv
|
Exhibit Hall I #176 | |
|
RA-BUSSeg: Relation-aware Semi-supervised Breast Ultrasound Image Segmentation via Adjacent Propagation and Cross-layer Alignment
Poster Session 5 & Exhibit Hall
Wanting ZHANG ⋅ Zhenhui Ding ⋅ Guilian Chen ⋅ Huisi Wu ⋅ Jing Qin
|
Exhibit Hall I #177 | |
|
ExCap3D: Expressive 3D Scene Understanding via Object Captioning with Varying Detail
Poster Session 5 & Exhibit Hall
Chandan Yeshwanth ⋅ David Rozenberszki ⋅ Angela Dai
|
Exhibit Hall I #178 | |
|
DisCo: Towards Distinct and Coherent Visual Encapsulation in Video MLLMs
Poster Session 5 & Exhibit Hall
JIAHE ZHAO ⋅ rongkun Zheng ⋅ Yi Wang ⋅ Helin WANG ⋅ Hengshuang Zhao
|
Exhibit Hall I #179 | |
|
CC-OCR: A Comprehensive and Challenging OCR Benchmark for Evaluating Large Multimodal Models in Literacy
Poster Session 5 & Exhibit Hall
Zhibo Yang ⋅ Jun Tang ⋅ Zhaohai Li ⋅ Pengfei Wang ⋅ Jianqiang Wan ⋅ Humen Zhong ⋅ Xuejing Liu ⋅ Mingkun Yang ⋅ Peng Wang ⋅ Shuai Bai ⋅ Lianwen Jin ⋅ Junyang Lin
|
Exhibit Hall I #182 | |
|
Exploring Probabilistic Modeling Beyond Domain Generalization for Semantic Segmentation
Poster Session 5 & Exhibit Hall
I-Hsiang Chen ⋅ Hua-En Chang ⋅ Wei-Ting Chen ⋅ Jenq-Newng Hwang ⋅ Sy-Yen Kuo
|
Exhibit Hall I #183 | |
|
Prototypes are Balanced Units for Efficient and Effective Partially Relevant Video Retrieval
Poster Session 5 & Exhibit Hall
WonJun Moon ⋅ Cheol-Ho Cho ⋅ Woojin Jun ⋅ Minho Shim ⋅ Taeoh Kim ⋅ Inwoong Lee ⋅ Dongyoon Wee ⋅ Jae-Pil Heo
|
Exhibit Hall I #186 | |
|
VideoAds for Fast-Paced Video Understanding
Poster Session 5 & Exhibit Hall
Zheyuan Zhang ⋅ Wanying Dou ⋅ Linkai Peng ⋅ Hongyi Pan ⋅ Ulas Bagci ⋅ Boqing Gong
|
Exhibit Hall I #188 | |
|
Auto-Controlled Image Perception in MLLMs via Visual Perception Tokens
Poster Session 5 & Exhibit Hall
Runpeng Yu ⋅ Xinyin Ma ⋅ Xinchao Wang
|
Exhibit Hall I #189 | |
|
Refer to Any Segmentation Mask Group With Vision-Language Prompts
Poster Session 5 & Exhibit Hall
Shengcao Cao ⋅ Zijun Wei ⋅ Jason Kuen ⋅ Kangning Liu ⋅ Lingzhi Zhang ⋅ Jiuxiang Gu ⋅ HyunJoon Jung ⋅ Liangyan Gui ⋅ Yu-Xiong Wang
|
Exhibit Hall I #192 | |
|
Triad: Empowering LMM-based Anomaly Detection with Expert-guided Region-of-Interest Tokenizer and Manufacturing Process
Poster Session 5 & Exhibit Hall
Yuanze Li ⋅ Shihao Yuan ⋅ Haolin Wang ⋅ Qizhang Li ⋅ Ming Liu ⋅ Chen Xu ⋅ Guangming Shi ⋅ Wangmeng Zuo
|
Exhibit Hall I #198 | |
|
Bridging the Gap between Brain and Machine in Interpreting Visual Semantics: Towards Self-adaptive Brain-to-Text Decoding
Poster Session 5 & Exhibit Hall
Jiaxuan Chen ⋅ Yu Qi ⋅ Yueming Wang ⋅ Gang Pan
|
Exhibit Hall I #200 | |
|
DisTime: Distribution-based Time Representation for Video Large Language Models
Poster Session 5 & Exhibit Hall
yingsen zeng ⋅ Zepeng Huang ⋅ Yujie Zhong ⋅ Chengjian Feng ⋅ Jie Hu ⋅ Lin Ma ⋅ Yang Liu
|
Exhibit Hall I #202 | |
|
WeaveSeg: Iterative Contrast-weaving and Spectral Feature-refining for Nuclei Instance Segmentation
Jiajia Li ⋅ Huisi Wu ⋅ Jing Qin
|
Exhibit Hall I #204 | |
|
How Can Objects Help Video-Language Understanding?
Poster Session 5 & Exhibit Hall
Zitian Tang ⋅ Shijie Wang ⋅ Junho Cho ⋅ Jaewook Yoo ⋅ Chen Sun
|
Exhibit Hall I #205 | |
|
Everything is a Video: Unifying Modalities through Next-Frame Prediction
Poster Session 5 & Exhibit Hall
G Thomas Hudson ⋅ Dean Slack ⋅ Thomas Winterbottom ⋅ Jamie Stirling ⋅ Chenghao Xiao ⋅ Junjie Shentu ⋅ Noura Al Moubayed
|
Exhibit Hall I #206 | |
|
Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabulary Segmentation
Poster Session 5 & Exhibit Hall
Luca Barsellotti ⋅ Lorenzo Bianchi ⋅ Nicola Messina ⋅ Fabio Carrara ⋅ Marcella Cornia ⋅ Lorenzo Baraldi ⋅ Fabrizio Falchi ⋅ Rita Cucchiara
|
Exhibit Hall I #208 | |
|
CARIM: Caption-Based Autonomous Driving Scene Retrieval via Inclusive Text Matching
Poster Session 5 & Exhibit Hall
Minjoo Ki ⋅ Dae Jung Kim ⋅ Kisung Kim ⋅ Seon Joo Kim ⋅ Jinhan Lee
|
Exhibit Hall I #209 | |
|
Beyond Training: Dynamic Token Merging for Zero-Shot Video Understanding
Poster Session 5 & Exhibit Hall
Yiming Zhang ⋅ Zhuokai Zhao ⋅ Zhaorun Chen ⋅ Zenghui Ding ⋅ Xianjun Yang ⋅ Yining Sun
|
Exhibit Hall I #210 | |
|
Modeling Saliency Dataset Bias
Matthias Kümmerer ⋅ Harneet Singh Khanuja ⋅ Matthias Bethge
|
Exhibit Hall I #213 | |
|
Beyond Single Images: Retrieval Self-Augmented Unsupervised Camouflaged Object Detection
Poster Session 5 & Exhibit Hall
Ji Du ⋅ Xin WANG ⋅ Fangwei Hao ⋅ Mingyang Yu ⋅ Chunyuan Chen ⋅ Jiesheng Wu ⋅ Bin Wang ⋅ Jing Xu ⋅ Ping Li
|
Exhibit Hall I #218 | |
|
Advancing Visual Large Language Model for Multi-granular Versatile Perception
Poster Session 5 & Exhibit Hall
Wentao Xiang ⋅ Haoxian Tan ⋅ Cong Wei ⋅ Yujie Zhong ⋅ Dengjie Li ⋅ Yujiu Yang
|
Exhibit Hall I #220 | |
|
Controllable Latent Space Augmentation for Digital Pathology
Poster Session 5 & Exhibit Hall
Sofiène Boutaj ⋅ Marin Scalbert ⋅ Pierre Marza ⋅ Florent Couzinie-Devy ⋅ Maria Vakalopoulou ⋅ Stergios Christodoulidis
|
Exhibit Hall I #221 | |
|
PS3: A Multimodal Transformer Integrating Pathology Reports with Histology Images and Biological Pathways for Cancer Survival Prediction
Poster Session 5 & Exhibit Hall
Manahil Raza ⋅ Ayesha Azam ⋅ Talha Qaiser ⋅ Nasir Rajpoot
|
Exhibit Hall I #222 | |
|
Balanced Sharpness-Aware Minimization for Imbalanced Regression
Poster Session 2 & Exhibit Hall with Coffee Break
Yahao Liu ⋅ Qin Wang ⋅ Lixin Duan ⋅ Wen Li
|
Exhibit Hall I #114 | |
|
MIEB: Massive Image Embedding Benchmark
Poster Session 5 & Exhibit Hall
Chenghao Xiao ⋅ Isaac Chung ⋅ Imene Kerboua ⋅ Jamie Stirling ⋅ Xin Zhang ⋅ Márton Kardos ⋅ Roman Solomatin ⋅ Noura Al Moubayed ⋅ Kenneth Enevoldsen ⋅ Niklas Muennighoff
|
Exhibit Hall I #223 | |
|
Interpretable point cloud classification using multiple instance learning
Matt De Vries ⋅ Reed Naidoo ⋅ Olga Fourkioti ⋅ Lucas Dent ⋅ Nathan Curry ⋅ Chris Dunsby ⋅ Chris Bakal
|
Exhibit Hall I #225 | |
|
Fine-grained Abnormality Prompt Learning for Zero-shot Anomaly Detection
Poster Session 5 & Exhibit Hall
Jiawen Zhu ⋅ YEW-SOON ONG ⋅ Chunhua Shen ⋅ Guansong Pang
|
Exhibit Hall I #228 | |
|
Bidirectional Likelihood Estimation with Multi-Modal Large Language Models for Text-Video Retrieval
Dohwan Ko ⋅ Ji Soo Lee ⋅ Minhyuk Choi ⋅ Zihang Meng ⋅ Hyunwoo Kim
|
Exhibit Hall I #232 | |
|
Alleviating Textual Reliance in Medical Language-guided Segmentation via Prototype-driven Semantic Approximation
Poster Session 5 & Exhibit Hall
Shuchang Ye ⋅ Usman Naseem ⋅ Mingyuan Meng ⋅ jinman kim
|
Exhibit Hall I #237 | |
|
Controllable-LPMoE: Adapting to Challenging Object Segmentation via Dynamic Local Priors from Mixture-of-Experts
Poster Session 5 & Exhibit Hall
Yanguang Sun ⋅ Jiawei Lian ⋅ jian Yang ⋅ lei luo
|
Exhibit Hall I #238 | |
|
Progressive Test Time Energy Adaptation for Medical Image Segmentation
Xiaoran Zhang ⋅ Byung-Woo Hong ⋅ Hyoungseob Park ⋅ Daniel Pak ⋅ Anne-Marie Rickmann ⋅ Lawrence Staib ⋅ James Duncan ⋅ Alex Wong
|
Exhibit Hall I #239 | |
|
Revisiting Efficient Semantic Segmentation: Learning Offsets for Better Spatial and Class Feature Alignment
Poster Session 5 & Exhibit Hall
Shi-Chen Zhang ⋅ Yunheng Li ⋅ Yu-Huan Wu ⋅ Qibin Hou ⋅ Ming-Ming Cheng
|
Exhibit Hall I #241 | |
|
SignRep: Enhancing Self-Supervised Sign Representations
Poster Session 5 & Exhibit Hall
Ryan Wong ⋅ Necati Cihan Camgoz ⋅ Richard Bowden
|
Exhibit Hall I #282 | |
|
GUIOdyssey: A Comprehensive Dataset for Cross-App GUI Navigation on Mobile Devices
Poster Session 5 & Exhibit Hall
Quanfeng Lu ⋅ Wenqi Shao ⋅ Zitao Liu ⋅ Lingxiao Du ⋅ Fanqing Meng ⋅ Boxuan Li ⋅ Botong Chen ⋅ Siyuan Huang ⋅ Kaipeng Zhang ⋅ Ping Luo
|
Exhibit Hall I #245 | |
|
Learning Beyond Still Frames: Scaling Vision-Language Models with Video
Poster Session 5 & Exhibit Hall
Yiyuan Zhang ⋅ Handong Li ⋅ Jing Liu ⋅ Xiangyu Yue
|
Exhibit Hall I #247 | |
|
Is CLIP ideal? No. Can we fix it? Yes!
Poster Session 5 & Exhibit Hall
Raphaela Kang ⋅ Yue Song ⋅ Georgia Gkioxari ⋅ Pietro Perona
|
Exhibit Hall I #248 | |
|
HQ-CLIP: Leveraging Large Vision-Language Models to Create High-Quality Image-Text Datasets and CLIP Models
Poster Session 5 & Exhibit Hall
ZHIXIANG WEI ⋅ Guangting Wang ⋅ Xiaoxiao Ma ⋅ Ke Mei ⋅ Fengyun Rao ⋅ Huaian Chen ⋅ Yi Jin
|
Exhibit Hall I #249 | |
|
Dynamic Dictionary Learning for Remote Sensing Image Segmentation
Poster Session 5 & Exhibit Hall
Xuechao Zou ⋅ Yue Li ⋅ Shun Zhang ⋅ Kai Li ⋅ Shiying Wang ⋅ Pin Tao ⋅ Junliang Xing ⋅ congyan lang
|
Exhibit Hall I #250 | |
|
Temporal-aware Query Routing for Real-time Video Instance Segmentation
Poster Session 5 & Exhibit Hall
Zesen Cheng ⋅ Kehan Li ⋅ Yian Zhao ⋅ Hang Zhang ⋅ Chang Liu ⋅ Jie Chen
|
Exhibit Hall I #251 | |
|
Free-MoRef: Instantly Multiplexing Context Perception Capabilities of Video-MLLMs within Single Inference
Poster Session 5 & Exhibit Hall
KUO WANG ⋅ Quanlong Zheng ⋅ Junlin Xie ⋅ Yanhao Zhang ⋅ Jinguo Luo ⋅ Haonan Lu ⋅ Liang Lin ⋅ Fan Zhou ⋅ Guanbin Li
|
Exhibit Hall I #254 | |
|
Towards Fine-grained Interactive Segmentation in Images and Videos
Poster Session 5 & Exhibit Hall
Yuan Yao ⋅ Qiushi Yang ⋅ Miaomiao Cui ⋅ Liefeng Bo
|
Exhibit Hall I #255 | |
|
Learnable Retrieval Enhanced Visual-Text Alignment and Fusion for Radiology Report Generation
Poster Session 5 & Exhibit Hall
Qin Zhou ⋅ Guoyan Liang ⋅ Xindi Li ⋅ Jingyuan CHEN ⋅ Zhe Wang ⋅ Chang Yao ⋅ Sai Wu
|
Exhibit Hall I #257 | |
|
Generalizable Object Re-Identification via Visual In-Context Prompting
Poster Session 5 & Exhibit Hall
Zhizhong Huang ⋅ Xiaoming Liu
|
Exhibit Hall I #258 | |
|
TAB: Transformer Attention Bottlenecks enable User Intervention and Debugging in Vision-Language Models
Poster Session 5 & Exhibit Hall
Pooyan Rahmanzadehgervi ⋅ Hung Nguyen ⋅ Rosanne Liu ⋅ Long Mai ⋅ Anh Nguyen
|
Exhibit Hall I #259 | |
|
Anomaly Detection of Integrated Circuits Package Substrates Using the Large Vision Model SAIC: Dataset Construction, Methodology, and Application
Poster Session 5 & Exhibit Hall
Ruiyun Yu ⋅ Bingyang Guo ⋅ Haoyuan Li
|
Exhibit Hall I #260 | |
|
Streaming VideoLLMs for Real-Time Procedural Video Understanding
Poster Session 5 & Exhibit Hall
Dibyadip Chatterjee ⋅ Edoardo Remelli ⋅ Yale Song ⋅ Bugra Tekin ⋅ Abhay Mittal ⋅ Bharat Bhatnagar ⋅ Necati Cihan Camgoz ⋅ Shreyas Hampali ⋅ Eric Sauser ⋅ Shugao Ma ⋅ Angela Yao ⋅ Fadime Sener
|
Exhibit Hall I #262 | |
|
Prompt-driven Transferable Adversarial Attack on Person Re-Identification with Attribute-aware Textual Inversion
Poster Session 5 & Exhibit Hall
Yuan Bian ⋅ Min Liu ⋅ Yunqi Yi ⋅ Xueping Wang ⋅ Shuai Jiang ⋅ Yaonan Wang
|
Exhibit Hall I #263 | |
|
FrameFusion: Combining Similarity and Importance for Video Token Reduction on Large Vision Language Models
Poster Session 5 & Exhibit Hall
Tianyu Fu ⋅ Tengxuan Liu ⋅ Qinghao Han ⋅ Guohao Dai ⋅ Shengen Yan ⋅ Huazhong Yang ⋅ Xuefei Ning ⋅ Yu Wang
|
Exhibit Hall I #268 | |
|
Aligning Effective Tokens with Video Anomaly in Large Language Models
Poster Session 5 & Exhibit Hall
YINGXIAN Chen ⋅ Jiahui Liu ⋅ Ruidi Fan ⋅ Yanwei Li ⋅ Chirui CHANG ⋅ Shizhen Zhao ⋅ Wilton.W.T. Fok ⋅ Xiaojuan Qi ⋅ Yik WU
|
Exhibit Hall I #272 | |
|
No More Sibling Rivalry: Debiasing Human-Object Interaction Detection
Poster Session 5 & Exhibit Hall
Bin Yang ⋅ Yulin Zhang ⋅ Hong-Yu Zhou ⋅ Sibei Yang
|
Exhibit Hall I #273 | |
|
Borrowing Eyes for the Blind Spot: Overcoming Data Scarcity in Malicious Video Detection via Cross-Domain Retrieval Augmentation
Poster Session 5 & Exhibit Hall
Rongpei Hong ⋅ Jian Lang ⋅ Ting Zhong ⋅ Fan Zhou
|
Exhibit Hall I #275 | |
|
DASH: Detection and Assessment of Systematic Hallucinations of VLMs
Poster Session 5 & Exhibit Hall
Maximilian Augustin ⋅ Yannic Neuhaus ⋅ Matthias Hein
|
Exhibit Hall I #277 | |
|
Sim-DETR: Unlock DETR for Temporal Sentence Grounding
Poster Session 5 & Exhibit Hall
Jiajin Tang ⋅ Zhengxuan Wei ⋅ Yuchen Zhu ⋅ Cheng Shi ⋅ Guanbin Li ⋅ Liang Lin ⋅ Sibei Yang
|
Exhibit Hall I #278 | |
|
ViCTr: Vital Consistency Transfer for Pathology Aware Image Synthesis
Poster Session 5 & Exhibit Hall
Onkar Susladkar ⋅ Gayatri Deshmukh ⋅ Yalcin Tur ⋅ Gorkem Durak ⋅ Ulas Bagci
|
Exhibit Hall I #279 | |
|
DIH-CLIP: Unleashing the Diversity of Multi-Head Self-Attention for Training-Free Open-Vocabulary Semantic Segmentation
Poster Session 5 & Exhibit Hall
Songsong Duan ⋅ Xi Yang ⋅ Nannan Wang
|
Exhibit Hall I #281 | |
|
Plug-in Feedback Self-adaptive Attention in CLIP for Training-free Open-Vocabulary Segmentation
Poster Session 5 & Exhibit Hall
Zhixiang Chi ⋅ Yanan Wu ⋅ Li Gu ⋅ Huan Liu ⋅ Ziqiang Wang ⋅ Yang Zhang ⋅ Yang Wang ⋅ Konstantinos Plataniotis
|
Exhibit Hall I #283 | |
|
Feather the Throttle: Revisiting Visual Token Pruning for Vision-Language Model Acceleration
Poster Session 5 & Exhibit Hall
Mark Endo ⋅ Xiaohan Wang ⋅ Serena Yeung-Levy
|
Exhibit Hall I #284 | |
|
Intermediate Connectors and Geometric Priors for Language-Guided Affordance Segmentation on Unseen Object Categories
Poster Session 5 & Exhibit Hall
Yicong Li ⋅ Yiyang Chen ⋅ Zhenyuan Ma ⋅ Junbin Xiao ⋅ Xiang Wang ⋅ Angela Yao
|
Exhibit Hall I #285 | |
|
LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models
Poster Session 5 & Exhibit Hall
Yuzhang Shang ⋅ Mu Cai ⋅ Bingxin Xu ⋅ Yong Jae Lee ⋅ Yan Yan
|
Exhibit Hall I #287 | |
|
AURELIA: Test-time Reasoning Distillation in Audio-Visual LLMs
Poster Session 5 & Exhibit Hall
Sanjoy Chowdhury ⋅ Hanan Gani ⋅ Nishit Anand ⋅ Sayan Nag ⋅ Ruohan Gao ⋅ Mohamed Elhoseiny ⋅ Salman Khan ⋅ Dinesh Manocha
|
Exhibit Hall I #291 | |
|
HERMES: temporal-coHERent long-forM understanding with Episodes and Semantics
Poster Session 5 & Exhibit Hall
Gueter Josmy Faure ⋅ Jia-Fong Yeh ⋅ Min-Hung Chen ⋅ Hung-Ting Su ⋅ Shang-Hong Lai ⋅ Winston Hsu
|
Exhibit Hall I #292 | |
|
Cycle Consistency as Reward: Learning Image-Text Alignment without Human Preferences
Poster Session 5 & Exhibit Hall
Hyojin Bahng ⋅ Caroline Chan ⋅ Fredo Durand ⋅ Phillip Isola
|
Exhibit Hall I #294 | |
|
Griffon v2: Advancing Multimodal Perception with High-Resolution Scaling and Visual-Language Co-Referring
Poster Session 5 & Exhibit Hall
Yufei Zhan ⋅ Shurong Zheng ⋅ Yousong Zhu ⋅ Hongyin Zhao ⋅ Fan Yang ⋅ Ming Tang ⋅ Jinqiao Wang
|
Exhibit Hall I #295 | |
|
LVBench: An Extreme Long Video Understanding Benchmark
Weihan Wang ⋅ zehai he ⋅ Wenyi Hong ⋅ Yean Cheng ⋅ Xiaohan Zhang ⋅ Ji Qi ⋅ Ming Ding ⋅ Xiaotao Gu ⋅ Shiyu Huang ⋅ Bin Xu ⋅ Yuxiao Dong ⋅ Jie Tang
|
Exhibit Hall I #296 | |
|
Debiasing Trace Guidance: Top-down Trace Distillation and Bottom-up Velocity Alignment for Unsupervised Anomaly Detection
Xingjian Wang ⋅ Li Chai ⋅ Jiming Chen
|
#299 | |
|
Beyond [cls]: Exploring the True Potential of Masked Image Modeling Representations
Poster Session 5 & Exhibit Hall
Marcin Przewięźlikowski ⋅ Randall Balestriero ⋅ Wojciech Jasiński ⋅ Marek Śmieja ⋅ Bartosz Zieliński
|
Exhibit Hall I #343 | |
|
MultiADS: Defect-aware Supervision for Multi-type Anomaly Detection and Segmentation in Zero-Shot Learning
Poster Session 5 & Exhibit Hall
Ylli Sadikaj ⋅ Hongkuan Zhou ⋅ Lavdim Halilaj ⋅ Stefan Schmid ⋅ Steffen Staab ⋅ Claudia Plant
|
Exhibit Hall I #298 | |
|
ODDR: Outlier Detection & Dimension Reduction Based Defense Against Adversarial Patches
Poster Session 5 & Exhibit Hall
Nandish Chattopadhyay ⋅ Amira Guesmi ⋅ Muhammad Abdullah Hanif ⋅ Bassem ouni ⋅ Muhammad Shafique
|
Exhibit Hall I #300 | |
|
Similarity Memory Prior is All You Need for Medical Image Segmentation
Hao Tang ⋅ Zhiqing Guo ⋅ Liejun Wang ⋅ Chao Liu
|
Exhibit Hall I #301 | |
|
CalliReader: Contextualizing Chinese Calligraphy via an Embedding-Aligned Vision-Language Model
Poster Session 5 & Exhibit Hall
Yuxuan Luo ⋅ Jiaqi Tang ⋅ Chenyi Huang ⋅ Feiyang Hao ⋅ Zhouhui Lian
|
Exhibit Hall I #305 | |
|
Bringing RNNs Back to Efficient Open-Ended Video Understanding
Poster Session 5 & Exhibit Hall
Weili Xu ⋅ Enxin Song ⋅ Wenhao Chai ⋅ Xuexiang Wen ⋅ Tian Ye ⋅ Gaoang Wang
|
Exhibit Hall I #344 | |
|
Boosting Vision Semantic Density with Anatomy Normality Modeling for Medical Vision-language Pre-training
Poster Session 5 & Exhibit Hall
Weiwei Cao ⋅ Jianpeng Zhang ⋅ Zhongyi Shui ⋅ Sinuo Wang ⋅ Zeli Chen ⋅ Xi Li ⋅ Le Lu ⋅ Xianghua Ye ⋅ Qi Zhang ⋅ Tingbo Liang ⋅ Ling Zhang
|
Exhibit Hall I #306 | |
|
Accelerate 3D Object Detection Models via Zero-Shot Attention Key Pruning
Poster Session 5 & Exhibit Hall
Lizhen Xu ⋅ Xiuxiu Bai ⋅ Xiaojun Jia ⋅ Jianwu Fang ⋅ Shanmin Pang
|
Exhibit Hall I #310 | |
|
SparseMM: Head Sparsity Emerges from Visual Concept Responses in MLLMs
Poster Session 5 & Exhibit Hall
Jiahui Wang ⋅ Zuyan Liu ⋅ Yongming Rao ⋅ Jiwen Lu
|
Exhibit Hall I #319 | |
|
ReferEverything: Towards Segmenting Everything We Can Speak of in Videos
Poster Session 5 & Exhibit Hall
Anurag Bagchi ⋅ Zhipeng Bao ⋅ Yu-Xiong Wang ⋅ Pavel Tokmakov ⋅ Martial Hebert
|
Exhibit Hall I #323 | |
|
Continual Multiple Instance Learning with Enhanced Localization for Histopathological Whole Slide Image Analysis
Poster Session 5 & Exhibit Hall
Byung Hyun Lee ⋅ Wongi Jeong ⋅ Woojae Han ⋅ KYOUNGBUN LEE ⋅ Se Young Chun
|
Exhibit Hall I #324 | |
|
From Trial to Triumph: Advancing Long Video Understanding via Visual Context Sample Scaling and Self-reward Alignment
Poster Session 5 & Exhibit Hall
Yucheng Suo ⋅ Fan Ma ⋅ Linchao Zhu ⋅ Tianyi Wang ⋅ Fengyun Rao ⋅ Yi Yang
|
Exhibit Hall I #325 | |
|
Cross-Architecture Distillation Made Simple with Redundancy Suppression
Weijia Zhang ⋅ Yuehao Liu ⋅ Wu Ran ⋅ Chao Ma
|
Exhibit Hall I #326 | |
|
DC-TTA: Divide-and-Conquer Framework for Test-Time Adaptation of Interactive Segmentation
Poster Session 5 & Exhibit Hall
Jihun Kim ⋅ Hoyong Kwon ⋅ Hyeokjun Kweon ⋅ Wooseong Jeong ⋅ Kuk-Jin Yoon
|
Exhibit Hall I #328 | |
|
FIND: Few-Shot Anomaly Inspection with Normal-Only Multi-Modal Data
Poster Session 5 & Exhibit Hall
YITING LI ⋅ Fayao Liu ⋅ Jingyi Liao ⋅ Sichao Tian ⋅ Chuan-Sheng Foo ⋅ Xulei Yang
|
Exhibit Hall I #329 | |
|
VISO: Accelerating In-orbit Object Detection with Language-Guided Mask Learning and Sparse Inference
Poster Session 5 & Exhibit Hall
Meiqi Wang ⋅ Han Qiu
|
Exhibit Hall I #330 | |
|
Unsupervised Histopathological Image Semantic Segmentation with Overlapping Patches Consistency Constraint
Poster Session 5 & Exhibit Hall
Wentian Cai ⋅ Weizhao Weng ⋅ Zihao Huang ⋅ Yandan Chen ⋅ Siquan Huang ⋅ Ping Gao ⋅ Victor Leung ⋅ Ying Gao
|
Exhibit Hall I #333 | |
|
How Do Optical Flow and Textual Prompts Collaborate to Assist in Audio-Visual Semantic Segmentation?
Poster Session 5 & Exhibit Hall
Yujian Lee ⋅ Peng Gao ⋅ Yongqi Xu ⋅ Wentao Fan
|
Exhibit Hall I #334 | |
|
UINavBench: A Framework for Comprehensive Evaluation of Interactive Digital Agents
Poster Session 5 & Exhibit Hall
Harsh Agrawal ⋅ Eldon Schoop ⋅ Xinlei Pan ⋅ Ari Seff ⋅ Anuj Mahajan ⋅ Di Feng ⋅ Ruijia Cheng ⋅ Andres Romero Mier y Teran ⋅ Esteban Gomez ⋅ Abhishek Sundararajan ⋅ Forrest Huang ⋅ Amanda Swearngin ⋅ Mohana Moorthy ⋅ Jeffrey Nichols ⋅ Alexander Toshev
|
Exhibit Hall I #335 | |
|
VIPerson: Flexibly Generating Virtual Identity for Person Re-Identification
Poster Session 5 & Exhibit Hall
Xiao-Wen Zhang ⋅ Delong Zhang ⋅ Yi-Xing Peng ⋅ Zhi Ouyang ⋅ Jingke Meng ⋅ Wei-Shi Zheng
|
Exhibit Hall I #337 | |
|
Towards Robustness of Person Search against Corruptions
Poster Session 5 & Exhibit Hall
Woojung Son ⋅ Yoonki Cho ⋅ Guoyuan An ⋅ Chanmi Lee ⋅ Sung-eui Yoon
|
Exhibit Hall I #340 | |
|
Flow-MIL: Constructing Highly-expressive Latent Feature Space For Whole Slide Image Classification Using Normalizing Flow
Poster Session 5 & Exhibit Hall
Yingfan MA ⋅ Bohan An ⋅ Ao Shen ⋅ Mingzhi Yuan ⋅ Minghong Duan ⋅ Manning Wang
|
Exhibit Hall I #354 | |
|
HarmonySeg: Tubular Structure Segmentation with Deep-Shallow Feature Fusion and Growth-Suppression Balanced Loss
Poster Session 5 & Exhibit Hall
Ke Zhang ⋅ Yi Huang ⋅ Wei Liu ⋅ Yuanyuan Wang ⋅ Vishal Patel ⋅ Le Lu ⋅ Xu Han ⋅ Dakai Jin ⋅ Ke Yan
|
Exhibit Hall I #355 | |
|
CompCap: Improving Multimodal Large Language Models with Composite Captions
Poster Session 5 & Exhibit Hall
Xiaohui Chen ⋅ Satya Narayan Shukla ⋅ Mahmoud Azab ⋅ Aashu Singh ⋅ Qifan Wang ⋅ David Yang ⋅ ShengYun Peng ⋅ Hanchao Yu ⋅ Shen Yan ⋅ Xuewen Zhang ⋅ Baosheng He
|
Exhibit Hall I #356 | |
|
Stable Diffusion Models are Secretly Good at Visual In-Context Learning
Poster Session 5 & Exhibit Hall
Trevine Oorloff ⋅ Vishwanath Sindagi ⋅ Wele Gedara Chaminda Bandara ⋅ Ali Shafahi ⋅ Amin Ghiasi ⋅ Charan Prakash ⋅ Reza Ardekani
|
Exhibit Hall I #358 | |
|
Prior2Former - Evidential Modeling of Mask Transformers for Assumption-Free Open-World Panoptic Segmentation
Sebastian Schmidt ⋅ Julius Koerner ⋅ Dominik Fuchsgruber ⋅ Stefano Gasperini ⋅ Federico Tombari ⋅ Stephan Günnemann
|
Exhibit Hall I #362 | |
|
Seeing the Unseen: A Semantic Alignment and Context-Aware Prompt Framework for Open-Vocabulary Camouflaged Object Segmentation
Poster Session 5 & Exhibit Hall
Peng Ren ⋅ Tian Bai ⋅ Jing Sun ⋅ Fuming Sun
|
Exhibit Hall I #363 | |
|
ViLLa: Video Reasoning Segmentation with Large Language Model
Poster Session 5 & Exhibit Hall
rongkun Zheng ⋅ Lu Qi ⋅ Xi Chen ⋅ Yi Wang ⋅ Kun Wang ⋅ Hengshuang Zhao
|
Exhibit Hall I #364 | |
|
DynImg: Key Frames with Visual Prompts are Good Representation for Multi-Modal Video Understanding
Poster Session 5 & Exhibit Hall
Xiaoyi Bao ⋅ Chen-Wei Xie ⋅ Hao Tang ⋅ Tingyu Weng ⋅ Xiaofeng Wang ⋅ Yun Zheng ⋅ Xingang Wang
|
Exhibit Hall I #365 | |
|
Object-level Correlation for Few-Shot Segmentation
Poster Session 5 & Exhibit Hall
chunlin wen ⋅ Yu Zhang ⋅ Jie Fan ⋅ Hongyuan Zhu ⋅ Xiu-Shen Wei ⋅ Yijun Wang ⋅ Zhiqiang Kou ⋅ Shuzhou Sun
|
Exhibit Hall I #366 | |
|
Vision-Language Neural Graph Featurization for Extracting Retinal Lesions
Poster Session 5 & Exhibit Hall
Taimur Hassan ⋅ Anabia Sohail ⋅ Muzammal Naseer ⋅ Naoufel Werghi
|
Exhibit Hall I #367 | |
|
SSVQ: Unleashing the Potential of Vector Quantization with Sign-Splitting
Poster Session 5 & Exhibit Hall
Shuaiting Li ⋅ Juncan Deng ⋅ Chengxuan Wang ⋅ Kedong Xu ⋅ Rongtao Deng ⋅ Hong Gu ⋅ Haibin Shen ⋅ Kejie Huang
|
Exhibit Hall I #368 | |
|
RadGPT: Constructing 3D Image-Text Tumor Datasets
Poster Session 5 & Exhibit Hall
Pedro Bassi ⋅ Mehmet Yavuz ⋅ Ibrahim Ethem Hamamci ⋅ Sezgin Er ⋅ Xiaoxi Chen ⋅ Wenxuan Li ⋅ Bjoern Menze ⋅ Sergio Decherchi ⋅ Andrea Cavalli ⋅ Kang Wang ⋅ Yang Yang ⋅ Alan Yuille ⋅ Zongwei Zhou
|
Exhibit Hall I #369 | |
|
LawDIS: Language-Window-based Controllable Dichotomous Image Segmentation
Poster Session 5 & Exhibit Hall
Xinyu Yan ⋅ Meijun Sun ⋅ Ge-Peng Ji ⋅ Fahad Khan ⋅ Salman Khan ⋅ Deng-Ping Fan
|
Exhibit Hall I #385 | |
|
VideoMiner: Iteratively Grounding Key Frames of Hour-Long Videos via Tree-based Group Relative Policy Optimization
Poster Session 5 & Exhibit Hall
Xinye Cao ⋅ Hongcan Guo ⋅ Jiawen Qian ⋅ Guoshun Nan ⋅ Chao Wang ⋅ Yuqi Pan ⋅ Tianhao Hou ⋅ Xiaojuan Wang ⋅ Yutong Gao
|
Exhibit Hall I #374 | |
|
Flow4Agent: Long-form Video Understanding via Motion Prior from Optical Flow
Poster Session 5 & Exhibit Hall
Ruyang Liu ⋅ Shangkun Sun ⋅ Haoran Tang ⋅ Wei Gao ⋅ Ge Li
|
Exhibit Hall I #380 | |
|
An OpenMind for 3D Medical Vision Self-supervised Learning
Poster Session 5 & Exhibit Hall
Tassilo Wald ⋅ Constantin Ulrich ⋅ Jonathan Suprijadi ⋅ Sebastian Ziegler ⋅ Michal Nohel ⋅ Robin Peretzke ⋅ Gregor Koehler ⋅ Klaus Maier-Hein
|
Exhibit Hall I #382 | |
|
OmniSAM: Omnidirectional Segment Anything Model for UDA in Panoramic Semantic Segmentation
Ding Zhong ⋅ Xu Zheng ⋅ Chenfei Liao ⋅ Yuanhuiyi Lyu ⋅ Jialei Chen ⋅ Shengyang Wu ⋅ Linfeng Zhang ⋅ Xuming Hu
|
Exhibit Hall I #384 | |
|
ModalTune: Fine-Tuning Slide-Level Foundation Models with Multi-Modal Information for Multi-task Learning in Digital Pathology
Poster Session 5 & Exhibit Hall
Vishwesh Ramanathan ⋅ Tony Xu ⋅ Pushpak Pati ⋅ Faruk Ahmed ⋅ Maged Goubran ⋅ Anne Martel
|
Exhibit Hall I #386 | |
|
VFlowOpt: A Token Pruning Framework for LMMs with Visual Information Flow-Guided Optimization
Poster Session 5 & Exhibit Hall
Sihan Yang ⋅ Runsen Xu ⋅ Chenhang Cui ⋅ Tai Wang ⋅ Dahua Lin ⋅ Jiangmiao Pang
|
Exhibit Hall I #387 | |
|
Open-Vocabulary HOI Detection with Interaction-aware Prompt and Concept Calibration
Poster Session 5 & Exhibit Hall
Ting Lei ⋅ Shaofeng Yin ⋅ Qingchao Chen ⋅ Yuxin Peng ⋅ Yang Liu
|
Exhibit Hall I #389 | |
|
MINERVA: Evaluating Complex Video Reasoning
Poster Session 5 & Exhibit Hall
Arsha Nagrani ⋅ Sachit Menon ⋅ Ahmet Iscen ⋅ Shyamal Buch ⋅ Nilpa Jha ⋅ Ramin Mehran ⋅ Anja Hauth ⋅ Mikhail Sirotenko ⋅ Yukun Zhu ⋅ Carl Vondrick ⋅ Cordelia Schmid ⋅ Tobias Weyand
|
Exhibit Hall I #391 | |
|
Multi-Granular Spatio-Temporal Token Merging for Training-Free Acceleration of Video LLMs
Poster Session 5 & Exhibit Hall
Jeongseok Hyun ⋅ Sukjun Hwang ⋅ Su Ho Han ⋅ Taeoh Kim ⋅ Inwoong Lee ⋅ Dongyoon Wee ⋅ Joon-Young Lee ⋅ Seon Joo Kim ⋅ Minho Shim
|
Exhibit Hall I #393 | |
|
Scaling Tumor Segmentation: Best Lessons from Real and Synthetic Data
Poster Session 5 & Exhibit Hall
Qi Chen ⋅ Xinze Zhou ⋅ Chen Liu ⋅ Hao Chen ⋅ Wenxuan Li ⋅ Zekun Jiang ⋅ Ziyan Huang ⋅ Yuxuan Zhao ⋅ Dexin Yu ⋅ Junjun He ⋅ Yefeng Zheng ⋅ Ling Shao ⋅ Alan Yuille ⋅ Zongwei Zhou
|
Exhibit Hall I #394 | |
|
TAViS: Text-bridged Audio-Visual Segmentation with Foundation Models
Poster Session 5 & Exhibit Hall
Ziyang Luo ⋅ Nian Liu ⋅ Xuguang Yang ⋅ Salman Khan ⋅ Rao Anwer ⋅ Hisham Cholakkal ⋅ Fahad Khan ⋅ Junwei Han
|
Exhibit Hall I #395 | |
|
Teaching AI the Anatomy Behind the Scan: Addressing Anatomical Flaws in Medical Image Segmentation with Learnable Prior
Poster Session 5 & Exhibit Hall
Young Seok Jeon ⋅ Hongfei Yang ⋅ Huazhu Fu ⋅ Young Seok Jeon
|
Exhibit Hall I #396 | |
|
LIRA: Inferring Segmentation in Large Multi-modal Models with Local Interleaved Region Assistance
Poster Session 5 & Exhibit Hall
Zhang Li ⋅ Biao Yang ⋅ Qiang Liu ⋅ Shuo Zhang ⋅ Zhiyin Ma ⋅ Liang Yin ⋅ Deng Linger ⋅ Yabo Sun ⋅ Yuliang Liu ⋅ Xiang Bai
|
Exhibit Hall I #399 | |
|
SimMLM: A Simple Framework for Multi-modal Learning with Missing Modality
Poster Session 5 & Exhibit Hall
Sijie Li ⋅ Chen Chen ⋅ Jungong Han
|
Exhibit Hall I #400 | |
|
NAVER: A Neuro-Symbolic Compositional Automaton for Visual Grounding with Explicit Logic Reasoning
Poster Session 5 & Exhibit Hall
Zhixi Cai ⋅ Fucai Ke ⋅ Simindokht Jahangard ⋅ Maria Garcia de la Banda ⋅ Gholamreza Haffari ⋅ Peter Stuckey ⋅ Hamid Rezatofighi
|
Exhibit Hall I #401 | |
|
MDP3: A Training-free Approach for List-wise Frame Selection in Video-LLMs
Poster Session 5 & Exhibit Hall
Hui Sun ⋅ Shiyin Lu ⋅ Huanyu Wang ⋅ Qing-Guo Chen ⋅ Zhao Xu ⋅ Weihua Luo ⋅ Kaifu Zhang ⋅ Ming Li
|
Exhibit Hall I #402 | |
|
Emulating Self-attention with Convolution for Efficient Image Super-Resolution
Dongheon Lee ⋅ Seokju Yun ⋅ Youngmin Ro
|
Exhibit Hall I #437 | |
|
Token-Efficient VLM: High-Resolution Image Understanding via Dynamic Region Proposal
Yitong Jiang ⋅ Jinwei Gu ⋅ Tianfan Xue ⋅ Ka Chun Cheung ⋅ Pavlo Molchanov ⋅ Hongxu Yin ⋅ Sifei Liu
|
Exhibit Hall I #407 | |
|
Vision-Language Models Can't See the Obvious
Poster Session 5 & Exhibit Hall
YASSER ABDELAZIZ DAHOU DJILALI ⋅ Ngoc Huynh ⋅ Phúc Lê Khắc ⋅ Wamiq Para ⋅ Ankit Singh ⋅ Sanath Narayan
|
Exhibit Hall I #408 | |
|
Region-aware Anchoring Mechanism for Efficient Referring Visual Grounding
Poster Session 5 & Exhibit Hall
Shuyi Ouyang ⋅ Ziwei Niu ⋅ Hongyi Wang ⋅ Yen-wei Chen ⋅ Lanfen Lin
|
Exhibit Hall I #411 | |
|
VTimeCoT: Thinking by Drawing for Video Temporal Grounding and Reasoning
Poster Session 5 & Exhibit Hall
Jinglei Zhang ⋅ Yuanfan Guo ⋅ Rolandos Alexandros Potamias ⋅ Jiankang Deng ⋅ Hang Xu ⋅ Chao Ma
|
Exhibit Hall I #412 | |
|
Kaputt: A Large-Scale Dataset for Visual Defect Detection
Poster Session 5 & Exhibit Hall
Sebastian Höfer ⋅ Dorian Henning ⋅ Artemij Amiranashvili ⋅ Douglas Morrison ⋅ Mariliza Tzes ⋅ Ingmar Posner ⋅ Marc Matvienko ⋅ Alessandro Rennola ⋅ Anton Milan
|
Exhibit Hall I #414 | |
|
ChatReID: Open-ended Interactive Person Retrieval via Hierarchical Progressive Tuning for Vision Language Models
Poster Session 5 & Exhibit Hall
Ke Niu ⋅ Haiyang Yu ⋅ Mengyang Zhao ⋅ Teng Fu ⋅ Siyang Yi ⋅ Wei Lu ⋅ Bin Li ⋅ Xuelin Qian ⋅ Xiangyang Xue
|
Exhibit Hall I #416 | |
|
Auto-Vocabulary Semantic Segmentation
Poster Session 5 & Exhibit Hall
Osman Ülger ⋅ Maksymilian Kulicki ⋅ Yuki Asano ⋅ Martin R. Oswald
|
Exhibit Hall I #418 | |
|
Enrich and Detect: Video Temporal Grounding with Multimodal LLMs
Shraman Pramanick ⋅ Effrosyni Mavroudi ⋅ Yale Song ⋅ Rama Chellappa ⋅ Lorenzo Torresani ⋅ Triantafyllos Afouras
|
Exhibit Hall I #421 | |
|
Player-Centric Multimodal Prompt Generation for Large Language Model Based Identity-Aware Basketball Video Captioning
Poster Session 5 & Exhibit Hall
Zeyu Xi ⋅ Haoying Sun ⋅ Yaofei Wu ⋅ Junchi Yan ⋅ Haoran Zhang ⋅ Lifang Wu ⋅ Liang Wang ⋅ Chang Wen Chen
|
Exhibit Hall I #424 | |
|
Synchronizing Task Behavior: Aligning Multiple Tasks during Test-Time Training
Poster Session 5 & Exhibit Hall
Wooseong Jeong ⋅ Jegyeong Cho ⋅ Youngho Yoon ⋅ Kuk-Jin Yoon
|
Exhibit Hall I #425 | |
|
Conditional Latent Diffusion Models for Zero-Shot Instance Segmentation
Poster Session 5 & Exhibit Hall
Maximilian Ulmer ⋅ Wout Boerdijk ⋅ Rudolph Triebel ⋅ Maximilian Durner
|
Exhibit Hall I #427 | |
|
GenHancer: Imperfect Generative Models are Secretly Strong Vision-Centric Enhancers
Poster Session 5 & Exhibit Hall
Shijie Ma ⋅ Yuying Ge ⋅ Teng Wang ⋅ Yuxin Guo ⋅ Yixiao Ge ⋅ Ying Shan
|
Exhibit Hall I #431 | |
|
Breaking Grid Constraints: Dynamic Graph Reconstruction Network for Multi-organ Segmentation
Poster Session 5 & Exhibit Hall
Junhao Xiao ⋅ Yang Wei ⋅ Jingyu Wang ⋅ Yongchao Wang ⋅ Xiuli Bi ⋅ Bin Xiao
|
Exhibit Hall I #432 | |
|
MaskSAM: Auto-prompt SAM with Mask Classification for Volumetric Medical Image Segmentation
Poster Session 5 & Exhibit Hall
Bin Xie ⋅ Hao Tang ⋅ Bin Duan ⋅ Dawen Cai ⋅ Yan Yan ⋅ Gady Agam
|
Exhibit Hall I #433 | |
|
Large-scale Pre-training for Grounded Video Caption Generation
Poster Session 5 & Exhibit Hall
Evangelos Kazakos ⋅ Cordelia Schmid ⋅ Josef Sivic
|
Exhibit Hall I #434 | |
|
MEH: A Multi-Style Dataset and Toolkit for Advancing Egyptian Hieroglyph Recognition
Poster Session 5 & Exhibit Hall
Maksim Golyadkin ⋅ Rubanova Alexandrovna ⋅ Aleksandr Utkov ⋅ Dmitry Nikolotov ⋅ Ilya Makarov
|
Exhibit Hall I #439 | |
|
Hybrid-Tower: Fine-grained Pseudo-query Interaction and Generation for Text-to-Video Retrieval
Poster Session 5 & Exhibit Hall
Bangxiang Lan ⋅ Ruobing Xie ⋅ Ruixiang Zhao ⋅ Xingwu Sun ⋅ Zhanhui Kang ⋅ Gang Yang ⋅ Xirong Li
|
Exhibit Hall I #440 | |
|
Unbiased Missing-modality Multimodal Learning
Poster Session 5 & Exhibit Hall
Ruiting Dai ⋅ Chenxi Li ⋅ Yandong Yan ⋅ Lisi Mo ⋅ Ke Qin ⋅ Tao He
|
Exhibit Hall I #441 | |
|
ViM-VQ: Efficient Post-Training Vector Quantization for Visual Mamba
Poster Session 5 & Exhibit Hall
Juncan Deng ⋅ Shuaiting Li ⋅ Zeyu Wang ⋅ Kedong Xu ⋅ Hong Gu ⋅ Kejie Huang
|
Exhibit Hall I #442 | |
|
Axis-level Symmetry Detection with Group-Equivariant Representation
Poster Session 6 & Exhibit Hall with Coffee Break
Wongyun Yu ⋅ Ahyun Seo ⋅ Minsu Cho
|
Exhibit Hall I #7 | |
|
B-VLLM: A Vision Large Language Model with Balanced Spatio-Temporal Tokens
Poster Session 5 & Exhibit Hall
Zhuqiang Lu ⋅ Zhenfei Yin ⋅ Mengwei He ⋅ Zhihui Wang ⋅ Zicheng Liu ⋅ Zhiyong Wang ⋅ Kun Hu
|
Exhibit Hall I #445 | |
|
DiffTell: A High-Quality Dataset for Describing Image Manipulation Changes
Poster Session 5 & Exhibit Hall
Zonglin Di ⋅ Jing Shi ⋅ Yifei Fan ⋅ Hao Tan ⋅ Alexander Black ⋅ John Collomosse ⋅ Yang Liu
|
Exhibit Hall I #448 | |
|
YOLOE: Real-Time Seeing Anything
Poster Session 5 & Exhibit Hall
Ao Wang ⋅ Lihao Liu ⋅ Hui Chen ⋅ Zijia Lin ⋅ Jungong Han ⋅ Guiguang Ding
|
Exhibit Hall I #449 | |
|
Mixture-of-Scores: Robust Image-Text Data Valuation via Three Lines of Code
Poster Session 5 & Exhibit Hall
WU Sitong ⋅ Haoru Tan ⋅ Yukang Chen ⋅ Shaofeng Zhang ⋅ Jingyao Li ⋅ Bei Yu ⋅ Xiaojuan Qi ⋅ Jiaya Jia
|
Exhibit Hall I #450 | |
|
Benchmarking Burst Super-Resolution for Polarization Images: Noise Dataset and Analysis
Poster Session 6 & Exhibit Hall with Coffee Break
Inseung Hwang ⋅ Kiseok Choi ⋅ Hyunho Ha ⋅ Min H. Kim
|
Exhibit Hall I #17 | |
|
X2-Gaussian: 4D Radiative Gaussian Splatting for Continuous-time Tomographic Reconstruction
Poster Session 6 & Exhibit Hall with Coffee Break
Weihao Yu ⋅ Yuanhao Cai ⋅ Ruyi Zha ⋅ Zhiwen Fan ⋅ Chenxin Li ⋅ Yixuan Yuan
|
Exhibit Hall I #1 | |
|
HyperGCT: A Dynamic Hyper-GNN-Learned Geometric Constraint for 3D Registration
Poster Session 6 & Exhibit Hall with Coffee Break
Xiyu Zhang ⋅ Jiayi Ma ⋅ Jianwei Guo ⋅ Wei Hu ⋅ Zhaoshuai Qi ⋅ Fei HUI ⋅ Jiaqi Yang ⋅ Yanning Zhang
|
Exhibit Hall I #3 | |
|
AD-GS: Object-Aware B-Spline Gaussian Splatting for Self-Supervised Autonomous Driving
Poster Session 6 & Exhibit Hall with Coffee Break
Jiawei Xu ⋅ Kai Deng ⋅ Zexin Fan ⋅ Shenlong Wang ⋅ Jin Xie ⋅ jian Yang
|
Exhibit Hall I #5 | |
|
EvaGaussians: Event Stream Assisted Gaussian Splatting from Blurry Images
Poster Session 6 & Exhibit Hall with Coffee Break
Wangbo Yu ⋅ Chaoran Feng ⋅ Jianing Li ⋅ Jiye Tang ⋅ Jiashu Yang ⋅ Zhenyu Tang ⋅ Meng Cao ⋅ Xu Jia ⋅ Yuchao Yang ⋅ Li Yuan ⋅ Yonghong Tian
|
Exhibit Hall I #6 | |
|
All in One: Visual-Description-Guided Unified Point Cloud Segmentation
Poster Session 6 & Exhibit Hall with Coffee Break
Zongyan Han ⋅ Mohamed El Amine Boudjoghra ⋅ Jiahua Dong ⋅ Jinhong Wang ⋅ Rao Anwer
|
Exhibit Hall I #11 | |
|
Bolt3D: Generating 3D Scenes in Seconds
Poster Session 6 & Exhibit Hall with Coffee Break
Stanislaw Szymanowicz ⋅ Jason Y. Zhang ⋅ Pratul Srinivasan ⋅ Ruiqi Gao ⋅ Arthur Brussee ⋅ Aleksander Holynski ⋅ Ricardo Martin Brualla ⋅ Jonathan Barron ⋅ Philipp Henzler
|
Exhibit Hall I #12 | |
|
Semantic Causality-Aware Vision-Based 3D Occupancy Prediction
Poster Session 6 & Exhibit Hall with Coffee Break
Dubing Chen ⋅ Huan Zheng ⋅ Yucheng Zhou ⋅ Xianfei Li ⋅ Wenlong Liao ⋅ Tao He ⋅ Pai Peng ⋅ Jianbing Shen
|
Exhibit Hall I #15 | |
|
U-ViLAR: Uncertainty-Aware Visual Localization for Autonomous Driving via Differentiable Association and Registration
Poster Session 6 & Exhibit Hall with Coffee Break
Xiaofan Li ⋅ Zhihao Xu ⋅ Chenming Wu ⋅ Zhao Yang ⋅ Yumeng Zhang ⋅ Jiang-Jiang Liu ⋅ Haibao Yu ⋅ Xiaoqing Ye ⋅ YuAn Wang ⋅ Shirui Li ⋅ Xun Sun ⋅ Ji Wan ⋅ Jun Wang
|
Exhibit Hall I #16 | |
|
Large Scene Generation with Cube-Absorb Discrete Diffusion
Poster Session 6 & Exhibit Hall with Coffee Break
Qianjiang Hu ⋅ Wei Hu
|
Exhibit Hall I #44 | |
|
Group Inertial Poser: Multi-Person Pose and Global Translation from Sparse Inertial Sensors and Ultra-Wideband Ranging
Poster Session 6 & Exhibit Hall with Coffee Break
Ying Xue ⋅ Jiaxi Jiang ⋅ Rayan Armani ⋅ Dominik Hollidt ⋅ Yi-Chi Liao ⋅ Christian Holz
|
Exhibit Hall I #18 | |
|
RESCUE: Crowd Evacuation Simulation via Controlling SDM-United Characters
Xiaolin Liu ⋅ Tianyi zhou ⋅ Hongbo Kang ⋅ Jian Ma ⋅ Ziwen Wang ⋅ Jing Huang ⋅ Wenguo Weng ⋅ Yu-Kun Lai ⋅ Kun Li
|
Exhibit Hall I #22 | |
|
SG-LDM: Semantic-Guided LiDAR Generation via Latent-Aligned Diffusion
Poster Session 6 & Exhibit Hall with Coffee Break
Zhengkang Xiang ⋅ Zizhao Li ⋅ Amir Khodabandeh ⋅ Kourosh Khoshelham
|
Exhibit Hall I #23 | |
|
LookOut: Real-World Humanoid Egocentric Navigation
Poster Session 6 & Exhibit Hall with Coffee Break
Boxiao Pan ⋅ Adam Harley ⋅ Francis Engelmann ⋅ Karen Liu ⋅ Leonidas Guibas
|
Exhibit Hall I #24 | |
|
Occupancy Learning with Spatiotemporal Memory
Poster Session 6 & Exhibit Hall with Coffee Break
Ziyang Leng ⋅ Jiawei Yang ⋅ Wenlong Yi ⋅ Bolei Zhou
|
Exhibit Hall I #179 | |
|
PointGAC: Geometric-Aware Codebook for Masked Point Modeling
Poster Session 6 & Exhibit Hall with Coffee Break
Abiao Li ⋅ Chenlei Lv ⋅ Guofeng Mei ⋅ Yifan Zuo ⋅ Jian Zhang ⋅ Yuming Fang
|
Exhibit Hall I #25 | |
|
Statistical Confidence Rescoring for Robust 3D Scene Graph Generation from Multi-View Images
Poster Session 6 & Exhibit Hall with Coffee Break
Qi Xun Yeo ⋅ Yanyan Li ⋅ Gim Hee Lee
|
Exhibit Hall I #26 | |
|
PRM: Photometric Stereo based Large Reconstruction Model
Wenhang Ge ⋅ Jiantao Lin ⋅ Guibao SHEN ⋅ Jiawei Feng ⋅ Tao Hu ⋅ Xinli Xu ⋅ Ying-Cong Chen
|
Exhibit Hall I #27 | |
|
4D Gaussian Splatting SLAM
Poster Session 6 & Exhibit Hall with Coffee Break
Yanyan Li ⋅ Youxu Fang ⋅ Zunjie Zhu ⋅ Kunyi Li ⋅ Yong Ding ⋅ Federico Tombari
|
Exhibit Hall I #28 | |
|
Generalizable Non-Line-of-Sight Imaging with Learnable Physical Priors
Poster Session 6 & Exhibit Hall with Coffee Break
Shida Sun ⋅ Yue Li ⋅ Yueyi Zhang ⋅ Zhiwei Xiong
|
Exhibit Hall I #30 | |
|
Hi3DGen: High-fidelity 3D Geometry Generation from Images via Normal Bridging
Poster Session 6 & Exhibit Hall with Coffee Break
Chongjie Ye ⋅ Yushuang Wu ⋅ Ziteng Lu ⋅ Jiahao Chang ⋅ Xiaoyang Guo ⋅ Jiaqing Zhou ⋅ Hao Zhao ⋅ Xiaoguang Han
|
Exhibit Hall I #31 | |
|
SuperMat: Physically Consistent PBR Material Estimation at Interactive Rates
Poster Session 6 & Exhibit Hall with Coffee Break
Yijia Hong ⋅ Yuan-Chen Guo ⋅ Ran Yi ⋅ Yulong Chen ⋅ Yanpei Cao ⋅ Lizhuang Ma
|
Exhibit Hall I #34 | |
|
RI3D: Few-Shot Gaussian Splatting With Repair and Inpainting Diffusion Priors
Poster Session 6 & Exhibit Hall with Coffee Break
Avinash Paliwal ⋅ xilong zhou ⋅ Wei Ye ⋅ Jinhui Xiong ⋅ Rakesh Ranjan ⋅ Nima Kalantari
|
Exhibit Hall I #35 | |
|
Dual-S3D: Hierarchical Dual-Path Selective SSM-CNN for High-Fidelity Implicit Reconstruction
Poster Session 6 & Exhibit Hall with Coffee Break
Luoxi Zhang ⋅ Pragyan Shrestha ⋅ Yu Zhou ⋅ Chun Xie ⋅ Itaru Kitahara
|
Exhibit Hall I #36 | |
|
AlignDiff: Learning Physically-Grounded Camera Alignment via Diffusion
Poster Session 6 & Exhibit Hall with Coffee Break
Liuyue Xie ⋅ Jiancong Guo ⋅ Ozan Cakmakci ⋅ Andre Araujo ⋅ Laszlo A. A. Jeni ⋅ zhiheng jia
|
Exhibit Hall I #210 | |
|
FastPoint: Accelerating 3D Point Cloud Model Inference via Sample Point Distance Prediction
Poster Session 6 & Exhibit Hall with Coffee Break
Donghyun Lee ⋅ Dawoon Jeong ⋅ Jae W. Lee ⋅ Hongil Yoon
|
Exhibit Hall I #37 | |
|
RobuSTereo: Robust Zero-Shot Stereo Matching under Adverse Weather
Poster Session 6 & Exhibit Hall with Coffee Break
Yuran Wang ⋅ Yingping Liang ⋅ Yutao Hu ⋅ Ying Fu
|
Exhibit Hall I #39 | |
|
Gaussian Splatting with Discretized SDF for Relightable Assets
Poster Session 6 & Exhibit Hall with Coffee Break
Zuo-Liang Zhu ⋅ jian Yang ⋅ Beibei Wang
|
Exhibit Hall I #41 | |
|
MMGeo: Multimodal Compositional Geo-Localization for UAVs
Poster Session 6 & Exhibit Hall with Coffee Break
Yuxiang Ji ⋅ Boyong He ⋅ Zhuoyue Tan ⋅ Liaoni Wu
|
Exhibit Hall I #42 | |
|
AdaptiveAE: An Adaptive Exposure Strategy for HDR Capturing in Dynamic Scenes
Poster Session 6 & Exhibit Hall with Coffee Break
Tianyi Xu ⋅ Fan Zhang ⋅ Boxin Shi ⋅ Tianfan Xue ⋅ Yujin Wang
|
Exhibit Hall I #43 | |
|
SynAD: Enhancing Real-World End-to-End Autonomous Driving Models through Synthetic Data Integration
Poster Session 6 & Exhibit Hall with Coffee Break
Jongsuk Kim ⋅ Jae Young Lee ⋅ Gyojin Han ⋅ Dong-Jae Lee ⋅ Minki Jeong ⋅ Junmo Kim
|
Exhibit Hall I #45 | |
|
Benchmarking Egocentric Visual-Inertial SLAM at City Scale
Anusha Krishnan ⋅ Shaohui Liu ⋅ Paul-Edouard Sarlin ⋅ Oscar Gentilhomme ⋅ David Caruso ⋅ Maurizio Monge ⋅ Richard Newcombe ⋅ Jakob Engel ⋅ Marc Pollefeys
|
Exhibit Hall I #46 | |
|
Neural Shell Texture Splatting: More Details and Fewer Primitives
Poster Session 6 & Exhibit Hall with Coffee Break
Xin Zhang ⋅ Anpei Chen ⋅ Jincheng Xiong ⋅ Pinxuan Dai ⋅ Yujun Shen ⋅ Weiwei Xu
|
Exhibit Hall I #48 | |
|
Gaussian-based World Model: Gaussian Priors for Voxel-Based Occupancy Prediction and Future Motion Prediction
Poster Session 6 & Exhibit Hall with Coffee Break
Tuo Feng ⋅ Wenguan Wang ⋅ Yi Yang
|
Exhibit Hall I #49 | |
|
Momentum-GS: Momentum Gaussian Self-Distillation for High-Quality Large Scene Reconstruction
Poster Session 6 & Exhibit Hall with Coffee Break
JIXUAN FAN ⋅ Wanhua Li ⋅ Yifei Han ⋅ Tianru Dai ⋅ Yansong Tang
|
Exhibit Hall I #50 | |
|
JointDiT: Enhancing RGB-Depth Joint Modeling with Diffusion Transformers
Poster Session 6 & Exhibit Hall with Coffee Break
Kwon Byung-Ki ⋅ Qi Dai ⋅ Lee Hyoseok ⋅ Chong Luo ⋅ Tae-Hyun Oh
|
Exhibit Hall I #51 | |
|
A Real-world Display Inverse Rendering Dataset
Poster Session 6 & Exhibit Hall with Coffee Break
Seokjun Choi ⋅ Hoon-Gyu Chung ⋅ Yujin Jeon ⋅ Giljoo Nam ⋅ Seung-Hwan Baek
|
Exhibit Hall I #52 | |
|
RCTDistill: Cross-Modal Knowledge Distillation Framework for Radar-Camera 3D Object Detection with Temporal Fusion
Poster Session 6 & Exhibit Hall with Coffee Break
Geonho Bang ⋅ Minjae Seong ⋅ Jisong Kim ⋅ Geunju Baek ⋅ Daye Oh ⋅ Junhyung Kim ⋅ Junho Koh ⋅ Jun Won Choi
|
Exhibit Hall I #56 | |
|
Federated Domain Generalization with Domain-specific Soft Prompts Generation
Poster Session 1 & Exhibit Hall
Jianhan Wu ⋅ Xiaoyang Qu ⋅ Zhangcheng Huang ⋅ Jianzong Wang
|
Exhibit Hall I #215 | |
|
GCRayDiffusion: Pose-Free Surface Reconstruction via Geometric Consistent Ray Diffusion
Poster Session 6 & Exhibit Hall with Coffee Break
Li-Heng Chen ⋅ Zi-Xin Zou ⋅ Chang Liu ⋅ Tianjiao Jing ⋅ Yanpei Cao ⋅ Shi-Sheng Huang ⋅ Hongbo Fu ⋅ Hua Huang
|
Exhibit Hall I #58 | |
|
GSRecon: Efficient Generalizable Gaussian Splatting for Surface Reconstruction from Sparse Views
Poster Session 6 & Exhibit Hall with Coffee Break
Hang Yang ⋅ Le Hui ⋅ Jianjun Qian ⋅ Jin Xie ⋅ Jian Yang
|
Exhibit Hall I #59 | |
|
REPARO: Compositional 3D Assets Generation with Differentiable 3D Layout Alignment
Poster Session 6 & Exhibit Hall with Coffee Break
Haonan Han ⋅ Rui Yang ⋅ Huan Liao ⋅ Haonan Han ⋅ Zunnan Xu ⋅ Xiaoming Yu ⋅ Junwei Zha ⋅ Xiu Li ⋅ Wanhua Li
|
Exhibit Hall I #61 | |
|
Towards Safer and Understandable Driver Intention Prediction
Poster Session 6 & Exhibit Hall with Coffee Break
Mukilan Karuppasamy ⋅ Shankar Gangisetty ⋅ Shyam Nandan Rai ⋅ Carlo Masone ⋅ C.V. Jawahar
|
Exhibit Hall I #62 | |
|
V2XPnP: Vehicle-to-Everything Spatio-Temporal Fusion for Multi-Agent Perception and Prediction
Poster Session 6 & Exhibit Hall with Coffee Break
Zewei Zhou ⋅ Hao Xiang ⋅ Zhaoliang Zheng ⋅ Zhihao Zhao ⋅ Mingyue Lei ⋅ Yun Zhang ⋅ Tianhui Cai ⋅ Xinyi Liu ⋅ Johnson Liu ⋅ Maheswari Bajji ⋅ Xin Xia ⋅ Zhiyu Huang ⋅ Bolei Zhou ⋅ Jiaqi Ma
|
Exhibit Hall I #64 | |
|
InstaDrive: Instance-Aware Driving World Models for Realistic and Consistent Video Generation
Poster Session 6 & Exhibit Hall with Coffee Break
Zhuoran Yang ⋅ Xi Guo ⋅ Chenjing Ding ⋅ Chiyu Wang ⋅ Wei Wu ⋅ Yanyong Zhang
|
Exhibit Hall I #65 | |
|
NormalLoc: Visual Localization on Textureless 3D Models using Surface Normals
Poster Session 6 & Exhibit Hall with Coffee Break
Jiro Abe ⋅ Gaku Nakano ⋅ Kazumine Ogura
|
Exhibit Hall I #66 | |
|
EmbodiedSplat: Personalized Real-to-Sim-to-Real Navigation with Gaussian Splats from a Mobile Device
Poster Session 6 & Exhibit Hall with Coffee Break
Gunjan Chhablani ⋅ Xiaomeng Ye ⋅ Muhammad Zubair Irshad ⋅ Zsolt Kira
|
Exhibit Hall I #67 | |
|
FreeSplatter: Pose-free Gaussian Splatting for Sparse-view 3D Reconstruction
Poster Session 6 & Exhibit Hall with Coffee Break
Jiale Xu ⋅ Shenghua Gao ⋅ Ying Shan
|
Exhibit Hall I #68 | |
|
NGD: Neural Gradient Based Deformation for Monocular Garment Reconstruction
Poster Session 6 & Exhibit Hall with Coffee Break
Soham Dasgupta ⋅ Shanthika Naik ⋅ Preet Savalia ⋅ Sujay Kumar Ingle ⋅ Avinash Sharma
|
Exhibit Hall I #72 | |
|
Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations
Poster Session 6 & Exhibit Hall with Coffee Break
Xiang Xu ⋅ Lingdong Kong ⋅ Song Wang ⋅ Chuanwei Zhou ⋅ Qingshan Liu
|
Exhibit Hall I #76 | |
|
Lifting the Structural Morphing for Wide-Angle Images Rectification: Unified Content and Boundary Modeling
Poster Session 6 & Exhibit Hall with Coffee Break
Wenting Luan ⋅ Siqi Lu ⋅ Yongbin Zheng ⋅ Wanying XU ⋅ Lang Nie ⋅ Zongtan Zhou ⋅ Kang Liao
|
Exhibit Hall I #78 | |
|
Global Regulation and Excitation via Attention Tuning for Stereo Matching
Poster Session 6 & Exhibit Hall with Coffee Break
Jiahao LI ⋅ Xinhong Chen ⋅ Zhengmin JIANG ⋅ Qian Zhou ⋅ Yung-Hui Li ⋅ Jianping Wang
|
Exhibit Hall I #79 | |
|
UniOcc: A Unified Benchmark for Occupancy Forecasting and Prediction in Autonomous Driving
Poster Session 6 & Exhibit Hall with Coffee Break
Yuping Wang ⋅ Xiangyu Huang ⋅ Xiaokang Sun ⋅ Mingxuan Yan ⋅ Shuo Xing ⋅ Zhengzhong Tu ⋅ Jiachen Li
|
Exhibit Hall I #81 | |
|
Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency
Poster Session 6 & Exhibit Hall with Coffee Break
Tianqi Liu ⋅ Zihao Huang ⋅ Zhaoxi Chen ⋅ Guangcong Wang ⋅ Shoukang Hu ⋅ Liao Shen ⋅ Huiqiang Sun ⋅ Zhiguo Cao ⋅ Wei Li ⋅ Ziwei Liu
|
Exhibit Hall I #82 | |
|
HiP-AD: Hierarchical and Multi-Granularity Planning with Deformable Attention for Autonomous Driving in a Single Decoder
Poster Session 6 & Exhibit Hall with Coffee Break
Yingqi Tang ⋅ Zhuoran Xu ⋅ Zhaotie Meng ⋅ Erkang Cheng
|
Exhibit Hall I #85 | |
|
RayletDF: Raylet Distance Fields for Generalizable 3D Surface Reconstruction from Point Clouds or Gaussians
Shenxing Wei ⋅ Jinxi Li ⋅ Yafei YANG ⋅ Siyuan Zhou ⋅ Bo Yang
|
Exhibit Hall I #86 | |
|
Semantic-guided Camera Ray Regression for Visual Localization
Poster Session 6 & Exhibit Hall with Coffee Break
Yesheng Zhang ⋅ Xu Zhao
|
Exhibit Hall I #88 | |
|
SketchSplat: 3D Edge Reconstruction via Differentiable Multi-view Sketch Splatting
Poster Session 6 & Exhibit Hall with Coffee Break
Haiyang Ying ⋅ Matthias Zwicker
|
Exhibit Hall I #89 | |
|
Polarimetric Neural Field via Unified Complex-Valued Wave Representation
Poster Session 6 & Exhibit Hall with Coffee Break
Chu Zhou ⋅ Yixin Yang ⋅ Junda Liao ⋅ Heng Guo ⋅ Boxin Shi ⋅ Imari Sato
|
Exhibit Hall I #90 | |
|
High-Precision 3D Measurement of Complex Textured Surfaces Using Multiple Filtering Approach
Poster Session 6 & Exhibit Hall with Coffee Break
Yuchong Chen ⋅ Jian Yu ⋅ Shaoyan Gai ⋅ Zeyu Cai ⋅ Feipeng Da
|
Exhibit Hall I #91 | |
|
AutoScape: Geometry-Consistent Long-Horizon Scene Generation
Poster Session 6 & Exhibit Hall with Coffee Break
Jiacheng Chen ⋅ Ziyu Jiang ⋅ Mingfu Liang ⋅ Bingbing Zhuang ⋅ Jong-Chyi Su ⋅ Sparsh Garg ⋅ Ying Wu ⋅ Manmohan Chandraker
|
Exhibit Hall I #94 | |
|
From Gallery to Wrist: Realistic 3D Bracelet Insertion in Videos
Poster Session 6 & Exhibit Hall with Coffee Break
Chenjian Gao ⋅ Lihe Ding ⋅ Rui Han ⋅ Zhanpeng Huang ⋅ Zibin Wang ⋅ Tianfan Xue
|
Exhibit Hall I #95 | |
|
Street Gaussians without 3D Object Tracker
Poster Session 6 & Exhibit Hall with Coffee Break
Ruida Zhang ⋅ Chengxi Li ⋅ Chenyangguang Zhang ⋅ Xingyu Liu ⋅ Haili Yuan ⋅ Yanyan Li ⋅ Xiangyang Ji ⋅ Gim Hee Lee
|
Exhibit Hall I #96 | |
|
HiNeuS: High-fidelity Neural Surface Mitigating Low-texture and Reflective Ambiguity
Yida Wang ⋅ Xueyang Zhang ⋅ Kun Zhan ⋅ Peng Jia ⋅ XianPeng Lang
|
Exhibit Hall I #98 | |
|
RGE-GS: Reward-Guided Expansive Driving Scene Reconstruction via Diffusion Priors
Poster Session 6 & Exhibit Hall with Coffee Break
Sicong Du ⋅ Jiarun Liu ⋅ Qifeng Chen ⋅ Hao-Xiang Chen ⋅ Tai-Jiang Mu ⋅ Sheng Yang
|
Exhibit Hall I #99 | |
|
Scene Coordinate Reconstruction Priors
Poster Session 6 & Exhibit Hall with Coffee Break
Wenjing Bian ⋅ Axel Barroso-Laguna ⋅ Tommaso Cavallari ⋅ Victor Prisacariu ⋅ Eric Brachmann
|
Exhibit Hall I #100 | |
|
Resonance: Learning to Predict Social-Aware Pedestrian Trajectories as Co-Vibrations
Poster Session 6 & Exhibit Hall with Coffee Break
Conghao Wong ⋅ Ziqian Zou ⋅ Beihao Xia
|
Exhibit Hall I #102 | |
|
I2-World: Intra-Inter Tokenization for Efficient Dynamic 4D Scene Forecasting
Poster Session 6 & Exhibit Hall with Coffee Break
Zhimin Liao ⋅ Ping Wei ⋅ Ruijie Zhang ⋅ Shuaijia Chen ⋅ Haoxuan Wang ⋅ Ziyang Ren
|
Exhibit Hall I #104 | |
|
InsideOut: Integrated RGB-Radiative Gaussian Splatting for Comprehensive 3D Object Representation
Poster Session 6 & Exhibit Hall with Coffee Break
Jungmin Lee ⋅ Seonghyuk Hong ⋅ Juyong Lee ⋅ Jaeyoon Lee ⋅ Jongwon Choi
|
Exhibit Hall I #105 | |
|
RIOcc: Efficient Cross-Modal Fusion Transformer with Collaborative Feature Refinement for 3D Semantic Occupancy Prediction
Poster Session 6 & Exhibit Hall with Coffee Break
Baojie Fan ⋅ Xiaotian Li ⋅ Yuhan Zhou ⋅ Yuyu Jiang ⋅ Jiandong Tian ⋅ Huijie Fan
|
Exhibit Hall I #108 | |
|
TeethGenerator: A two-stage framework for paired pre- and post-orthodontic 3D dental data generation
Poster Session 6 & Exhibit Hall with Coffee Break
Changsong Lei ⋅ Yaqian Liang ⋅ Shaofeng Wang ⋅ Jiajia Dai ⋅ Yong-Jin Liu
|
Exhibit Hall I #110 | |
|
Removing Out-of-Focus Reflective Flares via Color Alignment
Poster Session 2 & Exhibit Hall with Coffee Break
Fengbo Lan ⋅ Chang Wen Chen
|
Exhibit Hall I #445 | |
|
Towards Accurate and Efficient 3D Object Detection for Autonomous Driving: A Mixture of Experts Computing System on Edge
Poster Session 6 & Exhibit Hall with Coffee Break
Linshen Liu ⋅ Boyan Su ⋅ Junyue Jiang ⋅ Guanlin Wu ⋅ Cong Guo ⋅ Ceyu Xu ⋅ Hao Frank Yang
|
Exhibit Hall I #113 | |
|
Degradation-Modeled Multipath Diffusion for Tunable Metalens Photography
Jianing Zhang ⋅ Jiayi Zhu ⋅ Feiyu Ji ⋅ Xiaokang Yang ⋅ Xiaoyun Yuan
|
Exhibit Hall I #114 | |
|
GS-Occ3D: Scaling Vision-only Occupancy Reconstruction with Gaussian Splatting
Poster Session 6 & Exhibit Hall with Coffee Break
Baijun Ye ⋅ Minghui Qin ⋅ Saining Zhang ⋅ Moonjun Gong ⋅ Shaoting Zhu ⋅ Hao Zhao ⋅ Hang Zhao
|
Exhibit Hall I #115 | |
|
MetaScope: Optics-Driven Neural Network for Ultra-Micro Metalens Endoscopy
Wuyang Li ⋅ Wentao Pan ⋅ Xiaoyuan Liu ⋅ Zhendong Luo ⋅ Chenxin Li ⋅ Hengyu Liu ⋅ Din Tsai ⋅ Mu Chen ⋅ Yixuan Yuan
|
Exhibit Hall I #116 | |
|
CoLMDriver: LLM-based Negotiation Benefits Cooperative Autonomous Driving
Poster Session 6 & Exhibit Hall with Coffee Break
Changxing Liu ⋅ Genjia Liu ⋅ Zijun Wang ⋅ Jinchang Yang ⋅ Siheng Chen
|
Exhibit Hall I #117 | |
|
Free-running vs Synchronous: Single-Photon Lidar for High-flux 3D Imaging
Poster Session 6 & Exhibit Hall with Coffee Break
Ruangrawee Kitichotkul ⋅ Shashwath Bharadwaj ⋅ Joshua Rapp ⋅ Yanting Ma ⋅ Alexander Mehta ⋅ Vivek Goyal
|
Exhibit Hall I #119 | |
|
Mitigating Geometric Degradation in Fast DownSampling via FastAdapter for Point Cloud Segmentation
Poster Session 6 & Exhibit Hall with Coffee Break
Shuofeng Sun ⋅ Haibin Yan
|
Exhibit Hall I #120 | |
|
Noise2Score3D: Tweedie's Approach for Unsupervised Point Cloud Denoising
Poster Session 6 & Exhibit Hall with Coffee Break
Xiangbin Wei ⋅ Yuanfeng Wang ⋅ Ao XU ⋅ Lingyu Zhu ⋅ Dongyong Sun ⋅ Keren Li ⋅ Yang Li ⋅ Qi Qin
|
Exhibit Hall I #121 | |
|
ClaraVid: A Holistic Scene Reconstruction Benchmark From Aerial Perspective With Delentropy-Based Complexity Profiling
Poster Session 6 & Exhibit Hall with Coffee Break
Radu Beche ⋅ Sergiu Nedevschi
|
Exhibit Hall I #123 | |
|
Discontinuity-aware Normal Integration for Generic Central Camera Models
Francesco Milano ⋅ Manuel Lopez-Antequera ⋅ Naina Dhingra ⋅ Roland Siegwart ⋅ Robert Thiel
|
Exhibit Hall I #124 | |
|
SEHDR: Single-Exposure HDR Novel View Synthesis via 3D Gaussian Bracketing
Poster Session 6 & Exhibit Hall with Coffee Break
Yiyu Li ⋅ Haoyuan Wang ⋅ Ke Xu ⋅ Gerhard Hancke ⋅ Rynson W.H. Lau
|
Exhibit Hall I #126 | |
|
SL2A-INR: Single-Layer Learnable Activation for Implicit Neural Representation
Poster Session 6 & Exhibit Hall with Coffee Break
Reza Rezaeian ⋅ Moein Heidari ⋅ Reza Azad ⋅ Dorit Merhof ⋅ Hamid Soltanian-Zadeh ⋅ Ilker Hacihaliloglu
|
Exhibit Hall I #128 | |
|
TARS: Traffic-Aware Radar Scene Flow Estimation
Poster Session 6 & Exhibit Hall with Coffee Break
Jialong Wu ⋅ Marco Braun ⋅ Dominic Spata ⋅ Matthias Rottmann
|
Exhibit Hall I #129 | |
|
DoppDrive: Doppler-Driven Temporal Aggregation for Improved Radar Object Detection
Poster Session 6 & Exhibit Hall with Coffee Break
Yuval Haitman ⋅ Oded Bialer
|
Exhibit Hall I #130 | |
|
Leaps and Bounds: An Improved Point Cloud Winding Number Formulation for Fast Normal Estimation and Surface Reconstruction
Poster Session 6 & Exhibit Hall with Coffee Break
Chamin Hewa Koneputugodage ⋅ Dylan Campbell ⋅ Stephen Gould
|
Exhibit Hall I #133 | |
|
GVDepth: Zero-Shot Monocular Depth Estimation for Ground Vehicles based on Probabilistic Cue Fusion
Poster Session 6 & Exhibit Hall with Coffee Break
Karlo Koledic ⋅ Luka Petrovic ⋅ Ivan Marković ⋅ Ivan Petrovic
|
Exhibit Hall I #134 | |
|
Harnessing Text-to-Image Diffusion Models for Point Cloud Self-Supervised Learning
Poster Session 6 & Exhibit Hall with Coffee Break
Yiyang Chen ⋅ Shanshan Zhao ⋅ Lunhao Duan ⋅ Changxing Ding ⋅ Dacheng Tao
|
Exhibit Hall I #138 | |
|
OD-RASE: Ontology-Driven Risk Assessment and Safety Enhancement for Autonomous Driving
Poster Session 6 & Exhibit Hall with Coffee Break
Kota Shimomura ⋅ Masaki Nambata ⋅ Atsuya Ishikawa ⋅ Ryota Mimura ⋅ Takayuki Kawabuchi ⋅ Takayoshi Yamashita ⋅ Koki Inoue
|
Exhibit Hall I #139 | |
|
MDP-Omni: Parameter-free Multimodal Depth Prior-based Sampling for Omnidirectional Stereo Matching
Poster Session 6 & Exhibit Hall with Coffee Break
Eunjin Son ⋅ HyungGi Jo ⋅ Wookyong Kwon ⋅ Sang Jun Lee
|
Exhibit Hall I #140 | |
|
DistillDrive: End-to-End Multi-Mode Autonomous Driving Distillation by Isomorphic Hetero-Source Planning Model
Poster Session 6 & Exhibit Hall with Coffee Break
Rui Yu ⋅ Xianghang Zhang ⋅ Runkai Zhao ⋅ Huaicheng Yan ⋅ Meng Wang
|
Exhibit Hall I #141 | |
|
EDM: Efficient Deep Feature Matching
Xi Li ⋅ Tong Rao ⋅ Cihui Pan
|
Exhibit Hall I #142 | |
|
GS-ID: Illumination Decomposition on Gaussian Splatting via Adaptive Light Aggregation and Diffusion-Guided Material Priors
Poster Session 6 & Exhibit Hall with Coffee Break
Kang DU ⋅ Zhihao Liang ⋅ Yulin Shen ⋅ Zeyu Wang
|
Exhibit Hall I #146 | |
|
NeRF Is a Valuable Assistant for 3D Gaussian Splatting
Poster Session 6 & Exhibit Hall with Coffee Break
Shuangkang Fang ⋅ I-Chao Shen ⋅ Takeo Igarashi ⋅ Yufeng Wang ⋅ ZeSheng Wang ⋅ Yi Yang ⋅ Wenrui Ding ⋅ Shuchang Zhou
|
Exhibit Hall I #147 | |
|
UniGS: Modeling Unitary 3D Gaussians for Novel View Synthesis from Sparse-view Images
Poster Session 6 & Exhibit Hall with Coffee Break
Jiamin WU ⋅ Kenkun Liu ⋅ Xiaoke Jiang ⋅ Yuan Yao ⋅ Lei Zhang
|
Exhibit Hall I #148 | |
|
TOTP: Transferable Online Pedestrian Trajectory Prediction with Temporal-Adaptive Mamba Latent Diffusion
Poster Session 6 & Exhibit Hall with Coffee Break
Ziyang Ren ⋅ Ping Wei ⋅ Shangqi Deng ⋅ Haowen Tang ⋅ Jiapeng Li ⋅ Huan Li
|
Exhibit Hall I #150 | |
|
UnMix-NeRF: Spectral Unmixing Meets Neural Radiance Fields
Poster Session 6 & Exhibit Hall with Coffee Break
Fabian Perez ⋅ Sara Rojas Martinez ⋅ Carlos Hinojosa ⋅ Hoover Rueda-Chacón ⋅ Bernard Ghanem
|
Exhibit Hall I #152 | |
|
MaterialMVP: Illumination-Invariant Material Generation via Multi-view PBR Diffusion
Zebin He ⋅ Mx Yang ⋅ Shuhui Yang ⋅ Yixuan Tang ⋅ Tao Wang ⋅ Kaihao Zhang ⋅ Guanying Chen ⋅ Lliu Yuhong ⋅ Jie Jiang ⋅ Chunchao Guo ⋅ Wenhan Luo
|
Exhibit Hall I #153 | |
|
7DGS: Unified Spatial-Temporal-Angular Gaussian Splatting
Poster Session 6 & Exhibit Hall with Coffee Break
Zhongpai Gao ⋅ Benjamin Planche ⋅ Meng Zheng ⋅ Anwesa Choudhuri ⋅ Terrence Chen ⋅ Ziyan Wu
|
Exhibit Hall I #155 | |
|
StochasticSplats: Stochastic Rasterization for Sorting-Free 3D Gaussian Splatting
Poster Session 6 & Exhibit Hall with Coffee Break
Shakiba Kheradmand ⋅ Delio Vicini ⋅ George Kopanas ⋅ Dmitry Lagun ⋅ Kwang Moo Yi ⋅ Mark Matthews ⋅ Andrea Tagliasacchi
|
Exhibit Hall I #156 | |
|
TurboReg: TurboClique for Robust and Efficient Point Cloud Registration
Poster Session 6 & Exhibit Hall with Coffee Break
Shaocheng Yan ⋅ Pengcheng Shi ⋅ Zhenjun Zhao ⋅ Kaixin Wang ⋅ Kuang Cao ⋅ Ji Wu ⋅ Jiayuan Li
|
Exhibit Hall I #160 | |
|
Efficient Spiking Point Mamba for Point Cloud Analysis
Poster Session 6 & Exhibit Hall with Coffee Break
Peixi Wu ⋅ Bosong Chai ⋅ Menghua Zheng ⋅ Wei Li ⋅ Zhangchi Hu ⋅ Jie Chen ⋅ Zheyu Zhang ⋅ Hebei Li ⋅ Xiaoyan Sun
|
Exhibit Hall I #162 | |
|
SpatialSplat: Efficient Semantic 3D from Sparse Unposed Images
Poster Session 6 & Exhibit Hall with Coffee Break
Yu Sheng ⋅ Jiajun Deng ⋅ Xinran Zhang ⋅ Yu Zhang ⋅ Bei Hua ⋅ Yanyong Zhang ⋅ Jianmin Ji
|
Exhibit Hall I #163 | |
|
CoMoGaussian: Continuous Motion-Aware Gaussian Splatting from Motion-Blurred Images
Poster Session 6 & Exhibit Hall with Coffee Break
Jungho Lee ⋅ DongHyeong Kim ⋅ Dogyoon Lee ⋅ Suhwan Cho ⋅ Minhyeok Lee ⋅ Wonjoon Lee ⋅ Taeoh Kim ⋅ Dongyoon Wee ⋅ Sangyoun Lee
|
Exhibit Hall I #164 | |
|
Generalized and Efficient 2D Gaussian Splatting for Arbitrary-scale Super-Resolution
Poster Session 6 & Exhibit Hall with Coffee Break
Du Chen ⋅ Liyi Chen ⋅ Zhengqiang ZHANG ⋅ Lei Zhang
|
Exhibit Hall I #166 | |
|
Visual Surface Wave Elastography: Revealing Subsurface Physical Properties via Visible Surface Waves
Poster Session 6 & Exhibit Hall with Coffee Break
Alexander Ogren ⋅ Berthy Feng ⋅ Jihoon Ahn ⋅ Katherine Bouman ⋅ Chiara Daraio
|
Exhibit Hall I #167 | |
|
GaRe: Relightable 3D Gaussian Splatting for Outdoor Scenes from Unconstrained Photo Collections
Poster Session 6 & Exhibit Hall with Coffee Break
Haiyang Bai ⋅ Jiaqi Zhu ⋅ Songru Jiang ⋅ Wei Huang ⋅ Tao Lu ⋅ Yuanqi Li ⋅ Jie Guo ⋅ Runze Fu ⋅ Yanwen Guo ⋅ Lijun Chen
|
Exhibit Hall I #168 | |
|
PolarAnything: Diffusion-based Polarimetric Image Synthesis
Kailong Zhang ⋅ Youwei Lyu ⋅ Heng Guo ⋅ Si Li ⋅ Zhanyu Ma ⋅ Boxin Shi
|
Exhibit Hall I #169 | |
|
LightCity: An Urban Dataset for Outdoor Inverse Rendering and Reconstruction under Multi-illumination Conditions
Poster Session 6 & Exhibit Hall with Coffee Break
Jingjing Wang ⋅ Qirui Hu ⋅ Chong Bao ⋅ Yuke Zhu ⋅ Hujun Bao ⋅ Zhaopeng Cui ⋅ Guofeng Zhang
|
Exhibit Hall I #170 | |
|
ETA: Efficiency through Thinking Ahead, A Dual Approach to Self-Driving with Large Models
Poster Session 6 & Exhibit Hall with Coffee Break
Shadi Hamdan ⋅ Chonghao Sima ⋅ Zetong Yang ⋅ Hongyang Li ⋅ Fatma Guney
|
Exhibit Hall I #175 | |
|
MergeOcc: Bridge the Domain Gap between Different LiDARs for Robust Occupancy Prediction
Poster Session 6 & Exhibit Hall with Coffee Break
Zikun Xu ⋅ Shaobing Xu
|
Exhibit Hall I #176 | |
|
Feature Extraction and Representation of Pre-training Point Cloud Based on Diffusion Models
Poster Session 6 & Exhibit Hall with Coffee Break
Chang Qiu ⋅ Feipeng Da ⋅ Zilei Zhang
|
Exhibit Hall I #178 | |
|
Towards Open-World Generation of Stereo Images and Unsupervised Matching
Poster Session 6 & Exhibit Hall with Coffee Break
Feng Qiao ⋅ Zhexiao Xiong ⋅ Eric Xing ⋅ Nathan Jacobs
|
Exhibit Hall I #180 | |
|
LoD-Loc v2: Aerial Visual Localization over Low Level-of-Detail City Models using Explicit Silhouette Alignment
Poster Session 6 & Exhibit Hall with Coffee Break
Juelin Zhu ⋅ Shuaibang Peng ⋅ Long Wang ⋅ Hanlin Tan ⋅ Yu Liu ⋅ Maojun Zhang ⋅ Shen Yan
|
Exhibit Hall I #183 | |
|
LANGTRAJ: Diffusion Model and Dataset for Language-Conditioned Trajectory Simulation
Poster Session 6 & Exhibit Hall with Coffee Break
WEI-JER Chang ⋅ Masayoshi Tomizuka ⋅ Wei Zhan ⋅ Manmohan Chandraker ⋅ Francesco Pittaluga
|
Exhibit Hall I #184 | |
|
Temporal Overlapping Prediction: A Self-supervised Pre-training Method for LiDAR Moving Object Segmentation
Poster Session 6 & Exhibit Hall with Coffee Break
Ziliang Miao ⋅ Runjian Chen ⋅ Yixi Cai ⋅ Buwei He ⋅ Wenquan Zhao ⋅ Wenqi Shao ⋅ Bo Zhang ⋅ Fu Zhang
|
Exhibit Hall I #187 | |
|
VideoRFSplat: Direct Scene-Level Text-to-3D Gaussian Splatting Generation with Flexible Pose and Multi-View Joint Modeling
Poster Session 6 & Exhibit Hall with Coffee Break
Hyojun Go ⋅ Byeongjun Park ⋅ Hyelin Nam ⋅ Byung-Hoon Kim ⋅ Hyungjin Chung ⋅ Changick Kim
|
Exhibit Hall I #192 | |
|
S²M²: Scalable Stereo Matching Model for Reliable Depth Estimation
Poster Session 6 & Exhibit Hall with Coffee Break
JUNHONG MIN ⋅ YOUNGPIL JEON ⋅ Jimin Kim ⋅ Minyong Choi
|
Exhibit Hall I #194 | |
|
ACE-G: Improving Generalization of Scene Coordinate Regression Through Query Pre-Training
Poster Session 6 & Exhibit Hall with Coffee Break
Leonard Bruns ⋅ Axel Barroso-Laguna ⋅ Tommaso Cavallari ⋅ Áron Monszpart ⋅ Sowmya Munukutla ⋅ Victor Prisacariu ⋅ Eric Brachmann
|
Exhibit Hall I #196 | |
|
VistaDream: Sampling multiview consistent images for single-view scene reconstruction
Poster Session 6 & Exhibit Hall with Coffee Break
Haiping Wang ⋅ Yuan Liu ⋅ Ziwei Liu ⋅ Wenping Wang ⋅ Zhen Dong ⋅ Bisheng Yang
|
Exhibit Hall I #198 | |
|
Towards Visual Localization Interoperability: Cross-Feature for Collaborative Visual Localization and Mapping
Poster Session 6 & Exhibit Hall with Coffee Break
Alberto Jaenal ⋅ Paula Carbó Cubero ⋅ Jose Araujo ⋅ André Mateus
|
Exhibit Hall I #199 | |
|
MiDSummer: Multi-Guidance Diffusion for Controllable Zero-Shot Immersive Gaussian Splatting Scene Generation
Poster Session 6 & Exhibit Hall with Coffee Break
Anjun Hu ⋅ Richard Tomsett ⋅ Valentin Gourmet ⋅ Massimo Camplani ⋅ Jas Kandola ⋅ Hanting Xie
|
Exhibit Hall I #200 | |
|
Spatio-Spectral Pattern Illumination for Direct and Indirect Separation from a Single Hyperspectral Image
Shin Ishihara ⋅ Imari Sato
|
Exhibit Hall I #203 | |
|
Adversarial Exploitation of Data Diversity Improves Visual Localization
Poster Session 6 & Exhibit Hall with Coffee Break
Sihang Li ⋅ Siqi Tan ⋅ Bowen Chang ⋅ Jing Zhang ⋅ Chen Feng ⋅ Yiming Li
|
Exhibit Hall I #205 | |
|
GeoFormer: Geometry Point Encoder for 3D Object Detection with Graph-based Transformer
Poster Session 6 & Exhibit Hall with Coffee Break
Xin Jin ⋅ Haisheng Su ⋅ Cong Ma ⋅ Kai Liu ⋅ Wei Wu ⋅ Fei HUI ⋅ Junchi Yan
|
Exhibit Hall I #208 | |
|
Tile-wise vs. Image-wise: Random-Tile Loss and Training Paradigm for Gaussian Splatting
Poster Session 6 & Exhibit Hall with Coffee Break
Xiaoyu Zhang ⋅ Weihong Pan ⋅ Xiaojun Xiang ⋅ Hongjia Zhai ⋅ Liyang Zhou ⋅ Hanqing Jiang ⋅ Guofeng Zhang
|
Exhibit Hall I #212 | |
|
DriveArena: A Closed-loop Generative Simulation Platform for Autonomous Driving
Poster Session 6 & Exhibit Hall with Coffee Break
Xuemeng Yang ⋅ Licheng Wen ⋅ Tiantian Wei ⋅ Yukai Ma ⋅ Jianbiao Mei ⋅ Xin Li ⋅ Wenjie Lei ⋅ Daocheng Fu ⋅ Pinlong Cai ⋅ Min Dou ⋅ Liang He ⋅ Yong Liu ⋅ Botian Shi ⋅ Yu Qiao
|
Exhibit Hall I #213 | |
|
Explaining Human Preferences via Metrics for Structured 3D Reconstruction
Jack Langerman ⋅ Denis Rozumny ⋅ Yuzhong Huang ⋅ Dmytro Mishkin
|
Exhibit Hall I #214 | |
|
CoopTrack: Exploring End-to-End Learning for Efficient Cooperative Sequential Perception
Jiaru Zhong ⋅ Jiahao Wang ⋅ Jiahui Xu ⋅ Xiaofan Li ⋅ Zaiqing Nie ⋅ Haibao Yu
|
Exhibit Hall I #215 | |
|
UniVerse: Unleashing the Scene Prior of Video Diffusion Models for Robust Radiance Field Reconstruction
Poster Session 6 & Exhibit Hall with Coffee Break
Jin Cao ⋅ Hongrui Wu ⋅ Ziyong Feng ⋅ Hujun Bao ⋅ Xiaowei Zhou ⋅ Sida Peng
|
Exhibit Hall I #224 | |
|
RoCo-Sim: Enhancing Roadside Collaborative Perception through Foreground Simulation
Poster Session 6 & Exhibit Hall with Coffee Break
Yuwen Du ⋅ Anning Hu ⋅ Zichen Chao ⋅ Yifan Lu ⋅ Junhao Ge ⋅ Genjia Liu ⋅ Wei-Tao Wu ⋅ Lanjun Wang ⋅ Siheng Chen
|
Exhibit Hall I #217 | |
|
Inverse 3D Microscopy Rendering for Cell Shape Inference with Active Mesh
Sacha Ichbiah ⋅ Anshuman Sinha ⋅ Fabrice Delbary ⋅ Hervé Turlier
|
Exhibit Hall I #218 | |
|
GaussRender: Learning 3D Occupancy with Gaussian Rendering
Poster Session 6 & Exhibit Hall with Coffee Break
Loick Chambon ⋅ Eloi Zablocki ⋅ Alexandre Boulch ⋅ Mickael Chen ⋅ Matthieu Cord
|
Exhibit Hall I #222 | |
|
SA-Occ: Satellite-Assisted 3D Occupancy Prediction in Real World
Poster Session 6 & Exhibit Hall with Coffee Break
Chen Chen ⋅ Zhirui Wang ⋅ Taowei Sheng ⋅ Yi Jiang ⋅ Yundu Li ⋅ Peirui Cheng ⋅ Luning Zhang ⋅ Kaiqiang Chen ⋅ Yanfeng Hu ⋅ Xue Yang ⋅ Xian Sun
|
Exhibit Hall I #223 | |
|
UPP: Unified Point-Level Prompting for Robust Point Cloud Analysis
Poster Session 6 & Exhibit Hall with Coffee Break
Zixiang Ai ⋅ Zhenyu Cui ⋅ Yuxin Peng ⋅ Jiahuan Zhou
|
Exhibit Hall I #255 | |
|
ExploreGS: Explorable 3D Scene Reconstruction with Virtual Camera Samplings and Diffusion Priors
Poster Session 6 & Exhibit Hall with Coffee Break
Minsu Kim ⋅ Subin Jeon ⋅ In Cho ⋅ Mijin Yoo ⋅ Seon Joo Kim
|
Exhibit Hall I #225 | |
|
LaneDiffusion: Improving Centerline Graph Learning via Prior Injected BEV Feature Generation
Poster Session 6 & Exhibit Hall with Coffee Break
Zijie Wang ⋅ Weiming Zhang ⋅ Wei Zhang ⋅ Xiao Tan ⋅ hongxing liu ⋅ Yaowei Wang ⋅ Guanbin Li
|
Exhibit Hall I #226 | |
|
Bridging 3D Anomaly Localization and Repair via High-Quality Continuous Geometric Representation
Poster Session 6 & Exhibit Hall with Coffee Break
Bozhong Zheng ⋅ Jinye Gan ⋅ Xiaohao Xu ⋅ Xintao Chen ⋅ Wenqiao Li ⋅ Xiaonan Huang ⋅ Na Ni ⋅ Yingna Wu
|
Exhibit Hall I #227 | |
|
CuMPerLay: Learning Cubical Multiparameter Persistence Vectorizations
Poster Session 6 & Exhibit Hall with Coffee Break
Caner Korkmaz ⋅ Brighton Nuwagira ⋅ Baris Coskunuzer ⋅ Tolga Birdal
|
Exhibit Hall I #229 | |
|
SGAD: Semantic and Geometric-aware Descriptor for Local Feature Matching
Xiangzeng Liu ⋅ CHI WANG ⋅ Guanglu Shi ⋅ Xiaodong Zhang ⋅ Qiguang Miao ⋅ Miao Fan
|
Exhibit Hall I #230 | |
|
End-to-End Driving with Online Trajectory Evaluation via BEV World Model
Poster Session 6 & Exhibit Hall with Coffee Break
Yingyan Li ⋅ Yuqi Wang ⋅ Yang Liu ⋅ Jiawei He ⋅ Lue Fan ⋅ Zhaoxiang Zhang
|
Exhibit Hall I #234 | |
|
Planar Affine Rectification from Local Change of Scale and Orientation
Yuval Nissan ⋅ Marc Pollefeys ⋅ Daniel Barath
|
Exhibit Hall I #235 | |
|
ERNet: Efficient Non-Rigid Registration Network for Point Sequences
Poster Session 6 & Exhibit Hall with Coffee Break
Guangzhao He ⋅ Yuxi Xiao ⋅ Zhen Xu ⋅ Xiaowei Zhou ⋅ Sida Peng
|
Exhibit Hall I #236 | |
|
SeqGrowGraph: Learning Lane Topology as a Chain of Graph Expansions
Poster Session 6 & Exhibit Hall with Coffee Break
Mengwei Xie ⋅ Shuang Zeng ⋅ Xinyuan Chang ⋅ Xinran Liu ⋅ Zheng Pan ⋅ Mu Xu ⋅ Xing Wei
|
Exhibit Hall I #237 | |
|
Doppler-Aware LiDAR-RADAR Fusion for Weather-Robust 3D Detection
Poster Session 6 & Exhibit Hall with Coffee Break
Yujeong Chae ⋅ Heejun Park ⋅ Hyeonseong Kim ⋅ Kuk-Jin Yoon
|
Exhibit Hall I #240 | |
|
Egocentric Action-aware Inertial Localization in Point Clouds with Vision-Language Guidance
Poster Session 6 & Exhibit Hall with Coffee Break
Mingfang Zhang ⋅ Ryo Yonetani ⋅ Yifei Huang ⋅ Liangyang Ouyang ⋅ Ruicong Liu ⋅ Yoichi Sato
|
Exhibit Hall I #241 | |
|
Epona: Autoregressive Diffusion World Model for Autonomous Driving
Poster Session 6 & Exhibit Hall with Coffee Break
Kaiwen Zhang ⋅ Zhenyu Tang ⋅ Xiaotao Hu ⋅ Xingang Pan ⋅ Xiaoyang Guo ⋅ Yuan Liu ⋅ Jingwei Huang ⋅ Li Yuan ⋅ Qian Zhang ⋅ XIAOXIAO LONG ⋅ Xun Cao ⋅ Wei Yin
|
Exhibit Hall I #242 | |
|
Leveraging Local Patch Alignment to Seam-cutting for Large Parallax Image Stitching
Poster Session 6 & Exhibit Hall with Coffee Break
Tianli Liao ⋅ Chenyang Zhao ⋅ Lei Li ⋅ Heling Cao
|
Exhibit Hall I #246 | |
|
InfiniCube: Unbounded and Controllable Dynamic 3D Driving Scene Generation with World-Guided Video Models
Poster Session 6 & Exhibit Hall with Coffee Break
Yifan Lu ⋅ Xuanchi Ren ⋅ Jiawei Yang ⋅ Tianchang Shen ⋅ Jay Zhangjie Wu ⋅ Jun Gao ⋅ Yue Wang ⋅ Siheng Chen ⋅ Mike Chen ⋅ Sanja Fidler ⋅ Jiahui Huang
|
Exhibit Hall I #247 | |
|
SynCity: Training-Free Generation of 3D Cities
Poster Session 6 & Exhibit Hall with Coffee Break
Paul Engstler ⋅ Aleksandar Shtedritski ⋅ Iro Laina ⋅ Christian Rupprecht ⋅ Andrea Vedaldi
|
Exhibit Hall I #276 | |
|
PriorMotion: Generative Class-Agnostic Motion Prediction with Raster-Vector Motion Field Priors
Poster Session 6 & Exhibit Hall with Coffee Break
Kangan Qian ⋅ Jinyu Miao ⋅ Xinyu Jiao ⋅ Ziang Luo ⋅ Zheng Fu ⋅ Yining Shi ⋅ Yunlong Wang ⋅ Kun Jiang ⋅ Diange Yang
|
Exhibit Hall I #248 | |
|
MGSR: 2D/3D Mutual-boosted Gaussian Splatting for High-fidelity Surface Reconstruction under Various Light Conditions
Poster Session 6 & Exhibit Hall with Coffee Break
Qingyuan Zhou ⋅ Yuehu Gong ⋅ Weidong Yang ⋅ Jiaze Li ⋅ Yeqi Luo ⋅ Baixin Xu ⋅ Shuhao Li ⋅ Ben Fei ⋅ Ying He
|
Exhibit Hall I #249 | |
|
ArgMatch: Adaptive Refinement Gathering for Efficient Dense Matching
Poster Session 6 & Exhibit Hall with Coffee Break
Yuxin Deng ⋅ Kaining Zhang ⋅ Linfeng Tang ⋅ Jiaqi Yang ⋅ Jiayi Ma
|
Exhibit Hall I #256 | |
|
RoboTron-Sim: Improving Real-World Driving via Simulated Hard-Case
Poster Session 6 & Exhibit Hall with Coffee Break
Baihui Xiao ⋅ Chengjian Feng ⋅ Zhijian Huang ⋅ Feng yan ⋅ Yujie Zhong ⋅ Lin Ma
|
Exhibit Hall I #257 | |
|
SAFT: Shape and Appearance of Fabrics from Template via Differentiable Physical Simulations from Monocular Video
David Stotko ⋅ Reinhard Klein
|
Exhibit Hall I #283 | |
|
Thermal Polarimetric Multi-view Stereo
Takahiro Kushida ⋅ Kenichiro Tanaka
|
Exhibit Hall I #258 | |
|
StealthAttack: Robust 3D Gaussian Splatting Poisoning via Density-Guided Illusions
Poster Session 6 & Exhibit Hall with Coffee Break
Bo-Hsu Ke ⋅ You-Zhe Xie ⋅ Yu-Lun Liu ⋅ Wei-Chen Chiu
|
Exhibit Hall I #259 | |
|
LongSplat: Robust Unposed 3D Gaussian Splatting for Casual Long Videos
Poster Session 6 & Exhibit Hall with Coffee Break
Chin-Yang Lin ⋅ Cheng Sun ⋅ Fu-En Yang ⋅ Min-Hung Chen ⋅ Yen-Yu Lin ⋅ Yu-Lun Liu
|
Exhibit Hall I #260 | |
|
WonderTurbo: Generating Interactive 3D World in 0.72 Seconds
Poster Session 6 & Exhibit Hall with Coffee Break
Chaojun Ni ⋅ Xiaofeng Wang ⋅ Zheng Zhu ⋅ Weijie Wang ⋅ Haoyun Li ⋅ Guosheng Zhao ⋅ Jie Li ⋅ Wenkang Qin ⋅ Guan Huang ⋅ Wenjun Mei
|
Exhibit Hall I #261 | |
|
SFUOD: Source-Free Unknown Object Detection
Poster Session 1 & Exhibit Hall
Keon-Hee Park ⋅ Seun-An Choe ⋅ Gyeong-Moon Park
|
Exhibit Hall I #325 | |
|
MOSAIC: Generating Consistent, Privacy-Preserving Scenes from Multiple Depth Views in Multi-Room Environments
Poster Session 6 & Exhibit Hall with Coffee Break
Zhixuan Liu ⋅ Haokun Zhu ⋅ Rui Chen ⋅ Jonathan Francis ⋅ Soonmin Hwang ⋅ Ji Zhang ⋅ Jean Oh
|
Exhibit Hall I #264 | |
|
Coordinate-based Speed of Sound Recovery for Aberration-Corrected Photoacoustic Computed Tomography
Poster Session 6 & Exhibit Hall with Coffee Break
Tianao Li ⋅ Manxiu Cui ⋅ Cheng Ma ⋅ Emma Alexander
|
Exhibit Hall I #265 | |
|
GenFlow3D: Generative Scene Flow Estimation and Prediction on Point Cloud Sequences
Poster Session 6 & Exhibit Hall with Coffee Break
Hanlin Li ⋅ Wenming Weng ⋅ Yueyi Zhang ⋅ Zhiwei Xiong
|
Exhibit Hall I #267 | |
|
Generative Gaussian Splatting: Generating 3D Scenes with Video Diffusion Priors
Poster Session 6 & Exhibit Hall with Coffee Break
Katja Schwarz ⋅ Norman Müller ⋅ Peter Kontschieder
|
Exhibit Hall I #269 | |
|
Curve-Aware Gaussian Splatting for 3D Parametric Curve Reconstruction
Poster Session 6 & Exhibit Hall with Coffee Break
Zhirui Gao ⋅ Renjiao Yi ⋅ YaQiao Dai ⋅ Xuening Zhu ⋅ Wei Chen ⋅ Kai Xu ⋅ Chenyang Zhu
|
Exhibit Hall I #271 | |
|
RadarSplat: Radar Gaussian Splatting for High-Fidelity Data Synthesis and 3D Reconstruction of Autonomous Driving Scenes
Poster Session 6 & Exhibit Hall with Coffee Break
Pou-Chun Kung ⋅ Skanda Harisha ⋅ Ram Vasudevan ⋅ Aline Eid ⋅ Katherine A. Skinner
|
Exhibit Hall I #277 | |
|
Tree Skeletonization from 3D Point Clouds by Denoising Diffusion
Poster Session 6 & Exhibit Hall with Coffee Break
Elias Marks ⋅ Lucas Nunes ⋅ Federico Magistri ⋅ Matteo Sodano ⋅ Rodrigo Marcuzzi ⋅ Lars Zimmermann ⋅ Jens Behley ⋅ Cyrill Stachniss
|
Exhibit Hall I #278 | |
|
Splat-LOAM: Gaussian Splatting LiDAR Odometry and Mapping
Poster Session 6 & Exhibit Hall with Coffee Break
Emanuele Giacomini ⋅ Luca Di Giammarino ⋅ Lorenzo De Rebotti ⋅ Giorgio Grisetti ⋅ Martin R. Oswald
|
Exhibit Hall I #280 | |
|
Purge-Gate: Efficient Backpropagation-Free Test-Time Adaptation for Point Clouds via Token purging
Poster Session 6 & Exhibit Hall with Coffee Break
Moslem Yazdanpanah ⋅ Ali Bahri ⋅ Mehrdad Noori ⋅ Sahar Dastani ⋅ Gustavo Vargas Hakim ⋅ David OSOWIECHI ⋅ Ismail Ayed ⋅ Christian Desrosiers
|
Exhibit Hall I #281 | |
|
AAA-Gaussians: Anti-Aliased and Artifact-Free 3D Gaussian Rendering
Michael Steiner ⋅ Thomas Köhler ⋅ Lukas Radl ⋅ Felix Windisch ⋅ Dieter Schmalstieg ⋅ Markus Steinberger
|
Exhibit Hall I #282 | |
|
BridgeDepth: Bridging Monocular and Stereo Reasoning with Latent Alignment
Tongfan Guan ⋅ Jiaxin Guo ⋅ Chen Wang ⋅ Yun-Hui Liu
|
Exhibit Hall I #285 | |
|
Neural Inverse Rendering for High-Accuracy 3D Measurement of Moving Objects with Fewer Phase-Shifting Patterns
Poster Session 6 & Exhibit Hall with Coffee Break
Yuki Urakawa ⋅ Yoshihiro Watanabe
|
Exhibit Hall I #286 | |
|
FlowR: Flowing from Sparse to Dense 3D Reconstructions
Tobias Fischer ⋅ Samuel Rota Bulò ⋅ Yung-Hsu Yang ⋅ Nikhil Keetha ⋅ Lorenzo Porzi ⋅ Norman Müller ⋅ Katja Schwarz ⋅ Jonathon Luiten ⋅ Marc Pollefeys ⋅ Peter Kontschieder
|
Exhibit Hall I #289 | |
|
WorldScore: Unified Evaluation Benchmark for World Generation
Poster Session 6 & Exhibit Hall with Coffee Break
Haoyi Duan ⋅ Hong-Xing Yu ⋅ Sirui Chen ⋅ Li Fei-Fei ⋅ Jiajun Wu
|
Exhibit Hall I #290 | |
|
LightSwitch: Multi-view Relighting with Material-guided Diffusion
Poster Session 6 & Exhibit Hall with Coffee Break
Yehonathan Litman ⋅ Fernando De la Torre ⋅ Shubham Tulsiani
|
Exhibit Hall I #293 | |
|
Decoupled Diffusion Sparks Adaptive Scene Generation
Poster Session 6 & Exhibit Hall with Coffee Break
Yunsong Zhou ⋅ Naisheng Ye ⋅ William Ljungbergh ⋅ Tianyu Li ⋅ Jiazhi Yang ⋅ Zetong Yang ⋅ Hongzi Zhu ⋅ Christoffer Petersson ⋅ Hongyang Li
|
Exhibit Hall I #294 | |
|
Recover Biological Structure from Sparse-View Diffraction Images with Neural Volumetric Prior
Poster Session 6 & Exhibit Hall with Coffee Break
Renzhi He ⋅ Haowen Zhou ⋅ Yubei Chen ⋅ Yi Xue
|
Exhibit Hall I #295 | |
|
HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation
Poster Session 6 & Exhibit Hall with Coffee Break
Xin Zhou ⋅ DINGKANG LIANG ⋅ Sifan Tu ⋅ Xiwu Chen ⋅ Yikang Ding ⋅ Dingyuan Zhang ⋅ Feiyang Tan ⋅ Hengshuang Zhao ⋅ Xiang Bai
|
Exhibit Hall I #299 | |
|
Generative Active Learning for Long-tail Trajectory Prediction via Controllable Diffusion Model
Poster Session 6 & Exhibit Hall with Coffee Break
Daehee Park ⋅ Monu Surana ⋅ Pranav Desai ⋅ Ashish Mehta ⋅ Reuben John ⋅ Kuk-Jin Yoon
|
Exhibit Hall I #301 | |
|
QuickSplat: Fast 3D Surface Reconstruction via Learned Gaussian Initialization
Poster Session 6 & Exhibit Hall with Coffee Break
Yueh-Cheng Liu ⋅ Lukas Höllein ⋅ Matthias Nießner ⋅ Angela Dai
|
Exhibit Hall I #302 | |
|
SP2T: Sparse Proxy Attention for Dual-stream Point Transformer
Poster Session 6 & Exhibit Hall with Coffee Break
Jiaxu Wan ⋅ Hong Zhang ⋅ Ziqi He ⋅ Yangyan Deng ⋅ Qishu Wang ⋅ Ding Yuan ⋅ Yifan Yang
|
Exhibit Hall I #305 | |
|
Instant GaussianImage: A Generalizable and Self-Adaptive Image Representation via 2D Gaussian Splatting
Poster Session 6 & Exhibit Hall with Coffee Break
Zhaojie Zeng ⋅ Yuesong Wang ⋅ Chao Yang ⋅ Tao Guan ⋅ Lili Ju
|
Exhibit Hall I #306 | |
|
CF3: Compact and Fast 3D Feature Fields
Poster Session 6 & Exhibit Hall with Coffee Break
Hyunjoon Lee ⋅ Joonkyu Min ⋅ Jaesik Park
|
Exhibit Hall I #307 | |
|
When Anchors Meet Cold Diffusion: A Multi-Stage Approach to Lane Detection
Poster Session 6 & Exhibit Hall with Coffee Break
Bo-Lun Huang ⋅ Tzu-Hsiang Ni ⋅ Feng-Kai Huang ⋅ Hong-Han Shuai ⋅ Wen-Huang Cheng
|
Exhibit Hall I #308 | |
|
2D Gaussian Splatting-based Sparse-view Transparent Object Depth Reconstruction via Physics Simulation for Scene Update
Poster Session 6 & Exhibit Hall with Coffee Break
Jeongyun Kim ⋅ Seunghoon Jeong ⋅ Giseop Kim ⋅ Myung-Hwan Jeon ⋅ Eunji Jun ⋅ Ayoung Kim
|
Exhibit Hall I #309 | |
|
Faster and Better 3D Splatting via Group Training
Poster Session 6 & Exhibit Hall with Coffee Break
Chengbo Wang ⋅ Guozheng Ma ⋅ Yizhen Lao ⋅ Yifei Xue
|
Exhibit Hall I #313 | |
|
Sat2City: 3D City Generation from A Single Satellite Image with Cascaded Latent Diffusion
Poster Session 6 & Exhibit Hall with Coffee Break
Tongyan Hua ⋅ Lutao Jiang ⋅ Ying-Cong Chen ⋅ Wufan Zhao
|
Exhibit Hall I #314 | |
|
NeuFrameQ: Neural Frame Fields for Scalable and Generalizable Anisotropic Quadrangulation
Ying-Tian Liu ⋅ Jiajun Li ⋅ Yu-Tao Liu ⋅ Xin Yu ⋅ Yuan-Chen Guo ⋅ Yanpei Cao ⋅ Ding Liang ⋅ Ariel Shamir ⋅ Song-Hai Zhang
|
Exhibit Hall I #316 | |
|
RTMap: Real-Time Recursive Mapping with Change Detection and Localization
Poster Session 6 & Exhibit Hall with Coffee Break
Yuheng Du ⋅ Sheng Yang ⋅ Lingxuan Wang ⋅ Zhenghua.Hou Zhenghua.Hou ⋅ Chengying Cai ⋅ Zhitao Tan ⋅ Mingxia Chen ⋅ Shi-Sheng Huang ⋅ Qiang Li
|
Exhibit Hall I #318 | |
|
Controllable 3D Outdoor Scene Generation via Scene Graphs
Poster Session 6 & Exhibit Hall with Coffee Break
Yuheng Liu ⋅ Xinke Li ⋅ Yuning Zhang ⋅ Lu Qi ⋅ Xin Li ⋅ Wenping Wang ⋅ Chongshou Li ⋅ Xueting Li ⋅ Ming-Hsuan Yang
|
Exhibit Hall I #321 | |
|
PolGS: Polarimetric Gaussian Splatting for Fast Reflective Surface Reconstruction
Poster Session 6 & Exhibit Hall with Coffee Break
Yufei Han ⋅ Bowen Tie ⋅ Heng Guo ⋅ Youwei Lyu ⋅ Si Li ⋅ Boxin Shi ⋅ Yunpeng Jia ⋅ Zhanyu Ma
|
Exhibit Hall I #323 | |
|
Driving View Synthesis on Free-form Trajectories with Generative Prior
Poster Session 6 & Exhibit Hall with Coffee Break
Zeyu Yang ⋅ Zijie Pan ⋅ Yuankun Yang ⋅ Xiatian Zhu ⋅ Li Zhang
|
Exhibit Hall I #324 | |
|
Wasserstein Style Distribution Analysis and Transform for Stylized Image Generation
Xi Yu ⋅ Xiang Gu ⋅ Zhihao Shi ⋅ Jian Sun
|
Exhibit Hall I #250 | |
|
Constraint-Aware Feature Learning for Parametric Point Cloud
Poster Session 6 & Exhibit Hall with Coffee Break
Xi Cheng ⋅ Ruiqi Lei ⋅ Di Huang ⋅ Zhichao Liao ⋅ Fengyuan Piao ⋅ Yan Chen ⋅ Pingfa Feng ⋅ Long ZENG
|
Exhibit Hall I #327 | |
|
NeuraLeaf: Neural Parametric Leaf Models with Shape and Deformation Disentanglement
Yang Yang ⋅ Dongni Mao ⋅ Hiroaki Santo ⋅ Yasuyuki Matsushita ⋅ Fumio Okura
|
Exhibit Hall I #332 | |
|
ZeroStereo: Zero-shot Stereo Matching from Single Images
Poster Session 6 & Exhibit Hall with Coffee Break
Xianqi Wang ⋅ Hao Yang ⋅ Gangwei Xu ⋅ Junda Cheng ⋅ Min Lin ⋅ Yong Deng ⋅ Jinliang Zang ⋅ Yurui Chen ⋅ Xin Yang
|
Exhibit Hall I #333 | |
|
CVFusion: Cross-View Fusion of 4D Radar and Camera for 3D Object Detection
Poster Session 6 & Exhibit Hall with Coffee Break
Hanzhi Zhong ⋅ Zhiyu Xiang ⋅ Ruoyu Xu ⋅ Jingyun Fu ⋅ Peng Xu ⋅ Shaohong Wang ⋅ Zhihao Zhihao ⋅ Tianyu Pu ⋅ Eryun Liu
|
Exhibit Hall I #334 | |
|
Stochastic Gradient Estimation for Higher-Order Differentiable Rendering
Zican Wang ⋅ Michael Fischer ⋅ Tobias Ritschel
|
Exhibit Hall I #335 | |
|
CATSplat: Context-Aware Transformer with Spatial Guidance for Generalizable 3D Gaussian Splatting from A Single-View Image
Poster Session 6 & Exhibit Hall with Coffee Break
Wonseok Roh ⋅ Hwanhee Jung ⋅ JongWook Kim ⋅ Seunggwan Lee ⋅ Innfarn Yoo ⋅ Andreas Lugmayr ⋅ Seunggeun Chi ⋅ Karthik Ramani ⋅ Sangpil Kim
|
Exhibit Hall I #338 | |
|
Quadratic Gaussian Splatting: High Quality Surface Reconstruction with Second-order Geometric Primitives
Poster Session 6 & Exhibit Hall with Coffee Break
ziyu zhang ⋅ Binbin Huang ⋅ Hanqing Jiang ⋅ Liyang Zhou ⋅ Xiaojun Xiang ⋅ Shuhan Shen
|
Exhibit Hall I #341 | |
|
Uncertainty-Aware Diffusion-Guided Refinement of 3D Scenes
Poster Session 6 & Exhibit Hall with Coffee Break
Sarosij Bose ⋅ Arindam Dutta ⋅ Sayak Nag ⋅ Junge Zhang ⋅ Jiachen Li ⋅ Konstantinos Karydis ⋅ Amit Roy-Chowdhury
|
Exhibit Hall I #342 | |
|
Foresight in Motion: Reinforcing Trajectory Prediction with Reward Heuristics
Poster Session 6 & Exhibit Hall with Coffee Break
Muleilan Pei ⋅ Shaoshuai Shi ⋅ Xuesong Chen ⋅ Xu Liu ⋅ Shaojie Shen
|
Exhibit Hall I #345 | |
|
MAESTRO: Task-Relevant Optimization via Adaptive Feature Enhancement and Suppression for Multi-task 3D Perception
Poster Session 6 & Exhibit Hall with Coffee Break
ChangWon Kang ⋅ Jisong Kim ⋅ Hongjae Shin ⋅ Junseo Park ⋅ Jun Won Choi
|
Exhibit Hall I #346 | |
|
ToF-Splatting: Dense SLAM using Sparse Time-of-Flight Depth and Multi-Frame Integration
Poster Session 6 & Exhibit Hall with Coffee Break
Andrea Conti ⋅ Matteo Poggi ⋅ Valerio Cambareri ⋅ Martin R. Oswald ⋅ Stefano Mattoccia
|
Exhibit Hall I #349 | |
|
Joint Semantic and Rendering Enhancements in 3D Gaussian Modeling with Anisotropic Local Encoding
Poster Session 6 & Exhibit Hall with Coffee Break
Jingming He ⋅ Chongyi Li ⋅ Shiqi Wang ⋅ Sam Kwong
|
Exhibit Hall I #350 | |
|
Unsupervised Imaging Inverse Problems with Diffusion Distribution Matching
Poster Session 6 & Exhibit Hall with Coffee Break
Giacomo Meanti ⋅ Thomas Ryckeboer ⋅ Michael Arbel ⋅ Julien Mairal
|
Exhibit Hall I #351 | |
|
R-LiViT: A LiDAR-Visual-Thermal Dataset Enabling Vulnerable Road User Focused Roadside Perception
Poster Session 6 & Exhibit Hall with Coffee Break
Jonas Mirlach ⋅ Lei Wan ⋅ Andreas Wiedholz ⋅ Hannan Keen ⋅ Andreas Eich
|
Exhibit Hall I #352 | |
|
V2XScenes: A Multiple Challenging Traffic Conditions Dataset for Large-Range Vehicle-Infrastructure Collaborative Perception
Poster Session 6 & Exhibit Hall with Coffee Break
Bowen Wang ⋅ Yafei Wang ⋅ Wei Gong ⋅ Siheng Chen ⋅ Genjia Liu ⋅ Minhao Xiong ⋅ Chin Long Ng
|
Exhibit Hall I #353 | |
|
Robust 3D Object Detection using Probabilistic Point Clouds from Single-Photon LiDARs
Poster Session 6 & Exhibit Hall with Coffee Break
Bhavya Goyal ⋅ Felipe Gutierrez-Barragan ⋅ Wei Lin ⋅ Andreas Velten ⋅ Yin Li ⋅ Mohit Gupta
|
Exhibit Hall I #358 | |
|
SViM3D: Stable Video Material Diffusion for Single Image 3D Generation
Poster Session 6 & Exhibit Hall with Coffee Break
Andreas Engelhardt ⋅ Mark Boss ⋅ Vikram Voleti ⋅ Chun-Han Yao ⋅ Hendrik Lensch ⋅ Varun Jampani
|
Exhibit Hall I #359 | |
|
HouseCrafter: Lifting Floorplans to 3D Scenes with 2D Diffusion Models
YIWEN CHEN ⋅ Hieu (Hayden) Nguyen ⋅ Vikram Voleti ⋅ Varun Jampani ⋅ Huaizu Jiang
|
Exhibit Hall I #360 | |
|
G2D: Boosting Multimodal Learning with Gradient-Guided Distillation
Poster Session 1 & Exhibit Hall
Mohammed Rakib ⋅ Arunkumar Bagavathi
|
Exhibit Hall I #378 | |
|
Leveraging BEV Paradigm for Ground-to-Aerial Image Synthesis
Poster Session 6 & Exhibit Hall with Coffee Break
Junyan Ye ⋅ Jun He ⋅ Weijia Li ⋅ Zhutao Lv ⋅ Yi Lin ⋅ Jinhua Yu ⋅ Haote Yang ⋅ Conghui He
|
Exhibit Hall I #361 | |
|
EMD: Explicit Motion Modeling for High-Quality Street Gaussian Splatting
Poster Session 6 & Exhibit Hall with Coffee Break
Xiaobao Wei ⋅ Qingpo Wuwu ⋅ Zhongyu Zhao ⋅ Zhuangzhe Wu ⋅ Nan Huang ⋅ Ming Lu ⋅ ningning ma ⋅ Shanghang Zhang
|
Exhibit Hall I #362 | |
|
Perspective-aware 3D Gaussian Inpainting with Multi-view Consistency
Poster Session 6 & Exhibit Hall with Coffee Break
Yuxin CHENG ⋅ Binxiao Huang ⋅ Taiqiang Wu ⋅ Wenyong Zhou ⋅ Chenchen Ding ⋅ Zhengwu Liu ⋅ Graziano Chesi ⋅ Ngai Wong
|
Exhibit Hall I #366 | |
|
SparseRecon: Neural Implicit Surface Reconstruction from Sparse Views with Feature and Depth Consistencies
Poster Session 6 & Exhibit Hall with Coffee Break
Liang Han ⋅ Xu Zhang ⋅ Haichuan Song ⋅ Kanle Shi ⋅ Liang Han ⋅ Zhizhong Han
|
Exhibit Hall I #367 | |
|
SAM4D: Segment Anything in Camera and LiDAR Streams
Poster Session 6 & Exhibit Hall with Coffee Break
Jianyun Xu ⋅ Song Wang ⋅ Ziqian Ni ⋅ Chunyong Hu ⋅ Sheng Yang ⋅ Jianke Zhu ⋅ Qiang Li
|
Exhibit Hall I #369 | |
|
Representing 3D Shapes With 64 Latent Vectors for 3D Diffusion Models
Poster Session 6 & Exhibit Hall with Coffee Break
In Cho ⋅ Youngbeom Yoo ⋅ Subin Jeon ⋅ Seon Joo Kim
|
Exhibit Hall I #371 | |
|
LINR-PCGC: Lossless Implicit Neural Representations for Point Cloud Geometry Compression
Poster Session 6 & Exhibit Hall with Coffee Break
Wenjie Huang ⋅ Qi Yang ⋅ Shuting Xia ⋅ He Huang ⋅ Yiling Xu ⋅ Zhu Li
|
Exhibit Hall I #373 | |
|
Interaction-Merged Motion Planning: Effectively Leveraging Diverse Motion Datasets for Robust Planning
Giwon Lee ⋅ Wooseong Jeong ⋅ Daehee Park ⋅ Jaewoo Jeong ⋅ Kuk-Jin Yoon
|
Exhibit Hall I #376 | |
|
Communication-Efficient Multi-Vehicle Collaborative Semantic Segmentation via Sparse 3D Gaussian Sharing
Poster Session 6 & Exhibit Hall with Coffee Break
Tianyu Hong ⋅ Xiaobo Zhou ⋅ Wenkai Hu ⋅ Qi Xie ⋅ Zhihui Ke ⋅ Tie Qiu
|
Exhibit Hall I #377 | |
|
DATA: Domain-And-Time Alignment for High-Quality Feature Fusion in Collaborative Perception
Poster Session 6 & Exhibit Hall with Coffee Break
Chengchang Tian ⋅ Jianwei Ma ⋅ Yan Huang ⋅ Zhanye Chen ⋅ Honghao Wei ⋅ Hui Zhang ⋅ Wei Hong
|
Exhibit Hall I #379 | |
|
Hi-Gaussian: Hierarchical Gaussians under Normalized Spherical Projection for Single-View 3D Reconstruction
Poster Session 6 & Exhibit Hall with Coffee Break
Binjian Xie ⋅ Pengju Zhang ⋅ Hao Wei ⋅ Yihong Wu
|
Exhibit Hall I #381 | |
|
Towards More Diverse and Challenging Pre-training for Point Cloud Learning: Self-Supervised Cross Reconstruction with Decoupled Views
Poster Session 6 & Exhibit Hall with Coffee Break
Xiangdong Zhang ⋅ Shaofeng Zhang ⋅ Junchi Yan
|
Exhibit Hall I #384 | |
|
A Lesson in Splats: Teacher-Guided Diffusion for 3D Gaussian Splats Generation with 2D Supervision
Poster Session 6 & Exhibit Hall with Coffee Break
Chensheng Peng ⋅ Ido Sobol ⋅ Masayoshi Tomizuka ⋅ Kurt Keutzer ⋅ Chenfeng Xu ⋅ Or Litany
|
Exhibit Hall I #385 | |
|
MMAT-1M: A Large Reasoning Dataset for Multimodal Agent Tuning
Poster Session 1 & Exhibit Hall
Tianhong Gao ⋅ Yannian Fu ⋅ Weiqun Wu ⋅ Haixiao Yue ⋅ Shanshan Liu ⋅ Gang Zhang
|
Exhibit Hall I #131 | |
|
Extrapolated Urban View Synthesis Benchmark
Poster Session 6 & Exhibit Hall with Coffee Break
Xiangyu Han ⋅ Zhen Jia ⋅ Boyi Li ⋅ Yan Wang ⋅ Boris Ivanovic ⋅ Yurong You ⋅ Lingjie Liu ⋅ Yue Wang ⋅ Marco Pavone ⋅ Chen Feng ⋅ Yiming Li
|
Exhibit Hall I #386 | |
|
Heatmap Regression without Soft-Argmax for Facial Landmark Detection
Poster Session 6 & Exhibit Hall with Coffee Break
Chiao-An Yang ⋅ Raymond A. Yeh
|
Exhibit Hall I #387 | |
|
Demeter: A Parametric Model of Crop Plant Morphology from the Real World
Poster Session 6 & Exhibit Hall with Coffee Break
Tianhang Cheng ⋅ Albert Zhai ⋅ Evan Chen ⋅ Rui Zhou ⋅ Yawen Deng ⋅ Zitong Li ⋅ Kejie Zhao ⋅ Janice Shiu ⋅ Qianyu Zhao ⋅ Yide Xu ⋅ Xinlei Wang ⋅ Yuan Shen ⋅ Sheng Wang ⋅ Lisa Ainsworth ⋅ Kaiyu Guan ⋅ Shenlong Wang
|
Exhibit Hall I #388 | |
|
Mixed Signals: A Diverse Point Cloud Dataset for Heterogeneous LiDAR V2X Collaboration
Poster Session 6 & Exhibit Hall with Coffee Break
Katie Luo ⋅ Minh-Quan Dao ⋅ Zhenzhen Liu ⋅ Mark Campbell ⋅ Wei-Lun (Harry) Chao ⋅ Kilian Weinberger ⋅ Ezio Malis ⋅ Vincent FREMONT ⋅ Bharath Hariharan ⋅ Mao Shan ⋅ Stewart Worrall ⋅ Julie Stephany Berrio Perez
|
Exhibit Hall I #390 | |
|
Exploiting Vision Language Model for Training-Free 3D Point Cloud OOD Detection via Graph Score Propagation
Poster Session 6 & Exhibit Hall with Coffee Break
Tiankai Chen ⋅ Yushu Li ⋅ Adam Goodge ⋅ Fei Teng ⋅ Xulei Yang ⋅ Tianrui Li ⋅ Xun Xu
|
Exhibit Hall I #393 | |
|
FROSS: Faster-Than-Real-Time Online 3D Semantic Scene Graph Generation from RGB-D Images
Poster Session 6 & Exhibit Hall with Coffee Break
Hao-Yu Hou ⋅ Chun-Yi Lee ⋅ Motoharu Sonogashira ⋅ Yasutomo Kawanishi
|
Exhibit Hall I #395 | |
|
HUG: Hierarchical Urban Gaussian Splatting with Block-Based Reconstruction for Large-Scale Aerial Scenes
Poster Session 6 & Exhibit Hall with Coffee Break
Mai Su ⋅ Zhongtao Wang ⋅ Huishan Au ⋅ Yilong Li ⋅ Xizhe Cao ⋅ Chengwei Pan ⋅ Yisong Chen ⋅ Guoping Wang
|
Exhibit Hall I #397 | |
|
Unraveling the Effects of Synthetic Data on End-to-End Autonomous Driving
Poster Session 6 & Exhibit Hall with Coffee Break
Junhao Ge ⋅ Zuhong Liu ⋅ Longteng Fan ⋅ Yifan Jiang ⋅ Jiaqi Su ⋅ Yiming Li ⋅ Zhejun Zhang ⋅ Siheng Chen
|
Exhibit Hall I #399 | |
|
BANet: Bilateral Aggregation Network for Mobile Stereo Matching
Poster Session 6 & Exhibit Hall with Coffee Break
Gangwei Xu ⋅ Jiaxin Liu ⋅ Xianqi Wang ⋅ Junda Cheng ⋅ Yong Deng ⋅ Jinliang Zang ⋅ Yurui Chen ⋅ Xin Yang
|
Exhibit Hall I #400 | |
|
Puzzle Similarity: A Perceptually-guided Cross-Reference Metric for Artifact Detection in 3D Scene Reconstructions
Poster Session 6 & Exhibit Hall with Coffee Break
Nicolai Hermann ⋅ Jorge Condor ⋅ Piotr Didyk
|
Exhibit Hall I #401 | |
|
Authentic 4D Driving Simulation with a Video Generation Model
Poster Session 6 & Exhibit Hall with Coffee Break
Lening Wang ⋅ Wenzhao Zheng ⋅ Dalong Du ⋅ Yunpeng Zhang ⋅ Yilong Ren ⋅ Han Jiang ⋅ Zhiyong Cui ⋅ Haiyang Yu ⋅ Jie Zhou ⋅ Shanghang Zhang
|
Exhibit Hall I #402 | |
|
DONUT: A Decoder-Only Model for Trajectory Prediction
Poster Session 6 & Exhibit Hall with Coffee Break
Markus Knoche ⋅ Daan de Geus ⋅ Bastian Leibe
|
Exhibit Hall I #403 | |
|
Lidar Waveforms are Worth 40x128x33 Words
Dominik Scheuble ⋅ Hanno Holzhüter ⋅ Steven Peters ⋅ Mario Bijelic ⋅ Felix Heide
|
Exhibit Hall I #404 | |
|
Spherical Epipolar Rectification for Deep Two-View Absolute Depth Estimation
Poster Session 6 & Exhibit Hall with Coffee Break
Pierre-André Brousseau ⋅ Sébastien Roy
|
Exhibit Hall I #405 | |
|
PanoSplatt3R: Leveraging Perspective Pretraining for Generalized Unposed Wide-Baseline Panorama Reconstruction
Poster Session 6 & Exhibit Hall with Coffee Break
Jiahui Ren ⋅ Mochu Xiang ⋅ Jiajun Zhu ⋅ Yuchao Dai
|
Exhibit Hall I #408 | |
|
GaussianOcc: Fully Self-supervised and Efficient 3D Occupancy Estimation with Gaussian Splatting
Poster Session 6 & Exhibit Hall with Coffee Break
Wanshui Gan ⋅ Fang Liu ⋅ Hongbin Xu ⋅ Ningkai Mo ⋅ Naoto Yokoya
|
Exhibit Hall I #410 | |
|
GeoSplatting: Towards Geometry Guided Gaussian Splatting for Physically-based Inverse Rendering
Poster Session 6 & Exhibit Hall with Coffee Break
Kai Ye ⋅ Chong Gao ⋅ Guanbin Li ⋅ Wenzheng Chen ⋅ Baoquan Chen
|
Exhibit Hall I #411 | |
|
Wide2Long: Learning Lens Compression and Perspective Adjustment for Wide-Angle to Telephoto Translation
Poster Session 6 & Exhibit Hall with Coffee Break
Soumyadipta Banerjee ⋅ Jiaul Paik ⋅ Debashis Sen
|
Exhibit Hall I #412 | |
|
EVER: Exact Volumetric Ellipsoid Rendering for Real-time View Synthesis
Poster Session 2 & Exhibit Hall with Coffee Break
Alexander Mai ⋅ Peter Hedman ⋅ George Kopanas ⋅ Dor Verbin ⋅ David Futschik ⋅ Qiangeng Xu ⋅ Falko Kuester ⋅ Jonathan Barron ⋅ Yinda Zhang
|
Exhibit Hall I #381 | |
|
LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion
Poster Session 6 & Exhibit Hall with Coffee Break
Fangfu Liu ⋅ Hao Li ⋅ Jiawei Chi ⋅ Hanyang Wang ⋅ Minghui Yang ⋅ Fudong Wang ⋅ Yueqi Duan
|
Exhibit Hall I #413 | |
|
Leveraging 2D Priors and SDF Guidance for Urban Scene Rendering
Poster Session 6 & Exhibit Hall with Coffee Break
Siddharth Tourani ⋅ Jayaram Reddy ⋅ Akash Kumbar ⋅ Satyajit Tourani ⋅ Nishant Goyal ⋅ Madhava Krishna ⋅ Dinesh Reddy Narapureddy ⋅ Muhammad Haris Khan
|
Exhibit Hall I #417 | |
|
LBM: Latent Bridge Matching for Fast Image-to-Image Translation
Clément Chadebec ⋅ Onur Tasar ⋅ Sanjeev Sreetharan ⋅ Benjamin Aubin
|
Exhibit Hall I #420 | |
|
SparseLaneSTP: Leveraging Spatio-Temporal Priors with Sparse Transformers for 3D Lane Detection
Poster Session 6 & Exhibit Hall with Coffee Break
Maximilian Pittner ⋅ Joel Janai ⋅ Mario Faigle ⋅ Alexandru Condurache
|
Exhibit Hall I #421 | |
|
Relative Illumination Fields: Learning Medium and Light Independent Underwater Scenes
Poster Session 6 & Exhibit Hall with Coffee Break
Mengkun She ⋅ Felix Seegräber ⋅ David Nakath ⋅ Patricia Schöntag ⋅ Kevin Köser
|
Exhibit Hall I #422 | |
|
Super Resolved Imaging with Adaptive Optics
Robin Swanson ⋅ Esther Y. H. Lin ⋅ Masen Lamb ⋅ Suresh Sivanandam ⋅ Kiriakos N. Kutulakos
|
Exhibit Hall I #425 | |
|
HVPUNet: Hybrid-Voxel Point-cloud Upsampling Network
Poster Session 6 & Exhibit Hall with Coffee Break
Juhyung Ha ⋅ Vibhas Vats ⋅ Alimoor Reza ⋅ Soon-heung Jung ⋅ David Crandall
|
Exhibit Hall I #426 | |
|
Stealthy Backdoor Attack in Federated Learning via Adaptive Layer-wise Gradient Alignment
Poster Session 6 & Exhibit Hall with Coffee Break
Qingqian Yang ⋅ Peishen Yan ⋅ Xiaoyu Wu ⋅ Jiaru Zhang ⋅ Tao Song ⋅ Yang Hua ⋅ Hao Wang ⋅ Liangliang Wang ⋅ Haibing Guan
|
Exhibit Hall I #427 | |
|
VMem: Consistent Interactive Video Scene Generation with Surfel-Indexed View Memory
Runjia Li ⋅ Philip Torr ⋅ Andrea Vedaldi ⋅ Tomas Jakab
|
Exhibit Hall I #93 | |
|
Back on Track: Bundle Adjustment for Dynamic Scene Reconstruction
Poster Session 2 & Exhibit Hall with Coffee Break
Weirong Chen ⋅ Ganlin Zhang ⋅ Felix Wimbauer ⋅ Rui Wang ⋅ Nikita Araslanov ⋅ Andrea Vedaldi ⋅ Daniel Cremers
|
Exhibit Hall I #75 | |
|
Self-Ensembling Gaussian Splatting for Few-Shot Novel View Synthesis
Poster Session 2 & Exhibit Hall with Coffee Break
Chen Zhao ⋅ Xuan Wang ⋅ Tong Zhang ⋅ Saqib Javed ⋅ Mathieu Salzmann
|
Exhibit Hall I #147 | |
|
Importance-Based Token Merging for Efficient Image and Video Generation
Poster Session 2 & Exhibit Hall with Coffee Break
Haoyu Wu ⋅ Jingyi Xu ⋅ Hieu Le ⋅ Dimitris Samaras
|
Exhibit Hall I #303 | |
|
Knowledge Distillation for Learned Image Compression
Poster Session 2 & Exhibit Hall with Coffee Break
Yunuo Chen ⋅ Zezheng Lyu ⋅ Bing He ⋅ Ning Cao ⋅ Gang chen ⋅ Guo Lu ⋅ Wenjun Zhang
|
Exhibit Hall I #304 | |
|
Variance-Based Pruning for Accelerating and Compressing Trained Networks
Poster Session 2 & Exhibit Hall with Coffee Break
Uranik Berisha ⋅ Jens Mehnert ⋅ Alexandru Condurache
|
Exhibit Hall I #382 | |
|
LoftUp: Learning a Coordinate-Based Feature Upsampler for Vision Foundation Models
Poster Session 3 & Exhibit Hall
Haiwen Huang ⋅ Anpei Chen ⋅ Volodymyr Havrylov ⋅ Andreas Geiger ⋅ Dan Zhang
|
Exhibit Hall I #75 | |
|
MaskControl: Spatio-Temporal Control for Masked Motion Synthesis
Poster Session 3 & Exhibit Hall
Ekkasit Pinyoanuntapong ⋅ Muhammad Usama Saleem ⋅ Korrawe Karunratanakul ⋅ Pu Wang ⋅ Hongfei Xue ⋅ Chen Chen ⋅ chuan guo ⋅ Junli Cao ⋅ Jian Ren ⋅ Sergey Tulyakov
|
Exhibit Hall I #148 | |
|
RS-vHeat: Heat Conduction Guided Efficient Remote Sensing Foundation Model
Poster Session 3 & Exhibit Hall
Huiyang Hu ⋅ Peijin Wang ⋅ Hanbo Bi ⋅ Boyuan Tong ⋅ Zhaozhi Wang ⋅ Wenhui Diao ⋅ Hao Chang ⋅ Yingchao Feng ⋅ Ziqi Zhang ⋅ Yaowei Wang ⋅ Qixiang Ye ⋅ Kun Fu ⋅ Xian Sun
|
Exhibit Hall I #149 | |
|
HairCUP: Hair Compositional Universal Prior for 3D Gaussian Avatars
Poster Session 3 & Exhibit Hall
Byungjun Kim ⋅ Shunsuke Saito ⋅ Giljoo Nam ⋅ Tomas Simon ⋅ Jason Saragih ⋅ Hanbyul Joo ⋅ Junxuan Li
|
Exhibit Hall I #223 | |
|
Understanding Co-speech Gestures in-the-wild
Poster Session 3 & Exhibit Hall
Sindhu Hegde ⋅ K R Prajwal ⋅ Taein Kwon ⋅ Andrew Zisserman
|
Exhibit Hall I #302 | |
|
DPoser-X: Diffusion Model as Robust 3D Whole-body Human Pose Prior
Poster Session 3 & Exhibit Hall
Junzhe Lu ⋅ Jing Lin ⋅ Hongkun Dou ⋅ Ailing Zeng ⋅ Yue Deng ⋅ Xian Liu ⋅ Zhongang Cai ⋅ Lei Yang ⋅ YULUN ZHANG ⋅ Haoqian Wang ⋅ Ziwei Liu
|
Exhibit Hall I #377 | |
|
Towards a Unified Copernicus Foundation Model for Earth Vision
Poster Session 3 & Exhibit Hall
Yi Wang ⋅ Zhitong Xiong ⋅ Chenying Liu ⋅ Adam Stewart ⋅ Thomas Dujardin ⋅ Nikolaos Ioannis Bountos ⋅ Angelos Zavras ⋅ Franziska Gerken ⋅ Ioannis Papoutsis ⋅ Laura Leal-Taixé ⋅ Xiao Xiang Zhu
|
Exhibit Hall I #449 | |
|
Teeth Reconstruction and Performance Capture Using a Phone Camera
Poster Session 3 & Exhibit Hall
Weixi Zheng ⋅ Jingwang Ling ⋅ Zhibo Wang ⋅ Quan Wang ⋅ Feng Xu
|
Exhibit Hall I #450 | |
|
ReCamMaster: Camera-Controlled Generative Rendering from A Single Video
Poster Session 4 & Exhibit Hall with Coffee Break
Jianhong Bai ⋅ Menghan Xia ⋅ Xiao Fu ⋅ Xintao Wang ⋅ Lianrui Mu ⋅ Jinwen Cao ⋅ Zuozhu Liu ⋅ Haoji Hu ⋅ Xiang Bai ⋅ Pengfei Wan ⋅ Di ZHANG
|
Exhibit Hall I #74 | |
|
Spatially-Varying Autofocus
Poster Session 6 & Exhibit Hall with Coffee Break
Yingsi Qin ⋅ Aswin Sankaranarayanan ⋅ Matthew O'Toole
|
Exhibit Hall I #74 | |
|
SparseFlex: High-Resolution and Arbitrary-Topology 3D Shape Modeling
Poster Session 4 & Exhibit Hall with Coffee Break
Xianglong He ⋅ Zi-Xin Zou ⋅ Chia Hao Chen ⋅ Yuan-Chen Guo ⋅ Ding Liang ⋅ Chun Yuan ⋅ Wanli Ouyang ⋅ Yanpei Cao ⋅ Yangguang Li
|
Exhibit Hall I #75 | |
|
RePoseD: Efficient Relative Pose Estimation With Known Depth Information
Poster Session 4 & Exhibit Hall with Coffee Break
Yaqing Ding ⋅ Viktor Kocur ⋅ VACLAV VAVRA ⋅ Zuzana Berger Haladova ⋅ jian Yang ⋅ Torsten Sattler ⋅ Zuzana Kukelova
|
Exhibit Hall I #151 | |
|
Diving into the Fusion of Monocular Priors for Generalized Stereo Matching
Poster Session 4 & Exhibit Hall with Coffee Break
Chengtang Yao ⋅ Lidong Yu ⋅ Zhidan Liu ⋅ Jiaxi Zeng ⋅ Yuwei Wu ⋅ Yunde Jia
|
Exhibit Hall I #152 | |
|
Forecasting Continuous Non-Conservative Dynamical Systems in SO(3)
Poster Session 4 & Exhibit Hall with Coffee Break
Lennart Bastian ⋅ Mohammad Rashed ⋅ Nassir Navab ⋅ Tolga Birdal
|
Exhibit Hall I #226 | |
|
Dynamic Typography: Bringing Text to Life via Video Diffusion Prior
Poster Session 4 & Exhibit Hall with Coffee Break
Zichen Liu ⋅ Yihao Meng ⋅ Hao Ouyang ⋅ Yue Yu ⋅ Bolin Zhao ⋅ Daniel Cohen-Or ⋅ Huamin Qu
|
Exhibit Hall I #227 | |
|
Certifiably Optimal Anisotropic Rotation Averaging
Poster Session 4 & Exhibit Hall with Coffee Break
Carl Olsson ⋅ Yaroslava Lochman ⋅ Johan Malmport ⋅ Christopher Zach
|
Exhibit Hall I #305 | |
|
MIORe & VAR-MIORe: Benchmarks to Push the Boundaries of Restoration
Poster Session 5 & Exhibit Hall
George Ciubotariu ⋅ Zhuyun Zhou ⋅ Zongwei Wu ⋅ Radu Timofte
|
Exhibit Hall I #155 | |
|
MikuDance: Animating Character Art with Mixed Motion Dynamics
Poster Session 5 & Exhibit Hall
Jiaxu Zhang ⋅ Xianfang Zeng ⋅ Xin Chen ⋅ Wei Zuo ⋅ Gang YU ⋅ Zhigang Tu
|
Exhibit Hall I #156 | |
|
ROAR: Reducing Inversion Error in Generative Image Watermarking
Poster Session 5 & Exhibit Hall
Hanyi Wang ⋅ Han Fang ⋅ Shi-Lin Wang ⋅ Ee-Chien Chang
|
Exhibit Hall I #230 | |
|
Diffusion Transformer meets Multi-level Wavelet Spectrum for Single Image Super-Resolution
Poster Session 5 & Exhibit Hall
Peng Du ⋅ Hui Li ⋅ Han Xu ⋅ Paul Jeon ⋅ Dongwook Lee ⋅ Daehyun Ji ⋅ Ran Yang ⋅ Feng Zhu
|
Exhibit Hall I #303 | |
|
Automated Model Evaluation for Object Detection via Prediction Consistency and Reliability
Poster Session 5 & Exhibit Hall
Seungju Yoo ⋅ Hyuk Kwon ⋅ Joong-Won Hwang ⋅ Kibok Lee
|
Exhibit Hall I #304 | |
|
LOTS of Fashion! Multi-Conditioning for Image Generation via Sketch-Text Pairing
Poster Session 5 & Exhibit Hall
Federico Girella ⋅ Davide Talon ⋅ Ziyue Liu ⋅ Zanxi Ruan ⋅ Yiming Wang ⋅ Marco Cristani
|
Exhibit Hall I #376 | |
|
FlowEdit: Inversion-Free Text-Based Editing Using Pre-Trained Flow Models
Poster Session 5 & Exhibit Hall
Vladimir Kulikov ⋅ Matan Kleiner ⋅ Inbar Huberman-Spiegelglas ⋅ Tomer Michaeli
|
Exhibit Hall I #452 | |
|
LayerTracer: Cognitive-Aligned Layered SVG Synthesis via Diffusion Transformer
Poster Session 5 & Exhibit Hall
Yiren Song ⋅ Danze Chen ⋅ Mike Zheng Shou
|
Exhibit Hall I #453 | |
|
SuperDec: 3D Scene Decomposition with Superquadrics Primitives
Poster Session 6 & Exhibit Hall with Coffee Break
Elisabetta Fedele ⋅ Boyang Sun ⋅ Francis Engelmann ⋅ Marc Pollefeys ⋅ Leonidas Guibas
|
Exhibit Hall I #144 | |
|
E-SAM: Training-Free Segment Every Entity Model
Poster Session 6 & Exhibit Hall with Coffee Break
WEIMING ZHANG ⋅ Dingwen Xiao ⋅ Lei Chen ⋅ Lin Wang
|
Exhibit Hall I #219 | |
|
Online Reasoning Video Segmentation with Just-in-Time Digital Twins
Poster Session 6 & Exhibit Hall with Coffee Break
Yiqing Shen ⋅ Bohan Liu ⋅ Chenjia Li ⋅ Lalithkumar Seenivasan ⋅ Mathias Unberath
|
Exhibit Hall I #220 | |
|
Towards Foundational Models for Single-Chip Radar
Poster Session 6 & Exhibit Hall with Coffee Break
Tianshu Huang ⋅ Akarsh Prabhakara ⋅ Chuhan Chen ⋅ Jay Karhade ⋅ Deva Ramanan ⋅ Matthew O'Toole ⋅ Anthony Rowe
|
Exhibit Hall I #287 | |
|
Make Your Training Flexible: Towards Deployment-Efficient Video Models
Poster Session 5 & Exhibit Hall
Chenting Wang ⋅ Kunchang Li ⋅ Tianxiang Jiang ⋅ Xiangyu Zeng ⋅ Yi Wang ⋅ Limin Wang
|
Exhibit Hall I #383 | |
|
M2SFormer: Multi-Spectral and Multi-Scale Attention with Edge-Aware Difficulty Guidance for Image Forgery Localization
Ju-Hyeon Nam ⋅ Dong-Hyun Moon ⋅ Sang-Chul Lee
|
Exhibit Hall I #99 | |
|
Articulate3D: Holistic Understanding of 3D Scenes as Universal Scene Description
Poster Session 2 & Exhibit Hall with Coffee Break
Anna-Maria Halacheva ⋅ Yang Miao ⋅ Jan-Nico Zaech ⋅ Xi Wang ⋅ Luc Gool ⋅ Danda Pani Paudel
|
Exhibit Hall I #57 | |
|
What You Have is What You Track: Adaptive and Robust Multimodal Tracking
Poster Session 1 & Exhibit Hall
Yuedong Tan ⋅ Jiawei Shao ⋅ Eduard Zamfir ⋅ Ruanjun Li ⋅ Zhaochong An ⋅ Chao Ma ⋅ Danda Pani Paudel ⋅ Luc Gool ⋅ Radu Timofte ⋅ Zongwei Wu
|
Exhibit Hall I #321 | |
|
Low-Light Image Enhancement using Event-Based Illumination Estimation
Poster Session 2 & Exhibit Hall with Coffee Break
Lei Sun ⋅ Yuhan Bao ⋅ Jiajun Zhai ⋅ Jingyun Liang ⋅ YULUN ZHANG ⋅ Kaiwei Wang ⋅ Danda Pani Paudel ⋅ Luc Gool
|
Exhibit Hall I #156 | |
|
Multi-Modal Few-Shot Temporal Action Segmentation
Poster Session 3 & Exhibit Hall
Zijia Lu ⋅ Ehsan Elhamifar
|
Exhibit Hall I #387 | |
|
WildSAT: Learning Satellite Image Representations from Wildlife Observations
Poster Session 2 & Exhibit Hall with Coffee Break
Rangel Daroya ⋅ Elijah Cole ⋅ Oisin Mac Aodha ⋅ Grant Horn ⋅ Subhransu Maji
|
Exhibit Hall I #105 | |
|
Forgetting Through Transforming: Enabling Federated Unlearning via Class-Aware Representation Transformation
Poster Session 1 & Exhibit Hall
Qi Guo ⋅ Zhen Tian ⋅ Minghao Yao ⋅ Saiyu Qi ⋅ Yong Qi ⋅ Bingyi Liu
|
Exhibit Hall I #130 | |
|
SU-RGS: Relightable 3D Gaussian Splatting from Sparse Views under Unconstrained Illuminations
Poster Session 6 & Exhibit Hall with Coffee Break
Qi Zhang ⋅ Chi Huang ⋅ Qian Zhang ⋅ Nan Li ⋅ Wei Feng
|
Exhibit Hall I #206 | |
|
SpectralAR: Spectral Autoregressive Visual Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Yuanhui Huang ⋅ Weiliang Chen ⋅ Wenzhao Zheng ⋅ Yueqi Duan ⋅ Jie Zhou ⋅ Jiwen Lu
|
Exhibit Hall I #91 | |
|
Sibai: A Few-Shot Meta-Classifier for Poisoning Detection in Federated Learning
Poster Session 1 & Exhibit Hall
Melanie Götz ⋅ Torsten Krauß ⋅ Alexandra Dmitrienko
|
Exhibit Hall I #352 | |
|
Gradient Extrapolation for Debiased Representation Learning
Poster Session 1 & Exhibit Hall
Ihab Asaad ⋅ Maha Shadaydeh ⋅ Joachim Denzler
|
Exhibit Hall I #355 | |
|
Supercharging Floorplan Localization with Semantic Rays
Poster Session 6 & Exhibit Hall with Coffee Break
Yuval Grader ⋅ Hadar Averbuch-Elor
|
Exhibit Hall I #232 | |
|
Learning Streaming Video Representation via Multitask Training
Poster Session 3 & Exhibit Hall
Yibin Yan ⋅ Jilan Xu ⋅ Shangzhe Di ⋅ Yikun Liu ⋅ Yudi Shi ⋅ Qirui Chen ⋅ Zeqian Li ⋅ Yifei Huang ⋅ Weidi Xie
|
Exhibit Hall I #224 | |
|
InstantEdit: Text-Guided Few-Step Image Editing with Piecewise Rectified Flow
Poster Session 4 & Exhibit Hall with Coffee Break
Yiming Gong ⋅ Zhen Zhu ⋅ Minjia Zhang
|
Exhibit Hall I #184 | |
|
World4Drive: End-to-End Autonomous Driving via Intention-aware Physical Latent World Model
Poster Session 6 & Exhibit Hall with Coffee Break
Yupeng Zheng ⋅ Pengxuan Yang ⋅ Zebin Xing ⋅ Qichao Zhang ⋅ Yuhang Zheng ⋅ Yinfeng Gao ⋅ Pengfei Li ⋅ Teng Zhang ⋅ Zhongpu Xia ⋅ Peng Jia ⋅ XianPeng Lang ⋅ Dongbin Zhao
|
Exhibit Hall I #378 | |
|
CoHD: A Counting-Aware Hierarchical Decoding Framework for Generalized Referring Expression Segmentation
Poster Session 5 & Exhibit Hall
Zhuoyan Luo ⋅ Yinghao Wu ⋅ Tianheng Cheng ⋅ Yong Liu ⋅ Yicheng Xiao ⋅ Hongfa Wang ⋅ Xiao-Ping Zhang ⋅ Yujiu Yang
|
Exhibit Hall I #271 | |
|
Scaling Transformer-Based Novel View Synthesis with Models Token Disentanglement and Synthetic Data
Poster Session 6 & Exhibit Hall with Coffee Break
Nithin Gopalakrishnan Nair ⋅ Srinivas Kaza ⋅ Xuan Luo ⋅ Jungyeon Park ⋅ Stephen Lombardi ⋅ Vishal Patel
|
Exhibit Hall I #372 | |
|
Learning to See in the Extremely Dark
Poster Session 2 & Exhibit Hall with Coffee Break
Hai Jiang ⋅ Binhao Guan ⋅ Zhen Liu ⋅ Xiaohong Liu ⋅ Jian Yu ⋅ Zheng Liu ⋅ Songchen Han ⋅ Shuaicheng Liu
|
Exhibit Hall I #250 | |
|
Customizing Domain Adapters for Domain Generalization
Poster Session 1 & Exhibit Hall
Yuyang Ji ⋅ Zeyi Huang ⋅ Haohan Wang ⋅ Yong Jae Lee
|
Exhibit Hall I #80 | |
|
BATCLIP: Bimodal Online Test-Time Adaptation for CLIP
Poster Session 1 & Exhibit Hall
Sarthak Kumar Maharana ⋅ Baoming Zhang ⋅ Leonid Karlinsky ⋅ Rogerio Feris ⋅ Yunhui Guo
|
Exhibit Hall I #139 | |
|
BillBoard Splatting (BBSplat): Learnable Textured Primitives for Novel View Synthesis
Poster Session 6 & Exhibit Hall with Coffee Break
David Svitov ⋅ Pietro Morerio ⋅ Lourdes Agapito ⋅ ALESSIO DEL BUE
|
Exhibit Hall I #29 | |
|
Vivid4D: Improving 4D Reconstruction from Monocular Video by Video Inpainting
Poster Session 3 & Exhibit Hall
Jiaxin Huang ⋅ Sheng Miao ⋅ Bangbang Yang ⋅ Yuewen Ma ⋅ Yiyi Liao
|
Exhibit Hall I #244 | |
|
MemoryTalker: Personalized Speech-Driven 3D Facial Animation via Audio-Guided Stylization
Poster Session 3 & Exhibit Hall
Hyung Kyu Kim ⋅ Sangmin Lee ⋅ HAK GU KIM
|
Exhibit Hall I #116 | |
|
DriveX: Omni Scene Modeling for Learning Generalizable World Knowledge in Autonomous Driving
Poster Session 6 & Exhibit Hall with Coffee Break
Chen Shi ⋅ Shaoshuai Shi ⋅ Kehua Sheng ⋅ Bo Zhang ⋅ Li Jiang
|
Exhibit Hall I #375 | |
|
MamV2XCalib: V2X-based Target-less Infrastructure Camera Calibration with State Space Model
Poster Session 6 & Exhibit Hall with Coffee Break
Yaoye Zhu ⋅ Zhe Wang ⋅ Yan Wang
|
Exhibit Hall I #191 | |
|
SiM3D: Single-instance Multiview Multimodal and Multisetup 3D Anomaly Detection Benchmark
Poster Session 5 & Exhibit Hall
Alex Costanzino ⋅ Pierluigi Zama Ramirez ⋅ Luigi Lella ⋅ Matteo Ragaglia ⋅ Alessandro Oliva ⋅ Giuseppe Lisanti ⋅ Luigi Stefano
|
Exhibit Hall I #108 | |
|
Image as an IMU: Estimating Camera Motion from a Single Motion-Blurred Image
Poster Session 1 & Exhibit Hall
Jerred Chen ⋅ Ronald Clark
|
Exhibit Hall I #228 | |
|
SUB: Benchmarking CBM Generalization via Synthetic Attribute Substitutions
Poster Session 5 & Exhibit Hall
Jessica Bader ⋅ Leander Girrbach ⋅ Stephan Alaniz ⋅ Zeynep Akata
|
Exhibit Hall I #320 | |
|
PARTE: Part-Guided Texturing for 3D Human Reconstruction from a Single Image
Poster Session 2 & Exhibit Hall with Coffee Break
Hyeongjin Nam ⋅ Donghwan Kim ⋅ Gyeongsik Moon ⋅ Kyoung Mu Lee
|
Exhibit Hall I #332 | |
|
Cross-Subject Mind Decoding from Inaccurate Representations
Poster Session 4 & Exhibit Hall with Coffee Break
Yangyang Xu ⋅ Bangzhen Liu ⋅ Wenqi Shao ⋅ Yong Du ⋅ Shengfeng He ⋅ Tingting Zhu
|
Exhibit Hall I #17 | |
|
Boosting MLLM Reasoning with Text-Debiased Hint-GRPO
Poster Session 1 & Exhibit Hall
Qihan Huang ⋅ Weilong Dai ⋅ Jinlong Liu ⋅ Wanggui He ⋅ Hao Jiang ⋅ Mingli Song ⋅ Jingyuan CHEN ⋅ Chang Yao ⋅ Jie Song
|
Exhibit Hall I #455 | |
|
Adaptive Dual Uncertainty Optimization: Boosting Monocular 3D Object Detection under Test-Time Shifts
Zixuan Hu ⋅ Dongxiao Li ⋅ Xinzhu Ma ⋅ SHIXIANG TANG ⋅ Xiaotong Li ⋅ Wenhan Yang ⋅ LINGYU DUAN
|
Exhibit Hall I #211 | |
|
SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation
Junsong Chen ⋅ Shuchen Xue ⋅ Yuyang Zhao ⋅ Jincheng YU ⋅ Sayak Paul ⋅ Junyu Chen ⋅ Han Cai ⋅ Enze Xie ⋅ Song Han
|
Exhibit Hall I #123 | |
|
AirCache: Activating Inter-modal Relevancy KV Cache Compression for Efficient Large Vision-Language Model Inference
Poster Session 5 & Exhibit Hall
Kai Huang ⋅ hao zou ⋅ Bochen Wang ⋅ Xi Ye ⋅ Zhen Xie ⋅ Hao Wang
|
Exhibit Hall I #390 | |
|
LLM-enhanced Action-aware Multi-modal Prompt Tuning for Image-Text Matching
Poster Session 5 & Exhibit Hall
Meng Tian ⋅ Shuo Yang ⋅ Xinxiao Wu
|
Exhibit Hall I #90 | |
|
UKBOB: One Billion MRI Labeled Masks for Generalizable 3D Medical Image Segmentation
Poster Session 5 & Exhibit Hall
Emmanuelle Bourigault ⋅ Amir Jamaludin ⋅ Abdullah Hamdi
|
Exhibit Hall I #169 | |
|
FlowStyler: Artistic Video Stylization via Transformation Fields Transports
Poster Session 3 & Exhibit Hall
YuNing Gong ⋅ Jiaming Chen ⋅ Xiaohua Ren ⋅ Yuanjun Liao ⋅ Yanci Zhang
|
Exhibit Hall I #21 | |
|
ShadowHack: Hacking Shadows via Luminance-Color Divide and Conquer
Poster Session 3 & Exhibit Hall
Jin Hu ⋅ Mingjia Li ⋅ Xiaojie Guo
|
Exhibit Hall I #131 | |
|
Harnessing Massive Satellite Imagery with Efficient Masked Image Modeling
Poster Session 2 & Exhibit Hall with Coffee Break
Fengxiang Wang ⋅ Hongzhen Wang ⋅ Di Wang ⋅ Zonghao Guo ⋅ Zhenyu Zhong ⋅ Long Lan ⋅ Wenjing Yang ⋅ Jing Zhang
|
Exhibit Hall I #180 | |
|
Beyond Losses Reweighting: Empowering Multi-Task Learning via the Generalization Perspective
Hoang Phan ⋅ Tung Lam Tran ⋅ Quyen Tran ⋅ Ngoc Tran ⋅ Tuan Truong ⋅ Qi Lei ⋅ Nhat Ho ⋅ Dinh Phung ⋅ Trung Le
|
Exhibit Hall I #222 | |
|
StableCodec: Taming One-Step Diffusion for Extreme Image Compression
Poster Session 4 & Exhibit Hall with Coffee Break
Tianyu Zhang ⋅ Xin Luo ⋅ Li Li ⋅ Dong Liu
|
Exhibit Hall I #239 | |
|
FastJSMA: Accelerating Jacobian-based Saliency Map Attacks through Gradient Decoupling
Poster Session 1 & Exhibit Hall
Zhenghao Gao ⋅ Shengjie Xu ⋅ Zijing Li ⋅ Meixi Chen ⋅ Chaojian Yu ⋅ Yuanjie Shao ⋅ Changxin Gao
|
Exhibit Hall I #133 | |
|
Toward Fair and Accurate Cross-Domain Medical Image Segmentation: A VLM-Driven Active Domain Adaptation Paradigm
Poster Session 5 & Exhibit Hall
Hongqiu Wang ⋅ Wu Chen ⋅ Xiangde Luo ⋅ Zhaohu Xing ⋅ Lihao Liu ⋅ Jing Qin ⋅ Shaozhi Wu ⋅ Lei Zhu
|
Exhibit Hall I #403 | |
|
Decouple to Reconstruct: High Quality UHD Restoration via Active Feature Disentanglement and Reversible Fusion
Poster Session 3 & Exhibit Hall
Yidi Liu ⋅ Dong Li ⋅ Yuxin Ma ⋅ Jie Huang ⋅ Wenlong Zhang ⋅ Xueyang Fu ⋅ Zheng-Jun Zha
|
Exhibit Hall I #153 | |
|
Federated Continuous Category Discovery and Learning
Poster Session 1 & Exhibit Hall
Lixu Wang ⋅ Chenxi Liu ⋅ Junfeng Guo ⋅ Qingqing Ye ⋅ Heng Huang ⋅ Haibo Hu ⋅ Wei Dong
|
Exhibit Hall I #221 | |
|
Are They the Same? Exploring Visual Correspondence Shortcomings of Multimodal LLMs
Poster Session 4 & Exhibit Hall with Coffee Break
Yikang Zhou ⋅ Tao Zhang ⋅ Shilin Xu ⋅ Shihao Chen ⋅ Qianyu Zhou ⋅ Yunhai Tong ⋅ Shunping Ji ⋅ Jiangning Zhang ⋅ Lu Qi ⋅ Xiangtai Li
|
Exhibit Hall I #266 | |
|
Consensus-Driven Active Model Selection
Justin Kay ⋅ Grant Horn ⋅ Subhransu Maji ⋅ Daniel Sheldon ⋅ Sara Beery
|
Exhibit Hall I #431 | |
|
BlueNeg: A 35mm Negative Film Dataset for Restoring Channel-Heterogeneous Deterioration
Poster Session 3 & Exhibit Hall
Hanyuan Liu ⋅ Chengze Li ⋅ Minshan Xie ⋅ Wang Zhenni ⋅ Jiawen Liang ⋅ Chi LEUNG ⋅ Tien-Tsin Wong
|
Exhibit Hall I #293 | |
|
Rethinking Key-frame-based Micro-expression Recognition: A Robust and Accurate Framework Against Key-frame Errors
Zheyuan Zhang ⋅ Weihao Tang ⋅ Hong Chen
|
Exhibit Hall I #213 | |
|
Make Me Happier: Evoking Emotions Through Image Diffusion Models
Poster Session 4 & Exhibit Hall with Coffee Break
Qing Lin ⋅ Jingfeng Zhang ⋅ YEW-SOON ONG ⋅ Mengmi Zhang
|
Exhibit Hall I #140 | |
|
Pretend Benign: A Stealthy Adversarial Attack by Exploiting Vulnerabilities in Cooperative Perception
Poster Session 5 & Exhibit Hall
Hongwei Lin ⋅ Dongyu Pan ⋅ Qiming Xia ⋅ Hai Wu ⋅ Cheng Wang ⋅ Siqi Shen ⋅ Chenglu Wen
|
Exhibit Hall I #14 | |
|
What we need is explicit controllability: Training 3D gaze estimator using only facial images
Poster Session 3 & Exhibit Hall
Tingwei Li ⋅ Jun Bao ⋅ Zhenzhong Kuang ⋅ Buyu Liu
|
Exhibit Hall I #132 | |
|
SemiVisBooster: Boosting Semi-Supervised Learning for Fine-Grained Classification through Pseudo-Label Semantic Guidance
Poster Session 1 & Exhibit Hall
Wenjin Zhang ⋅ Xinyu Li ⋅ Chenyang Gao ⋅ Ivan Marsic
|
Exhibit Hall I #104 | |
|
OpenAnimals: Revisiting Person Re-Identification for Animals Towards Better Generalization
Poster Session 3 & Exhibit Hall
Saihui Hou ⋅ Panjian Huang ⋅ Zengbin Wang ⋅ Yuan Liu ⋅ Zeyu Li ⋅ Man Zhang ⋅ Yongzhen Huang
|
Exhibit Hall I #411 | |
|
Enhancing Prompt Generation with Adaptive Refinement for Camouflaged Object Detection
Poster Session 5 & Exhibit Hall
Xuehan Chen ⋅ Guangyu Ren ⋅ Tianhong Dai ⋅ Tania Stathaki ⋅ Hengyan Liu
|
Exhibit Hall I #83 | |
|
Hypergraph Clustering Network with Partial Attribute Imputation
Poster Session 1 & Exhibit Hall
Qianqian Wang ⋅ Bowen Zhao ⋅ Zhengming Ding ⋅ Wei Feng ⋅ Quanxue Gao
|
Exhibit Hall I #248 | |
|
Easy3D: A Simple Yet Effective Method for 3D Interactive Segmentation
Poster Session 6 & Exhibit Hall with Coffee Break
Andrea Simonelli ⋅ Norman Müller ⋅ Peter Kontschieder
|
Exhibit Hall I #357 | |
|
SAMPLE: Semantic Alignment through Temporal-Adaptive Multimodal Prompt Learning for Event-Based Open-Vocabulary Action Recognition
Poster Session 3 & Exhibit Hall
Jing Wang ⋅ Rui Zhao ⋅ Ruiqin Xiong ⋅ Xingtao Wang ⋅ Xiaopeng Fan ⋅ Tiejun Huang
|
Exhibit Hall I #415 | |
|
Object-centric Video Question Answering with Visual Grounding and Referring
Poster Session 5 & Exhibit Hall
Haochen Wang ⋅ Qirui Chen ⋅ Cilin Yan ⋅ Jiayin Cai ⋅ Xiaolong Jiang ⋅ Yao Hu ⋅ Weidi Xie ⋅ Stratis Gavves
|
Exhibit Hall I #233 | |
|
DSO: Aligning 3D Generators with Simulation Feedback for Physical Soundness
Ruining Li ⋅ Chuanxia Zheng ⋅ Christian Rupprecht ⋅ Andrea Vedaldi
|
Exhibit Hall I #165 | |
|
EgoAgent: A Joint Predictive Agent Model in Egocentric Worlds
Poster Session 2 & Exhibit Hall with Coffee Break
Lu Chen ⋅ Yizhou Wang ⋅ SHIXIANG TANG ⋅ Qianhong Ma ⋅ Tong He ⋅ Wanli Ouyang ⋅ Xiaowei Zhou ⋅ Hujun Bao ⋅ Sida Peng
|
Exhibit Hall I #183 | |
|
VMBench: A Benchmark for Perception-Aligned Video Motion Generation
Poster Session 3 & Exhibit Hall
Xinran Ling ⋅ Chen Zhu ⋅ Meiqi Wu ⋅ Hangyu Li ⋅ Xiaokun Feng ⋅ Cundian Yang ⋅ Aiming Hao ⋅ Jiashu Zhu ⋅ Jiahong Wu ⋅ Xiangxiang Chu
|
Exhibit Hall I #290 | |
|
UAVScenes: A Multi-Modal Dataset for UAVs
Poster Session 6 & Exhibit Hall with Coffee Break
Sijie Wang ⋅ Siqi Li ⋅ Yawei Zhang ⋅ Shangshu Yu ⋅ Shenghai Yuan ⋅ Rui She ⋅ Quanjiang Guo ⋅ JinXuan Zheng ⋅ Ong Howe ⋅ Leonrich Chandra ⋅ Shrivarshann Srijeyan ⋅ Aditya Sivadas ⋅ Toshan Aggarwal ⋅ Heyuan Liu ⋅ Hongming Zhang ⋅ CHEN CHUJIE ⋅ JIANG JUNYU ⋅ Lihua Xie ⋅ Wee Peng Tay
|
Exhibit Hall I #407 | |
|
LIRA: Reasoning Reconstruction via Multimodal Large Language Models
Poster Session 1 & Exhibit Hall
Zhen Zhou ⋅ Tong Wang ⋅ Yunkai Ma ⋅ Xiao Tan ⋅ Fengshui Jing
|
Exhibit Hall I #159 | |
|
Move to Understand a 3D Scene: Bridging Visual Grounding and Exploration for Efficient and Versatile Embodied Navigation
ZIYU ZHU ⋅ Xilin Wang ⋅ Yixuan Li ⋅ Zhuofan Zhang ⋅ Xiaojian Ma ⋅ Yixin Chen ⋅ Baoxiong Jia ⋅ Wei Liang ⋅ Qian Yu ⋅ Zhidong Deng ⋅ Siyuan Huang ⋅ Qing Li
|
Exhibit Hall I #291 | |
|
NavMorph: A Self-Evolving World Model for Vision-and-Language Navigation in Continuous Environments
Poster Session 2 & Exhibit Hall with Coffee Break
Xuan Yao ⋅ Junyu Gao ⋅ Changsheng Xu
|
Exhibit Hall I #48 | |
|
TF-TI2I: Training-Free Text-and-Image-to-Image Generation via Multi-Modal Implicit-Context Learning In Text-to-Image Models
Poster Session 4 & Exhibit Hall with Coffee Break
Teng-Fang Hsiao ⋅ Bo-Kai Ruan ⋅ Yi-Lun Wu ⋅ Tzu-Ling Lin ⋅ Hong-Han Shuai
|
Exhibit Hall I #335 | |
|
Exploiting Frequency Dynamics for Enhanced Multimodal Event-based Action Recognition
Poster Session 2 & Exhibit Hall with Coffee Break
Meiqi Cao ⋅ Xiangbo Shu ⋅ Xin Jiang ⋅ Rui Yan ⋅ Yazhou Yao ⋅ Jinhui Tang
|
Exhibit Hall I #89 | |
|
Compression of 3D Gaussian Splatting with Optimized Feature Planes and Standard Video Codecs
Poster Session 6 & Exhibit Hall with Coffee Break
Soonbin Lee ⋅ Fangwen Shu ⋅ Yago Sanchez de la Fuente ⋅ Thomas Schierl ⋅ Cornelius Hellge
|
Exhibit Hall I #73 | |
|
GeoProg3D: Compositional Visual Reasoning for City-Scale 3D Language Fields
Poster Session 2 & Exhibit Hall with Coffee Break
Shunsuke Yasuki ⋅ Taiki Miyanishi ⋅ Nakamasa Inoue ⋅ Shuhei Kurita ⋅ Koya Sakamoto ⋅ Daichi Azuma ⋅ Masato Taki ⋅ Yutaka Matsuo
|
Exhibit Hall I #442 | |
|
GazeGaussian: High-Fidelity Gaze Redirection with 3D Gaussian Splatting
Xiaobao Wei ⋅ Peng Chen ⋅ Guangyu Li ⋅ Ming Lu ⋅ Hui Chen ⋅ Feng Tian
|
Exhibit Hall I #311 | |
|
Boosting Adversarial Transferability via Negative Hessian Trace Regularization
Poster Session 1 & Exhibit Hall
Yunfei Long ⋅ Zilin Tian ⋅ Liguo Zhang ⋅ Huosheng Xu
|
Exhibit Hall I #217 | |
|
FB-Diff: Fourier Basis-guided Diffusion for Temporal Interpolation of 4D Medical Imaging
Poster Session 6 & Exhibit Hall with Coffee Break
Xin You ⋅ Runze Yang ⋅ Chuyan Zhang ⋅ Zhongliang Jiang ⋅ JIE YANG ⋅ Nassir Navab
|
Exhibit Hall I #317 | |
|
How Far are AI-generated Videos from Simulating the 3D Visual World: A Learned 3D Evaluation Approach
Poster Session 3 & Exhibit Hall
Chirui CHANG ⋅ Jiahui Liu ⋅ Zhengzhe Liu ⋅ Xiaoyang Lyu ⋅ Yi-Hua Huang ⋅ Xin Tao ⋅ Pengfei Wan ⋅ Di ZHANG ⋅ Xiaojuan Qi
|
Exhibit Hall I #28 | |
|
SIC: Similarity-Based Interpretable Image Classification with Neural Networks
Poster Session 5 & Exhibit Hall
Tom Nuno Wolf ⋅ Emre Kavak ⋅ Fabian Bongratz ⋅ Christian Wachinger
|
Exhibit Hall I #419 | |
|
3DRealCar: An In-the-wild RGB-D Car Dataset with 360-degree Views
Poster Session 6 & Exhibit Hall with Coffee Break
Xiaobiao Du ⋅ Yida Wang ⋅ Haiyang Sun ⋅ Zhuojie Wu ⋅ Hongwei Sheng ⋅ Shuyun Wang ⋅ Jiaying Ying ⋅ Ming Lu ⋅ Tianqing Zhu ⋅ Kun Zhan ⋅ Xin Yu
|
Exhibit Hall I #171 | |
|
Describe, Don’t Dictate: Semantic Image Editing with Natural Language Intent
Poster Session 4 & Exhibit Hall with Coffee Break
En Ci ⋅ Shanyan Guan ⋅ Yanhao Ge ⋅ Yilin Zhang ⋅ Wei Li ⋅ Zhenyu Zhang ⋅ Jian Yang ⋅ Ying Tai
|
Exhibit Hall I #412 | |
|
Event-based Tiny Object Detection: A Benchmark Dataset and Baselines
Poster Session 2 & Exhibit Hall with Coffee Break
Nuo Chen ⋅ Chao Xiao ⋅ Yimian Dai ⋅ Shiman He ⋅ Miao Li ⋅ Wei An
|
Exhibit Hall I #205 | |
|
Depth AnyEvent: A Cross-Modal Distillation Paradigm for Event-Based Monocular Depth Estimation
Poster Session 4 & Exhibit Hall with Coffee Break
Luca Bartolomei ⋅ Enrico Mannocci ⋅ Fabio Tosi ⋅ Matteo Poggi ⋅ Stefano Mattoccia
|
Exhibit Hall I #458 | |
|
EmotiCrafter: Text-to-Emotional-Image Generation based on Valence-Arousal Model
Poster Session 4 & Exhibit Hall with Coffee Break
Shengqi Dang ⋅ Yi He ⋅ Long Ling ⋅ Ziqing Qian ⋅ Nanxuan Zhao ⋅ Nan Cao
|
Exhibit Hall I #31 | |
|
LD-RPS: Zero-Shot Unified Image Restoration via Latent Diffusion Recurrent Posterior Sampling
Poster Session 3 & Exhibit Hall
Li Huaqiu ⋅ Yong Wang ⋅ Tongwen Huang ⋅ Hailang Huang ⋅ Haoqian Wang ⋅ Xiangxiang Chu
|
Exhibit Hall I #346 | |
|
Towards Stabilized and Efficient Diffusion Transformers through Long-Skip-Connections with Spectral Constraints
Poster Session 4 & Exhibit Hall with Coffee Break
Guanjie Chen ⋅ Xinyu Zhao ⋅ Yucheng Zhou ⋅ Xiaoye Qu ⋅ Tianlong Chen ⋅ Yu Cheng
|
Exhibit Hall I #270 | |
|
Not All Frame Features Are Equal: Video-to-4D Generation via Decoupling Dynamic-Static Features
Liying Yang ⋅ Chen Liu ⋅ Zhenwei Zhu ⋅ Ajian Liu ⋅ Hui Ma ⋅ Jian Nong ⋅ Yanyan Liang
|
Exhibit Hall I #233 | |
|
Fuse Before Transfer: Knowledge Fusion for Heterogeneous Distillation
Poster Session 1 & Exhibit Hall
Guopeng Li ⋅ Qiang Wang ⋅ Ke Yan ⋅ Shouhong Ding ⋅ Yuan Gao ⋅ Gui-Song Xia
|
Exhibit Hall I #320 | |
|
CoSMIC: Continual Self-supervised Learning for Multi-Domain Medical Imaging via Conditional Mutual Information Maximization
Poster Session 5 & Exhibit Hall
Yihang Liu ⋅ Ying Wen ⋅ Longzhen Yang ⋅ Lianghua He ⋅ Heng Tao Shen
|
Exhibit Hall I #307 | |
|
Unsupervised Identification of Protein Compositions and Conformations via Implicit Content-Transformation Disentanglement
Poster Session 2 & Exhibit Hall with Coffee Break
Mostofa Rafid Uddin ⋅ Jana Armouti ⋅ Min Xu
|
Exhibit Hall I #232 | |
|
SplArt: Articulation Estimation and Part-Level Reconstruction with 3D Gaussian Splatting
Poster Session 2 & Exhibit Hall with Coffee Break
Shengjie Lin ⋅ Jiading Fang ⋅ Muhammad Zubair Irshad ⋅ Vitor Campagnolo Guizilini ⋅ Rares Ambrus ⋅ Greg Shakhnarovich ⋅ Matthew Walter
|
Exhibit Hall I #359 | |
|
Splat-based 3D Scene Reconstruction with Extreme Motion-blur
Poster Session 6 & Exhibit Hall with Coffee Break
Hyeonjoong Jang ⋅ Dongyoung Choi ⋅ Donggun Kim ⋅ Woohyun Kang ⋅ Min H. Kim
|
Exhibit Hall I #165 | |
|
Diffusion Curriculum: Synthetic-to-Real Data Curriculum via Image-Guided Diffusion
Poster Session 1 & Exhibit Hall
Yijun Liang ⋅ Shweta Bhardwaj ⋅ Tianyi Zhou
|
Exhibit Hall I #151 | |
|
Training-free and Adaptive Sparse Attention for Efficient Long Video Generation
Poster Session 4 & Exhibit Hall with Coffee Break
yifei xia ⋅ Suhan Ling ⋅ Fangcheng Fu ⋅ Yujie Wang ⋅ Huixia Li ⋅ Xuefeng Xiao ⋅ Bin CUI
|
Exhibit Hall I #104 | |
|
ForestFormer3D: A Unified Framework for End-to-End Segmentation of Forest LiDAR 3D Point Clouds
Poster Session 6 & Exhibit Hall with Coffee Break
Binbin Xiang ⋅ Maciej Wielgosz ⋅ Stefano Puliti ⋅ Kamil Král ⋅ Martin Krůček ⋅ Azim Missarov ⋅ Rasmus Astrup
|
Exhibit Hall I #356 | |
|
OV3D-CG: Open-vocabulary 3D Instance Segmentation with Contextual Guidance
Poster Session 2 & Exhibit Hall with Coffee Break
Mingquan Zhou ⋅ Chen He ⋅ Ruiping Wang ⋅ Xilin Chen
|
Exhibit Hall I #27 | |
|
AdsQA: Towards Advertisement Video Understanding
Poster Session 5 & Exhibit Hall
Xinwei Long ⋅ Kai Tian ⋅ Peng Xu ⋅ Guoli Jia ⋅ Jingxuan Li ⋅ Sa Yang ⋅ Yihua Shao ⋅ Kaiyan Zhang ⋅ Che Jiang ⋅ Hao Xu ⋅ Yang Liu ⋅ Jiaheng Ma ⋅ Bowen Zhou
|
Exhibit Hall I #339 | |
|
Memory-Efficient Generative Models via Product Quantization
Poster Session 4 & Exhibit Hall with Coffee Break
Jie Shao ⋅ Hanxiao Zhang ⋅ Hao Yu ⋅ Jianxin Wu
|
Exhibit Hall I #190 | |
|
ForgeLens: Data-Efficient Forgery Focus for Generalizable Forgery Image Detection
Poster Session 4 & Exhibit Hall with Coffee Break
Yingjian Chen ⋅ Lei Zhang ⋅ Yakun Niu
|
Exhibit Hall I #131 | |
|
Rethinking Discrete Tokens: Treating Them as Conditions for Continuous Autoregressive Image Synthesis
Poster Session 4 & Exhibit Hall with Coffee Break
Peng Zheng ⋅ Junke Wang ⋅ Yi Chang ⋅ Yizhou Yu ⋅ Rui Ma ⋅ Zuxuan Wu
|
Exhibit Hall I #240 | |
|
Multimodal Prompt Alignment for Facial Expression Recognition
Poster Session 3 & Exhibit Hall
Fuyan Ma ⋅ Yiran He ⋅ Bin Sun ⋅ Shutao Li
|
Exhibit Hall I #243 | |
|
CogCM: Cognition-Inspired Contextual Modeling for Audio-Visual Speech Enhancement
Poster Session 5 & Exhibit Hall
Feixiang Wang ⋅ Shuang Yang ⋅ Shiguang Shan ⋅ Xilin Chen
|
Exhibit Hall I #149 | |
|
Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition
Poster Session 1 & Exhibit Hall
Zhisheng Zhong ⋅ Chengyao Wang ⋅ Yuqi Liu ⋅ Senqiao Yang ⋅ Longxiang Tang ⋅ Yuechen Zhang ⋅ Jingyao Li ⋅ Tianyuan Qu ⋅ Yanwei Li ⋅ Yukang Chen ⋅ Shaozuo Yu ⋅ WU Sitong ⋅ Eric Lo ⋅ Shu Liu ⋅ Jiaya Jia
|
Exhibit Hall I #343 | |
|
AIM: Adaptive Inference of Multi-Modal LLMs via Token Merging and Pruning
Poster Session 5 & Exhibit Hall
Yiwu Zhong ⋅ Zhuoming Liu ⋅ Yin Li ⋅ Liwei Wang
|
Exhibit Hall I #36 | |
|
EDFFDNet: Towards Accurate and Efficient Unsupervised Multi-Grid Image Registration
Poster Session 2 & Exhibit Hall with Coffee Break
Haokai Zhu ⋅ Bo Qu ⋅ Si-Yuan Cao ⋅ Runmin Zhang ⋅ Shujie Chen ⋅ Bailin Yang ⋅ Hui-liang Shen
|
Exhibit Hall I #8 | |
|
Enhancing Mamba Decoder with Bidirectional Interaction in Multi-Task Dense Prediction
Poster Session 4 & Exhibit Hall with Coffee Break
Mang Cao ⋅ Sanping Zhou ⋅ Yizhe Li ⋅ Ye Deng ⋅ Wenli Huang ⋅ Le Wang
|
Exhibit Hall I #375 | |
|
Leveraging Debiased Cross-modal Attention Maps and Code-based Reasoning for Zero-shot Referring Expression Comprehension
Poster Session 5 & Exhibit Hall
Juntao Chen ⋅ Wen Shen ⋅ Zhihua Wei ⋅ Lijun Sun ⋅ Hongyun Zhang
|
Exhibit Hall I #57 | |
|
UST-SSM: Unified Spatio-Temporal State Space Models for Point Cloud Video Modeling
Poster Session 2 & Exhibit Hall with Coffee Break
Peiming Li ⋅ Ziyi Wang ⋅ Yulin Yuan ⋅ Hong Liu ⋅ Xiangming Meng ⋅ Junsong Yuan ⋅ Mengyuan Liu
|
Exhibit Hall I #162 | |
|
Improving Multimodal Learning via Imbalanced Learning
Poster Session 1 & Exhibit Hall
Shicai Wei ⋅ Chunbo Luo ⋅ Yang Luo
|
Exhibit Hall I #204 | |
|
SITE: towards Spatial Intelligence Thorough Evaluation
Poster Session 2 & Exhibit Hall with Coffee Break
Wenqi Wang ⋅ Reuben Tan ⋅ Pengyue Zhu ⋅ Jianwei Yang ⋅ Zhengyuan Yang ⋅ Lijuan Wang ⋅ Andrey Kolobov ⋅ Jianfeng Gao ⋅ Boqing Gong
|
Exhibit Hall I #379 | |
|
SHIFT: Smoothing Hallucinations by Information Flow Tuning for Multimodal Large Language Models
Poster Session 1 & Exhibit Hall
Sudong Wang ⋅ Yunjian Zhang ⋅ Yao Zhu ⋅ Enci Liu ⋅ Jianing Li ⋅ Yanwei Liu ⋅ Xiangyang Ji
|
Exhibit Hall I #338 | |
|
Stable Score Distillation
Poster Session 4 & Exhibit Hall with Coffee Break
Haiming Zhu ⋅ Yangyang Xu ⋅ Chenshu Xu ⋅ Tingrui Shen ⋅ Wenxi Liu ⋅ Yong Du ⋅ Jun Yu ⋅ Shengfeng He
|
Exhibit Hall I #164 | |
|
Synergistic Prompting for Robust Visual Recognition with Missing Modalities
Poster Session 1 & Exhibit Hall
Zhihui Zhang ⋅ Luanyuan Dai ⋅ Qika Lin ⋅ Yunfeng Diao ⋅ Guangyin Jin ⋅ Yufei Guo ⋅ Jing Zhang ⋅ Xiaoshuai Hao
|
Exhibit Hall I #170 | |
|
Hierarchical Visual Prompt Learning for Continual Video Instance Segmentation
Poster Session 3 & Exhibit Hall
Jiahua Dong ⋅ Hui Yin ⋅ Wenqi Liang ⋅ Hanbin Zhao ⋅ Henghui Ding ⋅ Nicu Sebe ⋅ Salman Khan ⋅ Fahad Khan
|
Exhibit Hall I #172 | |
|
Automated Red Teaming for Text-to-Image Models through Feedback-Guided Prompt Iteration with Vision-Language Models
Poster Session 4 & Exhibit Hall with Coffee Break
Wei Xu ⋅ Kangjie Chen ⋅ Jiawei Qiu ⋅ Yuyang zhang ⋅ Run Wang ⋅ Jin Mao ⋅ Tianwei Zhang ⋅ Lina Wang
|
Exhibit Hall I #353 | |
|
RAGD: Regional-Aware Diffusion Model for Text-to-Image Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Chen Zhennan ⋅ Yajie Li ⋅ Haofan Wang ⋅ Zhibo Chen ⋅ Zhengkai Jiang ⋅ Jun Li ⋅ Qian Wang ⋅ Jian Yang ⋅ Ying Tai
|
Exhibit Hall I #426 | |
|
Enhancing Spatial Reasoning in Multimodal Large Language Models through Reasoning-based Segmentation
Poster Session 2 & Exhibit Hall with Coffee Break
Zhenhua Ning ⋅ Zhuotao Tian ⋅ Shaoshuai Shi ⋅ Daojing He ⋅ Guangming Lu ⋅ Wenjie Pei ⋅ Li Jiang
|
Exhibit Hall I #266 | |
|
Knowledge Distillation with Refined Logits
Poster Session 1 & Exhibit Hall
Wujie Sun ⋅ Defang Chen ⋅ Siwei Lyu ⋅ Genlang Chen ⋅ Chun Chen ⋅ Can Wang
|
Exhibit Hall I #96 | |
|
Dark-ISP: Enhancing RAW Image Processing for Low-Light Object Detection
Poster Session 2 & Exhibit Hall with Coffee Break
Jiasheng Guo ⋅ Xin Gao ⋅ Yuxiang Yan ⋅ Guanghao Li ⋅ Jian Pu
|
Exhibit Hall I #428 | |
|
BézierGS: Dynamic Urban Scene Reconstruction with Bézier Curve Gaussian Splatting
Poster Session 6 & Exhibit Hall with Coffee Break
Zipei Ma ⋅ Junzhe Jiang ⋅ Yurui Chen ⋅ Li Zhang
|
Exhibit Hall I #77 | |
|
Domain Generalizable Portrait Style Transfer
Poster Session 4 & Exhibit Hall with Coffee Break
Xinbo Wang ⋅ Wenju Xu ⋅ Qing Zhang ⋅ Wei-Shi Zheng
|
Exhibit Hall I #87 | |
|
PerLDiff: Controllable Street View Synthesis Using Perspective-Layout Diffusion Model
Poster Session 6 & Exhibit Hall with Coffee Break
Jinhua Zhang ⋅ Hualian Sheng ⋅ Sijia Cai ⋅ Bing Deng ⋅ Qiao Liang ⋅ Wen Li ⋅ Ying Fu ⋅ Jieping Ye ⋅ Shuhang Gu
|
Exhibit Hall I #154 | |
|
Diffusion Image Prior
Poster Session 6 & Exhibit Hall with Coffee Break
Hamadi Chihaoui ⋅ Paolo Favaro
|
Exhibit Hall I #288 | |
|
Text2VDM: Text to Vector Displacement Maps for Expressive and Interactive 3D Sculpting
Poster Session 4 & Exhibit Hall with Coffee Break
Hengyu Meng ⋅ Duotun Wang ⋅ Zhijing Shao ⋅ Ligang Liu ⋅ Zeyu Wang
|
Exhibit Hall I #191 | |
|
HERO: Human Reaction Generation from Videos
Poster Session 3 & Exhibit Hall
Chengjun Yu ⋅ Wei Zhai ⋅ Yuhang Yang ⋅ Yang Cao ⋅ Zheng-Jun Zha
|
Exhibit Hall I #24 | |
|
Towards Comprehensive Lecture Slides Understanding: Large-scale Dataset and Effective Method
Poster Session 1 & Exhibit Hall
Enming Zhang ⋅ Yuzhe Li ⋅ Yuliang Liu ⋅ Yingying Zhu ⋅ Xiang Bai
|
Exhibit Hall I #418 | |
|
A Unified Interpretation of Training-Time Out-of-Distribution Detection
Xu Cheng ⋅ Xin Jiang ⋅ Zechao Li
|
Exhibit Hall I #194 | |
|
VQ-SGen: A Vector Quantized Stroke Representation for Creative Sketch Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Jiawei Wang ⋅ Zhiming Cui ⋅ Changjian Li
|
Exhibit Hall I #424 | |
|
G2PDiffusion: Cross-species Genotype-to-Phenotype Prediction via Evolutionary Diffusion
Poster Session 5 & Exhibit Hall
Mengdi Liu ⋅ Zhangyang Gao ⋅ Hong Chang ⋅ Stan Li ⋅ Shiguang Shan ⋅ Xilin Chen
|
Exhibit Hall I #86 | |
|
Mamba-3VL: Taming State Space Model for 3D Vision Language Learning
Poster Session 2 & Exhibit Hall with Coffee Break
Yuan Wang ⋅ Yuxin Chen ⋅ Zhongang Qi ⋅ Lijun Liu ⋅ Jile Jiao ⋅ Xuetao Feng ⋅ Yujia Liang ⋅ Ying Shan ⋅ Zhipeng Zhang
|
Exhibit Hall I #117 | |
|
Embodied Representation Alignment with Mirror Neurons
Poster Session 3 & Exhibit Hall
Wentao Zhu ⋅ Zhining Zhang ⋅ Yuwei Ren ⋅ Yin Huang ⋅ Hao Xu ⋅ Yizhou Wang
|
Exhibit Hall I #183 | |
|
Referring to Any Person
Poster Session 5 & Exhibit Hall
Qing Jiang ⋅ Lin Wu ⋅ Zhaoyang Zeng ⋅ Tianhe Ren ⋅ Yuda Xiong ⋅ Yihao Chen ⋅ Liu Qin ⋅ Lei Zhang
|
Exhibit Hall I #175 | |
|
Selective Contrastive Learning for Weakly Supervised Affordance Grounding
Poster Session 2 & Exhibit Hall with Coffee Break
WonJun Moon ⋅ Hyun Seok Seong ⋅ Jae-Pil Heo
|
Exhibit Hall I #18 | |
|
CoST: Efficient Collaborative Perception From Unified Spatiotemporal Perspective
Zongheng Tang ⋅ Yi Liu ⋅ Yifan Sun ⋅ Yulu Gao ⋅ Jinyu Chen ⋅ Runsheng Xu ⋅ Si Liu
|
Exhibit Hall I #97 | |
|
M2EIT: Multi-Domain Mixture of Experts for Robust Neural Inertial Tracking
Poster Session 6 & Exhibit Hall with Coffee Break
Yan Li ⋅ Yang Xu ⋅ Changhao Chen ⋅ Zhongchen Shi ⋅ Wei Chen ⋅ Liang Xie ⋅ Hongbo Chen ⋅ Erwei Yin
|
Exhibit Hall I #336 | |
|
MobileViCLIP: An Efficient Video-Text Model for Mobile Devices
Poster Session 5 & Exhibit Hall
Min Yang ⋅ Zihan Jia ⋅ Zhilin Dai ⋅ Sheng Guo ⋅ Limin Wang
|
Exhibit Hall I #97 | |
|
Task-Specific Zero-shot Quantization-Aware Training for Object Detection
Poster Session 5 & Exhibit Hall
Changhao Li ⋅ Xinrui Chen ⋅ Ji Wang ⋅ Kang Zhao ⋅ Jianfei Chen
|
Exhibit Hall I #288 | |
|
Bridging Domain Generalization to Multimodal Domain Generalization via Unified Representations
Poster Session 5 & Exhibit Hall
Hai Huang ⋅ Yan Xia ⋅ Sashuai Zhou ⋅ Hanting Wang ⋅ Shulei Wang ⋅ Zhou Zhao
|
Exhibit Hall I #253 | |
|
Addressing Representation Collapse in Vector Quantized Models with One Linear Layer
Poster Session 5 & Exhibit Hall
Yongxin Zhu ⋅ Bocheng Li ⋅ Yifei Xin ⋅ Zhihua Xia ⋅ Linli Xu
|
Exhibit Hall I #297 | |
|
DictAS: A Framework for Class-Generalizable Few-Shot Anomaly Segmentation via Dictionary Lookup
Poster Session 5 & Exhibit Hall
Zhen Qu ⋅ Xian Tao ⋅ Xinyi Gong ⋅ ShiChen Qu ⋅ Xiaopei Zhang ⋅ Xingang Wang ⋅ Fei Shen ⋅ Zhengtao Zhang ⋅ Mukesh Prasad ⋅ Guiguang Ding
|
Exhibit Hall I #67 | |
|
EVOLVE: Event-Guided Deformable Feature Transfer and Dual-Memory Refinement for Low-Light Video Object Segmentation
Poster Session 3 & Exhibit Hall
Jong Hyeon Baek ⋅ Jiwon oh ⋅ Yeong Jun Koh
|
Exhibit Hall I #119 | |
|
MATE: Motion-Augmented Temporal Consistency for Event-based Point Tracking
Poster Session 2 & Exhibit Hall with Coffee Break
Han Han ⋅ Wei Zhai ⋅ Yang Cao ⋅ Bin Li ⋅ Zheng-Jun Zha
|
Exhibit Hall I #313 | |
|
Asynchronous Event Error-Minimizing Noise for Safeguarding Event Dataset
Ruofei WANG ⋅ Peiqi Duan ⋅ Boxin Shi ⋅ Renjie Wan
|
Exhibit Hall I #13 | |
|
AG2aussian: Anchor-Graph Structured Gaussian Splatting for Instance-Level 3D Scene Understanding and Editing
Poster Session 6 & Exhibit Hall with Coffee Break
Zhaonan Wang ⋅ Manyi Li ⋅ Changhe Tu
|
Exhibit Hall I #201 | |
|
Vector Contrastive Learning For Pixel-Wise Pretraining In Medical Vision
Poster Session 5 & Exhibit Hall
Yuting He ⋅ Shuo Li
|
Exhibit Hall I #3 | |
|
InterGSEdit: Interactive 3D Gaussian Splatting Editing with 3D Geometry-Consistent Attention Prior
Poster Session 6 & Exhibit Hall with Coffee Break
Minghao Wen ⋅ Shengjie Wu ⋅ Kangkan Wang ⋅ Dong Liang
|
Exhibit Hall I #135 | |
|
CaO2: Rectifying Inconsistencies in Diffusion-Based Dataset Distillation
Poster Session 1 & Exhibit Hall
Haoxuan Wang ⋅ Zhenghao Zhao ⋅ Junyi Wu ⋅ Yuzhang Shang ⋅ Gaowen Liu ⋅ Yan Yan
|
Exhibit Hall I #443 | |
|
Differential-informed Sample Selection Accelerates Multimodal Contrastive Learning
Poster Session 1 & Exhibit Hall
Zihua Zhao ⋅ Feng Hong ⋅ Mengxi Chen ⋅ Pengyi Chen ⋅ Benyuan Liu ⋅ Jiangchao Yao ⋅ Ya Zhang ⋅ Yanfeng Wang
|
Exhibit Hall I #270 | |
|
InstaScene: Towards Complete 3D Instance Decomposition and Reconstruction from Cluttered Scenes
Poster Session 2 & Exhibit Hall with Coffee Break
Zesong Yang ⋅ Bangbang Yang ⋅ Wenqi Dong ⋅ Chenxuan Cao ⋅ Liyuan Cui ⋅ Yuewen Ma ⋅ Zhaopeng Cui ⋅ Hujun Bao
|
Exhibit Hall I #259 | |
|
Efficient Fine-Tuning of Large Models via Nested Low-Rank Adaptation
Poster Session 5 & Exhibit Hall
Lujun Li ⋅ Cheng Lin ⋅ Dezhi Li ⋅ You-Liang Huang ⋅ Wei Li ⋅ Tianyu Wu ⋅ Jie Zou ⋅ Wei Xue ⋅ Sirui Han ⋅ Yike Guo
|
Exhibit Hall I #231 | |
|
Dual-level Prototype Learning for Composite Degraded Image Restoration
Poster Session 3 & Exhibit Hall
Zhongze Wang ⋅ Haitao Zhao ⋅ Lujian Yao ⋅ Jingchao Peng ⋅ Kaijie Zhao
|
Exhibit Hall I #378 | |
|
Dynamic Reconstruction of Hand-Object Interaction with Distributed Force-aware Contact Representation
Poster Session 2 & Exhibit Hall with Coffee Break
Zhenjun Yu ⋅ Wenqiang Xu ⋅ Pengfei Xie ⋅ Yutong Li ⋅ Brian Anthony ⋅ Zhuorui Zhang ⋅ Cewu Lu
|
Exhibit Hall I #336 | |
|
Efficient Input-level Backdoor Defense on Text-to-Image Synthesis via Neuron Activation Variation
Shengfang ZHAI ⋅ Jiajun Li ⋅ Yue Liu ⋅ Huanran Chen ⋅ Zhihua Tian ⋅ Wenjie Qu ⋅ Qingni Shen ⋅ Ruoxi Jia ⋅ Yinpeng Dong ⋅ Jiaheng Zhang
|
Exhibit Hall I #28 | |
|
Decoupled Multi-Predictor Optimization for Inference-Efficient Model Tuning
Poster Session 1 & Exhibit Hall
Liwei Luo ⋅ Shuaitengyuan Li ⋅ Dongwei Ren ⋅ Qilong Wang ⋅ Pengfei Zhu ⋅ Qinghua Hu
|
Exhibit Hall I #337 | |
|
Detection, Pose Estimation and Segmentation for Multiple Bodies: Closing the Virtuous Circle
Poster Session 2 & Exhibit Hall with Coffee Break
Miroslav Purkrabek ⋅ Jiri Matas
|
Exhibit Hall I #374 | |
|
ACAM-KD: Adaptive and Cooperative Attention Masking for Knowledge Distillation
Poster Session 1 & Exhibit Hall
Qizhen Lan ⋅ Qing Tian
|
Exhibit Hall I #368 | |
|
GReg: Geometry-Aware Region Refinement for Sign Language Video Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Tongkai Shi ⋅ Lianyu Hu ⋅ Fanhua Shang ⋅ Liqing Gao ⋅ Wei Feng
|
Exhibit Hall I #150 | |
|
Unsupervised Part Discovery via Descriptor-Based Masked Image Restoration with Optimized Constraints
Poster Session 2 & Exhibit Hall with Coffee Break
Jiahao Xia ⋅ Yike Wu ⋅ Wenjian Huang ⋅ Jianguo Zhang ⋅ Jian Zhang
|
Exhibit Hall I #343 | |
|
NETracer: A Topology-Aware Iterative Tracing Approach for Tubular Structure Extraction
Poster Session 5 & Exhibit Hall
Chao Liu ⋅ Yangbo Jiang ⋅ Nenggan Zheng
|
Exhibit Hall I #76 | |
|
Benchmarking Multimodal CoT Reward Model Stepwise by Visual Program
Poster Session 1 & Exhibit Hall
Minghe Gao ⋅ Xuqi Liu ⋅ Zhongqi Yue ⋅ Yang Wu ⋅ Shuang Chen ⋅ Juncheng Li ⋅ Siliang Tang ⋅ Fei Wu ⋅ Tat-Seng Chua ⋅ Yueting Zhuang
|
Exhibit Hall I #155 | |
|
MotionCtrl: A Real-time Controllable Vision-Language-Motion Model
Poster Session 3 & Exhibit Hall
Bin Cao ⋅ Sipeng Zheng ⋅ Ye Wang ⋅ Lujie Xia ⋅ Qianshan Wei ⋅ Qin Jin ⋅ Jing Liu ⋅ Zongqing Lu
|
Exhibit Hall I #211 | |
|
Visual Relation Diffusion for Human-Object Interaction Detection
Poster Session 5 & Exhibit Hall
Ping Cao ⋅ Yepeng Tang ⋅ Chunjie Zhang ⋅ Xiaolong Zheng ⋅ Chao Liang ⋅ Yunchao Wei ⋅ Yao Zhao
|
Exhibit Hall I #353 | |
|
Pinco: Position-induced Consistent Adapter for Diffusion Transformer in Foreground-conditioned Inpainting
Poster Session 4 & Exhibit Hall with Coffee Break
Guangben Lu ⋅ Yuzhen N/A ⋅ Zhimin Sun ⋅ Ran Yi ⋅ Yifan Qi ⋅ Yizhe Tang ⋅ Tianyi Wang ⋅ Lizhuang Ma ⋅ FangYuan Zou
|
Exhibit Hall I #35 | |
|
VLR-Driver: Large Vision-Language-Reasoning Models for Embodied Autonomous Driving
Poster Session 6 & Exhibit Hall with Coffee Break
Fanjie Kong ⋅ Yitong Li ⋅ Weihuang Chen ⋅ Chen Min ⋅ Yizhe Li ⋅ Zhiqiang Gao ⋅ Haoyang Li ⋅ Zhongyu Guo ⋅ Hongbin Sun
|
Exhibit Hall I #216 | |
|
Vid-Group: Temporal Video Grounding Pretraining from Unlabeled Videos in the Wild
Poster Session 5 & Exhibit Hall
Peijun Bao ⋅ Chenqi Kong ⋅ SIYUAN YANG ⋅ Zihao Shao ⋅ Xinghao Jiang ⋅ Boon Ng ⋅ Meng Er ⋅ Alex Kot
|
Exhibit Hall I #69 | |
|
AcZeroTS: Active Learning for Zero-shot Tissue Segmentation in Pathology Images
Poster Session 5 & Exhibit Hall
Jiao Tang ⋅ Junjie Zhou ⋅ Bo Qian ⋅ Peng Wan ⋅ Yingli Zuo ⋅ WEI SHAO ⋅ Daoqiang Zhang
|
Exhibit Hall I #349 | |
|
OneGT: One-Shot Geometry-Texture Neural Rendering for Head Avatars
Poster Session 3 & Exhibit Hall
Jinshu Chen ⋅ Bingchuan Li ⋅ Fan Zhang ⋅ Songtao Zhao ⋅ Qian HE
|
Exhibit Hall I #121 | |
|
METEOR: Multi-Encoder Collaborative Token Pruning for Efficient Vision Language Models
Poster Session 5 & Exhibit Hall
Yuchen Liu ⋅ Yaoming Wang ⋅ Bowen Shi ⋅ XIAOPENG ZHANG ⋅ Wenrui Dai ⋅ Chenglin Li ⋅ Hongkai Xiong ⋅ Qi Tian
|
Exhibit Hall I #159 | |
|
Unsupervised Visible-Infrared Person Re-identification under Unpaired Settings
Poster Session 3 & Exhibit Hall
Haoyu Yao ⋅ Bin Yang ⋅ Wenke Huang ⋅ Mang Ye ⋅ Bo Du
|
Exhibit Hall I #180 | |
|
Adaptive Prompt Learning via Gaussian Outlier Synthesis for Out-of-distribution Detection
Poster Session 1 & Exhibit Hall
Yongkang Zhang ⋅ Dongyu She ⋅ Zhong Zhou
|
Exhibit Hall I #299 | |
|
Bring Your Rear Cameras for Egocentric 3D Human Pose Estimation
Poster Session 2 & Exhibit Hall with Coffee Break
HIroyasu Akada ⋅ Jian Wang ⋅ Vladislav Golyanik ⋅ Christian Theobalt
|
Exhibit Hall I #420 | |
|
AMDANet: Attention-Driven Multi-Perspective Discrepancy Alignment for RGB-Infrared Image Fusion and Segmentation
Poster Session 3 & Exhibit Hall
Haifeng Zhong ⋅ Fan Tang ⋅ Zhuo Chen ⋅ Hyung Jin Chang ⋅ Yixing Gao
|
Exhibit Hall I #59 | |
|
Lay2Story: Extending Diffusion Transformers for Layout-Togglable Story Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Ao Ma ⋅ Jiasong Feng ⋅ Ke Cao ⋅ Jing Wang ⋅ Yun Wang ⋅ Quanwei Zhang ⋅ Zhanjie Zhang
|
Exhibit Hall I #115 | |
|
OCK: Unsupervised Dynamic Video Prediction with Object-Centric Kinematics
Poster Session 3 & Exhibit Hall
YeonJi Song ⋅ Jaein Kim ⋅ Suhyung Choi ⋅ Jin-Hwa Kim ⋅ Byoung-Tak Zhang
|
Exhibit Hall I #127 | |
|
Unraveling the Smoothness Properties of Diffusion Models: A Gaussian Mixture Perspective
Poster Session 3 & Exhibit Hall
Yingyu Liang ⋅ Zhizhou Sha ⋅ Zhenmei Shi ⋅ Zhao Song ⋅ Mingda Wan ⋅ Yufa Zhou
|
Exhibit Hall I #134 | |
|
S$^3$E: Self-Supervised State Estimation for Radar-Inertial System
Poster Session 6 & Exhibit Hall with Coffee Break
Shengpeng Wang ⋅ Yulong Xie ⋅ Qing Liao ⋅ Wei Wang
|
Exhibit Hall I #190 | |
|
Prompt Guidance and Human Proximal Perception for HOT Prediction with Regional Joint Loss
Poster Session 5 & Exhibit Hall
Yuxiao Wang ⋅ Yu Lei ⋅ Zhenao WEI ⋅ WeiYing Xue ⋅ Xinyu Jiang ⋅ Nan Zhuang ⋅ Qi Liu
|
Exhibit Hall I #361 | |
|
Scalable Image Tokenization with Index Backpropagation Quantization
Poster Session 4 & Exhibit Hall with Coffee Break
Fengyuan Shi ⋅ Zhuoyan Luo ⋅ Yixiao Ge ⋅ Yujiu Yang ⋅ Ying Shan ⋅ Limin Wang
|
Exhibit Hall I #109 | |
|
BVINet: Unlocking Blind Video Inpainting with Zero Annotations
Poster Session 3 & Exhibit Hall
zhiliang wu ⋅ Kerui Chen ⋅ Kun Li ⋅ Hehe Fan ⋅ Yi Yang
|
Exhibit Hall I #379 | |
|
Coupling the Generator with Teacher for Effective Data-Free Knowledge Distillation
Poster Session 1 & Exhibit Hall
Xu Chen ⋅ Yang Li ⋅ Yahong Han ⋅ Guangquan Xu ⋅ Jialie Shen
|
Exhibit Hall I #195 | |
|
Video Color Grading via Look-Up Table Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Seunghyun Shin ⋅ Dongmin Shin ⋅ Jisu Shin ⋅ Hae-Gon Jeon ⋅ Joon-Young Lee
|
Exhibit Hall I #408 | |
|
Structure-Guided Diffusion Models for High-Fidelity Portrait Shadow Removal
Poster Session 3 & Exhibit Hall
wanchang Yu ⋅ Qing Zhang ⋅ Rongjia Zheng ⋅ Wei-Shi Zheng
|
Exhibit Hall I #158 | |
|
FreeDNA: Endowing Domain Adaptation of Diffusion-Based Dense Prediction with Training-Free Domain Noise Alignment
Poster Session 1 & Exhibit Hall
Hang Xu ⋅ Jie Huang ⋅ Linjiang Huang ⋅ Dong Li ⋅ Yidi Liu ⋅ Feng Zhao
|
Exhibit Hall I #304 | |
|
ProbMED: A Probabilistic Framework for Medical Multimodal Binding
Poster Session 5 & Exhibit Hall
Yuan Gao ⋅ Sangwook Kim ⋅ Jianzhong You ⋅ Chris Mcintosh
|
Exhibit Hall I #34 | |
|
You Are Your Own Best Teacher: Achieving Centralized-level Performance in Federated Learning under Heterogeneous and Long-tailed Data
Poster Session 1 & Exhibit Hall
Shanshan Yan ⋅ Zexi Li ⋅ Chao Wu ⋅ Meng Pang ⋅ Yang Lu ⋅ Yan Yan ⋅ Hanzi Wang
|
Exhibit Hall I #253 | |
|
A Tiny Change, A Giant Leap: Long-Tailed Class-Incremental Learning via Geometric Prototype Alignment
Poster Session 1 & Exhibit Hall
xinyi lai ⋅ Luojun Lin ⋅ Weijie Chen ⋅ yuanlong yu
|
Exhibit Hall I #127 | |
|
CountSE: Soft Exemplar Open-set Object Counting
Shuai Liu ⋅ Peng Zhang ⋅ Shiwei Zhang ⋅ Wei Ke
|
Exhibit Hall I #163 | |
|
Sparfels: Fast Reconstruction from Sparse Unposed Imagery
Shubhendu Jena ⋅ Amine Ouasfi ⋅ Mae Younes ⋅ Adnane Boukhayma
|
Exhibit Hall I #266 | |
|
GaussianFlowOcc: Sparse and Weakly Supervised Occupancy Estimation using Gaussian Splatting and Temporal Flow
Poster Session 6 & Exhibit Hall with Coffee Break
Simon Boeder ⋅ Fabian Gigengack ⋅ Benjamin Risse
|
Exhibit Hall I #21 | |
|
Learning 4D Embodied World Models
Poster Session 2 & Exhibit Hall with Coffee Break
Haoyu Zhen ⋅ Qiao Sun ⋅ Hongxin Zhang ⋅ Junyan Li ⋅ Siyuan Zhou ⋅ Yilun Du ⋅ Chuang Gan
|
Exhibit Hall I #30 | |
|
MuGS: Multi-Baseline Generalizable Gaussian Splatting Reconstruction
Poster Session 6 & Exhibit Hall with Coffee Break
Yaopeng Lou ⋅ Liao Shen ⋅ Tianqi Liu ⋅ Jiaqi Li ⋅ Zihao Huang ⋅ Huiqiang Sun ⋅ Zhiguo Cao
|
Exhibit Hall I #83 | |
|
Region-Level Data Attribution for Text-to-Image Generative Models
Poster Session 4 & Exhibit Hall with Coffee Break
Trong Bang Nguyen ⋅ Phi Le Nguyen ⋅ Simon Lucey ⋅ Minh Hoai
|
Exhibit Hall I #376 | |
|
Trans-Adapter: A Plug-and-Play Framework for Transparent Image Inpainting
Poster Session 4 & Exhibit Hall with Coffee Break
Yuekun Dai ⋅ Haitian Li ⋅ Shangchen Zhou ⋅ Chen Change Loy
|
Exhibit Hall I #12 | |
|
Identity-aware Language Gaussian Splatting for Open-vocabulary 3D Semantic Segmentation
Poster Session 5 & Exhibit Hall
SungMin Jang ⋅ Wonjun Kim
|
Exhibit Hall I #62 | |
|
MolParser: End-to-end Visual Recognition of Molecule Structures in the Wild
Poster Session 5 & Exhibit Hall
Xi Fang ⋅ Jiankun Wang ⋅ Xiaochen Cai ⋅ Shang Chien ⋅ Shuwen Yang ⋅ Haoyi Tao ⋅ Nan wang ⋅ Lin Yao ⋅ Linfeng Zhang ⋅ Guolin Ke
|
Exhibit Hall I #443 | |
|
Fine-structure Preserved Real-world Image Super-resolution via Transfer VAE Training
Qiaosi Yi ⋅ Shuai Li ⋅ Rongyuan Wu ⋅ Lingchen Sun ⋅ Yuhui WU ⋅ Lei Zhang
|
Exhibit Hall I #228 | |
|
Ask and Remember: A Questions-Only Replay Strategy for Continual Visual Question Answering
Poster Session 4 & Exhibit Hall with Coffee Break
Imad Eddine MAROUF ⋅ Enzo Tartaglione ⋅ Stéphane Lathuilière ⋅ Joost van de Weijer
|
Exhibit Hall I #307 | |
|
Benefit From Seen: Enhancing Open-Vocabulary Object Detection by Bridging Visual and Textual Co-Occurrence Knowledge
Poster Session 5 & Exhibit Hall
Yanqi Li ⋅ Jianwei Niu ⋅ Tao Ren
|
Exhibit Hall I #216 | |
|
Towards Omnimodal Expressions and Reasoning in Referring Audio-Visual Segmentation
Poster Session 5 & Exhibit Hall
Kaining Ying ⋅ Henghui Ding ⋅ Guangquan Jie ⋅ Yu-Gang Jiang
|
Exhibit Hall I #261 | |
|
ImageGen-CoT: Enhancing Text-to-Image In-context Learning with Chain-of-Thought Reasoning
Poster Session 4 & Exhibit Hall with Coffee Break
Jiaqi Liao ⋅ Zhengyuan Yang ⋅ Linjie Li ⋅ Dianqi Li ⋅ Kevin Lin ⋅ Yu Cheng ⋅ Lijuan Wang
|
Exhibit Hall I #222 | |
|
CityGS-X: A Scalable Architecture for Efficient and Geometrically Accurate Large-Scale Scene Reconstruction
Poster Session 6 & Exhibit Hall with Coffee Break
Yuanyuan Gao ⋅ Hao Li ⋅ Jiaqi Chen ⋅ Zhihang Zhong ⋅ Zhengyu Zou ⋅ Dingwen Zhang ⋅ Xiao Sun ⋅ Junwei Han
|
Exhibit Hall I #239 | |
|
AIRA: Activation-Informed Low-Rank Adaptation for Large Models
Poster Session 1 & Exhibit Hall
Lujun Li ⋅ Dezhi Li ⋅ Cheng Lin ⋅ Wei Li ⋅ Wei Xue ⋅ Sirui Han ⋅ Yike Guo
|
Exhibit Hall I #156 | |
|
Robust Unfolding Network for HDR Imaging with Modulo Cameras
Poster Session 6 & Exhibit Hall with Coffee Break
Zhile Chen ⋅ Hui Ji
|
Exhibit Hall I #47 | |
|
Embodied Navigation with Auxiliary Task of Action Description Prediction
Poster Session 2 & Exhibit Hall with Coffee Break
Haru Kondoh ⋅ Asako Kanezaki
|
Exhibit Hall I #188 | |
|
IAP: Invisible Adversarial Patch Attack through Perceptibility-Aware Localization and Perturbation Optimization
Poster Session 3 & Exhibit Hall
Subrat Kishore Dutta ⋅ Xiao Zhang
|
Exhibit Hall I #448 | |
|
SweetTok: Semantic-Aware Spatial-Temporal Tokenizer for Compact Video Discretization
Poster Session 5 & Exhibit Hall
Zhentao Tan ⋅ Ben Xue ⋅ Jian Jia ⋅ Junhao Wang ⋅ Wencai Ye ⋅ Shaoyun Shi ⋅ Sun Mingjie ⋅ Wenjin Wu ⋅ Quan Chen ⋅ Peng Jiang
|
Exhibit Hall I #352 | |
|
Beyond Simple Edits: Composed Video Retrieval with Dense Modifications
Poster Session 5 & Exhibit Hall
Omkar Thawakar ⋅ Dmitry Demidov ⋅ Ritesh Thawkar ⋅ Rao Anwer ⋅ Mubarak Shah ⋅ Fahad Khan ⋅ Salman Khan
|
Exhibit Hall I #59 | |
|
GRAB: A Challenging GRaph Analysis Benchmark for Large Multimodal Models
Poster Session 1 & Exhibit Hall
Jonathan Roberts ⋅ Kai Han ⋅ Samuel Albanie
|
Exhibit Hall I #146 | |
|
Att-Adapter: A Robust and Precise Domain-Specific Multi-Attributes T2I Diffusion Adapter via Conditional Variational Autoencoder
Wonwoong Cho ⋅ Yan-Ying Chen ⋅ Matthew Klenk ⋅ David I. Inouye ⋅ Yanxia Zhang
|
Exhibit Hall I #69 | |
|
REDUCIO! Generating 1K Video within 16 Seconds using Extremely Compressed Motion Latents
Poster Session 4 & Exhibit Hall with Coffee Break
Rui Tian ⋅ Qi Dai ⋅ Jianmin Bao ⋅ Kai Qiu ⋅ Yifan Yang ⋅ Chong Luo ⋅ Zuxuan Wu ⋅ Yu-Gang Jiang
|
Exhibit Hall I #417 | |
|
DAP-MAE: Domain-Adaptive Point Cloud Masked Autoencoder for Effective Cross-Domain Learning
Ziqi Gao ⋅ Qiufu Li ⋅ Linlin Shen
|
Exhibit Hall I #324 | |
|
AllGCD: Leveraging All Unlabeled Data for Generalized Category Discovery
Poster Session 1 & Exhibit Hall
Xinzi Cao ⋅ Ke Chen ⋅ Feidiao Yang ⋅ Xiawu Zheng ⋅ Yutong Lu ⋅ Yonghong Tian
|
Exhibit Hall I #306 | |
|
Towards Long-Horizon Vision-Language-Action System: Reasoning, Acting and Memory
Poster Session 2 & Exhibit Hall with Coffee Break
Daixun Li ⋅ Yusi Zhang ⋅ Mingxiang Cao ⋅ donglai Liu ⋅ Weiying Xie ⋅ Tianlin Hui ⋅ Lunkai Lin ⋅ Zhiqiang Xie ⋅ Yunsong Li
|
Exhibit Hall I #171 | |
|
UniFuse: A Unified All-in-One Framework for Multi-Modal Medical Image Fusion Under Diverse Degradations and Misalignments
Poster Session 3 & Exhibit Hall
Dayong Su ⋅ Yafei Zhang ⋅ Huafeng Li ⋅ Jinxing Li ⋅ Yu Liu
|
Exhibit Hall I #399 | |
|
3DGS-LM: Faster Gaussian-Splatting Optimization with Levenberg-Marquardt
Poster Session 6 & Exhibit Hall with Coffee Break
Lukas Höllein ⋅ Aljaz Bozic ⋅ Michael Zollhöfer ⋅ Matthias Nießner
|
Exhibit Hall I #195 | |
|
OmniVTON: Training-Free Universal Virtual Try-On
Poster Session 4 & Exhibit Hall with Coffee Break
Zhaotong Yang ⋅ Yuhui Li ⋅ Shengfeng He ⋅ Xinzhe Li ⋅ Yangyang Xu ⋅ Junyu Dong ⋅ Yong Du
|
Exhibit Hall I #174 | |
|
FreeCus: Free Lunch Subject-driven Customization in Diffusion Transformers
Poster Session 4 & Exhibit Hall with Coffee Break
Yanbing Zhang ⋅ Zhe Wang ⋅ Qin Zhou ⋅ Mengping Yang
|
Exhibit Hall I #59 | |
|
GLEAM: Learning Generalizable Exploration Policy for Active Mapping in Complex 3D Indoor Scene
Poster Session 2 & Exhibit Hall with Coffee Break
Xiao Chen ⋅ Tai Wang ⋅ Quanyi Li ⋅ Tao Huang ⋅ Jiangmiao Pang ⋅ Tianfan Xue
|
Exhibit Hall I #50 | |
|
CopyrightShield: Enhancing Diffusion Model Security Against Copyright Infringement Attacks
Poster Session 4 & Exhibit Hall with Coffee Break
Zhixiang Guo ⋅ Siyuan Liang ⋅ Aishan Liu ⋅ Dacheng Tao
|
Exhibit Hall I #434 | |
|
CA2C: A Prior-Knowledge-Free Approach for Robust Label Noise Learning via Asymmetric Co-learning and Co-training
Poster Session 1 & Exhibit Hall
Mengmeng Sheng ⋅ Zeren Sun ⋅ Tianfei Zhou ⋅ Xiangbo Shu ⋅ Jinshan Pan ⋅ Yazhou Yao
|
Exhibit Hall I #77 | |
|
TCFG: Truncated Classifier-Free Guidance for Efficient and Scalable Text-to-Image Acceleration
Poster Session 4 & Exhibit Hall with Coffee Break
Xiaomeng Fu ⋅ Jia Li
|
Exhibit Hall I #351 | |
|
Point Cloud Self-supervised Learning via 3D to Multi-view Masked Learner
Poster Session 6 & Exhibit Hall with Coffee Break
Zhimin Chen ⋅ Xuewei Chen ⋅ Xiao Guo ⋅ Yingwei Li ⋅ Longlong Jing ⋅ Liang Yang ⋅ Bing Li
|
Exhibit Hall I #279 | |
|
FICGen: Frequency-Inspired Contextual Disentanglement for Layout-driven Degraded Image Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Wenzhuang Wang ⋅ Yifan Zhao ⋅ Mingcan Ma ⋅ Ming Liu ⋅ Zhonglin Jiang ⋅ Yong Chen ⋅ Jia Li
|
Exhibit Hall I #404 | |
|
MSA2: Multi-task Framework with Structure-aware and Style-adaptive Character Representation for Open-set Chinese Text Recognition
Poster Session 5 & Exhibit Hall
Yangfu Li ⋅ Hongjian Zhan ⋅ Qi Liu ⋅ Li Sun ⋅ Yu-Jie Xiong ⋅ Yue Lu
|
Exhibit Hall I #311 | |
|
Local Dense Logit Relations for Enhanced Knowledge Distillation
Poster Session 1 & Exhibit Hall
Liuchi Xu ⋅ Kang Liu ⋅ Jinshuai Liu ⋅ Lu Wang ⋅ Lisheng XU ⋅ Jun Cheng
|
Exhibit Hall I #426 | |
|
FW-Merging: Scaling Model Merging with Frank-Wolfe Optimization
Poster Session 1 & Exhibit Hall
Hao Chen ⋅ Shell Xu Hu ⋅ Wayne Luk ⋅ Timothy Hospedales ⋅ Hongxiang Fan
|
Exhibit Hall I #315 | |
|
HIS-GPT: Towards 3D Human-In-Scene Multimodal Understanding
Poster Session 1 & Exhibit Hall
JIAHE ZHAO ⋅ RuiBing Hou ⋅ zejie tian ⋅ Hong Chang ⋅ Shiguang Shan
|
Exhibit Hall I #405 | |
|
VisRL: Intention-Driven Visual Perception via Reinforced Reasoning
Poster Session 1 & Exhibit Hall
Zhangquan Chen ⋅ Xufang Luo ⋅ Dongsheng Li
|
Exhibit Hall I #234 | |
|
Soft Local Completeness: Rethinking Completeness in XAI
Poster Session 5 & Exhibit Hall
Ziv Weiss Haddad ⋅ Oren Barkan ⋅ Yehonatan Elisha ⋅ Noam Koenigstein
|
Exhibit Hall I #377 | |
|
Controllable Feature Whitening for Hyperparameter-Free Bias Mitigation
Poster Session 1 & Exhibit Hall
Yooshin Cho ⋅ Hanbyel Cho ⋅ Janghyeon Lee ⋅ HyeongGwon Hong ⋅ Jaesung Ahn ⋅ Junmo Kim
|
Exhibit Hall I #427 | |
|
UMDATrack: Unified Multi-Domain Adaptive Tracking Under Adverse Weather Conditions
Poster Session 2 & Exhibit Hall with Coffee Break
Siyuan Yao ⋅ Rui Zhu ⋅ Ziqi Wang ⋅ Wenqi Ren ⋅ Yanyang Yan ⋅ Xiaochun Cao
|
Exhibit Hall I #135 | |
|
KV-Edit: Training-Free Image Editing for Precise Background Preservation
Poster Session 4 & Exhibit Hall with Coffee Break
Tianrui Zhu ⋅ Shiyi Zhang ⋅ Jiawei Shao ⋅ Yansong Tang
|
Exhibit Hall I #165 | |
|
FusionPhys: A Flexible Framework for Fusing Complementary Sensing Modalities in Remote Physiological Measurement
Poster Session 2 & Exhibit Hall with Coffee Break
Chenhang Ying ⋅ Huiyu Yang ⋅ Jieyi Ge ⋅ Zhaodong Sun ⋅ Xu Cheng ⋅ Kui Ren ⋅ Xiaobai Li
|
Exhibit Hall I #408 | |
|
You Think, You ACT: The New Task of Arbitrary Text to Motion Generation
Poster Session 3 & Exhibit Hall
Runqi Wang ⋅ Caoyuan Ma ⋅ Guopeng Li ⋅ Hanrui Xu ⋅ Yuke Li ⋅ Zheng Wang
|
Exhibit Hall I #189 | |
|
DiffVSR: Revealing an Effective Recipe for Taming Robust Video Super-Resolution Against Complex Degradations
Poster Session 4 & Exhibit Hall with Coffee Break
Xiaohui Li ⋅ Yihao Liu ⋅ Shuo Cao ⋅ Chen Ziyan ⋅ SHAOBIN ZHUANG ⋅ Xiangyu Chen ⋅ Yinan He ⋅ Yi Wang ⋅ Yu Qiao
|
Exhibit Hall I #40 | |
|
End-to-End Multi-Modal Diffusion Mamba
Poster Session 5 & Exhibit Hall
Chunhao Lu ⋅ Qiang Lu ⋅ Meichen Dong ⋅ Jake Luo
|
Exhibit Hall I #68 | |
|
PanoLlama: Generating Endless and Coherent Panoramas with Next-Token-Prediction LLMs
Teng Zhou ⋅ Xiaoyu Zhang ⋅ Yongchuan Tang
|
Exhibit Hall I #42 | |
|
Power of Cooperative Supervision: Multiple Teachers Framework for Advanced 3D Semi-Supervised Object Detection
Poster Session 2 & Exhibit Hall with Coffee Break
Jin-Hee Lee ⋅ Jae-keun Lee ⋅ Jeseok Kim ⋅ Kwon Soon
|
Exhibit Hall I #185 | |
|
Adapting In-Domain Few-Shot Segmentation to New Domains without Source Domain Retraining
Poster Session 5 & Exhibit Hall
Qi Fan ⋅ Kaiqi Liu ⋅ Nian Liu ⋅ Hisham Cholakkal ⋅ Rao Anwer ⋅ Wenbin Li ⋅ Yang Gao
|
Exhibit Hall I #151 | |
|
COVTrack: Continuous Open-Vocabulary Tracking via Adaptive Multi-Cue Fusion
Poster Session 3 & Exhibit Hall
Zekun Qian ⋅ Ruize Han ⋅ Zhixiang Wang ⋅ Junhui Hou ⋅ Wei Feng
|
Exhibit Hall I #5 | |
|
Dense Policy: Bidirectional Autoregressive Learning of Actions
Poster Session 3 & Exhibit Hall
Yue Su ⋅ Xinyu Zhan ⋅ Hongjie Fang ⋅ Han Xue ⋅ Hao-Shu Fang ⋅ Yong-Lu Li ⋅ Cewu Lu ⋅ Lixin Yang
|
Exhibit Hall I #422 | |
|
monoVLN: Bridging the Observation Gap between Monocular and Panoramic Vision and Language Navigation
Poster Session 2 & Exhibit Hall with Coffee Break
Ren-Jie Lu ⋅ Yu Zhou ⋅ hao cheng ⋅ Jingke Meng ⋅ Wei-Shi Zheng
|
Exhibit Hall I #418 | |
|
3D Mesh Editing using Masked LRMs
Poster Session 2 & Exhibit Hall with Coffee Break
William Gao ⋅ Dilin Wang ⋅ Yuchen Fan ⋅ Aljaz Bozic ⋅ Tuur Stuyck ⋅ Zhengqin Li ⋅ Zhao Dong ⋅ Rakesh Ranjan ⋅ Nikolaos Sarafianos
|
Exhibit Hall I #200 | |
|
DOGR: Towards Versatile Visual Document Grounding and Referring
Poster Session 1 & Exhibit Hall
Yinan Zhou ⋅ Yuxin Chen ⋅ Haokun Lin ⋅ Yichen Wu ⋅ Shuyu Yang ⋅ Zhongang Qi ⋅ Chen Ma ⋅ Li Zhu
|
Exhibit Hall I #334 | |
|
Supervised Exploratory Learning for Long-Tailed Visual Recognition
Poster Session 1 & Exhibit Hall
Zhongquan Jian ⋅ Yanhao Chen ⋅ Wangyancheng Wangyancheng ⋅ Junfeng Yao ⋅ Meihong Wang ⋅ Qingqiang Wu
|
Exhibit Hall I #169 | |
|
Membership Inference Attacks with False Discovery Rate Control
Poster Session 1 & Exhibit Hall
Chenxu Zhao ⋅ Wei Qian ⋅ Aobo Chen ⋅ Mengdi Huai
|
Exhibit Hall I #106 | |
|
ProbRes: Probabilistic Jump Diffusion for Open-World Egocentric Activity Recognition
Poster Session 3 & Exhibit Hall
Sanjoy Kundu ⋅ Shanmukha Vellamcheti ⋅ Sathyanarayanan Aakur
|
Exhibit Hall I #389 | |
|
MMAIF: Multi-task and Multi-degradation All-in-One for Image Fusion with Language Guidance
Poster Session 3 & Exhibit Hall
Zihan Cao ⋅ Yu Zhong ⋅ Ziqi Wang ⋅ Liang-Jian Deng
|
Exhibit Hall I #164 | |
|
Blind Video Super-Resolution based on Implicit Kernels
Poster Session 3 & Exhibit Hall
Qiang Zhu ⋅ Yuxuan Jiang ⋅ Shuyuan Zhu ⋅ Fan Zhang ⋅ David Bull ⋅ Bing Zeng
|
Exhibit Hall I #91 | |
|
TimeExpert: An Expert-Guided Video LLM for Video Temporal Grounding
Poster Session 5 & Exhibit Hall
Zuhao Yang ⋅ Yingchen Yu ⋅ Yunqing Zhao ⋅ Shijian Lu ⋅ Song Bai
|
Exhibit Hall I #420 | |
|
Kestrel: 3D Multimodal LLM for Part-Aware Grounded Description
Poster Session 2 & Exhibit Hall with Coffee Break
Mahmoud Ahmed ⋅ Junjie Fei ⋅ Jian Ding ⋅ Eslam Abdelrahman ⋅ Mohamed Elhoseiny
|
Exhibit Hall I #371 | |
|
DCHM: Depth-Consistent Human Modeling for Multiview Detection
Poster Session 2 & Exhibit Hall with Coffee Break
Jiahao Ma ⋅ Tianyu Wang ⋅ Miaomiao Liu ⋅ David Ahmedt Aristizabal ⋅ Chuong Nguyen
|
Exhibit Hall I #255 | |
|
Adversarial Robustness of Discriminative Self-Supervised Learning in Vision
Poster Session 1 & Exhibit Hall
Ömer Veysel Çağatan ⋅ Ömer TAL ⋅ M. Emre Gursoy
|
Exhibit Hall I #210 | |
|
HPSv3: Towards Wide-Spectrum Human Preference Score
Poster Session 4 & Exhibit Hall with Coffee Break
Yuhang Ma ⋅ Keqiang Sun ⋅ Xiaoshi Wu ⋅ Hongsheng Li
|
Exhibit Hall I #19 | |
|
Semantic Watermarking Reinvented: Enhancing Robustness and Generation Quality with Fourier Integrity
Poster Session 4 & Exhibit Hall with Coffee Break
Sung Ju Lee ⋅ Nam Ik Cho
|
Exhibit Hall I #370 | |
|
Dual-Process Image Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Grace Luo ⋅ Jonathan Granskog ⋅ Aleksander Holynski ⋅ Trevor Darrell
|
Exhibit Hall I #295 | |
|
IntrinsicControlNet: Cross-distribution Image Generation with Real and Unreal
Poster Session 6 & Exhibit Hall with Coffee Break
Jiayuan Lu ⋅ Rengan Xie ⋅ Zixuan Xie ⋅ Zhizhen Wu ⋅ Dianbing Xi ⋅ Qi Ye ⋅ Rui Wang ⋅ Hujun Bao ⋅ Yuchi Huo
|
Exhibit Hall I #251 | |
|
Disentangling Instance and Scene Contexts for 3D Semantic Scene Completion
Poster Session 6 & Exhibit Hall with Coffee Break
Enyu Liu ⋅ En Yu ⋅ Sijia Chen ⋅ Wenbing Tao
|
Exhibit Hall I #221 | |
|
TrustMark: Robust Watermarking and Watermark Removal for Arbitrary Resolution Images
Poster Session 4 & Exhibit Hall with Coffee Break
Tu Bui ⋅ Shruti Agarwal ⋅ John Collomosse
|
Exhibit Hall I #358 | |
|
MeshMamba: State Space Models for Articulated 3D Mesh Generation and Reconstruction
Poster Session 2 & Exhibit Hall with Coffee Break
Yusuke Yoshiyasu ⋅ Leyuan Sun ⋅ Ryusuke Sagawa
|
Exhibit Hall I #144 | |
|
Domain-aware Category-level Geometry Learning Segmentation for 3D Point Clouds
Poster Session 6 & Exhibit Hall with Coffee Break
Pei He ⋅ Lingling Li ⋅ Licheng Jiao ⋅ Ronghua Shang ⋅ Fang Liu ⋅ Shuang Wang ⋅ Xu Liu ⋅ wenping ma
|
Exhibit Hall I #347 | |
|
Spatial-Temporal Aware Visuomotor Diffusion Policy Learning
Poster Session 2 & Exhibit Hall with Coffee Break
Zhenyang Liu ⋅ Yikai Wang ⋅ Kuanning Wang ⋅ Longfei Liang ⋅ Xiangyang Xue ⋅ Yanwei Fu
|
Exhibit Hall I #197 | |
|
GaussianReg: Rapid 2D/3D Registration for Emergency Surgery via Explicit 3D Modeling with Gaussian Primitives
Poster Session 5 & Exhibit Hall
Weihao Yu ⋅ Xiaoqing Guo ⋅ Xinyu Liu ⋅ Yifan Liu ⋅ Hao Zheng ⋅ Yawen Huang ⋅ Yixuan Yuan
|
Exhibit Hall I #158 | |
|
Learning Robust Image Watermarking with Lossless Cover Recovery
Poster Session 4 & Exhibit Hall with Coffee Break
jiale chen ⋅ Wei Wang ⋅ Chongyang Shi ⋅ Li Dong ⋅ Xiping Hu
|
Exhibit Hall I #16 | |
|
ArgoTweak: Towards Self-Updating HD Maps through Structured Priors
Poster Session 2 & Exhibit Hall with Coffee Break
Lena Wild ⋅ Rafael Valencia ⋅ Patric Jensfelt
|
Exhibit Hall I #100 | |
|
Event-aided Dense and Continuous Point Tracking: Everywhere and Anytime
Poster Session 2 & Exhibit Hall with Coffee Break
Zhexiong Wan ⋅ Jianqin Luo ⋅ Yuchao Dai ⋅ Gim Hee Lee
|
Exhibit Hall I #274 | |
|
Context-Aware Academic Emotion Dataset and Benchmark
Poster Session 3 & Exhibit Hall
Luming Zhao ⋅ Jingwen Xuan ⋅ Jiamin Lou ⋅ Yonghui Yu ⋅ Wenwu Yang
|
Exhibit Hall I #362 | |
|
FlowSeek: Optical Flow Made Easier with Depth Foundation Models and Motion Bases
Poster Session 2 & Exhibit Hall with Coffee Break
Matteo Poggi ⋅ Fabio Tosi
|
Exhibit Hall I #60 | |
|
TPG-INR: Target Prior-Guided Implicit 3D CT Reconstruction for Enhanced Sparse-view Imaging
QingleiCao QingleiCao ⋅ Ziyao Tang ⋅ Xiaoqin Tang
|
Exhibit Hall I #339 | |
|
NATRA: Noise-Agnostic Framework for Trajectory Prediction with Noisy Observations
Poster Session 6 & Exhibit Hall with Coffee Break
Rongqing Li ⋅ Changsheng Li ⋅ Ruilin Lv ⋅ Yuhang Li ⋅ Yang Gao ⋅ Xiaolu Zhang ⋅ JUN ZHOU
|
Exhibit Hall I #304 | |
|
MS3D: High-Quality 3D Generation via Multi-Scale Representation Modeling
Poster Session 6 & Exhibit Hall with Coffee Break
Guan Luo ⋅ Jianfeng Zhang
|
Exhibit Hall I #157 | |
|
General Compression Framework for Efficient Transformer Object Tracking
Poster Session 3 & Exhibit Hall
Lingyi Hong ⋅ Jinglun Li ⋅ Xinyu Zhou ⋅ Shilin Yan ⋅ Pinxue Guo ⋅ Kaixun Jiang ⋅ Zhaoyu Chen ⋅ Shuyong Gao ⋅ Runze Li ⋅ Xingdong Sheng ⋅ Wei Zhang ⋅ Hong Lu ⋅ Wenqiang Zhang
|
Exhibit Hall I #323 | |
|
UniDxMD: Towards Unified Representation for Cross-Modal Unsupervised Domain Adaptation in 3D Semantic Segmentation
Zhengyin Liang ⋅ Hui Yin ⋅ Min Liang ⋅ Qianqian Du ⋅ Ying Yang ⋅ Hua Huang
|
Exhibit Hall I #51 | |
|
FedXDS: Leveraging Model Attribution Methods to counteract Data Heterogeneity in Federated Learning
Poster Session 1 & Exhibit Hall
Maximilian Hoefler ⋅ Karsten Mueller ⋅ Wojciech Samek
|
Exhibit Hall I #429 | |
|
Visual Textualization for Image Prompted Object Detection
Poster Session 5 & Exhibit Hall
Yongjian Wu ⋅ Yang Zhou ⋅ Jiya Saiyin ⋅ Bingzheng Wei ⋅ Yan Xu
|
Exhibit Hall I #104 | |
|
TerraMind: Large-Scale Generative Multimodality for Earth Observation
Poster Session 2 & Exhibit Hall with Coffee Break
Johannes Jakubik ⋅ Felix Yang ⋅ Benedikt Blumenstiel ⋅ Erik Scheurer ⋅ Rocco Sedona ⋅ Stefano Maurogiovanni ⋅ Valerio Marsocci ⋅ Nikolaos Dionelis ⋅ Jente Bosmans ⋅ Niklas Kopp ⋅ Rahul Ramachandran ⋅ Paolo Fraccaro ⋅ Thomas Brunschwiler ⋅ Gabriele Cavallaro ⋅ Juan Moreno ⋅ Nicolas Longépé
|
Exhibit Hall I #221 | |
|
LLaVA-SP: Enhancing Visual Representation with Visual Spatial Tokens for MLLMs
Poster Session 5 & Exhibit Hall
Haoran Lou ⋅ Chunxiao Fan ⋅ Ziyan Liu ⋅ Yuexin Wu ⋅ Xinliang Wang
|
Exhibit Hall I #207 | |
|
Deep Space Weather Model: Long-Range Solar Flare Prediction from Multi-Wavelength Images
Poster Session 2 & Exhibit Hall with Coffee Break
Shunya Nagashima ⋅ Komei Sugiura
|
Exhibit Hall I #411 | |
|
ZIM: Zero-Shot Image Matting for Anything
Beomyoung Kim ⋅ Chanyong Shin ⋅ Joonhyun Jeong ⋅ Hyungsik Jung ⋅ Seyun Lee ⋅ Sewhan Chun ⋅ Dong-Hyun HWANG ⋅ Joonsang Yu
|
Exhibit Hall I #381 | |
|
Fusion Meets Diverse Conditions: A High-diversity Benchmark and Baseline for UAV-based Multimodal Object Detection with Condition Cues
Poster Session 6 & Exhibit Hall with Coffee Break
Chen Chen ⋅ Kangcheng Bin ⋅ Hu Ting ⋅ Jiahao Qi ⋅ Xingyue Liu ⋅ Tianpeng Liu ⋅ Zhen Liu ⋅ Yongxiang Liu ⋅ Ping Zhong
|
Exhibit Hall I #312 | |
|
EmbodiedOcc: Embodied 3D Occupancy Prediction for Vision-based Online Scene Understanding
Poster Session 6 & Exhibit Hall with Coffee Break
Yuqi Wu ⋅ Wenzhao Zheng ⋅ Sicheng Zuo ⋅ Yuanhui Huang ⋅ Jie Zhou ⋅ Jiwen Lu
|
Exhibit Hall I #159 | |
|
TextSSR: Diffusion-based Data Synthesis for Scene Text Recognition
Poster Session 4 & Exhibit Hall with Coffee Break
Xingsong Ye ⋅ Yongkun Du ⋅ Yunbo Tao ⋅ Zhineng Chen
|
Exhibit Hall I #247 | |
|
Omegance: A Single Parameter for Various Granularities in Diffusion-Based Synthesis
Poster Session 4 & Exhibit Hall with Coffee Break
Xinyu Hou ⋅ Zongsheng Yue ⋅ Xiaoming Li ⋅ Chen Change Loy
|
Exhibit Hall I #428 | |
|
DAViD: Data-efficient and Accurate Vision Models from Synthetic Data
Poster Session 2 & Exhibit Hall with Coffee Break
Fatemeh Saleh ⋅ Sadegh Aliakbarian ⋅ Charlie Hewitt ⋅ Lohit Petikam ⋅ Xiao Xian ⋅ Antonio Criminisi ⋅ Thomas J. Cashman ⋅ Tadas Baltrusaitis
|
Exhibit Hall I #31 | |
|
RareCLIP: Rarity-aware Online Zero-shot Industrial Anomaly Detection
Poster Session 5 & Exhibit Hall
Jianfang He ⋅ Min Cao ⋅ Silong Peng ⋅ Qiong Xie
|
Exhibit Hall I #438 | |
|
A Visual Leap in CLIP Compositionality Reasoning through Generation of Counterfactual Sets
Poster Session 5 & Exhibit Hall
Zexi Jia ⋅ Chuanwei Huang ⋅ Yeshuang Zhu ⋅ Hongyan Fei ⋅ Ying Deng ⋅ Zhiqiang Yuan ⋅ Jiapei Zhang ⋅ Jinchao Zhang ⋅ Jie Zhou
|
Exhibit Hall I #348 | |
|
MOSCATO: Predicting Multiple Object State Change Through Actions
Poster Session 3 & Exhibit Hall
Parnian Zameni ⋅ Yuhan Shen ⋅ Ehsan Elhamifar
|
Exhibit Hall I #151 | |
|
Skip-Vision: Efficient and Scalable Acceleration of Vision-Language Models via Adaptive Token Skipping
Poster Session 5 & Exhibit Hall
Weili Zeng ⋅ Ziyuan Huang ⋅ Kaixiang Ji ⋅ Yichao Yan
|
Exhibit Hall I #147 | |
|
Temporal Rate Reduction Clustering for Human Motion Segmentation
Poster Session 3 & Exhibit Hall
Xianghan Meng ⋅ Zhengyu Tong ⋅ Zhiyuan Huang ⋅ Chun-Guang Li
|
Exhibit Hall I #437 | |
|
HFD-Teacher: High-Frequency Depth Distillation from Depth Foundation Models for Enhanced Depth Completion
Poster Session 2 & Exhibit Hall with Coffee Break
Zhiyuan Yang ⋅ Anqi Cheng ⋅ Haiyue Zhu ⋅ Tianjiao Li ⋅ Pey Yuen Tao ⋅ Kezhi Mao
|
Exhibit Hall I #373 | |
|
LeanVAE: An Ultra-Efficient Reconstruction VAE for Video Diffusion Models
Poster Session 4 & Exhibit Hall with Coffee Break
Yu Cheng ⋅ Fajie Yuan
|
Exhibit Hall I #77 | |
|
DreamDance: Animating Human Images by Enriching 3D Geometry Cues from 2D Poses
Poster Session 3 & Exhibit Hall
Yatian Pang ⋅ Bin Zhu ⋅ Bin Lin ⋅ Mingzhe Zheng ⋅ Francis Tay ⋅ Ser-Nam Lim ⋅ Harry Yang ⋅ Li Yuan
|
Exhibit Hall I #381 | |
|
Separation for Better Integration: Disentangling Edge and Motion in Event-based Deblurring
Poster Session 3 & Exhibit Hall
Yufei Zhu ⋅ Hao Chen ⋅ Yongjian Deng ⋅ Wei You
|
Exhibit Hall I #445 | |
|
AnimeGamer: Infinite Anime Life Simulation with Next Game State Prediction
Poster Session 3 & Exhibit Hall
Junhao Cheng ⋅ Yuying Ge ⋅ Yixiao Ge ⋅ Jing Liao ⋅ Ying Shan
|
Exhibit Hall I #82 | |
|
OmniPaint: Mastering Object-Oriented Editing via Disentangled Insertion-Removal Inpainting
Poster Session 4 & Exhibit Hall with Coffee Break
Yongsheng Yu ⋅ Ziyun Zeng ⋅ Haitian Zheng ⋅ Jiebo Luo
|
Exhibit Hall I #235 | |
|
Diversity-Enhanced Distribution Alignment for Dataset Distillation
Poster Session 1 & Exhibit Hall
Hongcheng Li ⋅ Yucan Zhou ⋅ Xiaoyan Gu ⋅ Bo Li ⋅ Weiping Wang
|
Exhibit Hall I #348 | |
|
Height-Fidelity Dense Global Fusion for Multi-modal 3D Object Detection
Hanshi Wang ⋅ Jin Gao ⋅ Weiming Hu ⋅ Zhipeng Zhang
|
Exhibit Hall I #188 | |
|
SMSTracker: Tri-path Score Mask Sigma Fusion for Multi-Modal Tracking
Sixian Chan ⋅ Zedong Li ⋅ Xiaoqin Zhang ⋅ Wenhao Li ⋅ Shijian Lu ⋅ Chunhua Shen
|
Exhibit Hall I #447 | |
|
Two Losses, One Goal: Balancing Conflict Gradients for Semi-supervised Semantic Segmentation
Rui Sun ⋅ Huayu Mai ⋅ Wangkai Li ⋅ Yujia Chen ⋅ Yuan Wang
|
Exhibit Hall I #52 | |
|
Acknowledging Focus Ambiguity in Visual Questions
Poster Session 1 & Exhibit Hall
Chongyan Chen ⋅ Yu-Yun Tseng ⋅ Zhuoheng Li ⋅ Anush Venkatesh ⋅ Danna Gurari
|
Exhibit Hall I #107 | |
|
Guiding Noisy Label Conditional Diffusion Models with Score-based Discriminator Correction
Poster Session 4 & Exhibit Hall with Coffee Break
Dat Cong ⋅ Hieu Tran ⋅ Hoang Thanh-Tung
|
Exhibit Hall I #349 | |
|
Shape of Motion: 4D Reconstruction from a Single Video
Qianqian Wang ⋅ Vickie Ye ⋅ Hang Gao ⋅ Weijia Zeng ⋅ Jake Austin ⋅ Zhengqi Li ⋅ Angjoo Kanazawa
|
Exhibit Hall I #435 | |
|
VSSD: Vision Mamba with Non-Causal State Space Duality
Poster Session 3 & Exhibit Hall
Yuheng Shi ⋅ Mingjia Li ⋅ Minjing Dong ⋅ Chang Xu
|
Exhibit Hall I #77 | |
|
EditCLIP: Representation Learning for Image Editing
Poster Session 4 & Exhibit Hall with Coffee Break
Qian Wang ⋅ Aleksandar Cvejic ⋅ Abdelrahman Eldesokey ⋅ Peter Wonka
|
Exhibit Hall I #102 | |
|
CorrCLIP: Reconstructing Patch Correlations in CLIP for Open-Vocabulary Semantic Segmentation
Poster Session 6 & Exhibit Hall with Coffee Break
Dengke Zhang ⋅ Fagui Liu ⋅ Quan Tang
|
Exhibit Hall I #145 | |
|
mmCooper: A Multi-agent Multi-stage Communication-efficient and Collaboration-robust Cooperative Perception Framework
Poster Session 6 & Exhibit Hall with Coffee Break
Bingyi Liu ⋅ Jian Teng ⋅ Hongfei Xue ⋅ Enshu Wang ⋅ Chuanhui Zhu ⋅ Pu Wang ⋅ Libing Wu
|
Exhibit Hall I #354 | |
|
FreqPDE: Rethinking Positional Depth Embedding for Multi-View 3D Object Detection Transformers
Poster Session 6 & Exhibit Hall with Coffee Break
Junjie Zhang ⋅ Haisheng Su ⋅ Feixiang Song ⋅ Sanping Zhou ⋅ Wei Wu ⋅ Junchi Yan ⋅ Nanning Zheng
|
Exhibit Hall I #330 | |
|
GestureLSM: Latent Shortcut based Co-Speech Gesture Generation with Spatial-Temporal Modeling
Poster Session 3 & Exhibit Hall
Pinxin Liu ⋅ Luchuan Song ⋅ Junhua Huang ⋅ Haiyang Liu ⋅ Chenliang Xu
|
Exhibit Hall I #87 | |
|
OccluGaussian: Occlusion-Aware Gaussian Splatting for Large Scene Reconstruction and Rendering
Poster Session 6 & Exhibit Hall with Coffee Break
Shiyong Liu ⋅ Xiao Tang ⋅ Zhihao Li ⋅ Yingfan He ⋅ Chongjie Ye ⋅ Jianzhuang Liu ⋅ Binxiao Huang ⋅ Shunbo Zhou ⋅ Xiaofei Wu
|
Exhibit Hall I #186 | |
|
MagShield: Towards Better Robustness in Sparse Inertial Motion Capture Under Magnetic Disturbances
Poster Session 6 & Exhibit Hall with Coffee Break
Yunzhe Shao ⋅ Xinyu Yi ⋅ Lu Yin ⋅ Shihui Guo ⋅ Jun-Hai Yong ⋅ Feng Xu
|
Exhibit Hall I #414 | |
|
Towards Cross-modal Backward-compatible Representation Learning for Vision-Language Models
Poster Session 1 & Exhibit Hall
Young Kyun Jang ⋅ Ser-Nam Lim
|
Exhibit Hall I #161 | |
|
ILLUME: Illuminating Your LLMs to See, Draw, and Self-Enhance
Poster Session 5 & Exhibit Hall
Chunwei Wang ⋅ Guansong Lu ⋅ Junwei Yang ⋅ Runhui Huang ⋅ Jianhua Han ⋅ Lu Hou ⋅ Wei Zhang ⋅ Hang Xu
|
Exhibit Hall I #170 | |
|
DeFSS: Image-to-Mask Denoising Learning for Few-shot Segmentation
Poster Session 5 & Exhibit Hall
Zishu Qin ⋅ Junhao Xu ⋅ Weifeng Ge
|
Exhibit Hall I #227 | |
|
Unveiling the Invisible: Reasoning Complex Occlusions Amodally with AURA
Poster Session 5 & Exhibit Hall
Zhixuan Li ⋅ Hyunse Yoon ⋅ Sanghoon Lee ⋅ Weisi Lin
|
Exhibit Hall I #199 | |
|
VehicleMAE: View-asymmetry Mutual Learning for Vehicle Re-identification Pre-training via Masked AutoEncoders
Poster Session 1 & Exhibit Hall
Qi Wang ⋅ Zeyu Zhang ⋅ Dong Wang ⋅ Di Gai ⋅ Xin Xiong ⋅ Jiyang Xu ⋅ Ruihua Zhou
|
Exhibit Hall I #441 | |
|
SuperEdit: Rectifying and Facilitating Supervision for Instruction-Based Image Editing
Poster Session 4 & Exhibit Hall with Coffee Break
Ming Li ⋅ Xin Gu ⋅ Fan Chen ⋅ Xiaoying Xing ⋅ Longyin Wen ⋅ Chen Chen ⋅ Sijie Zhu
|
Exhibit Hall I #414 | |
|
MagicCity: Geometry-Aware 3D City Generation from Satellite Imagery with Multi-View Consistency
Poster Session 6 & Exhibit Hall with Coffee Break
Xingbo YAO ⋅ xuanmin Wang ⋅ Hao WU ⋅ Chengliang PING ⋅ ZHANG Doudou ⋅ Hui Xiong
|
Exhibit Hall I #57 | |
|
RARE: Refine Any Registration of Pairwise Point Clouds via Zero-Shot Learning
Poster Session 6 & Exhibit Hall with Coffee Break
Chengyu Zheng ⋅ Honghua Chen ⋅ Jin Huang ⋅ Mingqiang Wei
|
Exhibit Hall I #177 | |
|
Multi-scenario Overlapping Text Segmentation with Depth Awareness
Poster Session 4 & Exhibit Hall with Coffee Break
Yang Liu ⋅ Xudong Xie ⋅ Yuliang Liu ⋅ Xiang Bai
|
Exhibit Hall I #246 | |
|
Dataset Distillation via Vision-Language Category Prototype
YAWEN ZOU ⋅ Guang Li ⋅ Duo Su ⋅ Zi Wang ⋅ Jun YU ⋅ Chao Zhang
|
Exhibit Hall I #271 | |
|
ConceptSplit: Decoupled Multi-Concept Personalization of Diffusion Models via Token-wise Adaptation and Attention Disentanglement
Poster Session 4 & Exhibit Hall with Coffee Break
Habin Lim ⋅ Youngseob Won ⋅ Juwon Seo ⋅ Gyeong-Moon Park
|
Exhibit Hall I #339 | |
|
ULTHO: Ultra-Lightweight yet Efficient Hyperparameter Optimization in Deep Reinforcement Learning
Poster Session 1 & Exhibit Hall
Mingqi Yuan ⋅ Bo Li ⋅ Xin Jin ⋅ Wenjun Zeng
|
Exhibit Hall I #241 | |
|
Backdoor Defense via Enhanced Splitting and Trap Isolation
Poster Session 1 & Exhibit Hall
Hongrui Yu ⋅ Lu Qi ⋅ Wanyu Lin ⋅ Jian Chen ⋅ Hailong Sun ⋅ chengbin sun
|
Exhibit Hall I #152 | |
|
Learning Hierarchical Line Buffer for Image Processing
Poster Session 3 & Exhibit Hall
Jiacheng Li ⋅ Feiran Li ⋅ Daisuke Iso
|
Exhibit Hall I #106 | |
|
ATAS: Any-to-Any Self-Distillation for Enhanced Open-Vocabulary Dense Prediction
Poster Session 5 & Exhibit Hall
Soonwoo Cha ⋅ Jiwoo Song ⋅ Juan Yeo ⋅ Hyunbin Jin ⋅ Taesup Kim
|
Exhibit Hall I #55 | |
|
MUSE-VL: Modeling Unified VLM through Semantic Discrete Encoding
Poster Session 5 & Exhibit Hall
Rongchang Xie ⋅ Chen Du ⋅ Ping Song ⋅ Chang Liu
|
Exhibit Hall I #406 | |
|
A Plug-and-Play Physical Motion Restoration Approach for In-the-Wild High-Difficulty Motions
Youliang Zhang ⋅ Ronghui Li ⋅ Yachao Zhang ⋅ Liang Pan ⋅ Jingbo Wang ⋅ Yebin Liu ⋅ Xiu Li
|
Exhibit Hall I #310 | |
|
Humans as Checkerboards: Calibrating Camera Motion Scale for World-Coordinate Human Mesh Recovery
Poster Session 2 & Exhibit Hall with Coffee Break
Fengyuan Yang ⋅ Kerui Gu ⋅ Ha Linh Nguyen ⋅ Tze Ho Elden Tse ⋅ Angela Yao
|
Exhibit Hall I #98 | |
|
D3: Training-Free AI-Generated Video Detection Using Second-Order Features
Poster Session 3 & Exhibit Hall
Chende Zheng ⋅ Ruiqi suo ⋅ Chenhao Lin ⋅ Zhengyu Zhao ⋅ Le Yang ⋅ Shuai Liu ⋅ Minghui Yang ⋅ Cong Wang ⋅ Chao Shen
|
Exhibit Hall I #268 | |
|
χ: Symmetry Understanding of 3D Shapes via Chirality Disentanglement
Poster Session 6 & Exhibit Hall with Coffee Break
Weikang Wang ⋅ Tobias Weißberg ⋅ Nafie El Amrani ⋅ Florian Bernard
|
Exhibit Hall I #344 | |
|
Dirichlet-Constrained Variational Codebook Learning for Temporally Coherent Video Face Restoration
Baoyou Chen ⋅ Ce Liu ⋅ Weihao Yuan ⋅ Zilong Dong ⋅ Siyu Zhu
|
Exhibit Hall I #424 | |
|
VideoAuteur: Towards Long Narrative Video Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Junfei Xiao ⋅ Feng Cheng ⋅ Lu Qi ⋅ Liangke Gui ⋅ Yang Zhao ⋅ Shanchuan Lin ⋅ Jiepeng Cen ⋅ Zhibei Ma ⋅ Alan Yuille ⋅ Lu Jiang
|
Exhibit Hall I #410 | |
|
StreamMind: Unlocking Full Frame Rate Streaming Video Dialogue through Event-Gated Cognition
Poster Session 3 & Exhibit Hall
Xin Ding ⋅ Hao Wu ⋅ Yifan Yang ⋅ Shiqi Jiang ⋅ Qianxi Zhang ⋅ Donglin Bai ⋅ Zhibo Chen ⋅ Ting Cao
|
Exhibit Hall I #325 | |
|
ViT-Split: Unleashing the Power of Vision Foundation Models via Efficient Splitting Heads
Poster Session 1 & Exhibit Hall
Yifan Li ⋅ Xin Li ⋅ Tianqin Li ⋅ Wenbin He ⋅ Yu Kong ⋅ Liu Ren
|
Exhibit Hall I #179 | |
|
Robust and Efficient 3D Gaussian Splatting for Urban Scene Reconstruction
Poster Session 6 & Exhibit Hall with Coffee Break
Zhensheng Yuan ⋅ Haozhi Huang ⋅ Zhen Xiong ⋅ Di Wang ⋅ Guanghua Yang
|
Exhibit Hall I #143 | |
|
Neural Architecture Search Driven by Locally Guided Diffusion for Personalized Federated Learning
Poster Session 1 & Exhibit Hall
PENG LIAO ⋅ Xilu Wang ⋅ Yaochu Jin ⋅ WenLi Du ⋅ Han Hu
|
Exhibit Hall I #395 | |
|
Hierarchical 3D Scene Graphs Construction Outdoors
Poster Session 6 & Exhibit Hall with Coffee Break
Jon Nyffeler ⋅ Federico Tombari ⋅ Daniel Barath
|
Exhibit Hall I #202 | |
|
Cycle-Consistent Learning for Joint Layout-to-Image Generation and Object Detection
Poster Session 2 & Exhibit Hall with Coffee Break
Xinhao Cai ⋅ Qiuxia Lai ⋅ Gensheng Pei ⋅ Xiangbo Shu ⋅ Yazhou Yao ⋅ Wenguan Wang
|
Exhibit Hall I #167 | |
|
From Objects to Events: Unlocking Complex Visual Understanding in Object Detectors via LLM-guided Symbolic Reasoning
Poster Session 5 & Exhibit Hall
Yuhui Zeng ⋅ Haoxiang Wu ⋅ Wenjie Nie ⋅ Xiawu Zheng ⋅ Guangyao Chen ⋅ Yunhang Shen ⋅ Jun Peng ⋅ Yonghong Tian ⋅ Rongrong Ji
|
Exhibit Hall I #429 | |
|
StyleSRN: Scene Text Image Super-Resolution with Text Style Embedding
Poster Session 4 & Exhibit Hall with Coffee Break
Shengrong Yuan ⋅ Runmin Wang ⋅ Ke Hao ⋅ Xu-Qi Ma ⋅ Changxin Gao ⋅ Li Liu ⋅ Nong Sang
|
Exhibit Hall I #364 | |
|
Frequency-Guided Diffusion for Training-Free Text-Driven Image Translation
Poster Session 4 & Exhibit Hall with Coffee Break
Zheng Gao ⋅ Jifei Song ⋅ Zhensong Zhang ⋅ Jiankang Deng ⋅ Ioannis Patras
|
Exhibit Hall I #413 | |
|
Preacher: Paper-to-Video Agentic System
Poster Session 4 & Exhibit Hall with Coffee Break
Jingwei Liu ⋅ Ling Yang ⋅ Hao Luo ⋅ Fan Wang ⋅ Hongyan Li ⋅ Mengdi Wang
|
Exhibit Hall I #214 | |
|
Where am I? Cross-View Geo-localization with Natural Language Descriptions
Poster Session 2 & Exhibit Hall with Coffee Break
Junyan Ye ⋅ Honglin Lin ⋅ Leyan Ou ⋅ Dairong Chen ⋅ Zihao Wang ⋅ Qi Zhu ⋅ Conghui He ⋅ Weijia Li
|
Exhibit Hall I #82 | |
|
Frequency-Semantic Enhanced Variational Autoencoder for Zero-Shot Skeleton-based Action Recognition
Poster Session 3 & Exhibit Hall
Wenhan Wu ⋅ Zhishuai Guo ⋅ Chen Chen ⋅ Hongfei Xue ⋅ Aidong Lu
|
Exhibit Hall I #105 | |
|
How Do Multimodal Large Language Models Handle Complex Multimodal Reasoning? Placing Them in An Extensible Escape Game
Poster Session 1 & Exhibit Hall
Ziyue Wang ⋅ Yurui Dong ⋅ Fuwen Luo ⋅ Minyuan Ruan ⋅ Zhili Cheng ⋅ Chi Chen ⋅ Peng Li ⋅ Yang Liu
|
Exhibit Hall I #451 | |
|
Towards Human-like Virtual Beings: Simulating Human Behavior in 3D Scenes
Poster Session 3 & Exhibit Hall
CHEN LIANG ⋅ Wenguan Wang ⋅ Yi Yang
|
Exhibit Hall I #69 | |
|
Cross-Category Subjectivity Generalization for Style-Adaptive Sketch Re-ID
Poster Session 5 & Exhibit Hall
Zechao Hu ⋅ Zhengwei Yang ⋅ Hao Li ⋅ Zheng Wang ⋅ Yixiong Zou
|
Exhibit Hall I #267 | |
|
S3R-GS: Streamlining the Pipeline for Large-Scale Street Scene Reconstruction
Poster Session 6 & Exhibit Hall with Coffee Break
Guangting Zheng ⋅ Jiajun Deng ⋅ Xiaomeng Chu ⋅ Yu Yuan ⋅ Houqiang Li ⋅ Yanyong Zhang
|
Exhibit Hall I #84 | |
|
The Source Image is the Best Attention for Infrared and Visible Image Fusion
Poster Session 3 & Exhibit Hall
Song Wang ⋅ Xie Han ⋅ Liqun Kuang ⋅ Boying Wang ⋅ Zhongyu Chen ⋅ Zherui Qiao ⋅ Fan Yang ⋅ Xiaoxia Liu ⋅ Bingyu Zhang ⋅ Zhixun Wang
|
Exhibit Hall I #331 | |
|
WSI-LLaVA: A Multimodal Large Language Model for Whole Slide Image
Poster Session 5 & Exhibit Hall
Yuci Liang ⋅ Xinheng Lyu ⋅ Meidan Ding ⋅ Wenting Chen ⋅ Xiaohan Xing ⋅ Jipeng Zhang ⋅ Sen Yang ⋅ Xiangjian He ⋅ Song Wu ⋅ Xiyue Wang ⋅ Linlin Shen
|
Exhibit Hall I #274 | |
|
Exploiting Diffusion Prior for Task-driven Image Restoration
Poster Session 3 & Exhibit Hall
Jaeha Kim ⋅ Junghun Oh ⋅ Kyoung Mu Lee
|
Exhibit Hall I #14 | |
|
Video2BEV: Transforming Drone Videos to BEVs for Video-based Geo-localization
Poster Session 6 & Exhibit Hall with Coffee Break
Hao Ju ⋅ Shaofei Huang ⋅ Si Liu ⋅ Zhedong Zheng
|
Exhibit Hall I #228 | |
|
CLIPer: Hierarchically Improving Spatial Representation of CLIP for Open-Vocabulary Semantic Segmentation
Poster Session 5 & Exhibit Hall
Lin Sun ⋅ Jiale Cao ⋅ Jin Xie ⋅ Xiaoheng Jiang ⋅ Yanwei Pang
|
Exhibit Hall I #321 | |
|
Adaptive Articulated Object Manipulation On The Fly with Foundation Model Reasoning and Part Grounding
Poster Session 3 & Exhibit Hall
Xiaojie Zhang ⋅ Yuanfei Wang ⋅ Ruihai Wu ⋅ Kunqi Xu ⋅ Yu Li ⋅ Liuyu Xiang ⋅ Hao Dong ⋅ Zhaofeng He
|
Exhibit Hall I #285 | |
|
Scaling Laws for Native Multimodal Models
Poster Session 1 & Exhibit Hall
Mustafa Shukor ⋅ Enrico Fini ⋅ Victor Guilherme Turrisi da Costa ⋅ Matthieu Cord ⋅ Joshua Susskind ⋅ Alaaeldin El-Nouby
|
Exhibit Hall I #227 | |
|
Unlearning the Noisy Correspondence Makes CLIP More Robust
Poster Session 1 & Exhibit Hall
Haochen Han ⋅ Alex Jinpeng Wang ⋅ Peijun Ye ⋅ Fangming Liu
|
Exhibit Hall I #424 | |
|
KDA: Knowledge Diffusion Alignment with Enhanced Context for Video Temporal Grounding
Poster Session 5 & Exhibit Hall
Ran Ran ⋅ Jiwei Wei ⋅ Shiyuan He ⋅ Zeyu Ma ⋅ Chaoning Zhang ⋅ Ning Xie ⋅ Yang Yang
|
Exhibit Hall I #331 | |
|
VisNumBench: Evaluating Number Sense of Multimodal Large Language Models
Poster Session 1 & Exhibit Hall
Tengjin Weng ⋅ Jingyi Wang ⋅ Wenhao Jiang ⋅ Zhong Ming
|
Exhibit Hall I #356 | |
|
STEP-DETR: Advancing DETR-based Semi-Supervised Object Detection with Super Teacher and Pseudo-Label Guided Text Queries
Poster Session 1 & Exhibit Hall
Tahira Shehzadi ⋅ Khurram Azeem Hashmi ⋅ Shalini Sarode ⋅ Didier Stricker ⋅ Muhammad Zeshan Afzal
|
Exhibit Hall I #283 | |
|
MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation
Poster Session 5 & Exhibit Hall
Fu Rong ⋅ Meng Lan ⋅ Qian Zhang ⋅ Lefei Zhang
|
Exhibit Hall I #392 | |
|
VoxelKP: A Voxel-based Network Architecture for Human Keypoint Estimation in LiDAR Data
Poster Session 6 & Exhibit Hall with Coffee Break
Jian Shi ⋅ Peter Wonka
|
Exhibit Hall I #343 | |
|
``Principal Components" Enable A New Language of Images
Poster Session 4 & Exhibit Hall with Coffee Break
Xin Wen ⋅ Bingchen Zhao ⋅ Ismail Elezi ⋅ Jiankang Deng ⋅ Xiaojuan Qi
|
Exhibit Hall I #168 | |
|
VAFlow: Video-to-Audio Generation with Cross-Modality Flow Matching
Poster Session 3 & Exhibit Hall
Xihua Wang ⋅ Xin Cheng ⋅ Yuyue Wang ⋅ Ruihua Song ⋅ Yunfeng Wang
|
Exhibit Hall I #167 | |
|
Seeing the Trees for the Forest: Rethinking Weakly-Supervised Medical Visual Grounding
Huy Ta ⋅ Duy Anh Huynh ⋅ Yutong Xie ⋅ Yuankai Qi ⋅ Qi Chen ⋅ Phi Le Nguyen ⋅ Sen Tran ⋅ Son Lam Phung ⋅ Anton Hengel ⋅ Zhibin Liao ⋅ Minh-Son To ⋅ Johan Verjans ⋅ Vu Phan
|
Exhibit Hall I #435 | |
|
Beyond Blur: A Fluid Perspective on Generative Diffusion Models
Poster Session 4 & Exhibit Hall with Coffee Break
Grzegorz Gruszczynski ⋅ Jakub Meixner ⋅ Michał Włodarczyk ⋅ Przemyslaw Musialski
|
Exhibit Hall I #280 | |
|
Revisiting Adversarial Patch Defenses on Object Detectors: Unified Evaluation, Large-Scale Dataset, and New Insights
Poster Session 5 & Exhibit Hall
Junhao Zheng ⋅ Jiahao Sun ⋅ Chenhao Lin ⋅ Zhengyu Zhao ⋅ Chen Ma ⋅ Chong Zhang ⋅ Cong Wang ⋅ Qian Wang ⋅ Chao Shen
|
Exhibit Hall I #346 | |
|
Structure Matters: Revisiting Boundary Refinement in Video Object Segmentation
Guanyi Qin ⋅ Ziyue Wang ⋅ Daiyun Shen ⋅ Haofeng Liu ⋅ Hantao Zhou ⋅ Junde Wu ⋅ Runze Hu ⋅ Yueming Jin
|
Exhibit Hall I #417 | |
|
A View-consistent Sampling Method for Regularized Training of Neural Radiance Fields
Poster Session 6 & Exhibit Hall with Coffee Break
Aoxiang Fan ⋅ Corentin Dumery ⋅ Nicolas Talabot ⋅ Pascal Fua
|
Exhibit Hall I #118 | |
|
GGTalker: Talking Head Systhesis with Generalizable Gaussian Priors and Identity-Specific Adaptation
Wentao Hu ⋅ Shunkai Li ⋅ Ziqiao Peng ⋅ Haoxian Zhang ⋅ Fan Shi ⋅ Xiaoqiang Liu ⋅ Pengfei Wan ⋅ Di ZHANG ⋅ Hui Tian
|
Exhibit Hall I #10 | |
|
MonoFusion: Sparse-View 4D Reconstruction via Monocular Fusion
Poster Session 2 & Exhibit Hall with Coffee Break
Zihan Wang ⋅ Jeff Tan ⋅ Tarasha Khurana ⋅ Neehar Peri ⋅ Deva Ramanan
|
Exhibit Hall I #305 | |
|
One Last Attention for Your Vision-Language Model
Poster Session 1 & Exhibit Hall
Liang Chen ⋅ Ghazi Shazan Ahmad ⋅ Tianjun Yao ⋅ Lingqiao Liu ⋅ Zhiqiang Shen
|
Exhibit Hall I #129 | |
|
Zeroth-Order Fine-Tuning of LLMs in Random Subspaces
Poster Session 1 & Exhibit Hall
Ziming Yu ⋅ Pan Zhou ⋅ Sike Wang ⋅ Jia Li ⋅ Mi Tian ⋅ Hua Huang
|
Exhibit Hall I #420 | |
|
IFAdapter: Instance Feature Control for Grounded Text-to-Image Generation
Poster Session 4 & Exhibit Hall with Coffee Break
YINWEI WU ⋅ Xianpan Zhou ⋅ bing ma ⋅ Xuefeng Su ⋅ Kai Ma ⋅ Xinchao Wang
|
Exhibit Hall I #101 | |
|
Towards Scalable Spatial Intelligence via 2D-to-3D Data Lifting
Xingyu Miao ⋅ Haoran Duan ⋅ Quanhao Qian ⋅ Jiuniu Wang ⋅ Yang Long ⋅ Ling Shao ⋅ Deli Zhao ⋅ Ran Xu ⋅ Gongjie Zhang
|
Exhibit Hall I #81 | |
|
Balancing Conservatism and Aggressiveness: Prototype-Affinity Hybrid Network for Few-Shot Segmentation
Poster Session 5 & Exhibit Hall
Tianyu Zou ⋅ Shengwu Xiong ⋅ Ruilin Yao ⋅ Yi Rong
|
Exhibit Hall I #71 | |
|
EYE3:Turn Anything into Naked-eye 3D
Poster Session 6 & Exhibit Hall with Coffee Break
Yingde Song ⋅ Zongyuan Yang ⋅ Baolin Liu ⋅ yongping xiong ⋅ Sai Chen ⋅ Lan Yi ⋅ Zhaohe Zhang ⋅ Xunbo Yu
|
Exhibit Hall I #303 | |
|
C2MIL: Synchronizing Semantic and Topological Causalities in Multiple Instance Learning for Robust and Interpretable Survival Analysis
Poster Session 5 & Exhibit Hall
Min Cen ⋅ Zhenfeng Zhuang ⋅ Yuzhe Zhang ⋅ Min Zeng ⋅ Baptiste Magnier ⋅ Lequan Yu ⋅ Hong Zhang ⋅ Liansheng Wang
|
Exhibit Hall I #430 | |
|
CVPT: Cross Visual Prompt Tuning
Poster Session 1 & Exhibit Hall
Lingyun Huang ⋅ Jianxu Mao ⋅ Junfei YI ⋅ Ziming Tao ⋅ Yaonan Wang
|
Exhibit Hall I #70 | |
|
Enhancing Partially Relevant Video Retrieval with Hyperbolic Learning
Poster Session 5 & Exhibit Hall
Jun Li ⋅ Jinpeng Wang ⋅ Chaolei Tan ⋅ Niu Lian ⋅ Long Chen ⋅ Yaowei Wang ⋅ Min zhang ⋅ Shu-Tao Xia ⋅ Bin Chen
|
Exhibit Hall I #309 | |
|
MobileIE: An Extremely Lightweight and Effective ConvNet for Real-Time Image Enhancement on Mobile Devices
Poster Session 5 & Exhibit Hall
HAILONG YAN ⋅ Ao Li ⋅ Xiangtao Zhang ⋅ Zhe Liu ⋅ Zenglin Shi ⋅ Ce Zhu ⋅ Le Zhang
|
Exhibit Hall I #201 | |
|
Pi-GPS: Enhancing Geometry Problem Solving by Unleashing the Power of Diagrammatic Information
Poster Session 1 & Exhibit Hall
Junbo Zhao ⋅ Ting Zhang ⋅ Jiayu Sun ⋅ Mi Tian ⋅ Hua Huang
|
Exhibit Hall I #135 | |
|
FixTalk: Taming Identity Leakage for High-Quality Talking Head Generation in Extreme Cases
Poster Session 1 & Exhibit Hall
Shuai Tan ⋅ Bill Gong ⋅ Bin Ji ⋅ Ye Pan
|
Exhibit Hall I #302 | |
|
Serialization based Point Cloud Oversegmentation
Poster Session 6 & Exhibit Hall with Coffee Break
chenghui Lu ⋅ Dilong Li ⋅ Jianlong Kwan ⋅ Ziyi Chen ⋅ Haiyan Guan
|
Exhibit Hall I #106 | |
|
NeurOp-Diff: Continuous Remote Sensing Image Super-Resolution via Neural Operator Diffusion
Zihao Xu ⋅ Yuzhi Tang ⋅ Bowen Xu ⋅ Qingquan Li
|
#235 | |
|
Di[M]O: Distilling Masked Diffusion Models into One-step Generator
Poster Session 4 & Exhibit Hall with Coffee Break
Yuanzhi Zhu ⋅ Xi WANG ⋅ Stéphane Lathuilière ⋅ Vicky Kalogeiton
|
Exhibit Hall I #356 | |
|
Reinforcement Learning-Guided Data Selection via Redundancy Assessment
Poster Session 1 & Exhibit Hall
Suorong Yang ⋅ Peijia Li ⋅ Furao Shen ⋅ Jian Zhao
|
Exhibit Hall I #86 | |
|
Φ-GAN:Physics-Inspired GAN for Generating SAR Images Under Limited Data
Poster Session 6 & Exhibit Hall with Coffee Break
Xidan Zhang ⋅ Yihan Zhuang ⋅ Qian Guo ⋅ Haodong Yang ⋅ Xuelin Qian ⋅ Gong Cheng ⋅ Junwei Han ⋅ Zhongling Huang
|
Exhibit Hall I #419 | |
|
One Perturbation is Enough: On Generating Universal Adversarial Perturbations against Vision-Language Pre-training Models
Poster Session 1 & Exhibit Hall
Hao Fang ⋅ Jiawei Kong ⋅ Wenbo Yu ⋅ Bin Chen ⋅ Jiawei Li ⋅ Hao Wu ⋅ Shu-Tao Xia ⋅ Ke Xu
|
Exhibit Hall I #383 | |
|
Recognizing Actions from Robotic View for Natural Human-Robot Interaction
Poster Session 3 & Exhibit Hall
Ziyi Wang ⋅ Peiming Li ⋅ Hong Liu ⋅ Zhichao Deng ⋅ Can Wang ⋅ Jun Liu ⋅ Junsong Yuan ⋅ Mengyuan Liu
|
Exhibit Hall I #397 | |
|
Addressing Text Embedding Leakage in Diffusion-based Image Editing
Poster Session 4 & Exhibit Hall with Coffee Break
Sunung Mun ⋅ Jinhwan Nam ⋅ Sunghyun Cho ⋅ Jungseul Ok
|
Exhibit Hall I #148 | |
|
Robin3D: Improving 3D Large Language Model via Robust Instruction Tuning
Poster Session 1 & Exhibit Hall
Weitai Kang ⋅ Haifeng Huang ⋅ Yuzhang Shang ⋅ Mubarak Shah ⋅ Yan Yan
|
Exhibit Hall I #363 | |
|
DDB: Diffusion Driven Balancing to Address Spurious Correlations
Poster Session 4 & Exhibit Hall with Coffee Break
Aryan Yazdan Parast ⋅ Basim Azam ⋅ Naveed Akhtar
|
Exhibit Hall I #253 | |
|
TurboVSR: Fantastic Video Upscalers and Where to Find Them
Zhongdao Wang ⋅ Guodongfang Zhao ⋅ Jingjing Ren ⋅ bailan feng ⋅ Shifeng Zhang ⋅ Wenbo Li
|
Exhibit Hall I #312 | |
|
CoralSRT: Revisiting Coral Reef Semantic Segmentation by Feature Rectifying via Self-supervised Guidance
Poster Session 5 & Exhibit Hall
Zheng Ziqiang ⋅ Wong Kwan ⋅ Binh-Son Hua ⋅ Jianbo Shi ⋅ Sai-Kit Yeung
|
Exhibit Hall I #16 | |
|
Learning Dense Feature Matching via Lifting Single 2D Image to 3D Space
Poster Session 2 & Exhibit Hall with Coffee Break
Yingping Liang ⋅ Yutao Hu ⋅ Wenqi Shao ⋅ Ying Fu
|
Exhibit Hall I #152 | |
|
Diagnosing Pretrained Models for Out-of-distribution Detection
Poster Session 1 & Exhibit Hall
Haipeng Xiong ⋅ Kai Xu ⋅ Angela Yao
|
Exhibit Hall I #166 | |
|
Seeing 3D Through 2D Lenses: 3D Few-Shot Class-Incremental Learning via Cross-Modal Geometric Rectification
Poster Session 2 & Exhibit Hall with Coffee Break
Tuo Xiang ⋅ Xuemiao Xu ⋅ Bangzhen Liu ⋅ Jinyi Li ⋅ Yong Li ⋅ Shengfeng He
|
Exhibit Hall I #164 | |
|
CHORDS: Diffusion Sampling Accelerator with Multi-core Hierarchical ODE Solvers
Poster Session 4 & Exhibit Hall with Coffee Break
Jiaqi Han ⋅ Haotian Ye ⋅ Puheng Li ⋅ Minkai Xu ⋅ James Zou ⋅ Stefano Ermon
|
Exhibit Hall I #431 | |
|
RayGaussX: Accelerating Gaussian-Based Ray Marching for Real-Time and High-Quality Novel View Synthesis
Poster Session 6 & Exhibit Hall with Coffee Break
Hugo Blanc ⋅ Jean-Emmanuel Deschaud ⋅ Alexis Paljic
|
Exhibit Hall I #275 | |
|
Adversarial Training for Probabilistic Robustness
Poster Session 1 & Exhibit Hall
YI ZHANG ⋅ Yuhang Chen ⋅ Zhen Chen ⋅ Wenjie Ruan ⋅ Xiaowei Huang ⋅ Siddartha Khastgir ⋅ Xingyu Zhao
|
Exhibit Hall I #149 | |
|
Learning to See Inside Opaque Liquid Containers using Speckle Vibrometry
Poster Session 2 & Exhibit Hall with Coffee Break
Matan Kichler ⋅ Shai Bagon ⋅ Mark Sheinin
|
Exhibit Hall I #417 | |
|
Scaling Omni-modal Pretraining with Multimodal Context: Advancing Universal Representation Learning Across Modalities
Poster Session 1 & Exhibit Hall
Yiyuan Zhang ⋅ Handong Li ⋅ Jing Liu ⋅ Xiangyu Yue
|
Exhibit Hall I #117 | |
|
LightBSR: Towards Lightweight Blind Super-Resolution via Discriminative Implicit Degradation Representation Learning
Poster Session 3 & Exhibit Hall
Jiang Yuan ⋅ ji ma ⋅ Bo Wang ⋅ Guanzhou Ke ⋅ Weiming Hu
|
Exhibit Hall I #181 | |
|
INSTINCT: Instance-Level Interaction Architecture for Query-Based Collaborative Perception
Poster Session 6 & Exhibit Hall with Coffee Break
yunjiang xu ⋅ Yupeng Ouyang ⋅ Lingzhi Li ⋅ Jin Wang ⋅ Benyuan Yang
|
Exhibit Hall I #70 | |
|
SPD: Shallow Backdoor Protecting Deep Backdoor Against Backdoor Detection
Poster Session 1 & Exhibit Hall
Shunjie Yuan ⋅ Xinghua Li ⋅ Xuelin Cao ⋅ Haiyan Zhang ⋅ Mengyao Zhu ⋅ Robert Deng
|
Exhibit Hall I #375 | |
|
Efficient Multi-Person Motion Prediction by Lightweight Spatial and Temporal Interactions
Poster Session 3 & Exhibit Hall
Yuanhong Zheng ⋅ Ruixuan Yu ⋅ Jian Sun
|
Exhibit Hall I #79 | |
|
VISION-XL: High Definition Video Inverse Problem Solver using Latent Image Diffusion Models
Poster Session 3 & Exhibit Hall
Taesung Kwon ⋅ Jong Ye
|
Exhibit Hall I #42 | |
|
Rethinking DPO-style Diffusion Aligning Frameworks
XUN WU ⋅ Shaohan Huang ⋅ Lingjie Jiang ⋅ Furu Wei
|
Exhibit Hall I #304 | |
|
Debiased Curriculum Adaptation for Safe Transfer Learning in Chest X-ray Classification
Poster Session 5 & Exhibit Hall
Mingyang Liu ⋅ Xinyang Chen ⋅ Yang Shu ⋅ Xiucheng Li ⋅ Weili Guan ⋅ Liqiang Nie
|
Exhibit Hall I #264 | |
|
PHATNet: A Physics-guided Haze Transfer Network for Domain-adaptive Real-world Image Dehazing
Poster Session 2 & Exhibit Hall with Coffee Break
Fu-Jen Tsai ⋅ Yan-Tsung Peng ⋅ Yen-Yu Lin ⋅ Chia-Wen Lin
|
Exhibit Hall I #53 | |
|
End-to-End Entity-Predicate Association Reasoning for Dynamic Scene Graph Generation
Poster Session 4 & Exhibit Hall with Coffee Break
LiWei Wang ⋅ YanDuo Zhang ⋅ Tao Lu ⋅ Fang Liu ⋅ Huiqin Zhang ⋅ Jiayi Ma ⋅ Huabing Zhou
|
Exhibit Hall I #272 | |
|
Breaking the Encoder Barrier for Seamless Video-Language Understanding
Poster Session 5 & Exhibit Hall
Handong Li ⋅ Yiyuan Zhang ⋅ Longteng Guo ⋅ Xiangyu Yue ⋅ Jing Liu
|
Exhibit Hall I #318 | |
|
CapeLLM: Support-Free Category-Agnostic Pose Estimation with Multimodal Large Language Models
Poster Session 5 & Exhibit Hall
Junho Kim ⋅ Hyungjin Chung ⋅ Byung-Hoon Kim
|
Exhibit Hall I #290 | |
|
GenFlowRL: Shaping Rewards with Generative Object-Centric Flow in Visual Reinforcement Learning
Poster Session 3 & Exhibit Hall
Kelin Yu ⋅ Sheng Zhang ⋅ Harshit Soora ⋅ Furong Huang ⋅ Heng Huang ⋅ Pratap Tokekar ⋅ Ruohan Gao
|
Exhibit Hall I #299 | |
|
Perceiving and Acting in First-Person: A Dataset and Benchmark for Egocentric Human-Object-Human Interactions
Poster Session 3 & Exhibit Hall
Liang Xu ⋅ Chengqun Yang ⋅ Zili Lin ⋅ Fei Xu ⋅ Yifan Liu ⋅ Congsheng Xu ⋅ Yiyi Zhang ⋅ Jie Qin ⋅ Xingdong Sheng ⋅ Yunhui Liu ⋅ Xin Jin ⋅ Yichao Yan ⋅ Wenjun Zeng ⋅ Xiaokang Yang
|
Exhibit Hall I #239 | |
|
Leveraging Panoptic Scene Graph for Evaluating Fine-Grained Text-to-Image Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Xueqing Deng ⋅ Linjie Yang ⋅ Qihang Yu ⋅ Chenglin Yang ⋅ Liang-Chieh (Jay) Chen
|
Exhibit Hall I #21 | |
|
VisHall3D: Monocular Semantic Scene Completion from Reconstructing the Visible Regions to Hallucinating the Invisible Regions
Poster Session 6 & Exhibit Hall with Coffee Break
Haoang Lu ⋅ Yuanqi Su ⋅ Xiaoning Zhang ⋅ Longjun Gao ⋅ Yu Xue ⋅ Le Wang
|
Exhibit Hall I #382 | |
|
InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity
Liming Jiang ⋅ Qing Yan ⋅ Yumin Jia ⋅ Zichuan Liu ⋅ Hao Kang ⋅ Xin Lu
|
Exhibit Hall I #84 | |
|
Hierarchical Variational Test-Time Prompt Generation for Zero-Shot Generalization
Poster Session 1 & Exhibit Hall
Zhaoyang Wu ⋅ Fang Liu ⋅ Licheng Jiao ⋅ Shuo Li ⋅ Lingling Li ⋅ Xu Liu ⋅ Puhua Chen ⋅ wenping ma
|
Exhibit Hall I #211 | |
|
GaSLight: Gaussian Splats for Spatially-Varying Lighting in HDR
Poster Session 6 & Exhibit Hall with Coffee Break
Christophe Bolduc ⋅ Yannick Hold-Geoffroy ⋅ Jean-Francois Lalonde
|
Exhibit Hall I #423 | |
|
GUAVA: Generalizable Upper Body 3D Gaussian Avatar
Poster Session 3 & Exhibit Hall
Dongbin Zhang ⋅ Yunfei Liu ⋅ Lijian Lin ⋅ Ye Zhu ⋅ Yang Li ⋅ Minghan Qin ⋅ Yu Li ⋅ Haoqian Wang
|
Exhibit Hall I #396 | |
|
CO2-Net: A Physics-Informed Spatio-Temporal Model for Global Surface CO2 Reconstruction
Poster Session 2 & Exhibit Hall with Coffee Break
Hao Zheng ⋅ Yuting Zheng ⋅ Hanbo Huang ⋅ Chaofan Sun ⋅ Enhui Liao ⋅ Lin Liu ⋅ Yi Han ⋅ Hao Zhou ⋅ Shiyu Liang
|
Exhibit Hall I #112 | |
|
PoseAnchor: Robust Root Position Estimation for 3D Human Pose Estimation
Poster Session 2 & Exhibit Hall with Coffee Break
Jun-Hee Kim ⋅ Jumin Han ⋅ Seong-Whan Lee
|
Exhibit Hall I #193 | |
|
Semantic Alignment and Reinforcement for Data-Free Quantization of Vision Transformers
Poster Session 3 & Exhibit Hall
Yunshan Zhong ⋅ Yuyao Zhou ⋅ Yuxin Zhang ⋅ Wanchen Sui ⋅ Shen Li ⋅ Yong Li ⋅ Fei Chao ⋅ Rongrong Ji
|
Exhibit Hall I #234 | |
|
GS-LIVM: Real-Time Photo-Realistic LiDAR-Inertial-Visual Mapping with Gaussian Splatting
Poster Session 6 & Exhibit Hall with Coffee Break
Yusen XIE ⋅ Zhenmin Huang ⋅ Jin Wu ⋅ Jun Ma
|
Exhibit Hall I #207 | |
|
Salvaging the Overlooked: Leveraging Class-Aware Contrastive Learning for Multi-Class Anomaly Detection
Poster Session 5 & Exhibit Hall
Lei Fan ⋅ Junjie Huang ⋅ Donglin Di ⋅ Anyang Su ⋅ Tianyou Song ⋅ Maurice Pagnucco ⋅ Yang Song
|
Exhibit Hall I #150 | |
|
Boosting Multimodal Learning via Disentangled Gradient Learning
Poster Session 5 & Exhibit Hall
Shicai Wei ⋅ Chunbo Luo ⋅ Yang Luo
|
Exhibit Hall I #289 | |
|
Task Vector Quantization for Memory-Efficient Model Merging
Poster Session 5 & Exhibit Hall
Youngeun Kim ⋅ Seunghwan Lee ⋅ Aecheon Jung ⋅ Bogon Ryu ⋅ Sungeun Hong
|
Exhibit Hall I #29 | |
|
Weakly Supervised Visible-Infrared Person Re-Identification via Heterogeneous Expert Collaborative Consistency Learning
Poster Session 3 & Exhibit Hall
Yafei Zhang ⋅ Lingqi Kong ⋅ Huafeng Li ⋅ Jie Wen
|
Exhibit Hall I #250 | |
|
SurfaceSplat: Connecting Surface Reconstruction and Gaussian Splatting
Poster Session 6 & Exhibit Hall with Coffee Break
Zihui Gao ⋅ Jia-Wang Bian ⋅ Guosheng Lin ⋅ Hao Chen ⋅ Chunhua Shen
|
Exhibit Hall I #368 | |
|
The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single Transformer
Weixian Lei ⋅ Jiacong Wang ⋅ Haochen Wang ⋅ Xiangtai Li ⋅ Jun Hao Liew ⋅ Jiashi Feng ⋅ Zilong Huang
|
Exhibit Hall I #91 | |
|
CaliMatch: Adaptive Calibration for Improving Safe Semi-supervised Learning
Poster Session 1 & Exhibit Hall
Jinsoo Bae ⋅ Seoung Bum Kim ⋅ Hyungrok Do
|
Exhibit Hall I #264 | |
|
Amodal3R: Amodal 3D Reconstruction from Occluded 2D Images
Poster Session 2 & Exhibit Hall with Coffee Break
Tianhao Wu ⋅ Chuanxia Zheng ⋅ Frank Guan ⋅ Andrea Vedaldi ⋅ Tat-Jen Cham
|
Exhibit Hall I #392 | |
|
DC-AR: Efficient Masked Autoregressive Image Generation with Deep Compression Hybrid Tokenizer
Poster Session 4 & Exhibit Hall with Coffee Break
Yecheng Wu ⋅ Han Cai ⋅ Junyu Chen ⋅ Zhuoyang Zhang ⋅ Enze Xie ⋅ Jincheng YU ⋅ Junsong Chen ⋅ Jinyi Hu ⋅ Yao Lu ⋅ Song Han
|
Exhibit Hall I #301 | |
|
Language Decoupling with Fine-grained Knowledge Guidance for Referring Multi-object Tracking
Poster Session 5 & Exhibit Hall
guangyao Li ⋅ Siping Zhuang ⋅ Yajun Jian ⋅ Yan Yan ⋅ Hanzi Wang
|
Exhibit Hall I #360 | |
|
Neural Multi-View Self-Calibrated Photometric Stereo without Photometric Stereo Cues
Poster Session 6 & Exhibit Hall with Coffee Break
Xu Cao ⋅ Takafumi Taketomi
|
Exhibit Hall I #273 | |
|
Reminiscence Attack on Residuals: Exploiting Approximate Machine Unlearning for Privacy
Poster Session 1 & Exhibit Hall
Yaxin Xiao ⋅ Qingqing Ye ⋅ Li Hu ⋅ Huadi Zheng ⋅ Haibo Hu ⋅ Zi Liang ⋅ Haoyang LI ⋅ JIAOYIJIE JIAOYIJIE
|
Exhibit Hall I #282 | |
|
RayPose: Ray Bundling Diffusion for Template Views in Unseen 6D Object Pose Estimation
Poster Session 2 & Exhibit Hall with Coffee Break
Junwen Huang ⋅ Shishir Reddy Vutukur ⋅ Peter Yu ⋅ Nassir Navab ⋅ Slobodan Ilic ⋅ Benjamin Busam
|
Exhibit Hall I #385 | |
|
Hydra-NeXt: Robust Closed-Loop Driving with Open-Loop Training
Poster Session 6 & Exhibit Hall with Coffee Break
Zhenxin Li ⋅ Shihao Wang ⋅ Shiyi Lan ⋅ Zhiding Yu ⋅ Zuxuan Wu ⋅ Jose M. Alvarez
|
Exhibit Hall I #250 | |
|
CanFields: Consolidating Diffeomorphic Flows for Non-Rigid 4D Interpolation from Arbitrary-Length Sequences
Poster Session 6 & Exhibit Hall with Coffee Break
Miaowei Wang ⋅ Changjian Li ⋅ Amir Vaxman
|
Exhibit Hall I #374 | |
|
QR-LoRA: Efficient and Disentangled Fine-tuning via QR Decomposition for Customized Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Jiahui Yang ⋅ Yongjia Ma ⋅ Donglin Di ⋅ Hao Li ⋅ Chen Wei ⋅ Xie Yan ⋅ Jianxun Cui ⋅ Xun Yang ⋅ Wangmeng Zuo
|
Exhibit Hall I #259 | |
|
X2I: Seamless Integration of Multimodal Understanding into Diffusion Transformer via Attention Distillation
Poster Session 4 & Exhibit Hall with Coffee Break
jian ma ⋅ Qirong Peng ⋅ Xu Guo ⋅ Chen Chen ⋅ Haonan Lu ⋅ Zhenyu Yang
|
Exhibit Hall I #177 | |
|
WaveMamba: Wavelet-Driven Mamba Fusion for RGB-Infrared Object Detection
Poster Session 3 & Exhibit Hall
Haodong Zhu ⋅ Wenhao Dong ⋅ Linlin Yang ⋅ Hong Li ⋅ Yuguang Yang ⋅ Yangyang Ren ⋅ Qingcheng Zhu ⋅ Zichao Feng ⋅ CHANGBI LI ⋅ Shaohui Lin ⋅ Runqi Wang ⋅ Xiaoyan Luo ⋅ Baochang Zhang
|
Exhibit Hall I #114 | |
|
Backdooring Self-Supervised Contrastive Learning by Noisy Alignment
Poster Session 1 & Exhibit Hall
Tuo Chen ⋅ Jie Gui ⋅ Minjing Dong ⋅ Ju Jia ⋅ Lanting Fang ⋅ Jian liu
|
Exhibit Hall I #342 | |
|
CounterPC: Counterfactual Feature Realignment for Unsupervised Domain Adaptation on Point Clouds
Feng Yang ⋅ Yichao Cao ⋅ Xiu Su ⋅ Dan Niu ⋅ Xuanpeng Li
|
Exhibit Hall I #4 | |
|
Multidimensional Byte Pair Encoding: Shortened Sequences for Improved Visual Data Generation
Poster Session 5 & Exhibit Hall
Tim Elsner ⋅ Paula Usinger ⋅ Julius Nehring-Wirxel ⋅ Gregor Kobsik ⋅ Victor Czech ⋅ Yanjiang He ⋅ Isaak Lim ⋅ Leif Kobbelt
|
Exhibit Hall I #142 | |
|
Robust Dataset Condensation using Supervised Contrastive Learning
Poster Session 1 & Exhibit Hall
Nicole Kim ⋅ Hwanjun Song
|
Exhibit Hall I #263 | |
|
SCAN: Bootstrapping Contrastive Pre-training for Data Efficiency
Poster Session 1 & Exhibit Hall
Yangyang Guo ⋅ Mohan Kankanhalli
|
Exhibit Hall I #340 | |
|
IDEATOR: Jailbreaking and Benchmarking Large Vision-Language Models Using Themselves
Poster Session 2 & Exhibit Hall with Coffee Break
Ruofan Wang ⋅ Juncheng Li ⋅ Yixu Wang ⋅ Bo Wang ⋅ Xiaosen Wang ⋅ Yan Teng ⋅ Yingchun Wang ⋅ Xingjun Ma ⋅ Yu-Gang Jiang
|
Exhibit Hall I #362 | |
|
AccidentalGS: 3D Gaussian Splatting from Accidental Camera Motion
Poster Session 6 & Exhibit Hall with Coffee Break
Mao Mao ⋅ Xujie Shen ⋅ Guyuan Chen ⋅ Boming Zhao ⋅ Jiarui Hu ⋅ Hujun Bao ⋅ Zhaopeng Cui
|
Exhibit Hall I #263 | |
|
GEOBench-VLM: Benchmarking Vision-Language Models for Geospatial Tasks
Muhammad Danish ⋅ Muhammad Akhtar Munir ⋅ Syed Shah ⋅ Kartik Kuckreja ⋅ Fahad Khan ⋅ Paolo Fraccaro ⋅ Alexandre Lacoste ⋅ Salman Khan
|
Exhibit Hall I #198 | |
|
Event-boosted Deformable 3D Gaussians for Dynamic Scene Reconstruction
Poster Session 6 & Exhibit Hall with Coffee Break
Wenhao Xu ⋅ Wenming Weng ⋅ Yueyi Zhang ⋅ Ruikang Xu ⋅ Zhiwei Xiong
|
Exhibit Hall I #348 | |
|
MagicMirror: ID-Preserved Video Generation in Video Diffusion Transformers
Poster Session 3 & Exhibit Hall
Yuechen Zhang ⋅ YaoYang Liu ⋅ Bin Xia ⋅ Bohao PENG ⋅ Zexin Yan ⋅ Eric Lo ⋅ Jiaya Jia
|
Exhibit Hall I #420 | |
|
MRGen: Segmentation Data Engine For Underrepresented MRI Modalities
Poster Session 5 & Exhibit Hall
Haoning Wu ⋅ Ziheng Zhao ⋅ Ya Zhang ⋅ Yanfeng Wang ⋅ Weidi Xie
|
Exhibit Hall I #10 | |
|
GAP: Gaussianize Any Point Clouds with Text Guidance
Poster Session 6 & Exhibit Hall with Coffee Break
Weiqi Zhang ⋅ Junsheng Zhou ⋅ Haotian Geng ⋅ Wenyuan Zhang ⋅ Liang Han
|
Exhibit Hall I #87 | |
|
DNF-Intrinsic: Deterministic Noise-Free Diffusion for Indoor Inverse Rendering
Poster Session 3 & Exhibit Hall
Rongjia Zheng ⋅ Qing Zhang ⋅ Chengjiang Long ⋅ Wei-Shi Zheng
|
Exhibit Hall I #31 | |
|
Cross-modal Ship Re-Identification via Optical and SAR Imagery: A Novel Dataset and Method
Poster Session 2 & Exhibit Hall with Coffee Break
Han Wang ⋅ Shengyang Li ⋅ Jian Yang ⋅ Yuxuan Liu ⋅ Yixuan Lv ⋅ Zhuang Zhou
|
Exhibit Hall I #268 | |
|
MoFRR: Mixture of Diffusion Models for Face Retouching Restoration
Poster Session 3 & Exhibit Hall
Jiaxin Liu ⋅ Qichao Ying ⋅ Zhenxing Qian ⋅ Sheng Li ⋅ Runqi Zhang ⋅ Jian liu ⋅ Xinpeng Zhang
|
Exhibit Hall I #267 | |
|
Adversarial Reconstruction Feedback for Robust Fine-grained Generalization
Poster Session 1 & Exhibit Hall
Shijie Wang ⋅ Jian Shi ⋅ Haojie Li
|
Exhibit Hall I #284 | |
|
Embodied Image Captioning: Self-supervised Learning Agents for Spatially Coherent Image Descriptions
Tommaso Galliena ⋅ Tommaso Apicella ⋅ Stefano Rosa ⋅ Pietro Morerio ⋅ ALESSIO DEL BUE ⋅ Lorenzo Natale
|
Exhibit Hall I #428 | |
|
Unified Adversarial Augmentation for Improving Palmprint Recognition
Poster Session 3 & Exhibit Hall
Jianlong Jin ⋅ Chenglong Zhao ⋅ Ruixin Zhang ⋅ Sheng Shang ⋅ Yang Zhao ⋅ Jun Wang ⋅ Jingyun Zhang ⋅ Shouhong Ding ⋅ Wei Jia ⋅ Yunsheng Wu
|
Exhibit Hall I #390 | |
|
Adding Additional Control to One-Step Diffusion with Joint Distribution Matching
Poster Session 1 & Exhibit Hall
Yihong Luo ⋅ Tianyang Hu ⋅ Yifan Song ⋅ Jiacheng Sun ⋅ Zhenguo Li ⋅ Jing Tang
|
Exhibit Hall I #373 | |
|
Enhancing Transferability of Targeted Adversarial Examples via Inverse Target Gradient Competition and Spatial Distance Stretching
Poster Session 1 & Exhibit Hall
Zhankai Li ⋅ Weiping Wang ⋅ jie li ⋅ Shigeng Zhang ⋅ Yunan Hu ⋅ Song Guo
|
Exhibit Hall I #345 | |
|
LDPose: Towards Inclusive Human Pose Estimation for Limb-Deficient Individuals in the Wild
Poster Session 2 & Exhibit Hall with Coffee Break
Jiaying Ying ⋅ Heming Du ⋅ Kaihao Zhang ⋅ Lincheng Li ⋅ Xin Yu
|
Exhibit Hall I #454 | |
|
OURO: A Self-Bootstrapped Framework for Enhancing Multimodal Scene Understanding
Poster Session 4 & Exhibit Hall with Coffee Break
Tianrun Xu ⋅ Guanyu Chen ⋅ Ye Li ⋅ Xi Yuxin ⋅ Zeyu Mu ⋅ Ruichen Wang ⋅ Tianren Zhang ⋅ Haichuan Gao ⋅ Feng Chen
|
Exhibit Hall I #322 | |
|
LEGION: Learning to Ground and Explain for Synthetic Image Detection
Hengrui Kang ⋅ Siwei Wen ⋅ Zichen Wen ⋅ Junyan Ye ⋅ Weijia Li ⋅ Peilin Feng ⋅ Baichuan Zhou ⋅ Bin Wang ⋅ Dahua Lin ⋅ Linfeng Zhang ⋅ Conghui He
|
Exhibit Hall I #389 | |
|
SMP-Attack: Boosting the Transferability of Feature Importance-based Adversarial Attack with Semantics-aware Multi-granularity Patchout
Poster Session 1 & Exhibit Hall
Wen Yang ⋅ Guodong Liu ⋅ Di Ming
|
Exhibit Hall I #417 | |
|
Spatial-Temporal Forgery Trace based Forgery Image Identification
Poster Session 4 & Exhibit Hall with Coffee Break
Yilin Wang ⋅ Zunlei Feng ⋅ Jiachi Wang ⋅ Hengrui Lou ⋅ Binjia Zhou ⋅ Jie Lei ⋅ Mingli Song ⋅ Yijun Bei
|
Exhibit Hall I #208 | |
|
Towards Annotation-Free Evaluation: KPAScore for Human Keypoint Detection
Poster Session 2 & Exhibit Hall with Coffee Break
Xiaoxiao Wang ⋅ Chunxiao Li ⋅ Peng Sun ⋅ Boming Miao ⋅ Yunjian Zhang ⋅ Yao Zhu
|
Exhibit Hall I #322 | |
|
Ultra High-Resolution Image Inpainting with Patch-Based Content Consistency Adapter
Poster Session 4 & Exhibit Hall with Coffee Break
JianHui Zhang ⋅ Shen Cheng ⋅ Qirui Sun ⋅ Jia Liu ⋅ Wang Luyang ⋅ chaoyu feng ⋅ Chen Fang ⋅ LEI LEI ⋅ Jue Wang ⋅ Shuaicheng Liu
|
Exhibit Hall I #201 | |
|
PROL : Rehearsal Free Continual Learning in Streaming Data via Prompt Online Learning
Poster Session 1 & Exhibit Hall
Muhammad Anwar Ma'sum ⋅ Mahardhika Pratama ⋅ Savitha Ramasamy ⋅ Lin Liu ⋅ H Habibullah ⋅ Ryszard Kowalczyk
|
Exhibit Hall I #225 | |
|
Dual Domain Control via Active Learning for Remote Sensing Domain Incremental Object Detection
Poster Session 1 & Exhibit Hall
Jiachen Sun ⋅ De Cheng ⋅ Xi Yang ⋅ Nannan Wang
|
Exhibit Hall I #354 | |
|
SUV: Suppressing Undesired Video Content via Semantic Modulation Based on Text Embeddings
Poster Session 4 & Exhibit Hall with Coffee Break
Xiang Lv ⋅ Mingwen Shao ⋅ Lingzhuang Meng ⋅ Chang Liu ⋅ Yecong Wan ⋅ Xinyuan Chen
|
Exhibit Hall I #333 | |
|
Enpowering Your Pansharpening Models with Generalizability: Unified Distribution is All You Need
Poster Session 3 & Exhibit Hall
Yongchuan Cui ⋅ Peng Liu ⋅ HUI ZHANG
|
Exhibit Hall I #174 | |
|
DiMPLe - Disentangled Multi-Modal Prompt Learning: Enhancing Out-Of-Distribution Alignment with Invariant and Spurious Feature Separation
Poster Session 1 & Exhibit Hall
Umaima Rahman ⋅ Mohammad Yaqub ⋅ Dwarikanath Mahapatra
|
Exhibit Hall I #145 | |
|
Beyond Low-Rank Tuning: Model Prior-Guided Rank Allocation for Effective Transfer in Low-Data and Large-Gap Regimes.
Poster Session 1 & Exhibit Hall
Chuyan Zhang ⋅ Kefan Wang ⋅ Yun Gu
|
Exhibit Hall I #310 | |
|
OracleFusion: Assisting the Decipherment of Oracle Bone Script with Structurally Constrained Semantic Typography
Poster Session 5 & Exhibit Hall
Li Caoshuo ⋅ Zengmao Ding ⋅ Xiaobin Hu ⋅ Bang Li ⋅ Donghao Luo ⋅ AndyPianWu AndyPianWu ⋅ Chaoyang Wang ⋅ Chengjie Wang ⋅ Taisong Jin ⋅ SevenShu SevenShu ⋅ Yunsheng Wu ⋅ Yongge Liu ⋅ Rongrong Ji
|
Exhibit Hall I #9 | |
|
COSMO: Combination of Selective Memorization for Low-cost Vision-and-Language Navigation
Poster Session 2 & Exhibit Hall with Coffee Break
Siqi Zhang ⋅ Yanyuan Qiao ⋅ Qunbo Wang ⋅ Zike Yan ⋅ Qi Wu ⋅ Zhihua Wei ⋅ Jing Liu
|
Exhibit Hall I #46 | |
|
CoStoDet-DDPM: Collaborative Training of Stochastic and Deterministic Models Improves Surgical Workflow Anticipation and Recognition
Poster Session 5 & Exhibit Hall
Kaixiang Yang ⋅ Xin Li ⋅ Qiang Li ⋅ Zhiwei Wang
|
Exhibit Hall I #371 | |
|
MixA-Q: Revisiting Activation Sparsity for Vision Transformers from a Mixed-Precision Quantization Perspective
Poster Session 5 & Exhibit Hall
Weitian Wang ⋅ Shubham rai ⋅ Cecilia De la Parra ⋅ Akash Kumar
|
Exhibit Hall I #219 | |
|
MEGA: Memory-Efficient 4D Gaussian Splatting for Dynamic Scenes
XINJIE ZHANG ⋅ Zhening Liu ⋅ Yifan Zhang ⋅ Xingtong Ge ⋅ Dailan He ⋅ Tongda Xu ⋅ Yan Wang ⋅ Zehong Lin ⋅ Shuicheng YAN ⋅ Jun Zhang
|
Exhibit Hall I #300 | |
|
TOGA: Temporally Grounded Open-Ended Video QA with Weak Supervision
Poster Session 5 & Exhibit Hall
Ayush Gupta ⋅ Anirban Roy ⋅ Rama Chellappa ⋅ Nathaniel D. Bastian ⋅ Alvaro Velasquez ⋅ Susmit Jha
|
Exhibit Hall I #357 | |
|
LaRender: Training-Free Occlusion Control in Image Generation via Latent Rendering
Poster Session 5 & Exhibit Hall
Xiaohang Zhan ⋅ Dingming Liu
|
Exhibit Hall I #75 | |
|
DynFaceRestore: Balancing Fidelity and Quality in Diffusion-Guided Blind Face Restoration with Dynamic Blur-Level Mapping and Guidance
Huu Phu Do ⋅ Yu-Wei Chen ⋅ Yi-Cheng Liao ⋅ Chi-Wei Hsiao ⋅ Han-Yang Wang ⋅ Wei-Chen Chiu ⋅ Ching-Chun Huang
|
Exhibit Hall I #39 | |
|
Generalized Few-Shot Point Cloud Segmentation via LLM-Assisted Hyper-Relation Matching
Poster Session 5 & Exhibit Hall
Zhaoyang Li ⋅ Yuan Wang ⋅ Guoxin Xiong ⋅ Wangkai Li ⋅ Yuwen Pan ⋅ Tianzhu Zhang
|
Exhibit Hall I #308 | |
|
Training-free Geometric Image Editing on Diffusion Models
Poster Session 4 & Exhibit Hall with Coffee Break
Hanshen Zhu ⋅ Zhen Zhu ⋅ Kaile Zhang ⋅ Yiming Gong ⋅ Yuliang Liu ⋅ Xiang Bai
|
Exhibit Hall I #407 | |
|
Boost 3D Reconstruction using Diffusion-based Monocular Camera Calibration
Poster Session 2 & Exhibit Hall with Coffee Break
Junyuan Deng ⋅ Wei Yin ⋅ Xiaoyang Guo ⋅ Qian Zhang ⋅ Xiaotao Hu ⋅ Weiqiang Ren ⋅ XIAOXIAO LONG ⋅ Ping Tan
|
Exhibit Hall I #196 | |
|
Monocular Facial Appearance Capture in the Wild
Poster Session 3 & Exhibit Hall
Yingyan Xu ⋅ Kate Gadola ⋅ Prashanth Chandran ⋅ Sebastian Weiss ⋅ Markus Gross ⋅ Gaspard Zoss ⋅ Derek Bradley
|
Exhibit Hall I #195 | |
|
Growing a Twig to Accelerate Large Vision-Language Models
Poster Session 5 & Exhibit Hall
Zhenwei Shao ⋅ Mingyang Wang ⋅ Zhou Yu ⋅ Wenwen Pan ⋅ Yan Yang ⋅ Tao Wei ⋅ Hongyuan Zhang ⋅ Ning Mao ⋅ Chen Wei ⋅ Jun Yu
|
Exhibit Hall I #25 | |
|
AMD: Adaptive Momentum and Decoupled Contrastive Learning Framework for Robust Long-Tail Trajectory Prediction
Poster Session 6 & Exhibit Hall with Coffee Break
Bin Rao ⋅ Haicheng Liao ⋅ Yanchen Guan ⋅ Chengyue Wang ⋅ Bonan Wang ⋅ Jiaxun Zhang ⋅ Zhenning Li
|
Exhibit Hall I #398 | |
|
FreeDance: Towards Harmonic Free-Number Group Dance Generation via a Unified Framework
Poster Session 3 & Exhibit Hall
Yiwen Zhao ⋅ Yang Wang ⋅ Liting Wen ⋅ Hengyuan Zhang ⋅ Xingqun Qi
|
Exhibit Hall I #51 | |
|
Deep Incomplete Multi-view Clustering with Distribution Dual-Consistency Recovery Guidance
Poster Session 1 & Exhibit Hall
Jiaqi Jin ⋅ Siwei Wang ⋅ Zhibin Dong ⋅ Xihong Yang ⋅ Xinwang Liu ⋅ En Zhu ⋅ Kunlun He
|
Exhibit Hall I #87 | |
|
Learning Visual Hierarchies in Hyperbolic Space for Image Retrieval
Poster Session 3 & Exhibit Hall
Ziwei Wang ⋅ Sameera Ramasinghe ⋅ Chenchen Xu ⋅ Julien Monteil ⋅ Loris Bazzani ⋅ Thalaiyasingam Ajanthan
|
Exhibit Hall I #376 | |
|
TemCoCo: Temporally Consistent Multi-modal Video Fusion with Visual-Semantic Collaboration
Poster Session 3 & Exhibit Hall
Gong Meiqi ⋅ Hao Zhang ⋅ Xunpeng Yi ⋅ Linfeng Tang ⋅ Jiayi Ma
|
Exhibit Hall I #407 | |
|
RetinexMCNet: A Memory Controller Dominated Network for Low-Light Video Enhancement Based on Retinex
Poster Session 2 & Exhibit Hall with Coffee Break
Meiao Wang ⋅ Xuejing Kang ⋅ Yaxi Lu ⋅ Jie Xu
|
Exhibit Hall I #440 | |
|
D2ST-Adapter: Disentangled-and-Deformable Spatio-Temporal Adapter for Few-shot Action Recognition
Poster Session 3 & Exhibit Hall
Wenjie Pei ⋅ Qizhong Tan ⋅ Guangming Lu ⋅ Jiandong Tian ⋅ Jun Yu
|
Exhibit Hall I #123 | |
|
Sliced Wasserstein Bridge for Open-Vocabulary Video Instance Segmentation
Zheyun Qin ⋅ Deng Yu ⋅ Chuanchen Luo ⋅ Zhumin Chen
|
Exhibit Hall I #233 | |
|
Frequency-Aware Autoregressive Modeling for Efficient High-Resolution Image Synthesis
Poster Session 4 & Exhibit Hall with Coffee Break
Zhuokun Chen ⋅ Jugang Fan ⋅ Zhuowei Yu ⋅ Bohan Zhuang ⋅ Mingkui Tan
|
Exhibit Hall I #215 | |
|
KinMo: Kinematic-aware Human Motion Understanding and Generation
Poster Session 3 & Exhibit Hall
Pengfei Zhang ⋅ Pinxin Liu ⋅ Pablo Garrido ⋅ Hyeongwoo Kim ⋅ Bindita Chaudhuri
|
Exhibit Hall I #111 | |
|
CODA: Repurposing Continuous VAEs for Discrete Tokenization
Poster Session 4 & Exhibit Hall with Coffee Break
Zeyu Liu ⋅ Zanlin Ni ⋅ Yeguo Hua ⋅ Xin Deng ⋅ Xiao Ma ⋅ Cheng Zhong ⋅ Gao Huang
|
Exhibit Hall I #386 | |
|
3D Gaussian Splatting Driven Multi-View Robust Physical Adversarial Camouflage Generation
Poster Session 6 & Exhibit Hall with Coffee Break
Tianrui Lou ⋅ Xiaojun Jia ⋅ Siyuan Liang ⋅ Jiawei Liang ⋅ Ming Zhang ⋅ Yanjun Xiao ⋅ Xiaochun Cao
|
Exhibit Hall I #389 | |
|
Head2Body: Body Pose Generation from Multi-sensory Head-mounted Inputs
Poster Session 2 & Exhibit Hall with Coffee Break
Minh Tran ⋅ Hongda Mao ⋅ Qingshuang Chen ⋅ Yelin Kim
|
Exhibit Hall I #172 | |
|
LLM-Assisted Semantic Guidance for Sparsely Annotated Remote Sensing Object Detection
Poster Session 5 & Exhibit Hall
Wei Liao ⋅ Chunyan Xu ⋅ Chenxu Wang ⋅ Zhen Cui
|
Exhibit Hall I #256 | |
|
From Reusing to Forecasting: Accelerating Diffusion Models with TaylorSeers
Poster Session 4 & Exhibit Hall with Coffee Break
Jiacheng Liu ⋅ Chang Zou ⋅ Yuanhuiyi Lyu ⋅ Junjie Chen ⋅ Linfeng Zhang
|
Exhibit Hall I #92 | |
|
DADM: Dual Alignment of Domain and Modality for Face Anti-spoofing
Poster Session 3 & Exhibit Hall
Yang JingYi ⋅ Xun Lin ⋅ Zitong YU ⋅ Liepiao Zhang ⋅ Xin Liu ⋅ Hui Li ⋅ Xiaochen Yuan ⋅ Xiaochun Cao
|
Exhibit Hall I #192 | |
|
Quantifying and Narrowing the Unknown: Interactive Text-to-Video Retrieval via Uncertainty Minimization
Poster Session 5 & Exhibit Hall
Bingqing Zhang ⋅ Zhuo Cao ⋅ Heming Du ⋅ Yang Li ⋅ Xue Li ⋅ Jiajun Liu ⋅ Sen Wang
|
Exhibit Hall I #217 | |
|
Gradient Decomposition and Alignment for Incremental Object Detection
Poster Session 1 & Exhibit Hall
Wenlong Luo ⋅ Shizhou Zhang ⋅ De Cheng ⋅ Yinghui Xing ⋅ Guoqiang Liang ⋅ PENG WANG ⋅ Yanning Zhang
|
Exhibit Hall I #421 | |
|
PacGDC: Label-Efficient Generalizable Depth Completion with Projection Ambiguity and Consistency
Poster Session 2 & Exhibit Hall with Coffee Break
Haotian Wang ⋅ Aoran Xiao ⋅ Xiaoqin Zhang ⋅ Meng Yang ⋅ Shijian Lu
|
Exhibit Hall I #253 | |
|
TruthPrInt: Mitigating Large Vision-Language Models Object Hallucination Via Latent Truthful-Guided Pre-Intervention
Poster Session 2 & Exhibit Hall with Coffee Break
Jinhao Duan ⋅ Fei Kong ⋅ Hao Cheng ⋅ James Diffenderfer ⋅ Bhavya Kailkhura ⋅ Lichao Sun ⋅ Xiaofeng Zhu ⋅ Xiaoshuang Shi ⋅ Kaidi Xu
|
Exhibit Hall I #220 | |
|
Adversarial Attention Perturbations for Large Object Detection Transformers
Poster Session 1 & Exhibit Hall
Zachary Yahn ⋅ Selim Tekin ⋅ Fatih Ilhan ⋅ Sihao Hu ⋅ Tiansheng Huang ⋅ Yichang Xu ⋅ Margaret Loper ⋅ Ling Liu
|
Exhibit Hall I #294 | |
|
MCAM: Multimodal Causal Analysis Model for Ego-Vehicle-Level Driving Video Understanding
Poster Session 2 & Exhibit Hall with Coffee Break
Tongtong Cheng ⋅ Rongzhen Li ⋅ Yixin Xiong ⋅ Tao Zhang ⋅ Jing Wang ⋅ Kai Liu
|
Exhibit Hall I #43 | |
|
When and Where do Data Poisons Attack Textual Inversion?
Poster Session 4 & Exhibit Hall with Coffee Break
Jeremy Styborski ⋅ Mingzhi Lyu ⋅ Jiayou Lu ⋅ Nupur Kapur ⋅ Adams Kong
|
Exhibit Hall I #436 | |
|
Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation and Reconstruction
Poster Session 6 & Exhibit Hall with Coffee Break
Yuanhao Cai ⋅ He Zhang ⋅ Kai Zhang ⋅ Yixun Liang ⋅ Mengwei Ren ⋅ Fujun Luan ⋅ Qing Liu ⋅ Soo Ye Kim ⋅ Jianming Zhang ⋅ Zhifei Zhang ⋅ Yuqian Zhou ⋅ YULUN ZHANG ⋅ Xiaokang Yang ⋅ Zhe Lin ⋅ Alan Yuille
|
Exhibit Hall I #32 | |
|
SIMS: Simulating Stylized Human-Scene Interactions with Retrieval-Augmented Script Generation
Poster Session 3 & Exhibit Hall
Wenjia Wang ⋅ Liang Pan ⋅ Zhiyang Dou ⋅ Jidong Mei ⋅ Zhouyingcheng Liao ⋅ Yifan Wu ⋅ Yuke Lou ⋅ Jingbo Wang ⋅ Lei Yang ⋅ Taku Komura
|
Exhibit Hall I #388 | |
|
PBCAT: Patch-Based Composite Adversarial Training against Physically Realizable Attacks on Object Detection
Poster Session 5 & Exhibit Hall
Xiao Li ⋅ Yiming Zhu ⋅ Yifan Huang ⋅ Wei Zhang ⋅ Yingzhe He ⋅ Jie Shi ⋅ Xiaolin Hu
|
Exhibit Hall I #436 | |
|
SteerX: Creating Any Camera-Free 3D and 4D Scenes with Geometric Steering
Poster Session 6 & Exhibit Hall with Coffee Break
Byeongjun Park ⋅ Hyojun Go ⋅ Hyelin Nam ⋅ Byung-Hoon Kim ⋅ Hyungjin Chung ⋅ Changick Kim
|
Exhibit Hall I #252 | |
|
Engage for All: Making Ordinary Image Descriptions Appealing Again!
Poster Session 4 & Exhibit Hall with Coffee Break
Yuyan Chen ⋅ Yifan Jiang ⋅ Li Zhou ⋅ Jinghan Cao ⋅ Yu Guan ⋅ Ming Yang ⋅ Qingpei Guo
|
Exhibit Hall I #427 | |
|
Seam360GS: Seamless 360° Gaussian Splatting from Real-World Omnidirectional Images
Poster Session 6 & Exhibit Hall with Coffee Break
Changha Shin ⋅ Woong Oh Cho ⋅ Seon Joo Kim
|
Exhibit Hall I #409 | |
|
HiGarment: Cross-modal Harmony Based Diffusion Model for Flat Sketch to Realistic Garment Image
Poster Session 4 & Exhibit Hall with Coffee Break
Junyi Guo ⋅ Jingxuan Zhang ⋅ Fangyu Wu ⋅ Huanda Lu ⋅ Qiufeng Wang ⋅ Wenmian Yang ⋅ ENG Gee LIM ⋅ Dongming Lu
|
Exhibit Hall I #350 | |
|
AU-Blendshape for Fine-grained Stylized 3D Facial Expression Manipulation
Poster Session 3 & Exhibit Hall
Hao Li ⋅ Ju Dai ⋅ Feng Zhou ⋅ Kaida Ning ⋅ Lei Li ⋅ Junjun Pan
|
Exhibit Hall I #245 | |
|
LMM4LMM: Benchmarking and Evaluating Large-multimodal Image Generation with LMMs
Jiarui Wang ⋅ Huiyu Duan ⋅ Yu Zhao ⋅ Juntong Wang ⋅ Guangtao Zhai ⋅ Xiongkuo Min
|
Exhibit Hall I #233 | |
|
BokehDiff: Neural Lens Blur with One-Step Diffusion
Poster Session 2 & Exhibit Hall with Coffee Break
Chengxuan Zhu ⋅ Qingnan Fan ⋅ Qi Zhang ⋅ Jinwei Chen ⋅ Huaqi Zhang ⋅ Chao Xu ⋅ Boxin Shi
|
Exhibit Hall I #421 | |
|
VCA: Video Curious Agent for Long Video Understanding
Poster Session 5 & Exhibit Hall
Zeyuan Yang ⋅ Delin Chen ⋅ Xueyang Yu ⋅ Maohao Shen ⋅ Chuang Gan
|
Exhibit Hall I #35 | |
|
Geometry Distributions
Biao Zhang ⋅ Jing Ren ⋅ Peter Wonka
|
Exhibit Hall I #132 | |
|
Social Debiasing for Fair Multi-modal LLMs
Poster Session 1 & Exhibit Hall
Harry Cheng ⋅ Yangyang Guo ⋅ Qingpei Guo ⋅ Ming Yang ⋅ Tian Gan ⋅ Weili Guan ⋅ Liqiang Nie
|
Exhibit Hall I #157 | |
|
Hierarchy-Aware Pseudo Word Learning with Text Adaptation for Zero-Shot Composed Image Retrieval
Poster Session 5 & Exhibit Hall
Zhe Li ⋅ Lei Zhang ⋅ Zheren Fu ⋅ Kun Zhang ⋅ Zhendong Mao
|
Exhibit Hall I #423 | |
|
GaussianUpdate: Continual 3D Gaussian Splatting Update for Changing Environments
Poster Session 6 & Exhibit Hall with Coffee Break
Lin Zeng ⋅ Boming Zhao ⋅ Jiarui Hu ⋅ Xujie Shen ⋅ Ziqiang Dang ⋅ Hujun Bao ⋅ Zhaopeng Cui
|
Exhibit Hall I #103 | |
|
DALIP: Distribution Alignment-based Language-Image Pre-Training for Domain-Specific Data
Poster Session 1 & Exhibit Hall
Junjie Wu ⋅ Jiangtao Xie ⋅ Zhaolin Zhang ⋅ Qilong Wang ⋅ Qinghua Hu ⋅ Peihua Li ⋅ Sen Xu
|
Exhibit Hall I #190 | |
|
Long-term Traffic Simulation with Interleaved Autoregressive Motion and Scenario Generation
Poster Session 6 & Exhibit Hall with Coffee Break
Xiuyu Yang ⋅ Shuhan Tan ⋅ Philipp Kraehenbuehl
|
Exhibit Hall I #55 | |
|
Perspective-Invariant 3D Object Detection
Poster Session 6 & Exhibit Hall with Coffee Break
Alan Liang ⋅ Lingdong Kong ⋅ Dongyue Lu ⋅ Youquan Liu ⋅ Jian Fang ⋅ Huaici Zhao ⋅ Wei Tsang Ooi
|
Exhibit Hall I #291 | |
|
Probabilistic Inertial Poser (ProbIP): Uncertainty-aware Human Motion Modeling from Sparse Inertial Sensors
Poster Session 6 & Exhibit Hall with Coffee Break
Min Kim ⋅ Younho Jeon ⋅ Sungho Jo
|
Exhibit Hall I #112 | |
|
ARMO: Autoregressive Rigging for Multi-Category Objects
Poster Session 2 & Exhibit Hall with Coffee Break
mingze sun ⋅ Shiwei Mao ⋅ Keyi Chen ⋅ Yurun Chen ⋅ Shunlin Lu ⋅ Jingbo Wang ⋅ Junting Dong ⋅ Ruqi Huang
|
Exhibit Hall I #254 | |
|
Harnessing Vision Foundation Models for High-Performance, Training-Free Open Vocabulary Segmentation
Poster Session 5 & Exhibit Hall
Yuheng Shi ⋅ Minjing Dong ⋅ Chang Xu
|
Exhibit Hall I #347 | |
|
Aligning Constraint Generation with Design Intent in Parametric CAD
Poster Session 2 & Exhibit Hall with Coffee Break
Evan Casey ⋅ Tianyu Zhang ⋅ Shu Ishida ⋅ John Thompson ⋅ Amir Khasahmadi ⋅ Joseph Lambourne ⋅ Pradeep Kumar Jayaraman ⋅ Karl Willis
|
Exhibit Hall I #338 | |
|
Golden Noise for Diffusion Models: A Learning Framework
Poster Session 4 & Exhibit Hall with Coffee Break
zikai zhou ⋅ Shitong Shao ⋅ Lichen Bai ⋅ Shufei Zhang ⋅ zhiqiang xu ⋅ Bo Han ⋅ Zeke Xie
|
Exhibit Hall I #268 | |
|
STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution
Poster Session 4 & Exhibit Hall with Coffee Break
Rui Xie ⋅ Yinhong Liu ⋅ Penghao Zhou ⋅ Chen Zhao ⋅ Jun Zhou ⋅ Kai Zhang ⋅ Zhenyu Zhang ⋅ Jian Yang ⋅ Zhenheng Yang ⋅ Ying Tai
|
Exhibit Hall I #213 | |
|
Vision-Language Interactive Relation Mining for Open-Vocabulary Scene Graph Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Yukuan Min ⋅ Muli Yang ⋅ Jinhao Zhang ⋅ Yuxuan Wang ⋅ Aming WU ⋅ Cheng Deng
|
Exhibit Hall I #179 | |
|
OrderChain: Towards General Instruct-Tuning for Stimulating the Ordinal Understanding Ability of MLLM
Poster Session 1 & Exhibit Hall
Jinhong Wang ⋅ Shuo Tong ⋅ Jintai CHEN ⋅ Jian liu ⋅ Dongqi Tang ⋅ Weiqiang Wang ⋅ Wentong Li ⋅ Hongxia Xu ⋅ Danny Chen ⋅ Jian Wu
|
Exhibit Hall I #323 | |
|
Joint Learning of Pose Regression and Denoising Diffusion with Score Scaling Sampling for Category-level 6D Pose Estimation
Poster Session 2 & Exhibit Hall with Coffee Break
Seunghyun Lee ⋅ Tae-Kyun Kim
|
Exhibit Hall I #68 | |
|
LHM: Large Animatable Human Reconstruction Model for Single Image to 3D in Seconds
Poster Session 3 & Exhibit Hall
Lingteng Qiu ⋅ Xiaodong Gu ⋅ Peihao Li ⋅ Qi Zuo ⋅ Weichao Shen ⋅ Junfei Zhang ⋅ Kejie Qiu ⋅ Weihao Yuan ⋅ Guanying Chen ⋅ Zilong Dong ⋅ Liefeng Bo
|
Exhibit Hall I #394 | |
|
UniVG: A Generalist Diffusion Model for Unified Image Generation and Editing
Poster Session 4 & Exhibit Hall with Coffee Break
Tsu-Jui Fu ⋅ Yusu Qian ⋅ Chen Chen ⋅ Wenze Hu ⋅ Zhe Gan ⋅ Yinfei Yang
|
Exhibit Hall I #217 | |
|
Distilling Diffusion Models to Efficient 3D LiDAR Scene Completion
Poster Session 2 & Exhibit Hall with Coffee Break
shengyuan zhang ⋅ An Zhao ⋅ Ling Yang ⋅ Zejian Li ⋅ Chenye Meng ⋅ Haoran Xu ⋅ Tianrun Chen ⋅ AnYang Wei ⋅ Perry GU ⋅ Lingyun Sun
|
Exhibit Hall I #225 | |
|
FOLDER: Accelerating Multi-Modal Large Language Models with Enhanced Performance
Poster Session 5 & Exhibit Hall
Haicheng Wang ⋅ Zhemeng Yu ⋅ Gabriele Spadaro ⋅ Chen Ju ⋅ Victor Quétu ⋅ Shuai Xiao ⋅ Enzo Tartaglione
|
Exhibit Hall I #359 | |
|
ViSpeak: Visual Instruction Feedback in Streaming Videos
Poster Session 5 & Exhibit Hall
Shenghao Fu ⋅ Qize Yang ⋅ Yuan-Ming Li ⋅ Yi-Xing Peng ⋅ Kun-Yu Lin ⋅ Xihan Wei ⋅ Jian-Fang Hu ⋅ Xiaohua Xie ⋅ Wei-Shi Zheng
|
Exhibit Hall I #185 | |
|
FedAGC: Federated Continual Learning with Asymmetric Gradient Correction
Poster Session 1 & Exhibit Hall
Chengchao Zhang ⋅ Fanhua Shang ⋅ Hongying Liu ⋅ Liang Wan ⋅ Wei Feng
|
Exhibit Hall I #357 | |
|
MoGA: 3D Generative Avatar Prior for Monocular Gaussian Avatar Reconstruction
Zijian Dong ⋅ Longteng Duan ⋅ Jie Song ⋅ Michael Black ⋅ Andreas Geiger
|
Exhibit Hall I #312 | |
|
ATCTrack: Aligning Target-Context Cues with Dynamic Target States for Robust Vision-Language Tracking
Xiaokun Feng ⋅ Shiyu Hu ⋅ Xuchen Li ⋅ Dailing Zhang ⋅ Meiqi Wu ⋅ Jing Zhang ⋅ Xiaotang Chen ⋅ Kaiqi Huang
|
Exhibit Hall I #5 | |
|
Federated Representation Angle Learning
Poster Session 1 & Exhibit Hall
Liping Yi ⋅ Han Yu ⋅ Gang Wang ⋅ xiaoguang Liu ⋅ Xiaoxiao Li
|
Exhibit Hall I #115 | |
|
MotionShot: Adaptive Motion Transfer across Arbitrary Objects for Text-to-Video Generation
Poster Session 3 & Exhibit Hall
Yanchen Liu ⋅ Yanan SUN ⋅ Zhening Xing ⋅ Junyao Gao ⋅ Kai Chen ⋅ Wenjie Pei
|
Exhibit Hall I #175 | |
|
GroundFlow: A Plug-in Module for Temporal Reasoning on 3D Point Cloud Sequential Grounding
Poster Session 6 & Exhibit Hall with Coffee Break
Zijun Lin ⋅ Shuting He ⋅ Cheston Tan ⋅ Bihan Wen
|
Exhibit Hall I #391 | |
|
Enhancing Adversarial Transferability by Balancing Exploration and Exploitation with Gradient-Guided Sampling
Poster Session 1 & Exhibit Hall
Zenghao Niu ⋅ Weicheng Xie ⋅ Siyang Song ⋅ Zitong YU ⋅ Feng Liu ⋅ Linlin Shen
|
Exhibit Hall I #361 | |
|
CWNet: Causal Wavelet Network for Low-Light Image Enhancement
Poster Session 2 & Exhibit Hall with Coffee Break
Tongshun Zhang ⋅ Pingping Liu ⋅ Yubing Lu ⋅ Mengen Cai ⋅ Zijian Zhang ⋅ Zhe Zhang ⋅ Qiuzhan Zhou
|
Exhibit Hall I #354 | |
|
InterSyn: Interleaved Learning for Dynamic Motion Synthesis in the Wild
Poster Session 3 & Exhibit Hall
Yiyi Ma ⋅ Yuanzhi Liang ⋅ Xiu Li ⋅ Chi Zhang ⋅ Xuelong Li
|
Exhibit Hall I #266 | |
|
GeoDistill: Geometry-Guided Self-Distillation for Weakly Supervised Cross-View Localization
Poster Session 6 & Exhibit Hall with Coffee Break
Shaowen Tong ⋅ Zimin Xia ⋅ Alexandre Alahi ⋅ Xuming He ⋅ Yujiao Shi
|
Exhibit Hall I #60 | |
|
BlinkTrack: Feature Tracking over 80 FPS via Events and Images
Poster Session 2 & Exhibit Hall with Coffee Break
Yichen Shen ⋅ Yijin Li ⋅ Shuo Chen ⋅ Guanglin Li ⋅ Zhaoyang Huang ⋅ Hujun Bao ⋅ Zhaopeng Cui ⋅ Guofeng Zhang
|
Exhibit Hall I #402 | |
|
DICE: Staleness-Centric Optimizations for Parallel Diffusion MoE Inference
Poster Session 4 & Exhibit Hall with Coffee Break
Jiajun Luo ⋅ Lizhuo Luo ⋅ Jianru Xu ⋅ Jiajun Song ⋅ Rongwei Lu ⋅ Chen Tang ⋅ Zhi Wang
|
Exhibit Hall I #55 | |
|
AerialVG: A Challenging Benchmark for Aerial Visual Grounding by Exploring Positional Relations
Poster Session 2 & Exhibit Hall with Coffee Break
Junli Liu ⋅ Qizhi Chen ⋅ Zhigang Wang ⋅ Yiwen Tang ⋅ Yiting Zhang ⋅ Chi Yan ⋅ Dong Wang ⋅ Xuelong Li ⋅ Bin Zhao
|
Exhibit Hall I #15 | |
|
The Curse of Conditions: Analyzing and Improving Optimal Transport for Conditional Flow-Based Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Ho Kei Cheng ⋅ Alex Schwing
|
Exhibit Hall I #94 | |
|
Diffusion-based Source-biased Model for Single Domain Generalized Object Detection
Poster Session 1 & Exhibit Hall
Han Jiang ⋅ Wenfei Yang ⋅ Tianzhu Zhang ⋅ Yongdong Zhang
|
Exhibit Hall I #137 | |
|
ReconDreamer++: Harmonizing Generative and Reconstructive Models for Driving Scene Representation
Poster Session 6 & Exhibit Hall with Coffee Break
Guosheng Zhao ⋅ Xiaofeng Wang ⋅ Chaojun Ni ⋅ Zheng Zhu ⋅ Wenkang Qin ⋅ Guan Huang ⋅ Xingang Wang
|
Exhibit Hall I #193 | |
|
VLRMBench: A Comprehensive and Challenging Benchmark for Vision-Language Reward Models
Poster Session 1 & Exhibit Hall
JIACHENG RUAN ⋅ Wenzhen Yuan ⋅ Xian Gao ⋅ Ye Guo ⋅ Daoxin Zhang ⋅ Zhe Xu ⋅ Yao Hu ⋅ Ting Liu ⋅ yuzhuo fu
|
Exhibit Hall I #292 | |
|
Measuring the Impact of Rotation Equivariance on Aerial Object Detection
Poster Session 2 & Exhibit Hall with Coffee Break
Xiuyu Wu ⋅ Xinhao Wang ⋅ Xiubin Zhu ⋅ Lan Yang ⋅ Jiyuan Liu ⋅ Xingchen Hu
|
Exhibit Hall I #216 | |
|
Enhanced Pansharpening via Quaternion Spatial-Spectral Interactions
Poster Session 3 & Exhibit Hall
Dong Li ⋅ Chunhui Luo ⋅ Yuanfei Bao ⋅ Gang Yang ⋅ Jie Xiao ⋅ Xueyang Fu ⋅ Zheng-Jun Zha
|
Exhibit Hall I #85 | |
|
Monocular Semantic Scene Completion via Masked Recurrent Networks
Poster Session 6 & Exhibit Hall with Coffee Break
Xuzhi Wang ⋅ Xinran Wu ⋅ Song Wang ⋅ Lingdong Kong ⋅ Ziping Zhao
|
Exhibit Hall I #9 | |
|
Client2Vec: Improving Federated Learning by Distribution Shifts Aware Client Indexing
Poster Session 1 & Exhibit Hall
Yongxin Guo ⋅ Lin Wang ⋅ Xiaoying Tang ⋅ Tao Lin
|
Exhibit Hall I #126 | |
|
OpenRSD: Towards Open-prompts for Object Detection in Remote Sensing Images
Poster Session 2 & Exhibit Hall with Coffee Break
Ziyue Huang ⋅ Yongchao Feng ⋅ Ziqi Liu ⋅ Shuai Yang ⋅ Qingjie Liu ⋅ Yunhong Wang
|
Exhibit Hall I #317 | |
|
InsViE-1M: Effective Instruction-based Video Editing with Elaborate Dataset Construction
Poster Session 4 & Exhibit Hall with Coffee Break
Yuhui WU ⋅ Liyi Chen ⋅ Ruibin Li ⋅ Shihao Wang ⋅ Chenxi Xie ⋅ Lei Zhang
|
Exhibit Hall I #173 | |
|
PathDiff: Histopathology Image Synthesis with Unpaired Text and Mask Conditions
Poster Session 5 & Exhibit Hall
Mahesh Bhosale ⋅ Abdul Wasi ⋅ Yuanhao Zhai ⋅ Yunjie Tian ⋅ Samuel Border ⋅ Nan Xi ⋅ Pinaki Sarder ⋅ Junsong Yuan ⋅ David Doermann ⋅ Xuan Gong
|
Exhibit Hall I #246 | |
|
PhysRig: Differentiable Physics-Based Skinning and Rigging Framework for Realistic Articulated Object Modeling
Poster Session 2 & Exhibit Hall with Coffee Break
Hao Zhang ⋅ Haolan Xu ⋅ Chun Feng ⋅ Varun Jampani ⋅ Narendra Ahuja
|
Exhibit Hall I #150 | |
|
From Gaze to Movement: Predicting Visual Attention for Autonomous Driving Human-Machine Interaction based on Programmatic Imitation Learning
Poster Session 6 & Exhibit Hall with Coffee Break
Yexin Huang ⋅ Yongbin Lin ⋅ Lishengsa Yue ⋅ Zhihong Yao ⋅ Jie Wang
|
Exhibit Hall I #136 | |
|
ALOcc: Adaptive Lifting-based 3D Semantic Occupancy and Cost Volume-based Flow Predictions
Poster Session 1 & Exhibit Hall
Dubing Chen ⋅ Jin Fang ⋅ Wencheng Han ⋅ Xinjing Cheng ⋅ Junbo Yin ⋅ Cheng-zhong Xu ⋅ Fahad Khan ⋅ Jianbing Shen
|
Exhibit Hall I #389 | |
|
Optical Model-Driven Sharpness Mapping for Autofocus in Small Depth-of-Field and Severe Defocus Scenarios
Poster Session 2 & Exhibit Hall with Coffee Break
Chen-Liang Fan ⋅ Mingpei Cao ⋅ Chih-Chien Hung ⋅ Yuesheng Zhu
|
Exhibit Hall I #131 | |
|
HyPiDecoder: Hybrid Pixel Decoder for Efficient Segmentation and Detection
Poster Session 5 & Exhibit Hall
Fengzhe Zhou ⋅ Humphrey Shi
|
Exhibit Hall I #215 | |
|
UniCombine: Unified Multi-Conditional Combination with Diffusion Transformer
Poster Session 4 & Exhibit Hall with Coffee Break
Haoxuan Wang ⋅ Jinlong Peng ⋅ Qingdong He ⋅ Hao Yang ⋅ Ying Jin ⋅ Jiafu Wu ⋅ Xiaobin Hu ⋅ Yanjie Pan ⋅ Zhenye Gan ⋅ Mingmin Chi ⋅ Bo Peng ⋅ Yabiao Wang
|
Exhibit Hall I #330 | |
|
MMAD: Multi-label Micro-Action Detection in Videos
Poster Session 3 & Exhibit Hall
Kun Li ⋅ pengyu Liu ⋅ Dan Guo ⋅ Fei Wang ⋅ zhiliang wu ⋅ Hehe Fan ⋅ Meng Wang
|
Exhibit Hall I #305 | |
|
MP-HSIR: A Multi-Prompt Framework for Universal Hyperspectral Image Restoration
Poster Session 3 & Exhibit Hall
Zhehui Wu ⋅ Yong Chen ⋅ Naoto Yokoya ⋅ Wei He
|
Exhibit Hall I #283 | |
|
Perceive, Understand and Restore: Real-World Image Super-Resolution with Autoregressive Multimodal Generative Models
Poster Session 4 & Exhibit Hall with Coffee Break
Hongyang Wei ⋅ Shuaizheng Liu ⋅ Chun Yuan ⋅ Lei Zhang
|
Exhibit Hall I #359 | |
|
Learning to Generalize without Bias for Open-Vocabulary Action Recognition
Yating Yu ⋅ Congqi Cao ⋅ Yifan Zhang ⋅ Yanning Zhang
|
Exhibit Hall I #263 | |
|
Semantic Equitable Clustering: A Simple and Effective Strategy for Clustering Vision Tokens
Poster Session 1 & Exhibit Hall
Qihang Fan ⋅ Huaibo Huang ⋅ Mingrui Chen ⋅ Ran He
|
Exhibit Hall I #374 | |
|
TikZero: Zero-Shot Text-Guided Graphics Program Synthesis
Jonas Belouadi ⋅ Eddy Ilg ⋅ Margret Keuper ⋅ Hideki Tanaka ⋅ Masao Utiyama ⋅ Raj Dabre ⋅ Steffen Eger ⋅ Simone Paolo Ponzetto
|
Exhibit Hall I #278 | |
|
SAUCE: Selective Concept Unlearning in Vision-Language Models with Sparse Autoencoders
Poster Session 1 & Exhibit Hall
Jiahui Geng ⋅ Qing Li
|
Exhibit Hall I #279 | |
|
Training-Free Industrial Defect Generation with Diffusion Models
Poster Session 5 & Exhibit Hall
Ruyi Xu ⋅ Yen-Tzu Chiu ⋅ Tai-I Chen ⋅ Oscar Chew ⋅ Yung-Yu Chuang ⋅ Wen-Huang Cheng
|
Exhibit Hall I #413 | |
|
Feature Decomposition-Recomposition in Large Vision-Language Model for Few-Shot Class-Incremental Learning
Poster Session 1 & Exhibit Hall
Zongyao Xue ⋅ Meina Kan ⋅ Shiguang Shan ⋅ Xilin Chen
|
Exhibit Hall I #291 | |
|
When Schrödinger Bridge Meets Real-World Image Dehazing with Unpaired Training
Poster Session 2 & Exhibit Hall with Coffee Break
Yunwei Lan ⋅ Zhigao Cui ⋅ Xin Luo ⋅ Chang Liu ⋅ Nian Wang ⋅ Menglin Zhang ⋅ Yanzhao Su ⋅ Dong Liu
|
Exhibit Hall I #351 | |
|
TRCE: Towards Reliable Malicious Concept Erasure in Text-to-Image Diffusion Models
Poster Session 4 & Exhibit Hall with Coffee Break
Ruidong Chen ⋅ honglin guo ⋅ Lanjun Wang ⋅ Chenyu Zhang ⋅ Weizhi Nie ⋅ Anan Liu
|
Exhibit Hall I #388 | |
|
MagicMotion: Controllable Video Generation with Dense-to-Sparse Trajectory Guidance
Poster Session 3 & Exhibit Hall
Quanhao Li ⋅ Zhen Xing ⋅ Rui Wang ⋅ Hui Zhang ⋅ Qi Dai ⋅ Zuxuan Wu
|
Exhibit Hall I #198 | |
|
Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness
Poster Session 2 & Exhibit Hall with Coffee Break
Haochen Wang ⋅ Yucheng Zhao ⋅ Tiancai Wang ⋅ Haoqiang Fan ⋅ Xiangyu Zhang ⋅ Zhaoxiang Zhang
|
Exhibit Hall I #400 | |
|
SEAL: Semantic Aware Image Watermarking
Poster Session 4 & Exhibit Hall with Coffee Break
Kasra Arabi ⋅ R. Teal Witter ⋅ Chinmay Hegde ⋅ Niv Cohen
|
Exhibit Hall I #124 | |
|
ArchiSet: Benchmarking Editable and Consistent Single-View 3D Reconstruction of Buildings with Specific Window-to-Wall Ratios
Poster Session 6 & Exhibit Hall with Coffee Break
Jun Yin ⋅ Pengyu Zeng ⋅ Licheng Shen ⋅ Miao Zhang ⋅ Jing Zhong ⋅ Yuxing Han ⋅ Shuai Lu
|
Exhibit Hall I #122 | |
|
SkySense V2: A Unified Foundation Model for Multi-modal Remote Sensing
Poster Session 2 & Exhibit Hall with Coffee Break
Yingying Zhang ⋅ Lixiang Ru ⋅ Kang Wu ⋅ Lei Yu ⋅ Lei Liang ⋅ Yansheng Li ⋅ Jingdong Chen
|
Exhibit Hall I #388 | |
|
DMesh++: An Efficient Differentiable Mesh for Complex Shapes
Poster Session 6 & Exhibit Hall with Coffee Break
Sanghyun Son ⋅ Matheus Gadelha ⋅ Yang Zhou ⋅ Matthew Fisher ⋅ Zexiang Xu ⋅ Yi-Ling Qiao ⋅ Ming Lin ⋅ Yi Zhou
|
Exhibit Hall I #181 | |
|
Advancing Textual Prompt Learning with Anchored Attributes
Poster Session 1 & Exhibit Hall
Zheng Li ⋅ Yibing Song ⋅ Ming-Ming Cheng ⋅ Xiang Li ⋅ jian Yang
|
Exhibit Hall I #336 | |
|
Meta-Unlearning on Diffusion Models: Preventing Relearning Unlearned Concepts
Poster Session 1 & Exhibit Hall
Hongcheng Gao ⋅ Tianyu Pang ⋅ Chao Du ⋅ Taihang Hu ⋅ Zhijie Deng ⋅ Min Lin
|
Exhibit Hall I #193 | |
|
AR-1-to-3: Single Image to Consistent 3D Object via Next-View Prediction
Poster Session 6 & Exhibit Hall with Coffee Break
Xuying Zhang ⋅ Yupeng Zhou ⋅ Kai Wang ⋅ Yikai Wang ⋅ Zhen Li ⋅ Daquan Zhou ⋅ Shaohui Jiao ⋅ Qibin Hou ⋅ Ming-Ming Cheng
|
Exhibit Hall I #151 | |
|
TR-PTS: Task-Relevant Parameter and Token Selection for Efficient Tuning
Poster Session 1 & Exhibit Hall
Siqi Luo ⋅ Haoran Yang ⋅ Yi Xin ⋅ Mingyang Yi ⋅ Guangyang Wu ⋅ Guangtao Zhai ⋅ Xiaohong Liu
|
Exhibit Hall I #409 | |
|
Benchmarking Multimodal Large Language Models Against Image Corruptions
Poster Session 2 & Exhibit Hall with Coffee Break
Xinkuan Qiu ⋅ Meina Kan ⋅ Yongbin Zhou ⋅ Shiguang Shan
|
Exhibit Hall I #375 | |
|
Harnessing Uncertainty-aware Bounding Boxes for Unsupervised 3D Object Detection
Poster Session 2 & Exhibit Hall with Coffee Break
Ruiyang Zhang ⋅ Hu Zhang ⋅ Zhedong Zheng
|
Exhibit Hall I #396 | |
|
DexH2R: A Benchmark for Dynamic Dexterous Grasping in Human-to-Robot Handover
Poster Session 3 & Exhibit Hall
Youzhuo Wang ⋅ jiayi ye ⋅ Chuyang Xiao ⋅ Yiming Zhong ⋅ Heng Tao ⋅ Hang Yu ⋅ Yumeng Liu ⋅ Jingyi Yu ⋅ Yuexin Ma
|
Exhibit Hall I #254 | |
|
Latent Expression Generation for Referring Image Segmentation and Grounding
Poster Session 5 & Exhibit Hall
Seonghoon Yu ⋅ Junbeom Hong ⋅ Joonseok Lee ⋅ Jeany Son
|
Exhibit Hall I #146 | |
|
LiON-LoRA: Rethinking LoRA Fusion to Unify Controllable Spatial and Temporal Generation for Video Diffusion
Poster Session 3 & Exhibit Hall
Yisu Zhang ⋅ Chenjie Cao ⋅ Chaohui Yu ⋅ Jianke Zhu
|
Exhibit Hall I #430 | |
|
BASIC: Boosting Visual Alignment with Intrinsic Refined Embeddings in Multimodal Large Language Models
Poster Session 5 & Exhibit Hall
Jianting Tang ⋅ Yubo Wang ⋅ Haoyu Cao ⋅ Linli Xu
|
Exhibit Hall I #73 | |
|
Bridging Diffusion Models and 3D Representations: A 3D Consistent Super-Resolution Framework
Poster Session 3 & Exhibit Hall
Yi-Ting Chen ⋅ Ting-Hsuan Liao ⋅ Pengsheng Guo ⋅ Alex Schwing ⋅ Jia-Bin Huang
|
Exhibit Hall I #328 | |
|
Deterministic Object Pose Confidence Region Estimation
Poster Session 4 & Exhibit Hall with Coffee Break
Jinghao Wang ⋅ Zhang Li ⋅ Zi Wang ⋅ Banglei Guan ⋅ Yang Shang ⋅ Qifeng Yu
|
Exhibit Hall I #383 | |
|
Online Language Splatting
Poster Session 6 & Exhibit Hall with Coffee Break
Saimouli Katragadda ⋅ Cho-Ying Wu ⋅ Yuliang Guo ⋅ Xinyu Huang ⋅ Guoquan Huang ⋅ Liu Ren
|
Exhibit Hall I #111 | |
|
JailbreakDiffBench: A Comprehensive Benchmark for Jailbreaking Diffusion Models
Poster Session 4 & Exhibit Hall with Coffee Break
Xiaolong Jin ⋅ Zixuan Weng ⋅ Hanxi Guo ⋅ Chenlong Yin ⋅ Siyuan Cheng ⋅ Guangyu Shen ⋅ Xiangyu Zhang
|
Exhibit Hall I #149 | |
|
UIPro: Unleashing Superior Interaction Capability For GUI Agents
Poster Session 1 & Exhibit Hall
Hongxin Li ⋅ Jingran Su ⋅ Jingfan CHEN ⋅ Zheng Ju ⋅ Yuntao Chen ⋅ Li Qing ⋅ Zhaoxiang Zhang
|
Exhibit Hall I #143 | |
|
SALAD -- Semantics-Aware Logical Anomaly Detection
Poster Session 5 & Exhibit Hall
Matic Fučka ⋅ Vitjan Zavrtanik ⋅ Danijel Skocaj
|
Exhibit Hall I #191 | |
|
FineMotion: A Dataset and Benchmark with both Spatial and Temporal Annotation for Fine-grained Motion Generation and Editing
Poster Session 3 & Exhibit Hall
Bizhu Wu ⋅ Jinheng Xie ⋅ Meidan Ding ⋅ Zhe Kong ⋅ Jianfeng Ren ⋅ Ruibin Bai ⋅ Rong Qu ⋅ Linlin Shen
|
Exhibit Hall I #360 | |
|
FiffDepth: Feed-forward Transformation of Diffusion-Based Generators for Detailed Depth Estimation
Poster Session 2 & Exhibit Hall with Coffee Break
Yunpeng Bai ⋅ Qixing Huang
|
Exhibit Hall I #94 | |
|
Perception-as-Control: Fine-grained Controllable Image Animation with 3D-aware Motion Representation
Poster Session 3 & Exhibit Hall
Yingjie Chen ⋅ Yifang Men ⋅ Yuan Yao ⋅ Miaomiao Cui ⋅ Liefeng Bo
|
Exhibit Hall I #412 | |
|
OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models
gaojie lin ⋅ Jianwen Jiang ⋅ Jiaqi Yang ⋅ Zerong Zheng ⋅ Chao Liang ⋅ ZHANG YUAN ⋅ Jingtu Li
|
Exhibit Hall I #361 | |
|
Knowledge Transfer from Interaction Learning
Poster Session 1 & Exhibit Hall
Yilin Gao ⋅ Kangyi Chen ⋅ Zhongxing Peng ⋅ Hengjie Lu ⋅ Shugong Xu
|
Exhibit Hall I #333 | |
|
WIR3D: Visually-Informed and Geometry-Aware 3D Shape Abstraction
Poster Session 4 & Exhibit Hall with Coffee Break
Richard Liu ⋅ Daniel Fu ⋅ Noah Tan ⋅ Itai Lang ⋅ Rana Hanocka
|
Exhibit Hall I #384 | |
|
GSV3D: Gaussian Splatting-based Geometric Distillation with Stable Video Diffusion for Single-Image 3D Object Generation
Poster Session 2 & Exhibit Hall with Coffee Break
Ye Tao ⋅ jiawei zhang ⋅ Yahao Shi ⋅ Dongqing Zou ⋅ Bin Zhou
|
Exhibit Hall I #257 | |
|
Multi-modal Segment Anything Model for Camouflaged Scene Segmentation
Poster Session 5 & Exhibit Hall
Guangyu Ren ⋅ Hengyan Liu ⋅ Michalis Lazarou ⋅ Tania Stathaki
|
Exhibit Hall I #8 | |
|
Shot-by-Shot: Film-Grammar-Aware Training-Free Audio Description Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Junyu Xie ⋅ Tengda Han ⋅ Max Bain ⋅ Arsha Nagrani ⋅ Eshika Khandelwal ⋅ Gül Varol ⋅ Weidi Xie ⋅ Andrew Zisserman
|
Exhibit Hall I #155 | |
|
MagicColor: Multi-instance Sketch Colorization
Poster Session 4 & Exhibit Hall with Coffee Break
yinhan Zhang ⋅ Yue Ma ⋅ Bingyuan Wang ⋅ Qifeng Chen ⋅ Zeyu Wang
|
Exhibit Hall I #30 | |
|
Synthesizing Near-Boundary OOD Samples for Out-of-Distribution Detection
Jinglun Li ⋅ Kaixun Jiang ⋅ Zhaoyu Chen ⋅ Bo Lin ⋅ Yao Tang ⋅ Weifeng Ge ⋅ Wenqiang Zhang
|
Exhibit Hall I #422 | |
|
Cassic: Towards Content-Adaptive State-Space Models for Learned Image Compression
Poster Session 4 & Exhibit Hall with Coffee Break
Shiyu Qin ⋅ Jinpeng Wang ⋅ Yimin Zhou ⋅ Bin Chen ⋅ Tianci Luo ⋅ Baoyi An ⋅ Tao Dai ⋅ Shu-Tao Xia ⋅ Yaowei Wang
|
Exhibit Hall I #80 | |
|
Know Your Attention Maps: Class-specific Token Masking for Weakly Supervised Semantic Segmentation
Poster Session 5 & Exhibit Hall
Joëlle Hanna ⋅ Damian Borth
|
Exhibit Hall I #373 | |
|
Can We Achieve Efficient Diffusion Without Self-Attention? Distilling Self-Attention into Convolutions
Poster Session 4 & Exhibit Hall with Coffee Break
ZiYi Dong ⋅ Chengxing Zhou ⋅ Weijian Deng ⋅ Pengxu Wei ⋅ Xiangyang Ji ⋅ Liang Lin
|
Exhibit Hall I #241 | |
|
Ultra-Precision 6DoF Pose Estimation Using 2-D Interpolated Discrete Fourier Transform
Poster Session 2 & Exhibit Hall with Coffee Break
Guowei Shi ⋅ Zian Mao ⋅ Peisen Huang
|
Exhibit Hall I #72 | |
|
RealGeneral: Unifying Visual Generation via Temporal In-Context Learning with Video Models
Poster Session 4 & Exhibit Hall with Coffee Break
Yijing Lin ⋅ Mengqi Huang ⋅ Shuhan Zhuang ⋅ Zhendong Mao
|
Exhibit Hall I #10 | |
|
PixelStitch: Structure-Preserving Pixel-Wise Bidirectional Warps for Unsupervised Image Stitching
Poster Session 6 & Exhibit Hall with Coffee Break
Hengzhe Jin ⋅ Lang Nie ⋅ Chunyu Lin ⋅ Xiaomei Feng ⋅ Yao Zhao
|
Exhibit Hall I #328 | |
|
A Differentiable Wave Optics Model for End-to-End Computational Imaging System Optimization
Poster Session 6 & Exhibit Hall with Coffee Break
Chi-Jui Ho ⋅ Yash Belhe ⋅ Steve Rotenberg ⋅ Ravi Ramamoorthi ⋅ Tzu-Mao Li ⋅ Nicholas Antipa
|
Exhibit Hall I #320 | |
|
Towards a Universal Image Degradation Model via Content-Degradation Disentanglement
Poster Session 3 & Exhibit Hall
Wenbo Yang ⋅ Zhongling Wang ⋅ Zhou Wang
|
Exhibit Hall I #279 | |
|
Intra-view and Inter-view Correlation Guided Multi-view Novel Class Discovery
Poster Session 1 & Exhibit Hall
Xinhang Wan ⋅ Jiyuan Liu ⋅ Qian Qu ⋅ Suyuan Liu ⋅ Chuyu Zhang ⋅ Fangdi Wang ⋅ Xinwang Liu ⋅ En Zhu ⋅ Kunlun He
|
Exhibit Hall I #385 | |
|
HUST: High-Fidelity Unbiased Skin Tone Estimation via Texture Quantization
Poster Session 3 & Exhibit Hall
Zimin Ran ⋅ Xingyu Ren ⋅ Xiang An ⋅ Kaicheng Yang ⋅ Ziyong Feng ⋅ Jing Yang ⋅ Rolandos Alexandros Potamias ⋅ Linchao Zhu ⋅ Jiankang Deng
|
Exhibit Hall I #332 | |
|
One Polyp Identifies All: One-Shot Polyp Segmentation with SAM via Cascaded Priors and Iterative Prompt Evolution
Poster Session 5 & Exhibit Hall
Xinyu Mao ⋅ Xiaohan Xing ⋅ Fei MENG ⋅ Jianbang LIU ⋅ Fan BAI ⋅ Qiang Nie ⋅ Max Meng
|
Exhibit Hall I #410 | |
|
FDPT: Federated Discrete Prompt Tuning for Black-Box Visual-Language Models
Poster Session 1 & Exhibit Hall
Jiaqi Wu ⋅ Simin Chen ⋅ Jing Tang ⋅ Yuzhe YANG ⋅ Yiming Chen ⋅ Lixu Wang ⋅ Song Lin ⋅ Zehua Wang ⋅ Wei Chen ⋅ Zijian Tian
|
Exhibit Hall I #224 | |
|
Leveraging the Power of MLLMs for Gloss-Free Sign Language Translation
Poster Session 5 & Exhibit Hall
Jungeun Kim ⋅ Hyeongwoo Jeon ⋅ Jongseong Bae ⋅ Ha Young Kim
|
Exhibit Hall I #117 | |
|
CATP-LLM: Empowering Large Language Models for Cost-Aware Tool Planning
Poster Session 2 & Exhibit Hall with Coffee Break
Duo Wu ⋅ Jinghe Wang ⋅ Yuan Meng ⋅ Yanning Zhang ⋅ Le Sun ⋅ Zhi Wang
|
Exhibit Hall I #346 | |
|
Dynamic Group Detection using VLM-augmented Temporal Groupness Graph
Poster Session 3 & Exhibit Hall
Kaname Yokoyama ⋅ Chihiro Nakatani ⋅ Norimichi Ukita
|
Exhibit Hall I #43 | |
|
CanonSwap: High-Fidelity and Consistent Video Face Swapping via Canonical Space Modulation
Poster Session 3 & Exhibit Hall
Xiangyang Luo ⋅ Ye Zhu ⋅ Yunfei Liu ⋅ Lijian Lin ⋅ Cong Wan ⋅ Zijian Cai ⋅ Yu Li ⋅ Shao-Lun Huang
|
Exhibit Hall I #6 | |
|
MedVSR: Medical Video Super-Resolution with Cross State-Space Propagation
Poster Session 3 & Exhibit Hall
Xinyu Liu ⋅ Guolei Sun ⋅ Cheng Wang ⋅ Yixuan Yuan ⋅ Ender Konukoglu
|
Exhibit Hall I #160 | |
|
Learning Deblurring Texture Prior from Unpaired Data with Diffusion Model
Poster Session 3 & Exhibit Hall
Chengxu Liu ⋅ Lu Qi ⋅ Jinshan Pan ⋅ Xueming Qian ⋅ Ming-Hsuan Yang
|
Exhibit Hall I #395 | |
|
Top2Pano: Learning to Generate Indoor Panoramas from Top-Down View
Poster Session 6 & Exhibit Hall with Coffee Break
Zitong Zhang ⋅ Suranjan Gautam ⋅ Rui Yu
|
Exhibit Hall I #365 | |
|
ConformalSAM: Unlocking the Potential of Foundational Segmentation Models in Semi-Supervised Semantic Segmentation with Conformal Prediction
Poster Session 5 & Exhibit Hall
Danhui Chen ⋅ Ziquan Liu ⋅ Chuxi Yang ⋅ Dan Wang ⋅ Yan Yan ⋅ Yi Xu ⋅ Xiangyang Ji
|
Exhibit Hall I #398 | |
|
Underwater Visual SLAM with Depth Uncertainty and Medium Modeling
Rui Liu ⋅ Sheng Fan ⋅ Wenguan Wang ⋅ Yi Yang
|
Exhibit Hall I #83 | |
|
Generalization-Preserved Learning: Closing the Backdoor to Catastrophic Forgetting in Continual Deepfake Detection
Poster Session 1 & Exhibit Hall
Xueyi Zhang ⋅ Peiyin Zhu ⋅ Chengwei Zhang ⋅ Zhiyuan Yan ⋅ Jikang Cheng ⋅ Mingrui Lao ⋅ Siqi Cai ⋅ Yanming Guo
|
Exhibit Hall I #353 | |
|
Open-Vocabulary Octree-Graph for 3D Scene Understanding
Poster Session 2 & Exhibit Hall with Coffee Break
Zhigang Wang ⋅ Yifei Su ⋅ Chenhui Li ⋅ Dong Wang ⋅ Yan Huang ⋅ Xuelong Li ⋅ Bin Zhao
|
Exhibit Hall I #189 | |
|
LangBridge: Interpreting Image as a Combination of Language Embeddings
Poster Session 5 & Exhibit Hall
Jiaqi Liao ⋅ Yuwei Niu ⋅ Fanqing Meng ⋅ Hao Li ⋅ Changyao Tian ⋅ Yinuo Du ⋅ Yuwen Xiong ⋅ Dianqi Li ⋅ Xizhou Zhu ⋅ Li Yuan ⋅ Jifeng Dai ⋅ Yu Cheng
|
Exhibit Hall I #372 | |
|
IGD: Instructional Graphic Design with Multimodal Layer Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Yadong Qu ⋅ Shancheng Fang ⋅ Yuxin Wang ⋅ Xiaorui Wang ⋅ Zhineng Chen ⋅ Hongtao Xie ⋅ Yongdong Zhang
|
Exhibit Hall I #320 | |
|
ResGS: Residual Densification of 3D Gaussian for Efficient Detail Recovery
Poster Session 6 & Exhibit Hall with Coffee Break
Yanzhe Lyu ⋅ Kai Cheng ⋅ Kang Xin ⋅ Xuejin Chen
|
Exhibit Hall I #325 | |
|
DASH: 4D Hash Encoding with Self-Supervised Decomposition for Real-Time Dynamic Scene Rendering
Poster Session 6 & Exhibit Hall with Coffee Break
Jie Chen ⋅ Zhangchi Hu ⋅ Peixi Wu ⋅ Huyue Zhu ⋅ Hebei Li ⋅ Xiaoyan Sun
|
Exhibit Hall I #158 | |
|
Bridging the Gap Between Ideal and Real-world Evaluation: Benchmarking AI-Generated Image Detection in Challenging Scenarios
Poster Session 5 & Exhibit Hall
Chunxiao Li ⋅ Xiaoxiao Wang ⋅ Meiling Li ⋅ Boming Miao ⋅ Peng Sun ⋅ Yunjian Zhang ⋅ Xiangyang Ji ⋅ Yao Zhu
|
Exhibit Hall I #54 | |
|
You Share Beliefs, I Adapt: Progressive Heterogeneous Collaborative Perception
Poster Session 6 & Exhibit Hall with Coffee Break
hao si ⋅ Ehsan Javanmardi ⋅ Manabu Tsukada
|
Exhibit Hall I #270 | |
|
Learning Normal Flow Directly From Events
Poster Session 2 & Exhibit Hall with Coffee Break
Dehao Yuan ⋅ Levi Burner ⋅ Jiayi Wu ⋅ Minghui Liu ⋅ Jingxi Chen ⋅ Yiannis Aloimonos ⋅ Cornelia Fermuller
|
Exhibit Hall I #277 | |
|
UniMLVG: Unified Framework for Multi-view Long Video Generation with Comprehensive Control Capabilities for Autonomous Driving
Poster Session 6 & Exhibit Hall with Coffee Break
Rui Chen ⋅ Zehuan Wu ⋅ Yichen Liu ⋅ Yuxin Guo ⋅ Jingcheng Ni ⋅ Haifeng Xia ⋅ Siyu Xia
|
Exhibit Hall I #69 | |
|
EVT: Efficient View Transformation for Multi-Modal 3D Object Detection
Poster Session 6 & Exhibit Hall with Coffee Break
Yongjin Lee ⋅ Hyeon-Mun Jeong ⋅ Yurim Jeon ⋅ Sanghyun Kim
|
Exhibit Hall I #185 | |
|
An Inversion-based Measure of Memorization for Diffusion Models
Poster Session 4 & Exhibit Hall with Coffee Break
Zhe Ma ⋅ Qingming Li ⋅ Xuhong Zhang ⋅ Tianyu Du ⋅ Ruixiao Lin ⋅ Zonghui Wang ⋅ Shouling Ji ⋅ Wenzhi CHEN
|
Exhibit Hall I #198 | |
|
TLB-VFI: Temporal-Aware Latent Brownian Bridge Diffusion for Video Frame Interpolation
Poster Session 4 & Exhibit Hall with Coffee Break
Zonglin Lyu ⋅ Chen Chen
|
Exhibit Hall I #130 | |
|
Face Retouching with Diffusion Data Generation and Spectral Restorement
Poster Session 3 & Exhibit Hall
Zhidan Xu ⋅ Xiaoqin Zhang ⋅ Shijian Lu
|
Exhibit Hall I #444 | |
|
HumanSAM: Classifying Human-centric Forgery Videos in Human Spatial, Appearance, and Motion Anomaly
Poster Session 3 & Exhibit Hall
Chang Liu ⋅ Yunfan Ye ⋅ Fan Zhang ⋅ Qingyang Zhou ⋅ Yuchuan Luo ⋅ Zhiping Cai
|
Exhibit Hall I #380 | |
|
OcRFDet: Object-Centric Radiance Fields for Multi-View 3D Object Detection in Autonomous Driving
Poster Session 6 & Exhibit Hall with Coffee Break
Mingqian Ji ⋅ Jian Yang ⋅ Shanshan Zhang
|
Exhibit Hall I #20 | |
|
Contrastive Flow Matching
Poster Session 1 & Exhibit Hall
George Stoica ⋅ Vivek Ramanujan ⋅ Xiang Fan ⋅ Ali Farhadi ⋅ Ranjay Krishna ⋅ Judy Hoffman
|
Exhibit Hall I #103 | |
|
Class Token as Proxy: Optimal Transport-assisted Proxy Learning for Weakly Supervised Semantic Segmentation
Poster Session 5 & Exhibit Hall
Jian Wang ⋅ Tianhong Dai ⋅ Bingfeng Zhang ⋅ Siyue Yu ⋅ ENG Gee LIM ⋅ Jimin XIAO
|
Exhibit Hall I #173 | |
|
Neural Compression for 3D Geometry Sets
Poster Session 6 & Exhibit Hall with Coffee Break
Siyu Ren ⋅ Junhui Hou ⋅ Weiyao Lin ⋅ Wenping Wang
|
Exhibit Hall I #54 | |
|
Learnable Logit Adjustment for Imbalanced Semi-Supervised Learning under Class Distribution Mismatch
Poster Session 1 & Exhibit Hall
lee hyuck ⋅ Taemin Park ⋅ Heeyoung Kim
|
Exhibit Hall I #245 | |
|
SMARTIES: Spectrum-Aware Multi-Sensor Auto-Encoder for Remote Sensing Images
Poster Session 2 & Exhibit Hall with Coffee Break
Gencer Sumbul ⋅ Chang Xu ⋅ Emanuele Dalsasso ⋅ Devis Tuia
|
Exhibit Hall I #51 | |
|
Dataset Ownership Verification for Pre-trained Masked Models
Poster Session 1 & Exhibit Hall
Yuechen Xie ⋅ Jie Song ⋅ Yicheng Shan ⋅ Xiaoyan Zhang ⋅ Yuanyu Wan ⋅ Shengxuming Zhang ⋅ Jiarui Duan ⋅ Mingli Song
|
Exhibit Hall I #289 | |
|
CARL: Causality-guided Architecture Representation Learning for an Interpretable Performance Predictor
Poster Session 5 & Exhibit Hall
Han Ji ⋅ Yuqi Feng ⋅ Jiahao Fan ⋅ Yanan Sun
|
Exhibit Hall I #302 | |
|
From Holistic to Localized: Local Enhanced Adapters for Efficient Visual Instruction Fine-Tuning
Poster Session 1 & Exhibit Hall
Pengkun Jiao ⋅ Bin Zhu ⋅ Jingjing Chen ⋅ Chong-Wah Ngo ⋅ Yu-Gang Jiang
|
Exhibit Hall I #251 | |
|
DiffPCI: Large Motion Point Cloud frame Interpolation with Diffusion Model
Poster Session 6 & Exhibit Hall with Coffee Break
tianyu zhang ⋅ Haobo Jiang ⋅ jian Yang ⋅ Jin Xie
|
Exhibit Hall I #254 | |
|
GLEAM: Enhanced Transferable Adversarial Attacks for Vision-Language Pre-training Models via Global-Local Transformations
Poster Session 1 & Exhibit Hall
Yunqi Liu ⋅ Xiaohui Cui ⋅ Ouyang Xue
|
Exhibit Hall I #148 | |
|
Disentangled World Models: Learning to Transfer Semantic Knowledge from Distracting Videos for Reinforcement Learning
Poster Session 1 & Exhibit Hall
Qi Wang ⋅ Zhipeng Zhang ⋅ Baao Xie ⋅ Xin Jin ⋅ Yunbo Wang ⋅ Shiyu Wang ⋅ Liaomo Zheng ⋅ Xiaokang Yang ⋅ Wenjun Zeng
|
Exhibit Hall I #239 | |
|
Stepping Out of Similar Semantic Space for Open-Vocabulary Segmentation
Poster Session 5 & Exhibit Hall
Yong Liu ⋅ Song-Li Wu ⋅ Sule Bai ⋅ Jiahao Wang ⋅ Yitong Wang ⋅ Yansong Tang
|
Exhibit Hall I #269 | |
|
MultiModal Action Conditioned Video Simulation
Poster Session 3 & Exhibit Hall
Yichen Li ⋅ Antonio Torralba
|
Exhibit Hall I #393 | |
|
ClearSight: Human Vision-Inspired Solutions for Event-Based Motion Deblurring
Poster Session 2 & Exhibit Hall with Coffee Break
Xiaopeng LIN ⋅ Yulong Huang ⋅ Hongwei Ren ⋅ Zunchang Liu ⋅ Hongxiang Huang ⋅ Yue Zhou ⋅ Haotian FU ⋅ Bojun Cheng
|
Exhibit Hall I #230 | |
|
PBFG: A New Physically-Based Dataset and Removal of Lens Flares and Glares
Poster Session 2 & Exhibit Hall with Coffee Break
Jie Zhu ⋅ Sungkil Lee
|
Exhibit Hall I #40 | |
|
Correspondence as Video: Test-Time Adaption on SAM2 for Reference Segmentation in the Wild
Poster Session 2 & Exhibit Hall with Coffee Break
Haoran Wang ⋅ Zekun Li ⋅ Jian Zhang ⋅ Lei Qi ⋅ Yinghuan Shi
|
Exhibit Hall I #296 | |
|
An Information-Theoretic Regularizer for Lossy Neural Image Compression
Poster Session 4 & Exhibit Hall with Coffee Break
ZHANG YINGWEN ⋅ Meng Wang ⋅ Xihua Sheng ⋅ Peilin CHEN ⋅ Junru Li ⋅ Li Zhang ⋅ Shiqi Wang
|
Exhibit Hall I #64 | |
|
Knowledge-Guided Part Segmentation
Poster Session 2 & Exhibit Hall with Coffee Break
Xuejian Gou ⋅ Fang Liu ⋅ Licheng Jiao ⋅ Shuo Li ⋅ Lingling Li ⋅ Hao Wang ⋅ Xu Liu ⋅ Puhua Chen ⋅ wenping ma
|
Exhibit Hall I #44 | |
|
ASGS: Single-Domain Generalizable Open-Set Object Detection via Adaptive Subgraph Searching
Poster Session 5 & Exhibit Hall
Yuxuan Yuan ⋅ Luyao Tang ⋅ Chaoqi Chen ⋅ Yixin Chen ⋅ Yue Huang ⋅ Xinghao Ding
|
Exhibit Hall I #105 | |
|
DADet: Safeguarding Image Conditional Diffusion Models against Adversarial and Backdoor Attacks via Diffusion Anomaly Detection
Hongwei Yu ⋅ Xinlong Ding ⋅ Jiawei Li ⋅ Jinlong Wang ⋅ Yudong Zhang ⋅ Rongquan Wang ⋅ Huimin Ma ⋅ Jiansheng Chen
|
Exhibit Hall I #242 | |
|
Unbiased Region-Language Alignment for Open-Vocabulary Dense Prediction
Poster Session 5 & Exhibit Hall
Yunheng Li ⋅ Yuxuan Li ⋅ Quan-Sheng Zeng ⋅ Wenhai Wang ⋅ Qibin Hou ⋅ Ming-Ming Cheng
|
Exhibit Hall I #378 | |
|
Rethinking Layered Graphic Design Generation with a Top-Down Approach
Poster Session 4 & Exhibit Hall with Coffee Break
Jingye Chen ⋅ Zhaowen Wang ⋅ Nanxuan Zhao ⋅ Li Zhang ⋅ Difan Liu ⋅ Jimei Yang ⋅ Qifeng Chen
|
Exhibit Hall I #189 | |
|
LEGO-Maker: A Semantic-Driven Algorithm for Text-to-3D Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Yifei Zhang ⋅ Lei Chen
|
Exhibit Hall I #23 | |
|
ReME: A Data-Centric Framework for Training-Free Open-Vocabulary Segmentation
Xiwei Xuan ⋅ Ziquan Deng ⋅ Kwan-Liu Ma
|
Exhibit Hall I #109 | |
|
MonoMobility: Zero-Shot 3D Mobility Analysis from Monocular Videos
Poster Session 2 & Exhibit Hall with Coffee Break
Hongyi Zhou ⋅ Xiaogang Wang ⋅ Yulan Guo ⋅ Kai Xu
|
Exhibit Hall I #355 | |
|
PCR-GS: COLMAP-Free 3D Gaussian Splatting via Pose Co-Regularizations
Poster Session 6 & Exhibit Hall with Coffee Break
YU WEI ⋅ Jiahui Zhang ⋅ Xiaoqin Zhang ⋅ Ling Shao ⋅ Shijian Lu
|
Exhibit Hall I #172 | |
|
Performing Defocus Deblurring by Modeling its Formation Process
Poster Session 2 & Exhibit Hall with Coffee Break
Zhengbo Zhang ⋅ Lin Geng Foo ⋅ Hossein Rahmani ⋅ Jun Liu ⋅ De Wen Soh
|
Exhibit Hall I #71 | |
|
CasP: Improving Semi-Dense Feature Matching Pipeline Leveraging Cascaded Correspondence Priors for Guidance
Peiqi Chen ⋅ Lei Yu ⋅ Yi Wan ⋅ Yingying Pei ⋅ Xinyi Liu ⋅ YongxiangYao YongxiangYao ⋅ Yingying Zhang ⋅ Lixiang Ru ⋅ Liheng Zhong ⋅ Jingdong Chen ⋅ Ming Yang ⋅ Yongjun Zhang
|
Exhibit Hall I #322 | |
|
OmniDiff: A Comprehensive Benchmark for Fine-grained Image Difference Captioning
Poster Session 5 & Exhibit Hall
Yuan Liu ⋅ Saihui Hou ⋅ Saijie Hou ⋅ Jiabao Du ⋅ Shibei Meng ⋅ Yongzhen Huang
|
Exhibit Hall I #152 | |
|
Toward Long-Tailed Online Anomaly Detection through Class-Agnostic Concepts
Poster Session 5 & Exhibit Hall
Chiao-An Yang ⋅ Kuan-Chuan Peng ⋅ Raymond A. Yeh
|
Exhibit Hall I #341 | |
|
PLMP - Point-Line Minimal Problems for Projective SfM
Kim Kiehn ⋅ Albin Ahlbäck ⋅ Kathlén Kohn
|
Exhibit Hall I #333 | |
|
More Reliable Pseudo-labels, Better Performance: A Generalized Approach to Single Positive Multi-label Learning
Poster Session 1 & Exhibit Hall
Luong Tran ⋅ Thieu Vo ⋅ Anh Nguyen ⋅ Sang Dinh ⋅ Van Nguyen
|
Exhibit Hall I #118 | |
|
SpiLiFormer: Enhancing Spiking Transformers with Lateral Inhibition
Poster Session 5 & Exhibit Hall
Zeqi Zheng ⋅ Yanchen Huang ⋅ Yingchao Yu ⋅ Zizheng Zhu ⋅ Junfeng Tang ⋅ Zhaofei Yu ⋅ Yaochu Jin
|
Exhibit Hall I #444 | |
|
SparseVILA: Decoupling Visual Sparsity for Efficient VLM Inference
Poster Session 5 & Exhibit Hall
Samir Khaki ⋅ Junxian Guo ⋅ Jiaming Tang ⋅ Shang Yang ⋅ Yukang Chen ⋅ Konstantinos Plataniotis ⋅ Yao Lu ⋅ Song Han ⋅ Zhijian Liu
|
Exhibit Hall I #375 | |
|
Active Perception Meets Rule-Guided RL: A Two-Phase Approach for Precise Object Navigation in Complex Environments
Poster Session 2 & Exhibit Hall with Coffee Break
Liang Qin ⋅ Min Wang ⋅ Peiwei Li ⋅ Wengang Zhou ⋅ Houqiang Li
|
Exhibit Hall I #243 | |
|
INTER: Mitigating Hallucination in Large Vision-Language Models by Interaction Guidance Sampling
Poster Session 1 & Exhibit Hall
Xin Dong ⋅ Shichao Dong ⋅ Jin Wang ⋅ Jing Huang ⋅ Li Zhou ⋅ Zenghui Sun ⋅ Lihua Jing ⋅ Jinsong Lan ⋅ Xiaoyong Zhu ⋅ Bo Zheng
|
Exhibit Hall I #233 | |
|
UNIS: A Unified Framework for Achieving Unbiased Neural Implicit Surfaces in Volume Rendering
Poster Session 6 & Exhibit Hall with Coffee Break
Junkai Deng ⋅ Hanting Niu ⋅ Jiaze Li ⋅ Fei Hou ⋅ Ying He
|
Exhibit Hall I #284 | |
|
Loss Functions for Predictor-based Neural Architecture Search
Poster Session 1 & Exhibit Hall
Han Ji ⋅ Yuqi Feng ⋅ Jiahao Fan ⋅ Yanan Sun
|
Exhibit Hall I #144 | |
|
Advancing Text-to-3D Generation with Linearized Lookahead Variational Score Distillation
Poster Session 4 & Exhibit Hall with Coffee Break
Yu Lei ⋅ Bingde Liu ⋅ Qingsong Xie ⋅ Haonan Lu ⋅ Zhijie Deng
|
Exhibit Hall I #448 | |
|
Decoding Correlation-Induced Misalignment in the Stable Diffusion Workflow for Text-to-Image Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Yunze Tong ⋅ Fengda Zhang ⋅ Didi Zhu ⋅ Jun Xiao ⋅ Kun Kuang
|
Exhibit Hall I #317 | |
|
Steering Guidance for Personalized Text-to-Image Diffusion Models
Poster Session 4 & Exhibit Hall with Coffee Break
Sunghyun Park ⋅ Seokeon Choi ⋅ Hyoungwoo Park ⋅ Sungrack Yun
|
Exhibit Hall I #97 | |
|
ONLY: One-Layer Intervention Sufficiently Mitigates Hallucinations in Large Vision-Language Models
Poster Session 1 & Exhibit Hall
Zifu Wan ⋅ Ce Zhang ⋅ Silong Yong ⋅ Martin Ma ⋅ Simon Stepputtis ⋅ Louis-Philippe Morency ⋅ Deva Ramanan ⋅ Katia Sycara ⋅ Yaqi Xie
|
Exhibit Hall I #298 | |
|
M-SpecGene: Generalized Foundation Model for RGBT Multispectral Vision
Poster Session 2 & Exhibit Hall with Coffee Break
Kailai Zhou ⋅ Fuqiang Yang ⋅ Shixian Wang ⋅ Bihan Wen ⋅ Chongde Zi ⋅ Linsen Chen ⋅ Qiu Shen ⋅ Xun Cao
|
Exhibit Hall I #267 | |
|
SpatialCrafter: Unleashing the Imagination of Video Diffusion Models for Scene Reconstruction from Limited Observations
Poster Session 6 & Exhibit Hall with Coffee Break
Songchun Zhang ⋅ Huiyao Xu ⋅ Sitong Guo ⋅ Zhongwei Xie ⋅ Hujun Bao ⋅ Weiwei Xu ⋅ Changqing Zou
|
Exhibit Hall I #297 | |
|
Seeing Through Deepfakes: A Human-Inspired Framework for Multi-Face Detection
Poster Session 3 & Exhibit Hall
Juan Hu ⋅ Shaojing Fan ⋅ Terence Sim
|
Exhibit Hall I #425 | |
|
Snakes and Ladders: Two Steps Up for VideoMamba
Poster Session 5 & Exhibit Hall
Hui Lu ⋅ Albert Ali Salah ⋅ Ronald Poppe
|
Exhibit Hall I #415 | |
|
Efficient Visual Place Recognition Through Multimodal Semantic Knowledge Integration
Poster Session 2 & Exhibit Hall with Coffee Break
Sitao Zhang ⋅ Hongda Mao ⋅ Qingshuang Chen ⋅ Yelin Kim
|
Exhibit Hall I #54 | |
|
COME: Dual Structure-Semantic Learning with Collaborative MoE for Universal Lesion Detection Across Heterogeneous Ultrasound Datasets
Poster Session 5 & Exhibit Hall
Lingyu Chen ⋅ Yawen Zeng ⋅ Yue Wang ⋅ Peng Wan ⋅ Guo-chen Ning ⋅ Hongen Liao ⋅ Daoqiang Zhang ⋅ Fang Chen
|
Exhibit Hall I #154 | |
|
Hybrid Layout Control for Diffusion Transformer: Fewer Annotations, Superior Aesthetics
Poster Session 4 & Exhibit Hall with Coffee Break
Keming Wu ⋅ Junwen Chen ⋅ Zhanhao Liang ⋅ Yinuo Wang ⋅ Ji Li ⋅ Chao Zhang ⋅ Bin Wang ⋅ Yuhui Yuan
|
Exhibit Hall I #291 | |
|
PLAN: Proactive Low-Rank Allocation for Continual Learning
Poster Session 1 & Exhibit Hall
XIEQUN WANG ⋅ Zhan Zhuang ⋅ Yu Zhang
|
Exhibit Hall I #268 | |
|
Leveraging Spatial Invariance to Boost Adversarial Transferability
Poster Session 1 & Exhibit Hall
Zihan Zhou ⋅ LI LI ⋅ Yanli Ren ⋅ Chuan Qin ⋅ Guorui Feng
|
Exhibit Hall I #125 | |
|
AnyPortal: Zero-Shot Consistent Video Background Replacement
Poster Session 4 & Exhibit Hall with Coffee Break
Wenshuo Gao ⋅ Xicheng Lan ⋅ Shuai Yang
|
Exhibit Hall I #394 | |
|
Textured 3D Regenerative Morphing with 3D Diffusion Prior
Poster Session 4 & Exhibit Hall with Coffee Break
Songlin Yang ⋅ Yushi LAN ⋅ Honghua Chen ⋅ Xingang Pan
|
Exhibit Hall I #26 | |
|
Inference-Time Diffusion Model Distillation
Poster Session 1 & Exhibit Hall
Geon Yeong Park ⋅ Sang Wan Lee ⋅ Jong Ye
|
Exhibit Hall I #377 | |
|
Marigold-DC: Zero-Shot Monocular Depth Completion with Guided Diffusion
Poster Session 2 & Exhibit Hall with Coffee Break
Massimiliano Viola ⋅ Kevin Qu ⋅ Nando Metzger ⋅ Bingxin Ke ⋅ Alexander Becker ⋅ Konrad Schindler ⋅ Anton Obukhov
|
Exhibit Hall I #32 | |
|
Transformer-based Tooth Alignment Prediction with Occlusion and Collision Constraints
Poster Session 6 & Exhibit Hall with Coffee Break
DongZhenXing DongZhenXing ⋅ Jiazhou Chen
|
Exhibit Hall I #40 | |
|
EC-Flow: Enabling Versatile Robotic Manipulation from Action-Unlabeled Videos via Embodiment-Centric Flow
Poster Session 3 & Exhibit Hall
Yixiang Chen ⋅ Peiyan Li ⋅ Yan Huang ⋅ Jiabing Yang ⋅ Kehan Chen ⋅ Liang Wang
|
Exhibit Hall I #184 | |
|
CityNav: A Large-Scale Dataset for Real-World Aerial Navigation
Poster Session 2 & Exhibit Hall with Coffee Break
Jungdae Lee ⋅ Taiki Miyanishi ⋅ Shuhei Kurita ⋅ Koya Sakamoto ⋅ Daichi Azuma ⋅ Yutaka Matsuo ⋅ Nakamasa Inoue
|
Exhibit Hall I #84 | |
|
Scene Graph Guided Generation: Enable Accurate Relations Generation in Text-to-Image Models via Textural Rectification
Poster Session 4 & Exhibit Hall with Coffee Break
Guibao SHEN ⋅ Luozhou Wang ⋅ Jiantao Lin ⋅ Wenhang Ge ⋅ CHAOZHE ZHANG ⋅ Xin Tao ⋅ Di ZHANG ⋅ Pengfei Wan ⋅ Guangyong Chen ⋅ Yijun Li ⋅ Ying-Cong Chen
|
Exhibit Hall I #51 | |
|
ReMP-AD: Retrieval-enhanced Multi-modal Prompt Fusion for Few-Shot Industrial Visual Anomaly Detection
Poster Session 5 & Exhibit Hall
Hongchi Ma ⋅ Guanglei Yang ⋅ Debin Zhao ⋅ Yanli JI ⋅ Wangmeng Zuo
|
Exhibit Hall I #58 | |
|
GMMamba: Group Masking Mamba for Whole Slide Image Classification
Poster Session 3 & Exhibit Hall
Tingting Zheng ⋅ Hongxun Yao ⋅ Kui Jiang ⋅ Yi Xiao ⋅ Sicheng Zhao
|
Exhibit Hall I #301 | |
|
TimeFormer: Capturing Temporal Relationships of Deformable 3D Gaussians for Robust Reconstruction
Poster Session 2 & Exhibit Hall with Coffee Break
Dadong Jiang ⋅ Zhi Hou ⋅ Zhihui Ke ⋅ Xianghui Yang ⋅ Xiaobo Zhou ⋅ Tie Qiu
|
Exhibit Hall I #348 | |
|
Hierarchy UGP: Hierarchy Unified Gaussian Primitive for Large-Scale Dynamic Scene Reconstruction
Poster Session 6 & Exhibit Hall with Coffee Break
Hongyang Sun ⋅ Qinglin Yang ⋅ Jiawei Wang ⋅ Zhen Xu ⋅ Chen Liu ⋅ Yida Wang ⋅ Kun Zhan ⋅ Hujun Bao ⋅ Xiaowei Zhou ⋅ Sida Peng
|
Exhibit Hall I #149 | |
|
SciVid: Cross-Domain Evaluation of Video Models in Scientific Applications
Poster Session 5 & Exhibit Hall
Yana Hasson ⋅ Pauline Luc ⋅ Liliane Momeni ⋅ Maks Ovsjanikov ⋅ Guillaume Le Moing ⋅ Alina Kuznetsova ⋅ Ira Ktena ⋅ Jennifer J. Sun ⋅ Skanda Koppula ⋅ Dilara Gokay ⋅ Joseph Heyward ⋅ Etienne Pot ⋅ Andrew Zisserman
|
Exhibit Hall I #187 | |
|
Backdoor Mitigation by Distance-Driven Detoxification
Shaokui Wei ⋅ Jiayin Liu ⋅ Hongyuan Zha
|
Exhibit Hall I #419 | |
|
Democratizing High-Fidelity Co-Speech Gesture Video Generation
Poster Session 3 & Exhibit Hall
Xu Yang ⋅ Shaoli Huang ⋅ Shenbo Xie ⋅ Xuelin Chen ⋅ Yifei Liu ⋅ Changxing Ding
|
Exhibit Hall I #403 | |
|
UnrealZoo: Enriching Photo-realistic Virtual Worlds for Embodied AI
Fangwei Zhong ⋅ Kui Wu ⋅ Churan Wang ⋅ Hao Chen ⋅ Hai Ci ⋅ Zhoujun Li ⋅ Yizhou Wang
|
Exhibit Hall I #69 | |
|
Region-based Cluster Discrimination for Visual Representation Learning
Yin Xie ⋅ Kaicheng Yang ⋅ Xiang An ⋅ Kun Wu ⋅ Yongle Zhao ⋅ Weimo Deng ⋅ Zimin Ran ⋅ Yumeng Wang ⋅ Ziyong Feng ⋅ Roy Miles ⋅ Ismail Elezi ⋅ Jiankang Deng
|
Exhibit Hall I #162 | |
|
CMB-ML: A Cosmic Microwave Background Dataset for the Oldest Possible Computer Vision Task
Poster Session 2 & Exhibit Hall with Coffee Break
James Amato ⋅ Yunan Xie ⋅ Leonel Medina-Varela ⋅ Ammar Aljerwi ⋅ Adam McCutcheon ⋅ T. Rippentrop ⋅ Kristian Gonzalez ⋅ Jacques Delabrouille ⋅ Mustapha Ishak ⋅ Nicholas Ruozzi
|
Exhibit Hall I #413 | |
|
Adapt Foundational Segmentation Models with Heterogeneous Searching Space
Poster Session 5 & Exhibit Hall
Li Yi ⋅ Jie Hu ⋅ Songan Zhang ⋅ GUANNAN JIANG
|
Exhibit Hall I #336 | |
|
Think Twice: Test-Time Reasoning for Robust CLIP Zero-Shot Classification
Poster Session 1 & Exhibit Hall
Shenyu Lu ⋅ Zhaoying Pan ⋅ Xiaoqian Wang
|
Exhibit Hall I #269 | |
|
Rethinking Detecting Salient and Camouflaged Objects in Unconstrained Scenes
Poster Session 5 & Exhibit Hall
Zhangjun Zhou ⋅ Yiping Li ⋅ Chunlin Zhong ⋅ Jianuo Huang ⋅ Jialun Pei ⋅ Hua Li ⋅ He Tang
|
Exhibit Hall I #242 | |
|
Counting Stacked Objects
Poster Session 5 & Exhibit Hall
Corentin Dumery ⋅ Noa Ette ⋅ Aoxiang Fan ⋅ Ren Li ⋅ Jingyi Xu ⋅ Hieu Le ⋅ Pascal Fua
|
Exhibit Hall I #74 | |
|
Joint Self-Supervised Video Alignment and Action Segmentation
Poster Session 3 & Exhibit Hall
Ali Shah Ali ⋅ Syed Ahmed Mahmood ⋅ Mubin Saeed ⋅ Andrey Konin ⋅ Zeeshan Zia ⋅ Quoc-Huy Tran
|
Exhibit Hall I #76 | |
|
TrackAny3D: Transferring Pretrained 3D Models for Category-unified 3D Point Cloud Tracking
Poster Session 6 & Exhibit Hall with Coffee Break
Mengmeng Wang ⋅ Haonan Wang ⋅ Yulong Li ⋅ Xiangjie Kong ⋅ Jiaxin Du ⋅ Feng Xia ⋅ Guojiang Shen
|
Exhibit Hall I #340 | |
|
Allowing Oscillation Quantization: Overcoming Solution Space Limitation in Low Bit-Width Quantization
Poster Session 5 & Exhibit Hall
Weiying Xie ⋅ Zihan Meng ⋅ Jitao Ma ⋅ Wenjin Guo ⋅ Haowei Li ⋅ Haonan Qin ⋅ Leyuan Fang ⋅ Yunsong Li
|
Exhibit Hall I #451 | |
|
MOVE: Motion-Guided Few-Shot Video Object Segmentation
Poster Session 3 & Exhibit Hall
Kaining Ying ⋅ Hengrui Hu ⋅ Henghui Ding
|
Exhibit Hall I #154 | |
|
SDFormer: Vision-based 3D Semantic Scene Completion via SAM-assisted Dual-channel Voxel Transformer
Poster Session 6 & Exhibit Hall with Coffee Break
Yujie Xue ⋅ Huilong Pi ⋅ Jiapeng Zhang ⋅ Qin Yunchuan ⋅ Zhuo Tang ⋅ Kenli Li ⋅ Ruihui Li
|
Exhibit Hall I #204 | |
|
Enhancing Numerical Prediction of MLLMs with Soft Labeling
Poster Session 1 & Exhibit Hall
Pei Wang ⋅ Zhaowei Cai ⋅ Hao Yang ⋅ Davide Modolo ⋅ Ashwin Swaminathan
|
Exhibit Hall I #318 | |
|
TopoTTA: Topology-Enhanced Test-Time Adaptation for Tubular Structure Segmentation
Poster Session 5 & Exhibit Hall
Jiale Zhou ⋅ Wenhan Wang ⋅ Shikun Li ⋅ Xiaolei Qu ⋅ Xin Guo ⋅ Yizhong Liu ⋅ Wenzhong Tang ⋅ Xun Lin ⋅ Yefeng Zheng
|
Exhibit Hall I #405 | |
|
RealCam-I2V: Real-World Image-to-Video Generation with Interactive Complex Camera Control
Poster Session 6 & Exhibit Hall with Coffee Break
Teng Li ⋅ Guangcong Zheng ⋅ Rui Jiang ⋅ Shuigenzhan Shuigenzhan ⋅ Tao Wu ⋅ Yehao Lu ⋅ Yining Lin ⋅ Chuanyun Deng ⋅ Yepan Xiong ⋅ Min Chen ⋅ Lin Cheng ⋅ Xi Li
|
Exhibit Hall I #392 | |
|
ReAL-AD: Towards Human-Like Reasoning in End-to-End Autonomous Driving
Poster Session 6 & Exhibit Hall with Coffee Break
Yuhang Lu ⋅ Jiadong Tu ⋅ Yuexin Ma ⋅ Xinge Zhu
|
Exhibit Hall I #296 | |
|
TAD-E2E: A Large-scale End-to-end Autonomous Driving Dataset
Poster Session 6 & Exhibit Hall with Coffee Break
Chang Liu ⋅ mingxuzhu mingxuzhu ⋅ Zheyuan Zhang ⋅ Linna Song ⋅ xiao zhao ⋅ Luo Qingliang ⋅ Qi Wang ⋅ Chufan Guo ⋅ Kuifeng Su
|
Exhibit Hall I #182 | |
|
MotionDiff: Training-free Zero-shot Interactive Motion Editing via Flow-assisted Multi-view Diffusion
Poster Session 3 & Exhibit Hall
Yikun Ma ⋅ Yiqing Li ⋅ Jiawei Wu ⋅ Xing Luo ⋅ Zhi Jin
|
Exhibit Hall I #421 | |
|
FPEM: Face Prior Enhanced Facial Attractiveness Prediction for Live Videos with Face Retouching
Hui Li ⋅ Xiaoyu Ren ⋅ Hongjiu Yu ⋅ Ying Chen ⋅ Kai Li ⋅ L Wang ⋅ Xiongkuo Min ⋅ Huiyu Duan ⋅ Guangtao Zhai ⋅ Xu Liu
|
Exhibit Hall I #136 | |
|
VAGUE: Visual Contexts Clarify Ambiguous Expressions
Poster Session 1 & Exhibit Hall
Heejeong Nam ⋅ Jinwoo Ahn ⋅ Keummin Ka ⋅ Jiwan Chung ⋅ Youngjae Yu
|
Exhibit Hall I #136 | |
|
Overcoming Dual Drift for Continual Long-Tailed Visual Question Answering
Poster Session 1 & Exhibit Hall
Feifei Zhang ⋅ Zhihao Wang ⋅ Xi Zhang ⋅ Changsheng Xu
|
Exhibit Hall I #414 | |
|
Photolithography Overlay Map Generation with Implicit Knowledge Distillation Diffusion Transformer
Poster Session 4 & Exhibit Hall with Coffee Break
YuanFu Yang ⋅ Hsiu-Hui Hsiao
|
Exhibit Hall I #37 | |
|
Does Your Vision-Language Model Get Lost in the Long Video Sampling Dilemma?
Poster Session 5 & Exhibit Hall
Tianyuan Qu ⋅ Longxiang Tang ⋅ Bohao PENG ⋅ Senqiao Yang ⋅ Bei Yu ⋅ Jiaya Jia
|
Exhibit Hall I #103 | |
|
What's Making That Sound Right Now? Video-centric Audio-Visual Localization
Poster Session 5 & Exhibit Hall
hahyeon choi ⋅ Junhoo Lee ⋅ Nojun Kwak
|
Exhibit Hall I #28 | |
|
STD-GS: Exploring Frame-Event Interaction for SpatioTemporal-Disentangled Gaussian Splatting to Reconstruct High-Dynamic Scene
Poster Session 6 & Exhibit Hall with Coffee Break
Hanyu Zhou ⋅ Haonan Wang ⋅ Haoyue Liu ⋅ Yuxing Duan ⋅ Luxin Yan ⋅ Gim Hee Lee
|
Exhibit Hall I #8 | |
|
Robust Machine Unlearning for Quantized Neural Networks via Adaptive Gradient Reweighting with Similar Labels
Poster Session 5 & Exhibit Hall
Yujia Tong ⋅ Yuze Wang ⋅ Jingling Yuan ⋅ Chuang Hu
|
Exhibit Hall I #77 | |
|
Zero-Shot Vision Encoder Grafting via LLM Surrogates
Poster Session 1 & Exhibit Hall
Kaiyu Yue ⋅ Vasu Singla ⋅ Menglin Jia ⋅ John Kirchenbauer ⋅ Rifaa Qadri ⋅ Zikui Cai ⋅ Abhinav Bhatele ⋅ Furong Huang ⋅ Tom Goldstein
|
Exhibit Hall I #400 | |
|
OV-SCAN: Semantically Consistent Alignment for Novel Object Discovery in Open-Vocabulary 3D Object Detection
Poster Session 2 & Exhibit Hall with Coffee Break
Adrian Chow ⋅ Evelien Riddell ⋅ Yimu Wang ⋅ Sean Sedwards ⋅ Krzysztof Czarnecki
|
Exhibit Hall I #279 | |
|
FullDiT: Video Generative Foundation Models with Multimodal Control via Full Attention
Poster Session 4 & Exhibit Hall with Coffee Break
Xuan Ju ⋅ Weicai Ye ⋅ Quande Liu ⋅ Qiulin Wang ⋅ Xintao Wang ⋅ Pengfei Wan ⋅ Di ZHANG ⋅ Kun Gai ⋅ Qiang Xu
|
Exhibit Hall I #81 | |
|
SC-Lane: Slope-aware and Consistent Road Height Estimation Framework for 3D Lane Detection
Poster Session 6 & Exhibit Hall with Coffee Break
Chaesong Park ⋅ Eunbin Seo ⋅ JihyeonHwang JihyeonHwang ⋅ Jongwoo Lim
|
Exhibit Hall I #355 | |
|
Exploring the Visual Feature Space for Multimodal Neural Decoding
Poster Session 1 & Exhibit Hall
Weihao Xia ⋅ Cengiz Oztireli
|
Exhibit Hall I #410 | |
|
RAGNet: Large-scale Reasoning-based Affordance Segmentation Benchmark towards General Grasping
Poster Session 3 & Exhibit Hall
Dongming Wu ⋅ Yanping Fu ⋅ Saike Huang ⋅ Yingfei Liu ⋅ Fan Jia ⋅ Nian Liu ⋅ Feng Dai ⋅ Tiancai Wang ⋅ Rao Anwer ⋅ Fahad Khan ⋅ Jianbing Shen
|
Exhibit Hall I #186 | |
|
GeoMan: Temporally Consistent Human Geometry Estimation using Image-to-Video Diffusion
Poster Session 2 & Exhibit Hall with Coffee Break
Gwanghyun Kim ⋅ Xueting Li ⋅ Ye Yuan ⋅ Koki Nagano ⋅ Tianye Li ⋅ Jan Kautz ⋅ Se Young Chun ⋅ Umar Iqbal
|
Exhibit Hall I #229 | |
|
Stereo Any Video: Temporally Consistent Stereo Matching
Junpeng Jing ⋅ Weixun Luo ⋅ Ye Mao ⋅ Krystian Mikolajczyk
|
Exhibit Hall I #98 | |
|
Keyframe-oriented Vision Token Pruning: Enhancing Efficiency of Large Vision Language Models on Long-Form Video Processing
Poster Session 5 & Exhibit Hall
Yudong Liu ⋅ Jingwei Sun ⋅ Yueqian Lin ⋅ Jingyang Zhang ⋅ Ming Yin ⋅ Qinsi Wang ⋅ Jianyi Zhang ⋅ Hai Li ⋅ Yiran Chen
|
Exhibit Hall I #95 | |
|
When Confidence Fails: Revisiting Pseudo-Label Selection in Semi-supervised Semantic Segmentation
Pan Liu ⋅ Jinshi Liu
|
Exhibit Hall I #194 | |
|
Bridging Local Inductive Bias and Long-Range Dependencies with Pixel-Mamba for End-to-end Whole Slide Image Analysis
Poster Session 5 & Exhibit Hall
Zhongwei Qiu ⋅ Hanqing Chao ⋅ Tiancheng Lin ⋅ Wanxing Chang ⋅ Zijiang Yang ⋅ Wenpei Jiao ⋅ Yixuan Shen ⋅ Yunshuo Zhang ⋅ Yelin Yang ⋅ Wenbin Liu ⋅ Hui Jiang ⋅ Yun Bian ⋅ Ke Yan ⋅ Dakai Jin ⋅ Le Lu
|
Exhibit Hall I #276 | |
|
Neuroverse3D: Developing In-Context Learning Universal Model for Neuroimaging in 3D
Poster Session 5 & Exhibit Hall
Jiesi Hu ⋅ Hanyang Peng ⋅ Yanwu Yang ⋅ Xutao Guo ⋅ Yang Shang ⋅ Pengcheng Shi ⋅ Chenfei Ye ⋅ Ting Ma
|
Exhibit Hall I #180 | |
|
Heavy Labels Out! Dataset Distillation with Label Space Lightening
Poster Session 2 & Exhibit Hall with Coffee Break
Ruonan Yu ⋅ Songhua Liu ⋅ Zigeng Chen ⋅ Jingwen Ye ⋅ Xinchao Wang
|
Exhibit Hall I #226 | |
|
Taming Flow Matching with Unbalanced Optimal Transport into Fast Pansharpening
Poster Session 1 & Exhibit Hall
Zihan Cao ⋅ Yu Zhong ⋅ Liang-Jian Deng
|
Exhibit Hall I #258 | |
|
Revisiting Pool-based Prompt Learning for Few-shot Class-incremental Learning
Poster Session 1 & Exhibit Hall
Yongwei Jiang ⋅ Yixiong Zou ⋅ Yuhua Li ⋅ Ruixuan Li
|
Exhibit Hall I #114 | |
|
CarGait: Cross-Attention based Re-ranking for Gait recognition
Poster Session 3 & Exhibit Hall
Gavriel Habib ⋅ Noa Barzilay ⋅ Or Shimshi ⋅ Rami Ben-Ari ⋅ Nir Darshan
|
Exhibit Hall I #177 | |
|
Incremental Few-Shot Semantic Segmentation via Multi-Level Switchable Visual Prompts
Poster Session 5 & Exhibit Hall
Maoxian Wan ⋅ Kaige Li ⋅ Qichuan Geng ⋅ Weimin Shi ⋅ Zhong Zhou
|
Exhibit Hall I #404 | |
|
ZeroKey: Point-Level Reasoning and Zero-Shot 3D Keypoint Detection from Large Language Models
Poster Session 5 & Exhibit Hall
Bingchen Gong ⋅ Diego Gomez ⋅ Abdullah Hamdi ⋅ Abdelrahman Eldesokey ⋅ Ahmed Abdelreheem ⋅ Peter Wonka ⋅ Maks Ovsjanikov
|
Exhibit Hall I #214 | |
|
SVIP: Semantically Contextualized Visual Patches for Zero-Shot Learning
Poster Session 1 & Exhibit Hall
Zhi Chen ⋅ Zecheng Zhao ⋅ Jingcai Guo ⋅ Jingjing Li ⋅ Zi Huang
|
Exhibit Hall I #311 | |
|
Flash-VStream: Efficient Real-Time Understanding for Long Video Streams
Poster Session 5 & Exhibit Hall
Haoji Zhang ⋅ Yiqin Wang ⋅ Yansong Tang ⋅ Yong Liu ⋅ Jiashi Feng ⋅ Xiaojie Jin
|
Exhibit Hall I #118 | |
|
MR-FIQA: Face Image Quality Assessment with Multi-Reference Representations from Synthetic Data Generation
Poster Session 3 & Exhibit Hall
Fu-Zhao Ou ⋅ Chongyi Li ⋅ Shiqi Wang ⋅ Sam Kwong
|
Exhibit Hall I #274 | |
|
Learnable Fractional Reaction-Diffusion Dynamics for Under-Display ToF Imaging and Beyond
Poster Session 2 & Exhibit Hall with Coffee Break
Xin Qiao ⋅ Matteo Poggi ⋅ Xing Wei ⋅ Pengchao Deng ⋅ Yanhui Zhou ⋅ Stefano Mattoccia
|
Exhibit Hall I #99 | |
|
Gait-X: Exploring X modality for Generalized Gait Recognition
Poster Session 3 & Exhibit Hall
Zengbin Wang ⋅ Saihui Hou ⋅ Junjie Li ⋅ Xu Liu ⋅ Chunshui Cao ⋅ Yongzhen Huang ⋅ Siye Wang ⋅ Man Zhang
|
Exhibit Hall I #308 | |
|
Scendi Score: Prompt‑Aware Diversity Evaluation via Schur Complement of CLIP Embeddings
Azim Ospanov ⋅ Mohammad Jalali ⋅ Farzan Farnia
|
Exhibit Hall I #195 | |
|
Discretized Gaussian Representation for Tomographic Reconstruction
Poster Session 6 & Exhibit Hall with Coffee Break
Shaokai Wu ⋅ Yuxiang Lu ⋅ Yapan Guo ⋅ Wei Ji ⋅ Suizhi Huang ⋅ Fengyu Yang ⋅ Shalayiding Sirejiding ⋅ Qichen He ⋅ Jing Tong ⋅ Yanbiao Ji ⋅ Yue Ding ⋅ Hongtao Lu
|
Exhibit Hall I #33 | |
|
Wave-MambaAD: Wavelet-driven State Space Model for Multi-class Unsupervised Anomaly Detection
Poster Session 5 & Exhibit Hall
Qiao Zhang ⋅ Mingwen Shao ⋅ Xinyuan Chen ⋅ Xiang Lv ⋅ Kai Xu
|
Exhibit Hall I #101 | |
|
3D Test-time Adaptation via Graph Spectral Driven Point Shift
Poster Session 6 & Exhibit Hall with Coffee Break
Xin Wei ⋅ Qin Yang ⋅ Yijie Fang ⋅ Mingrui Zhu ⋅ Nannan Wang
|
Exhibit Hall I #197 | |
|
Task-Decoupled Bézier Surface Constraint for Uneven Low-Light Image Enhancement
Poster Session 2 & Exhibit Hall with Coffee Break
Xingxiang Zhou ⋅ Xiangdong Su ⋅ Haoran Zhang ⋅ Wei Chen ⋅ Guanglai Gao
|
Exhibit Hall I #173 | |
|
EMoTive: Event-guided Trajectory Modeling for 3D Motion Estimation
Poster Session 2 & Exhibit Hall with Coffee Break
Zengyu Wan ⋅ Wei Zhai ⋅ Yang Cao ⋅ Zheng-Jun Zha
|
Exhibit Hall I #406 | |
|
Text-to-Any-Skeleton Motion Generation Without Retargeting
Poster Session 3 & Exhibit Hall
Qingyuan Liu ⋅ Ke Lv ⋅ Kun Dong ⋅ Jian Xue ⋅ Zehai Niu ⋅ Jinbao Wang
|
Exhibit Hall I #275 | |
|
Completing 3D Partial Assemblies with View-Consistent 2D-3D Correspondence
Poster Session 2 & Exhibit Hall with Coffee Break
Weihao Wang ⋅ Yu Lan ⋅ Mingyu You ⋅ Bin He
|
Exhibit Hall I #256 | |
|
Aligning Global Semantics and Local Textures in Generative Video Enhancement
Poster Session 4 & Exhibit Hall with Coffee Break
Zhikai Chen ⋅ Fuchen Long ⋅ Zhaofan Qiu ⋅ Ting Yao ⋅ Wengang Zhou ⋅ Jiebo Luo ⋅ Tao Mei
|
Exhibit Hall I #210 | |
|
Simulating Dual-Pixel Images From Ray Tracing For Depth Estimation
Poster Session 6 & Exhibit Hall with Coffee Break
Fengchen He ⋅ Dayang Zhao ⋅ Hao Xu ⋅ Tingwei Quan ⋅ Shaoqun zeng
|
Exhibit Hall I #132 | |
|
Robust 3D-Masked Part-level Editing in 3D Gaussian Splatting with Regularized Score Distillation Sampling
Poster Session 2 & Exhibit Hall with Coffee Break
Hayeon Kim ⋅ Ji Ha Jang ⋅ Se Young Chun
|
Exhibit Hall I #45 | |
|
Learning Robust Stereo Matching in the Wild with Selective Mixture-of-Experts
Yun Wang ⋅ Longguang Wang ⋅ Chenghao Zhang ⋅ Yongjian Zhang ⋅ Zhanjie Zhang ⋅ Ao Ma ⋅ Chenyou Fan ⋅ Tin Lun Lam ⋅ Junjie Hu
|
Exhibit Hall I #137 | |
|
Global-Aware Monocular Semantic Scene Completion with State Space Models
Poster Session 6 & Exhibit Hall with Coffee Break
Shijie Li ⋅ Zhongyao Cheng ⋅ Rong Li ⋅ Shuai Li ⋅ Juergen Gall ⋅ Xun Xu ⋅ Xulei Yang
|
Exhibit Hall I #80 | |
|
DIMO: Diverse 3D Motion Generation for Arbitrary Objects
Linzhan Mou ⋅ Jiahui Lei ⋅ Chen Wang ⋅ Lingjie Liu ⋅ Kostas Daniilidis
|
Exhibit Hall I #410 | |
|
GTR: Guided Thought Reinforcement Prevents Thought Collapse in RL-based VLM Agent Training
Poster Session 4 & Exhibit Hall with Coffee Break
Tong Wei ⋅ Yijun Yang ⋅ Junliang Xing ⋅ Yuanchun Shi ⋅ Zongqing Lu ⋅ Deheng Ye
|
Exhibit Hall I #379 | |
|
LoRAverse: A Submodular Framework to Retrieve Diverse Adapters for Diffusion Models
Poster Session 4 & Exhibit Hall with Coffee Break
Mert Sonmezer ⋅ Matthew Zheng ⋅ Pinar Yanardag
|
Exhibit Hall I #286 | |
|
Autoregressive Denoising Score Matching is a Good Video Anomaly Detector
Poster Session 3 & Exhibit Hall
hanwen Zhang ⋅ Congqi Cao ⋅ Qinyi Lv ⋅ Lingtong Min ⋅ Yanning Zhang
|
Exhibit Hall I #193 | |
|
MagicDrive-V2: High-Resolution Long Video Generation for Autonomous Driving with Adaptive Control
Poster Session 6 & Exhibit Hall with Coffee Break
Ruiyuan Gao ⋅ Kai Chen ⋅ Bo Xiao ⋅ Lanqing HONG ⋅ Zhenguo Li ⋅ Qiang Xu
|
Exhibit Hall I #329 | |
|
PVChat: Personalized Video Chat with One-Shot Learning
Poster Session 5 & Exhibit Hall
YUFEI SHI ⋅ Weilong Yan ⋅ Gang Xu ⋅ Yumeng Li ⋅ Yucheng Chen ⋅ ZhenXi Li ⋅ Fei Yu ⋅ Ming Li ⋅ Si Yong Yeo
|
Exhibit Hall I #332 | |
|
AIM: Amending Inherent Interpretability via Self-Supervised Masking
Eyad Alshami ⋅ Shashank Agnihotri ⋅ Bernt Schiele ⋅ Margret Keuper
|
Exhibit Hall I #85 | |
|
From Panels to Prose: Generating Literary Narratives from Comics
Poster Session 5 & Exhibit Hall
Ragav Sachdeva ⋅ Andrew Zisserman
|
Exhibit Hall I #193 | |
|
MVGBench: a Comprehensive Benchmark for Multi-view Generation Models
Poster Session 2 & Exhibit Hall with Coffee Break
Xianghui Xie ⋅ Jan Lenssen ⋅ Gerard Pons-Moll
|
Exhibit Hall I #299 | |
|
A Constrained Optimization Approach for Gaussian Splatting from Coarsely-posed Images and Noisy Lidar Point Clouds
Jizong Peng ⋅ Tze Ho Elden Tse ⋅ Kai Xu ⋅ Wenchao Gao ⋅ Angela Yao
|
Exhibit Hall I #273 | |
|
Conditional Visual Autoregressive Modeling for Pathological Image Restoration
Poster Session 4 & Exhibit Hall with Coffee Break
Ziyi Liu ⋅ Zhe Xu ⋅ Jiabo MA ⋅ Wenqiang Li ⋅ Ruixuan Wang ⋅ Bo Du ⋅ Hao Chen
|
Exhibit Hall I #281 | |
|
Text-IRSTD: Leveraging Semantic Text to Promote Infrared Small Target Detection in Complex Scenes
Poster Session 3 & Exhibit Hall
Feng Huang ⋅ Shuyuan Zheng ⋅ Zhaobing Qiu ⋅ Huanxian Liu ⋅ huanxin Bai ⋅ Liqiong Chen
|
Exhibit Hall I #58 | |
|
Amodal Depth Anything: Amodal Depth Estimation in the Wild
Poster Session 2 & Exhibit Hall with Coffee Break
Zhenyu Li ⋅ Mykola Lavreniuk ⋅ Jian Shi ⋅ Shariq Bhat ⋅ Peter Wonka
|
Exhibit Hall I #436 | |
|
SEGA: A Stepwise Evolution Paradigm for Content-Aware Layout Generation with Design Prior
Bo Zhao ⋅ Haoran Wang ⋅ Jinghui Wang ⋅ Hanzhang Wang ⋅ Huan Yang ⋅ Wei Ji ⋅ Hao Liu ⋅ Xinyan Xiao
|
Exhibit Hall I #425 | |
|
RobustSplat: Decoupling Densification and Dynamics for Transient-Free 3DGS
Poster Session 6 & Exhibit Hall with Coffee Break
Chuanyu Fu ⋅ Yuqi Zhang ⋅ Kunbin Yao ⋅ Guanying Chen ⋅ Yuan Xiong ⋅ Chuan Huang ⋅ Shuguang Cui ⋅ Xiaochun Cao
|
Exhibit Hall I #233 | |
|
High-Resolution Spatiotemporal Modeling with Global-Local State Space Models for Video-Based Human Pose Estimation
Poster Session 2 & Exhibit Hall with Coffee Break
Runyang Feng ⋅ Hyung Jin Chang ⋅ Tze Ho Elden Tse ⋅ Boeun Kim ⋅ Yi Chang ⋅ Yixing Gao
|
Exhibit Hall I #367 | |
|
MCOP: Multi-UAV Collaborative Occupancy Prediction
Poster Session 6 & Exhibit Hall with Coffee Break
Zefu Lin ⋅ Wenbo Chen ⋅ Xiaojuan Jin ⋅ Yuran Yang ⋅ Lue Fan ⋅ YIXIN ZHANG ⋅ Yufeng Zhang ⋅ Zhaoxiang Zhang
|
Exhibit Hall I #244 | |
|
Bayesian-Inspired Space-Time Superpixels
Poster Session 2 & Exhibit Hall with Coffee Break
Kent Gauen ⋅ Stanley Chan
|
Exhibit Hall I #34 | |
|
From Easy to Hard: Progressive Active Learning Framework for Infrared Small Target Detection with Single Point Supervision
Poster Session 1 & Exhibit Hall
Chuang Yu ⋅ Jinmiao Zhao ⋅ Yunpeng Liu ⋅ Sicheng Zhao ⋅ Yimian Dai ⋅ Xiangyu Yue
|
Exhibit Hall I #238 | |
|
Mitigating Catastrophic Overfitting in Fast Adversarial Training via Label Information Elimination
Poster Session 1 & Exhibit Hall
Chao Pan ⋅ Ke Tang ⋅ Li Qing ⋅ Xin Yao
|
Exhibit Hall I #276 | |
|
Consistency Trajectory Matching for One-Step Generative Super-Resolution
Poster Session 3 & Exhibit Hall
Weiyi You ⋅ Mingyang Zhang ⋅ Leheng Zhang ⋅ Xingyu Zhou ⋅ Kexuan Shi ⋅ Shuhang Gu
|
Exhibit Hall I #258 | |
|
MotionLab: Unified Human Motion Generation and Editing via the Motion-Condition-Motion Paradigm
Poster Session 3 & Exhibit Hall
Ziyan Guo ⋅ Zeyu HU ⋅ Na Zhao ⋅ De Wen Soh
|
Exhibit Hall I #363 | |
|
Scoring, Remember, and Reference: Catching Camouflaged Objects in Videos
Poster Session 3 & Exhibit Hall
Yuang Feng ⋅ Shuyong Gao ⋅ Fuzhen Yan ⋅ Yicheng Song ⋅ Lingyi Hong ⋅ Junjie Hu ⋅ Wenqiang Zhang
|
Exhibit Hall I #286 | |
|
Geometric Alignment and Prior Modulation for View-Guided Point Cloud Completion on Unseen Categories
Poster Session 6 & Exhibit Hall with Coffee Break
Jingqiao Xiu ⋅ Yicong Li ⋅ Na Zhao ⋅ Han Fang ⋅ Xiang Wang ⋅ Angela Yao
|
Exhibit Hall I #262 | |
|
FRET: Feature Redundancy Elimination for Test Time Adaptation
Poster Session 1 & Exhibit Hall
Linjing You ⋅ Jiabao Lu ⋅ Xiayuan Huang ⋅ Xiangli Nie
|
Exhibit Hall I #192 | |
|
Motion-2-to-3: Leveraging 2D Motion Data for 3D Motion Generations
Poster Session 3 & Exhibit Hall
Ruoxi Guo ⋅ Huaijin Pi ⋅ Zehong Shen ⋅ Qing Shuai ⋅ zechenhu zechenhu ⋅ Zhumei Wang ⋅ Yajiao Dong ⋅ Ruizhen Hu ⋅ Taku Komura ⋅ Sida Peng ⋅ Xiaowei Zhou
|
Exhibit Hall I #405 | |
|
AnyBimanual: Transferring Unimanual Policy for General Bimanual Manipulation
Poster Session 3 & Exhibit Hall
Guanxing Lu ⋅ Tengbo Yu ⋅ Haoyuan Deng ⋅ Season Chen ⋅ Yansong Tang ⋅ Ziwei Wang
|
Exhibit Hall I #344 | |
|
SPA: Efficient User-Preference Alignment against Uncertainty in Medical Image Segmentation
Poster Session 5 & Exhibit Hall
Jiayuan Zhu ⋅ Junde Wu ⋅ Cheng Ouyang ⋅ Konstantinos Kamnitsas ⋅ Alison Noble
|
Exhibit Hall I #370 | |
|
Signs as Tokens: A Retrieval-Enhanced Multilingual Sign Language Generator
Poster Session 5 & Exhibit Hall
Ronglai Zuo ⋅ Rolandos Alexandros Potamias ⋅ Evangelos Ververas ⋅ Jiankang Deng ⋅ Stefanos Zafeiriou
|
Exhibit Hall I #379 | |
|
A₀ : An Affordance-Aware Hierarchical Model for General Robotic Manipulation
Poster Session 3 & Exhibit Hall
Rongtao Xu ⋅ Jian Zhang ⋅ Minghao Guo ⋅ Youpeng Wen ⋅ Haoting Yang ⋅ Min Lin ⋅ Jianzheng Huang ⋅ Zhe Li ⋅ Kaidong Zhang ⋅ Liqiong Wang ⋅ Yuxuan Kuang ⋅ Meng Cao ⋅ Feng Zheng ⋅ Xiaodan Liang
|
Exhibit Hall I #329 | |
|
FVGen: Accelerating Novel-View Synthesis with Adversarial Video Diffusion Distillation
Poster Session 6 & Exhibit Hall with Coffee Break
Wenbin Teng ⋅ Gonglin Chen ⋅ Haiwei Chen ⋅ Yajie Zhao
|
Exhibit Hall I #131 | |
|
PVMamba: Parallelizing Vision Mamba via Dynamic State Aggregation
Poster Session 3 & Exhibit Hall
Fei Xie ⋅ Zhongdao Wang ⋅ Weijia Zhang ⋅ Chao Ma
|
Exhibit Hall I #20 | |
|
Controllable and Expressive One-Shot Video Head Swapping
Poster Session 3 & Exhibit Hall
Chaonan Ji ⋅ Jinwei Qi ⋅ Peng Zhang ⋅ Bang Zhang ⋅ Liefeng Bo
|
Exhibit Hall I #22 | |
|
When Pixel Difference Patterns Meet ViT: PiDiViT for Few-Shot Object Detection
Poster Session 5 & Exhibit Hall
Hongliang Zhou ⋅ Yongxiang Liu ⋅ Canyu Mo ⋅ Weijie Li ⋅ Bowen Peng ⋅ Li Liu
|
Exhibit Hall I #422 | |
|
Boosting Adversarial Transferability via Residual Perturbation Attack
Poster Session 1 & Exhibit Hall
Jinjia Peng ⋅ Zeze Tao ⋅ Huibing Wang ⋅ Meng Wang ⋅ Yang Wang
|
Exhibit Hall I #110 | |
|
What Makes for Text to 360-degree Panorama Generation with Stable Diffusion?
Poster Session 4 & Exhibit Hall with Coffee Break
Jinhong Ni ⋅ Chang-Bin Zhang ⋅ Qiang Zhang ⋅ Jing Zhang
|
Exhibit Hall I #160 | |
|
Learning Normals of Noisy Points by Local Gradient-Aware Surface Filtering
Poster Session 6 & Exhibit Hall with Coffee Break
Qing Li ⋅ Huifang Feng ⋅ Xun Gong ⋅ Liang Han
|
Exhibit Hall I #396 | |
|
ORION: A Holistic End-to-End Autonomous Driving Framework by Vision-Language Instructed Action Generation
Poster Session 6 & Exhibit Hall with Coffee Break
Haoyu Fu ⋅ Diankun Zhang ⋅ Zongchuang Zhao ⋅ Jianfeng Cui ⋅ DINGKANG LIANG ⋅ Chong Zhang ⋅ Dingyuan Zhang ⋅ Hongwei Xie ⋅ BING WANG ⋅ Xiang Bai
|
Exhibit Hall I #10 | |
|
Learning Pixel-adaptive Multi-layer Perceptrons for Real-time Image Enhancement
Poster Session 3 & Exhibit Hall
Junyu Lou ⋅ Xiaorui Zhao ⋅ Kexuan Shi ⋅ Shuhang Gu
|
Exhibit Hall I #386 | |
|
Kaleidoscopic Background Attack: Disrupting Pose Estimation with Multi-Fold Radial Symmetry Textures
Xinlong Ding ⋅ Hongwei Yu ⋅ Jiawei Li ⋅ Feifan Li ⋅ Yu Shang ⋅ Bochao Zou ⋅ Huimin Ma ⋅ Jiansheng Chen
|
Exhibit Hall I #364 | |
|
CULTURE3D: A Large-Scale and Diverse Dataset of Cultural Landmarks and Terrains for Gaussian-Based Scene Rendering
Poster Session 6 & Exhibit Hall with Coffee Break
xinyi zheng ⋅ Steve Zhang ⋅ Weizhe Lin ⋅ Fan Zhang ⋅ Walterio Mayol-Cuevas ⋅ Yunze Liu ⋅ Junxiao Shen
|
Exhibit Hall I #418 | |
|
NullSwap: Proactive Identity Cloaking Against Deepfake Face Swapping
Poster Session 3 & Exhibit Hall
Tianyi Wang ⋅ Shuaicheng Niu ⋅ Harry Cheng ⋅ xiao zhang ⋅ Yinglong Wang
|
Exhibit Hall I #74 | |
|
Forensic-MoE: Exploring Comprehensive Synthetic Image Detection Traces with Mixture of Experts
Poster Session 4 & Exhibit Hall with Coffee Break
Mingqi Fang ⋅ Ziguang Li ⋅ Lingyun Yu ⋅ Quanwei Yang ⋅ Hongtao Xie ⋅ Yongdong Zhang
|
Exhibit Hall I #276 | |
|
RoboTron-Mani: All-in-One Multimodal Large Model for Robotic Manipulation
Poster Session 3 & Exhibit Hall
Feng yan ⋅ Fanfan Liu ⋅ Yiyang Huang ⋅ ZechaoGuan ZechaoGuan ⋅ Liming Zheng ⋅ Yufeng Zhong ⋅ Chengjian Feng ⋅ Lin Ma
|
Exhibit Hall I #348 | |
|
Information-Bottleneck Driven Binary Neural Network for Change Detection
Poster Session 2 & Exhibit Hall with Coffee Break
Kaijie Yin ⋅ Zhiyuan Zhang ⋅ Shu Kong ⋅ Tian Gao ⋅ Cheng-zhong Xu ⋅ Hui Kong
|
Exhibit Hall I #202 | |
|
Entropy-Adaptive Diffusion Policy Optimization with Dynamic Step Alignment
Poster Session 1 & Exhibit Hall
Renye Yan ⋅ Jikang Cheng ⋅ Yaozhong Gan ⋅ Shikun Sun ⋅ You Wu ⋅ Yunfan Yang ⋅ Ling Liang ⋅ JinLong Lin ⋅ Yeshuang Zhu ⋅ Jie Zhou ⋅ Jinchao Zhang ⋅ Junliang Xing ⋅ Yimao Cai ⋅ Ru Huang
|
Exhibit Hall I #174 | |
|
Time-Aware Auto White Balance in Mobile Photography
Poster Session 2 & Exhibit Hall with Coffee Break
Mahmoud Afifi ⋅ Luxi Zhao ⋅ Abhijith Punnappurath ⋅ Mohamed Abdelsalam ⋅ Ran Zhang ⋅ Michael Brown
|
Exhibit Hall I #2 | |
|
ViewSRD: 3D Visual Grounding via Structured Multi-View Decomposition
Poster Session 2 & Exhibit Hall with Coffee Break
Ronggang Huang ⋅ Haoxin Yang ⋅ Yan Cai ⋅ Xuemiao Xu ⋅ Huaidong Zhang ⋅ Shengfeng He
|
Exhibit Hall I #441 | |
|
Physical Degradation Model-Guided Interferometric Hyperspectral Reconstruction with Unfolding Transformer
Poster Session 3 & Exhibit Hall
Yuansheng Li ⋅ Yunhao Zou ⋅ Linwei Chen ⋅ Ying Fu
|
Exhibit Hall I #358 | |
|
VPR-Cloak: A First Look at Privacy Cloak Against Visual Place Recognition
Poster Session 2 & Exhibit Hall with Coffee Break
Shuting Dong ⋅ Mingzhi Chen ⋅ Feng Lu ⋅ Hao Yu ⋅ Guanghao Li ⋅ Zhe Wu ⋅ Ming Tang ⋅ Chun Yuan
|
Exhibit Hall I #204 | |
|
Evidential Knowledge Distillation
Poster Session 1 & Exhibit Hall
Liangyu Xiang ⋅ Junyu Gao ⋅ Changsheng Xu
|
Exhibit Hall I #259 | |
|
Pruning All-Rounder: Rethinking and Improving Inference Efficiency for Large Vision Language Models
Poster Session 5 & Exhibit Hall
Wei Suo ⋅ Ji Ma ⋅ Mengyang Sun ⋅ Lin Wu ⋅ PENG WANG ⋅ Yanning Zhang
|
Exhibit Hall I #42 | |
|
Align Your Rhythm: Generating Highly Aligned Dance Poses with Gating-Enhanced Rhythm-Aware Feature Representation
Poster Session 3 & Exhibit Hall
Congyi Fan ⋅ Jian Guan ⋅ Xuanjia Zhao ⋅ Dongli Xu ⋅ Youtian Lin ⋅ Tong Ye ⋅ Pengming Feng ⋅ Haiwei Pan
|
Exhibit Hall I #300 | |
|
HOMO-Feature: Cross-Arbitrary-Modal Image Matching with Homomorphism of Organized Major Orientation
Poster Session 3 & Exhibit Hall
Chenzhong Gao ⋅ Wei Li ⋅ Desheng Weng
|
Exhibit Hall I #49 | |
|
OCSplats: Observation Completeness Quantification and Label Noise Separation in 3DGS
Poster Session 6 & Exhibit Hall with Coffee Break
Han Ling ⋅ Yinghui Sun ⋅ Xian Xu ⋅ Quansen Sun
|
Exhibit Hall I #92 | |
|
GSOT3D: Towards Generic 3D Single Object Tracking in the Wild
Poster Session 2 & Exhibit Hall with Coffee Break
Yifan Jiao ⋅ Yunhao Li ⋅ Junhua Ding ⋅ Qing Yang ⋅ Song Fu ⋅ Heng Fan ⋅ Libo Zhang
|
Exhibit Hall I #42 | |
|
GWM: Towards Scalable Gaussian World Models for Robotic Manipulation
Poster Session 2 & Exhibit Hall with Coffee Break
Guanxing Lu ⋅ Baoxiong Jia ⋅ Puhao Li ⋅ Yixin Chen ⋅ Ziwei Wang ⋅ Yansong Tang ⋅ Siyuan Huang
|
Exhibit Hall I #399 | |
|
Dynamic-DINO: Fine-Grained Mixture of Experts Tuning for Real-time Open-Vocabulary Object Detection
Poster Session 5 & Exhibit Hall
Yehao Lu ⋅ Minghe Weng ⋅ Zekang Xiao ⋅ Rui Jiang ⋅ Wei Su ⋅ Guangcong Zheng ⋅ Luping Luping ⋅ Xi Li
|
Exhibit Hall I #99 | |
|
WAVE: Warp-Based View Guidance for Consistent Novel View Synthesis Using a Single Image
Poster Session 3 & Exhibit Hall
Jiwoo Park ⋅ Tae Choi ⋅ Youngjun Jun ⋅ Seong Jae Hwang
|
Exhibit Hall I #179 | |
|
DanceEditor: Towards Iterative Editable Music-driven Dance Generation with Open-Vocabulary Descriptions
Poster Session 3 & Exhibit Hall
Hengyuan Zhang ⋅ Zhe Li ⋅ Xingqun Qi ⋅ Mengze Li ⋅ Muyi Sun ⋅ Siye Wang ⋅ Man Zhang ⋅ Sirui Han
|
Exhibit Hall I #202 | |
|
TAG-WM: Tamper-Aware Generative Image Watermarking via Diffusion Inversion Sensitivity
Poster Session 4 & Exhibit Hall with Coffee Break
Yuzhuo Chen ⋅ Zehua Ma ⋅ Han Fang ⋅ Weiming Zhang ⋅ Nenghai Yu
|
Exhibit Hall I #176 | |
|
HORT: Monocular Hand-held Objects Reconstruction with Transformers
Poster Session 2 & Exhibit Hall with Coffee Break
Zerui Chen ⋅ Rolandos Alexandros Potamias ⋅ Shizhe Chen ⋅ Cordelia Schmid
|
Exhibit Hall I #96 | |
|
Multimodal Latent Diffusion Model for Complex Sewing Pattern Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Shengqi Liu ⋅ Yuhao Cheng ⋅ Zhuo Chen ⋅ Xingyu Ren ⋅ Wenhan Zhu ⋅ Lincheng Li ⋅ Mengxiao Bi ⋅ Xiaokang Yang ⋅ Yichao Yan
|
Exhibit Hall I #264 | |
|
Enhancing Reward Models for High-quality Image Generation: Beyond Text-Image Alignment
Poster Session 4 & Exhibit Hall with Coffee Break
ying ba ⋅ Tianyu Zhang ⋅ Yalong Bai ⋅ Wenyi Mo ⋅ Tao Liang ⋅ Bing Su ⋅ Ji-Rong Wen
|
Exhibit Hall I #397 | |
|
Lightweight and Fast Real-time Image Enhancement via Decomposition of the Spatial-aware Lookup Tables
Poster Session 3 & Exhibit Hall
Wontae Kim ⋅ Keuntek Lee ⋅ Nam Ik Cho
|
Exhibit Hall I #178 | |
|
Diffusion-based 3D Hand Motion Recovery with Intuitive Physics
Poster Session 2 & Exhibit Hall with Coffee Break
Yufei Zhang ⋅ Zijun Cui ⋅ Jeffrey Kephart ⋅ Qiang Ji
|
Exhibit Hall I #214 | |
|
HumanOLAT: A Large-Scale Dataset for Full-Body Human Relighting and Novel-View Synthesis
Poster Session 6 & Exhibit Hall with Coffee Break
Timo Teufel ⋅ xilong zhou ⋅ Umar Iqbal ⋅ Pramod Rao ⋅ Pulkit Gera ⋅ Jan Kautz ⋅ Vladislav Golyanik ⋅ Christian Theobalt
|
Exhibit Hall I #424 | |
|
Devil is in the Uniformity: Exploring Diverse Learners within Transformer for Image Restoration
Poster Session 3 & Exhibit Hall
Shihao Zhou ⋅ Dayu Li ⋅ Jinshan Pan ⋅ Juncheng Zhou ⋅ Jinglei Shi ⋅ Jufeng Yang
|
Exhibit Hall I #216 | |
|
Tensor-aggregated LoRA in Federated Fine-tuning
Poster Session 1 & Exhibit Hall
Zhixuan Li ⋅ Binqian Xu ⋅ Xiangbo Shu ⋅ Jiachao Zhang ⋅ Yazhou Yao ⋅ Guo-Sen Xie ⋅ Jinhui Tang
|
Exhibit Hall I #91 | |
|
LVAgent: Long Video Understanding by Multi-Round Dynamical Collaboration of MLLM Agents
Poster Session 5 & Exhibit Hall
Boyu Chen ⋅ Zhengrong Yue ⋅ Siran Chen ⋅ Zikang Wang ⋅ Yang Liu ⋅ Peng Li ⋅ Yali Wang
|
Exhibit Hall I #41 | |
|
Aligning Vision to Language: Annotation-Free Multimodal Knowledge Graph Construction for Enhanced LLMs Reasoning
Poster Session 1 & Exhibit Hall
Junming Liu ⋅ Siyuan Meng ⋅ Yanting Gao ⋅ Song Mao ⋅ Pinlong Cai ⋅ Guohang Yan ⋅ Yirong Chen ⋅ Zilin Bian ⋅ DING WANG ⋅ Botian Shi
|
Exhibit Hall I #84 | |
|
EMatch: A Unified Framework for Event-based Optical Flow and Stereo Matching
Poster Session 2 & Exhibit Hall with Coffee Break
Pengjie Zhang ⋅ Lin Zhu ⋅ Xiao Wang ⋅ Lizhi Wang ⋅ Hua Huang
|
Exhibit Hall I #78 | |
|
Liberated-GS: 3D Gaussian Splatting Independent from SfM Point Clouds
Poster Session 6 & Exhibit Hall with Coffee Break
Weihong Pan ⋅ Xiaoyu Zhang ⋅ Hongjia Zhai ⋅ Xiaojun Xiang ⋅ Hanqing Jiang ⋅ Guofeng Zhang
|
Exhibit Hall I #189 | |
|
Unlocking the Potential of Diffusion Priors in Blind Face Restoration
Poster Session 3 & Exhibit Hall
Yunqi Miao ⋅ Zhiyu Qu ⋅ Mingqi Gao ⋅ Changrui Chen ⋅ Jifei Song ⋅ Jungong Han ⋅ Jiankang Deng
|
Exhibit Hall I #327 | |
|
DyWA: Dynamics-adaptive World Action Model for Generalizable Non-prehensile Manipulation
Poster Session 3 & Exhibit Hall
Jiangran Lyu ⋅ Ziming Li ⋅ Xuesong Shi ⋅ Chaoyi Xu ⋅ Yizhou Wang ⋅ He Wang
|
Exhibit Hall I #99 | |
|
Self-Supervised Sparse Sensor Fusion for Long Range Perception
Poster Session 6 & Exhibit Hall with Coffee Break
Edoardo Palladin ⋅ Samuel Brucker ⋅ Filippo Ghilotti ⋅ Praveen Narayanan ⋅ Mario Bijelic ⋅ Felix Heide
|
Exhibit Hall I #268 | |
|
Joint Asymmetric Loss for Learning with Noisy Labels
Poster Session 1 & Exhibit Hall
Jialiang Wang ⋅ Xianming Liu ⋅ Xiong Zhou ⋅ Gangfeng Hu ⋅ Deming Zhai ⋅ Junjun Jiang ⋅ Xiangyang Ji
|
Exhibit Hall I #176 | |
|
SAME: Learning Generic Language-Guided Visual Navigation with State-Adaptive Mixture of Experts
Poster Session 2 & Exhibit Hall with Coffee Break
Gengze Zhou ⋅ Yicong Hong ⋅ Zun Wang ⋅ Chongyang Zhao ⋅ Mohit Bansal ⋅ Qi Wu
|
Exhibit Hall I #261 | |
|
Implicit Counterfactual Learning for Audio-Visual Segmentation
Poster Session 5 & Exhibit Hall
Mingfeng Zha ⋅ Tianyu Li ⋅ Guoqing Wang ⋅ Peng Wang ⋅ Yangyang Wu ⋅ Yang Yang ⋅ Heng Tao Shen
|
Exhibit Hall I #240 | |
|
DC-ControlNet: Decoupling Inter- and Intra-Element Conditions in Image Generation with Diffusion Models
Poster Session 4 & Exhibit Hall with Coffee Break
hongji yang ⋅ Wencheng Han ⋅ Yucheng Zhou ⋅ Jianbing Shen
|
Exhibit Hall I #401 | |
|
STaR: Seamless Spatial-Temporal Aware Motion Retargeting with Penetration and Consistency Constraints
Poster Session 3 & Exhibit Hall
Xiaohang Yang ⋅ Qing Wang ⋅ Jiahao Yang ⋅ Gregory Slabaugh ⋅ Shanxin Yuan
|
Exhibit Hall I #277 | |
|
FALCON: Resolving Visual Redundancy and Fragmentation in High-resolution Multimodal Large Language Models via Visual Registers
Poster Session 5 & Exhibit Hall
Renshan Zhang ⋅ Rui Shao ⋅ Gongwei Chen ⋅ Miao Zhang ⋅ Kaiwen Zhou ⋅ Weili Guan ⋅ Liqiang Nie
|
Exhibit Hall I #351 | |
|
Visual-Oriented Fine-Grained Knowledge Editing for MultiModal Large Language Models
Poster Session 1 & Exhibit Hall
Zhen Zeng ⋅ Leijiang Gu ⋅ Xun Yang ⋅ Zhangling Duan ⋅ Zenglin Shi ⋅ Meng Wang
|
Exhibit Hall I #229 | |
|
Competitive Distillation: A Simple Learning Strategy for Improving Visual Classification
Poster Session 1 & Exhibit Hall
Daqian Shi ⋅ Xiaolei Diao ⋅ Xu Chen ⋅ Cedric John
|
Exhibit Hall I #275 | |
|
AIComposer: Any Style and Content Image Composition via Feature Integration
Poster Session 4 & Exhibit Hall with Coffee Break
Haowen Li ⋅ Zhenfeng Fan ⋅ Zhang Wen ⋅ Zhengzhou Zhu ⋅ Yunjin Li
|
Exhibit Hall I #187 | |
|
Rethink Sparse Signals for Pose-guided Text-to-image Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Wenjie Xuan ⋅ Jing Zhang ⋅ Juhua Liu ⋅ Bo Du ⋅ Dacheng Tao
|
Exhibit Hall I #96 | |
|
VPO: Aligning Text-to-Video Generation Models with Prompt Optimization
Poster Session 4 & Exhibit Hall with Coffee Break
Jiale Cheng ⋅ Ruiliang Lyu ⋅ Xiaotao Gu ⋅ Xiao Liu ⋅ Jiazheng Xu ⋅ Yida Lu ⋅ Jiayan Teng ⋅ Zhuoyi Yang ⋅ Yuxiao Dong ⋅ Jie Tang ⋅ Hongning Wang ⋅ Minlie Huang
|
Exhibit Hall I #70 | |
|
Stylized-Face: A Million-level Stylized Face Dataset for Face Recognition
Poster Session 3 & Exhibit Hall
Zhengyuan Peng ⋅ Jianqing Xu ⋅ Yuge Huang ⋅ Jinkun Hao ⋅ Shouhong Ding ⋅ zhizhong zhang ⋅ Xin TAN ⋅ Lizhuang Ma
|
Exhibit Hall I #287 | |
|
Uncover Treasures in DCT: Advancing JPEG Quality Enhancement by Exploiting Latent Correlations
Poster Session 4 & Exhibit Hall with Coffee Break
jing Yang ⋅ Qunliang Xing ⋅ Mai Xu ⋅ Minglang Qiao
|
Exhibit Hall I #260 | |
|
From One to More: Contextual Part Latents for 3D Generation
Poster Session 2 & Exhibit Hall with Coffee Break
Shaocong Dong ⋅ Lihe Ding ⋅ Xiao Chen ⋅ Yaokun Li ⋅ Yuxin WANG ⋅ Yucheng Wang ⋅ Qi WANG ⋅ Jaehyeok Kim ⋅ Chenjian Gao ⋅ Zhanpeng Huang ⋅ Zibin Wang ⋅ Tianfan Xue ⋅ Dan Xu
|
Exhibit Hall I #301 | |
|
Single-Scanline Relative Pose Estimation for Rolling Shutter Cameras
Poster Session 2 & Exhibit Hall with Coffee Break
Petr Hruby ⋅ Marc Pollefeys
|
Exhibit Hall I #199 | |
|
Unified Multi-Agent Trajectory Modeling with Masked Trajectory Diffusion
Poster Session 6 & Exhibit Hall with Coffee Break
songru Yang ⋅ Zhenwei Shi ⋅ Zhengxia Zou
|
Exhibit Hall I #274 | |
|
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion
Poster Session 5 & Exhibit Hall
Ahmed Nassar ⋅ Matteo Omenetti ⋅ Maksym Lysak ⋅ Nikolaos Livathinos ⋅ Christoph Auer ⋅ Lucas Morin ⋅ Rafael Teixeira de Lima ⋅ Yusik Kim ⋅ A. Said Gurbuz ⋅ Michele Dolfi ⋅ Peter Staar
|
Exhibit Hall I #203 | |
|
Beyond Walking: A Large-Scale Image-Text Benchmark for Text-based Person Anomaly Search
Shuyu Yang ⋅ Yaxiong Wang ⋅ Li Zhu ⋅ Zhedong Zheng
|
Exhibit Hall I #162 | |
|
Images as Noisy Labels: Unleashing the Potential of the Diffusion Model for Open-Vocabulary Semantic Segmentation
Fan Li ⋅ Xuanbin Wang ⋅ Xuan Wang ⋅ Zhaoxiang Zhang ⋅ yuelei xu
|
Exhibit Hall I #417 | |
|
ContextFace: Generating Facial Expressions from Emotional Contexts
Poster Session 3 & Exhibit Hall
minjung kim ⋅ Minsang Kim ⋅ Seung Jun Baek
|
Exhibit Hall I #129 | |
|
Agreement aware and dissimilarity oriented GLOM
Poster Session 5 & Exhibit Hall
Ru Zeng ⋅ Yan Song ⋅ Yang ZHANG ⋅ yanlinghu yanlinghu ⋅ Hui Yu
|
Exhibit Hall I #426 | |
|
The Best of Both Worlds: Integrating Language Models and Diffusion Models for Video Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Aoxiong Yin ⋅ Kai Shen ⋅ Yichong Leng ⋅ Xu Tan ⋅ Xinyu Zhou ⋅ Juncheng Li ⋅ Siliang Tang
|
Exhibit Hall I #67 | |
|
LLaFEA: Frame-Event Complementary Fusion for Fine-Grained Spatiotemporal Understanding in LMMs
Poster Session 5 & Exhibit Hall
Hanyu Zhou ⋅ Gim Hee Lee
|
Exhibit Hall I #235 | |
|
Bridging Class Imbalance and Partial Labeling via Spectral-Balanced Energy Propagation for Skeleton-based Action Recognition
Poster Session 3 & Exhibit Hall
Yandan Wang ⋅ Chenqi Guo ⋅ Yinglong Ma ⋅ Jiangyan Chen ⋅ Yuan Gao ⋅ Weiming Dong
|
Exhibit Hall I #15 | |
|
MeasureXpert: Automatic Anthropometric Measurement Extraction from Two Unregistered, Partial, Posed, and Dressed Body Scans
Poster Session 2 & Exhibit Hall with Coffee Break
Ran Zhao ⋅ Xinxin Dai ⋅ Pengpeng Hu ⋅ Vasile Palade ⋅ Adrian Munteanu
|
Exhibit Hall I #430 | |
|
ForeSight: Multi-View Streaming Joint Object Detection and Trajectory Forecasting
Poster Session 6 & Exhibit Hall with Coffee Break
Sandro Papais ⋅ Letian Wang ⋅ Brian Cheong ⋅ Steven Waslander
|
Exhibit Hall I #71 | |
|
DeRIS: Decoupling Perception and Cognition for Enhanced Referring Image Segmentation through Loopback Synergy
Poster Session 5 & Exhibit Hall
Ming Dai ⋅ Wenxuan Cheng ⋅ Jiang-Jiang Liu ⋅ Sen Yang ⋅ Wenxiao Cai ⋅ Yanpeng Sun ⋅ Wankou Yang
|
Exhibit Hall I #13 | |
|
ResidualViT for Efficient Temporally Dense Video Encoding
Mattia Soldan ⋅ Fabian Caba Heilbron ⋅ Bernard Ghanem ⋅ Josef Sivic ⋅ Bryan Russell
|
Exhibit Hall I #236 | |
|
VideoOrion: Tokenizing Object Dynamics in Videos
Poster Session 5 & Exhibit Hall
Yicheng Feng ⋅ Yijiang Li ⋅ Wanpeng Zhang ⋅ Sipeng Zheng ⋅ Hao Luo ⋅ Zihao Yue ⋅ Zongqing Lu
|
Exhibit Hall I #56 | |
|
SeaS: Few-shot Industrial Anomaly Image Generation with Separation and Sharing Fine-tuning
Poster Session 5 & Exhibit Hall
Zhewei Dai ⋅ Shilei Zeng ⋅ Haotian Liu ⋅ Xurui Li ⋅ Feng Xue ⋅ Yu Zhou
|
Exhibit Hall I #315 | |
|
MoSiC: Optimal-Transport Motion Trajectory for Dense Self-Supervised Learning
Poster Session 2 & Exhibit Hall with Coffee Break
Mohammadreza Salehi ⋅ Shashanka Venkataramanan ⋅ Ioana Simion ⋅ Stratis Gavves ⋅ Cees Snoek ⋅ Yuki Asano
|
Exhibit Hall I #142 | |
|
Exploring Weather-aware Aggregation and Adaptation for Semantic Segmentation under Adverse Conditions
Poster Session 3 & Exhibit Hall
Yuwen Pan ⋅ Rui Sun ⋅ Wangkai Li ⋅ Tianzhu Zhang
|
Exhibit Hall I #371 | |
|
MUG: Pseudo Labeling Augmented Audio-Visual Mamba Network for Audio-Visual Video Parsing
Poster Session 5 & Exhibit Hall
Langyu Wang ⋅ Langyu Wang ⋅ Yingying Chen ⋅ Yiyuan Zhang ⋅ Ming Tang ⋅ Jinqiao Wang
|
Exhibit Hall I #80 | |
|
Randomized Autoregressive Visual Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Qihang Yu ⋅ Ju He ⋅ Xueqing Deng ⋅ Xiaohui Shen ⋅ Liang-Chieh (Jay) Chen
|
Exhibit Hall I #340 | |
|
CleanPose: Category-Level Object Pose Estimation via Causal Learning and Knowledge Distillation
Poster Session 2 & Exhibit Hall with Coffee Break
Xiao Lin ⋅ Yun Peng ⋅ Liuyi Wang ⋅ xianyou zhong ⋅ Minghao Zhu ⋅ Jingwei Yang ⋅ Yi Feng ⋅ Chengju Liu ⋅ Qijun Chen
|
Exhibit Hall I #91 | |
|
Unsupervised RGB-D Point Cloud Registration for Scenes with Low Overlap and Photometric Inconsistency
Poster Session 6 & Exhibit Hall with Coffee Break
yejun Shou ⋅ Haocheng Wang ⋅ Lingfeng Shen ⋅ Qian Zheng ⋅ Gang Pan ⋅ Yanlong Cao
|
Exhibit Hall I #14 | |
|
Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment
Lijie Liu ⋅ Tianxiang Ma ⋅ Bingchuan Li ⋅ Zhuowei Chen ⋅ Jiawei Liu ⋅ Gen Li ⋅ SiYu Zhou ⋅ Qian HE ⋅ Xinglong Wu
|
Exhibit Hall I #6 | |
|
TokensGen: Harnessing Condensed Tokens for Long Video Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Wenqi Ouyang ⋅ Zeqi Xiao ⋅ Danni Yang ⋅ Yifan Zhou ⋅ Shuai Yang ⋅ Lei Yang ⋅ Jianlou Si ⋅ Xingang Pan
|
Exhibit Hall I #318 | |
|
Gradient-Reweighted Adversarial Camouflage for Physical Object Detection Evasion
Poster Session 3 & Exhibit Hall
Jiawei Liang ⋅ Siyuan Liang ⋅ Tianrui Lou ⋅ Ming Zhang ⋅ liwenjin liwenjin ⋅ Dunqiu fan ⋅ Xiaochun Cao
|
Exhibit Hall I #364 | |
|
Progressive Homeostatic and Plastic Prompt Tuning for Audio-Visual Multi-Task Incremental Learning
Poster Session 1 & Exhibit Hall
Jiong Yin ⋅ Liang Li ⋅ Jiehua Zhang ⋅ Yuhan Gao ⋅ Chenggang Yan ⋅ Xichun Sheng
|
Exhibit Hall I #183 | |
|
MixA: A Mixed Attention approach with Stable Lightweight Linear Attention to enhance Efficiency of Vision Transformers at the Edge
Poster Session 5 & Exhibit Hall
Sabbir Ahmed ⋅ Jingtao Li ⋅ Weiming Zhuang ⋅ Chen Chen ⋅ Lingjuan Lyu
|
Exhibit Hall I #129 | |
|
Transparent Vision: A Theory of Hierarchical Invariant Representations
Poster Session 1 & Exhibit Hall
Shuren Qi ⋅ Yushu Zhang ⋅ CHAO WANG ⋅ Zhihua Xia ⋅ Xiaochun Cao ⋅ FENGLEI FAN
|
Exhibit Hall I #319 | |
|
AutoPrompt: Automated Red-Teaming of Text-to-Image Models via LLM-Driven Adversarial Prompts
Poster Session 4 & Exhibit Hall with Coffee Break
Yufan Liu ⋅ Wanqian Zhang ⋅ Huashan Chen ⋅ Lin Wang ⋅ Xiaojun Jia ⋅ Zheng Lin ⋅ Weiping Wang
|
Exhibit Hall I #256 | |
|
Q-Frame: Query-aware Frame Selection and Multi-Resolution Adaptation for Video-LLMs
Poster Session 5 & Exhibit Hall
Shaojie Zhang ⋅ Jiahui Yang ⋅ Jianqin Yin ⋅ Zhenbo Luo ⋅ Jian Luan
|
Exhibit Hall I #211 | |
|
LongAnimation: Long Animation Generation with Dynamic Global-Local Memory
Poster Session 3 & Exhibit Hall
Nan Chen ⋅ Mengqi Huang ⋅ Yihao Meng ⋅ Zhendong Mao
|
Exhibit Hall I #3 | |
|
OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation
Poster Session 4 & Exhibit Hall with Coffee Break
Junyuan Zhang ⋅ Qintong Zhang ⋅ Bin Wang ⋅ Linke Ouyang ⋅ Zichen Wen ⋅ Ying Li ⋅ Ka-Ho Chow ⋅ Conghui He ⋅ Wentao Zhang
|
Exhibit Hall I #245 | |
|
Efficient Event Camera Data Pretraining with Adaptive Prompt Fusion
Poster Session 2 & Exhibit Hall with Coffee Break
Quanmin Liang ⋅ Qiang Li ⋅ Shuai Liu ⋅ Xinzi Cao ⋅ Jinyi Lu ⋅ Feidiao Yang ⋅ Wei Zhang ⋅ Kai Huang ⋅ Yonghong Tian
|
Exhibit Hall I #342 | |
|
Lightweight Gradient-Aware Upscaling of 3D Gaussian Splatting Images
Poster Session 6 & Exhibit Hall with Coffee Break
Simon Niedermayr ⋅ Christoph Neuhauser ⋅ Rüdiger Westermann
|
Exhibit Hall I #109 | |
|
RoBridge: A Hierarchical Architecture Bridging Cognition and Execution for General Robotic Manipulation
Poster Session 3 & Exhibit Hall
Kaidong Zhang ⋅ Rongtao Xu ⋅ Ren Pengzhen ⋅ Junfan Lin ⋅ Hefeng Wu ⋅ Liang Lin ⋅ Xiaodan Liang
|
Exhibit Hall I #432 | |
|
SEGS-SLAM: Structure-enhanced 3D Gaussian Splatting SLAM with Appearance Embedding
Poster Session 6 & Exhibit Hall with Coffee Break
Tianci Wen ⋅ Zhiang Liu ⋅ Yongchun Fang
|
Exhibit Hall I #326 | |
|
BoxDreamer: Dreaming Box Corners for Generalizable Object Pose Estimation
Poster Session 2 & Exhibit Hall with Coffee Break
Yuanhong Yu ⋅ Xingyi He ⋅ Chen Zhao ⋅ Junhao Yu ⋅ Jiaqi Yang ⋅ Ruizhen Hu ⋅ Yujun Shen ⋅ Xing Zhu ⋅ Xiaowei Zhou ⋅ Sida Peng
|
Exhibit Hall I #409 | |
|
Looking in the Mirror: A Faithful Counterfactual Explanation Method for Interpreting Deep Image Classification Models
Poster Session 1 & Exhibit Hall
Townim Chowdhury ⋅ Vu Phan ⋅ Kewen Liao ⋅ Nanyu Dong ⋅ Minh-Son To ⋅ Anton Hengel ⋅ Johan Verjans ⋅ Zhibin Liao
|
Exhibit Hall I #203 | |
|
FLSeg: Enhancing Privacy and Robustness in Federated Learning under Heterogeneous Data via Model Segmentation
Poster Session 1 & Exhibit Hall
Zichun Su ⋅ Zhi Lu ⋅ Yutong Wu ⋅ Renfei Shen ⋅ Songfeng Lu
|
Exhibit Hall I #364 | |
|
Self-Calibrating Gaussian Splatting for Large Field-of-View Reconstruction
Youming Deng ⋅ Wenqi Xian ⋅ Guandao Yang ⋅ Leonidas Guibas ⋅ Gordon Wetzstein ⋅ Steve Marschner ⋅ Paul Debevec
|
Exhibit Hall I #38 | |
|
Trial-Oriented Visual Rearrangement
Poster Session 2 & Exhibit Hall with Coffee Break
Yuyi Liu ⋅ Xinhang Song ⋅ Tianliang Qi ⋅ Shuqiang Jiang
|
Exhibit Hall I #282 | |
|
MSQ: Memory-Efficient Bit Sparsification Quantization
Poster Session 5 & Exhibit Hall
Seokho Han ⋅ Seoyeon Yoon ⋅ Jinhee Kim ⋅ Dongwei Wang ⋅ Kang Jeon ⋅ Huanrui Yang ⋅ Jong Hwan Ko
|
Exhibit Hall I #195 | |
|
SuMa: A Subspace Mapping Approach for Robust and Effective Concept Erasure in Text-to-Image Diffusion Models
Poster Session 4 & Exhibit Hall with Coffee Break
Kien Nguyen ⋅ Anh Tran ⋅ Cuong Pham
|
Exhibit Hall I #450 | |
|
LVFace: Progressive Cluster Optimization for Large Vision Models in Face Recognition
Jinghan You ⋅ Shanglin Li ⋅ Yuanrui Sun ⋅ Jiangchuanwei Wei ⋅ Mingyu Guo ⋅ Chao Feng ⋅ Jiao Ran
|
Exhibit Hall I #173 | |
|
Timestep-Aware Diffusion Model for Extreme Image Rescaling
Poster Session 4 & Exhibit Hall with Coffee Break
Ce Wang ⋅ Zhenyu Hu ⋅ Wanjie Sun ⋅ Zhenzhong Chen
|
Exhibit Hall I #66 | |
|
Recovering Parametric Scenes from Very Few Time-of-Flight Pixels
Poster Session 6 & Exhibit Hall with Coffee Break
Carter Sifferman ⋅ Yiquan Li ⋅ Yiming Li ⋅ Fangzhou Mu ⋅ Michael Gleicher ⋅ Mohit Gupta ⋅ Yin Li
|
Exhibit Hall I #315 | |
|
SRefiner: Soft-Braid Attention for Multi-Agent Trajectory Refinement
Liwen Xiao ⋅ Zhiyu Pan ⋅ Zhicheng Wang ⋅ Zhiguo Cao ⋅ Wei Li
|
Exhibit Hall I #82 | |
|
Generating Physically Stable and Buildable Brick Structures from Text
Poster Session 4 & Exhibit Hall with Coffee Break
Ava Pun ⋅ Kangle Deng ⋅ Ruixuan Liu ⋅ Deva Ramanan ⋅ Changliu Liu ⋅ Jun-Yan Zhu
|
Exhibit Hall I #306 | |
|
An Empirical Study of Autoregressive Pre-training from Videos
Poster Session 4 & Exhibit Hall with Coffee Break
Jathushan Rajasegaran ⋅ Ilija Radosavovic ⋅ Rahul Ravishankar ⋅ Yossi Gandelsman ⋅ Christoph Feichtenhofer ⋅ Jitendra Malik
|
Exhibit Hall I #405 | |
|
Rethinking Few Shot CLIP Benchmarks: A Critical Analysis in the Inductive Setting
Poster Session 1 & Exhibit Hall
Alexey Kravets ⋅ Da Chen ⋅ Vinay Namboodiri
|
Exhibit Hall I #172 | |
|
TACO: Taming Diffusion for in-the-wild Video Amodal Completion
Poster Session 3 & Exhibit Hall
Ruijie Lu ⋅ Yixin Chen ⋅ Yu Liu ⋅ Jiaxiang Tang ⋅ Junfeng Ni ⋅ Diwen Wan ⋅ Gang Zeng ⋅ Siyuan Huang
|
Exhibit Hall I #342 | |
|
STI-Bench: Are MLLMs Ready for Precise Spatial-Temporal World Understanding?
Poster Session 2 & Exhibit Hall with Coffee Break
Yun Li ⋅ Yiming Zhang ⋅ Tao Lin ⋅ Xiangrui Liu ⋅ Wenxiao Cai ⋅ Zheng Liu ⋅ Bo Zhao
|
Exhibit Hall I #56 | |
|
Debiased Teacher for Day-to-Night Domain Adaptive Object Detection
Poster Session 1 & Exhibit Hall
Yiming Cui ⋅ Liang Li ⋅ Haibing YIN ⋅ Yuhan Gao ⋅ Yaoqi Sun ⋅ Chenggang Yan
|
Exhibit Hall I #237 | |
|
Towards Effective Foundation Model Adaptation for Extreme Cross-Domain Few-Shot Learning
Poster Session 1 & Exhibit Hall
Fei Zhou ⋅ Peng Wang ⋅ Lei Zhang ⋅ Wei Wei ⋅ Chen Ding ⋅ Guosheng Lin ⋅ Yanning Zhang
|
Exhibit Hall I #430 | |
|
SpikePack: Enhanced Information Flow in Spiking Neural Networks with High Hardware Compatibility
Poster Session 5 & Exhibit Hall
Guobin Shen ⋅ Jindong Li ⋅ Tenglong Li ⋅ Dongcheng Zhao ⋅ Yi Zeng
|
Exhibit Hall I #338 | |
|
AV-Flow: Transforming Text to Audio-Visual Human-like Interactions
Poster Session 3 & Exhibit Hall
Aggelina Chatziagapi ⋅ Louis-Philippe Morency ⋅ Hongyu Gong ⋅ Michael Zollhöfer ⋅ Dimitris Samaras ⋅ Alexander Richard
|
Exhibit Hall I #402 | |
|
Stronger, Steadier & Superior: Geometric Consistency in Depth VFM Forges Domain Generalized Semantic Segmentation
Poster Session 2 & Exhibit Hall with Coffee Break
Siyu Chen ⋅ Ting Han ⋅ Changshe Zhang ⋅ Xin Luo ⋅ Meiliu Wu ⋅ Guorong Cai ⋅ Jinhe Su
|
Exhibit Hall I #308 | |
|
Efficient Adaptation of Pre-trained Vision Transformer underpinned by Approximately Orthogonal Fine-Tuning Strategy
Poster Session 1 & Exhibit Hall
Yiting Yang ⋅ Hao Luo ⋅ Yuan Sun ⋅ Qingsen Yan ⋅ Haokui Zhang ⋅ Wei Dong ⋅ Guoqing Wang ⋅ Peng Wang ⋅ Yang Yang ⋅ Heng Tao Shen
|
Exhibit Hall I #458 | |
|
FA: Forced Prompt Learning of Vision-Language Models for Out-of-Distribution Detection
Poster Session 1 & Exhibit Hall
Xinhua Lu ⋅ Runhe Lai ⋅ Yanqi Wu ⋅ Kanghao Chen ⋅ Wei-Shi Zheng ⋅ Ruixuan Wang
|
Exhibit Hall I #100 | |
|
Compression-Aware One-Step Diffusion Model for JPEG Artifact Removal
Poster Session 4 & Exhibit Hall with Coffee Break
Jinpei Guo ⋅ Zheng Chen ⋅ Wenbo Li ⋅ Yong Guo ⋅ YULUN ZHANG
|
Exhibit Hall I #4 | |
|
ConstStyle: Robust Domain Generalization with Unified Style Transformation
Poster Session 1 & Exhibit Hall
Nam Duong Tran ⋅ Nam Nguyen Phuong ⋅ Hieu Pham ⋅ Phi Le Nguyen ⋅ My Thai
|
Exhibit Hall I #293 | |
|
CCL-LGS: Contrastive Codebook Learning for 3D Language Gaussian Splatting
Poster Session 2 & Exhibit Hall with Coffee Break
Lei Tian ⋅ Xiaomin Li ⋅ Liqian Ma ⋅ Hao Yin ⋅ Zirui Zheng ⋅ Hefei Huang ⋅ Taiqing Li ⋅ Huchuan Lu ⋅ Xu Jia
|
Exhibit Hall I #453 | |
|
DreamActor-M1: Holistic, Expressive and Robust Human Image Animation with Hybrid Guidance
Poster Session 3 & Exhibit Hall
Yuxuan Luo ⋅ Zhengkun Rong ⋅ Lizhen Wang ⋅ Longhao Zhang ⋅ Tianshu Hu
|
Exhibit Hall I #97 | |
|
Learning Few-Step Diffusion Models by Trajectory Distribution Matching
Poster Session 4 & Exhibit Hall with Coffee Break
Yihong Luo ⋅ Tianyang Hu ⋅ Jiacheng Sun ⋅ Yujun Cai ⋅ Jing Tang
|
Exhibit Hall I #271 | |
|
Aether: Geometric-Aware Unified World Modeling
Poster Session 2 & Exhibit Hall with Coffee Break
Haoyi Zhu ⋅ Yifan Wang ⋅ Jianjun Zhou ⋅ Wenzheng Chang ⋅ Yang Zhou ⋅ Zizun Li ⋅ Junyi Chen ⋅ Chunhua Shen ⋅ Jiangmiao Pang ⋅ Tong He
|
Exhibit Hall I #331 | |
|
ConsistentCity: Semantic Flow-guided Occupancy DiT for Temporally Consistent Driving Scene Synthesis
Poster Session 6 & Exhibit Hall with Coffee Break
Benjin Zhu ⋅ Xiaogang Wang ⋅ Hongsheng Li
|
Exhibit Hall I #161 | |
|
CLOT: Closed Loop Optimal Transport for Unsupervised Action Segmentation
Poster Session 3 & Exhibit Hall
Elena Bueno-Benito ⋅ Mariella Dimiccoli
|
Exhibit Hall I #66 | |
|
Dual-Temporal Exemplar Representation Network for Video Semantic Segmentation
Poster Session 3 & Exhibit Hall
Xiaolong Xu ⋅ Lei Zhang ⋅ Jiayi Li ⋅ Lituan Wang ⋅ Yifan Guan ⋅ Yu Yan ⋅ Leyi Zhang ⋅ Hao Song
|
Exhibit Hall I #71 | |
|
Gaussian Variation Field Diffusion for High-fidelity Video-to-4D Synthesis
Poster Session 3 & Exhibit Hall
Bowen Zhang ⋅ Sicheng Xu ⋅ Chuxin Wang ⋅ Jiaolong Yang ⋅ Feng Zhao ⋅ Dong Chen ⋅ Baining Guo
|
Exhibit Hall I #236 | |
|
Unified Open-World Segmentation with Multi-Modal Prompts
Poster Session 5 & Exhibit Hall
Yang Liu ⋅ Yufei Yin ⋅ Chenchen Jing ⋅ Muzhi Zhu ⋅ Hao Chen ⋅ Yuling Xi ⋅ Bo Feng ⋅ Hao Wang ⋅ Shiyu Li ⋅ Chunhua Shen
|
Exhibit Hall I #165 | |
|
Neurons: Emulating the Human Visual Cortex Improves Fidelity and Interpretability in fMRI-to-Video Reconstruction
Poster Session 4 & Exhibit Hall with Coffee Break
Haonan Wang ⋅ Qixiang ZHANG ⋅ Lehan Wang ⋅ Xuanqi Huang ⋅ Xiaomeng Li
|
Exhibit Hall I #334 | |
|
Outdoor Monocular SLAM with Global Scale-Consistent 3D Gaussian Pointmaps
Poster Session 6 & Exhibit Hall with Coffee Break
Chong Cheng ⋅ Sicheng Yu ⋅ Zijian Wang ⋅ Yifan Zhou ⋅ Hao Wang
|
Exhibit Hall I #125 | |
|
LayerAnimate: Layer-level Control for Animation
Poster Session 3 & Exhibit Hall
Yuxue Yang ⋅ Lue Fan ⋅ Zuzeng Lin ⋅ Feng Wang ⋅ Zhaoxiang Zhang
|
Exhibit Hall I #81 | |
|
AnimateAnyMesh: A Feed-Forward 4D Foundation Model for Text-Driven Universal Mesh Animation
Poster Session 3 & Exhibit Hall
zijie wu ⋅ Chaohui Yu ⋅ Fan Wang ⋅ Xiang Bai
|
Exhibit Hall I #335 | |
|
Rethinking the Embodied Gap in Vision-and-Language Navigation: A Holistic Study of Physical and Visual Disparities
Poster Session 2 & Exhibit Hall with Coffee Break
Liuyi Wang ⋅ Xinyuan Xia ⋅ Hui Zhao ⋅ Hanqing Wang ⋅ Tai Wang ⋅ Yilun Chen ⋅ Chengju Liu ⋅ Qijun Chen ⋅ Jiangmiao Pang
|
Exhibit Hall I #416 | |
|
SuperEvent: Cross-Modal Learning of Event-based Keypoint Detection for SLAM
Yannick Burkhardt ⋅ Simon Schaefer ⋅ Stefan Leutenegger
|
Exhibit Hall I #366 | |
|
RogSplat: Robust Gaussian Splatting via Generative Priors
Poster Session 6 & Exhibit Hall with Coffee Break
Hanyang Kong ⋅ Xingyi Yang ⋅ Xinchao Wang
|
Exhibit Hall I #97 | |
|
From Imitation to Innovation: The Emergence of AI's Unique Artistic Styles and the Challenge of Copyright Protection
Poster Session 4 & Exhibit Hall with Coffee Break
Zexi Jia ⋅ Chuanwei Huang ⋅ Hongyan Fei ⋅ Yeshuang Zhu ⋅ Zhiqiang Yuan ⋅ Ying Deng ⋅ Jiapei Zhang ⋅ Jinchao Zhang ⋅ Jie Zhou
|
Exhibit Hall I #393 | |
|
Intra-modal and Cross-modal Synchronization for Audio-visual Deepfake Detection and Temporal Localization
Poster Session 3 & Exhibit Hall
Ashutosh Anshul ⋅ Shreyas Gopal ⋅ Deepu Rajan ⋅ Eng Chng
|
Exhibit Hall I #359 | |
|
MinCD-PnP: Learning 2D-3D Correspondences with Approximate Blind PnP
Poster Session 6 & Exhibit Hall with Coffee Break
Pei An ⋅ Jiaqi Yang ⋅ Muyao Peng ⋅ You Yang ⋅ Qiong Liu ⋅ Xiaolin Wu ⋅ Liangliang Nan
|
Exhibit Hall I #174 | |
|
SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation
Shiqi Huang ⋅ Shuting He ⋅ Huaiyuan Qin ⋅ Bihan Wen
|
Exhibit Hall I #241 | |
|
Mind the Cost of Scaffold! Benign Clients May Even Become Accomplices of Backdoor Attack
Poster Session 1 & Exhibit Hall
Xingshuo Han ⋅ Xuanye Zhang ⋅ Xiang Lan ⋅ Haozhao Wang ⋅ Shengmin Xu ⋅ Shen Ren ⋅ Jason Zeng ⋅ Ming Wu ⋅ Michael Heinrich ⋅ Tianwei Zhang
|
Exhibit Hall I #140 | |
|
How To Make Your Cell Tracker Say "I dunno!"
Poster Session 2 & Exhibit Hall with Coffee Break
Richard D Paul ⋅ Johannes Seiffarth ⋅ David Rügamer ⋅ Hanno Scharr ⋅ Katharina Nöh
|
Exhibit Hall I #178 | |
|
InstructSeg: Unifying Instructed Visual Segmentation with Multi-modal Large Language Models
Poster Session 5 & Exhibit Hall
Cong Wei ⋅ Yujie Zhong ⋅ yingsen zeng ⋅ Haoxian Tan ⋅ Yong Liu ⋅ Hongfa Wang ⋅ Yujiu Yang
|
Exhibit Hall I #37 | |
|
Bilateral Collaboration with Large Vision-Language Models for Open Vocabulary Human-Object Interaction Detection
Poster Session 5 & Exhibit Hall
Yupeng Hu ⋅ Changxing Ding ⋅ Chang Sun ⋅ Shaoli Huang ⋅ Xiangmin Xu
|
Exhibit Hall I #31 | |
|
Open-ended Hierarchical Streaming Video Understanding with Vision Language Models
Poster Session 5 & Exhibit Hall
Hyolim Kang ⋅ Yunsu Park ⋅ Youngbeom Yoo ⋅ Yeeun Choi ⋅ Seon Joo Kim
|
Exhibit Hall I #87 | |
|
V2M4: 4D Mesh Animation Reconstruction from a Single Monocular Video
Poster Session 3 & Exhibit Hall
Jianqi Chen ⋅ Biao Zhang ⋅ Xiangjun Tang ⋅ Peter Wonka
|
Exhibit Hall I #155 | |
|
CoTMR: Chain-of-Thought Multi-Scale Reasoning for Training-Free Zero-Shot Composed Image Retrieval
Poster Session 5 & Exhibit Hall
Zelong Sun ⋅ Dong Jing ⋅ Zhiwu Lu
|
Exhibit Hall I #270 | |
|
Towards a 3D Transfer-based Black-box Attack via Critical Feature Guidance
Poster Session 6 & Exhibit Hall with Coffee Break
Shuchao Pang ⋅ Zhenghan Chen ⋅ Shen Zhang ⋅ Liming Lu ⋅ Siyuan Liang ⋅ Anan Du ⋅ Yongbin Zhou
|
Exhibit Hall I #211 | |
|
DepthSync: Diffusion Guidance-Based Depth Synchronization for Scale- and Geometry-Consistent Video Depth Estimation
Poster Session 2 & Exhibit Hall with Coffee Break
Yue-Jiang Dong ⋅ Wang Zhao ⋅ Jiale Xu ⋅ Ying Shan ⋅ Song-Hai Zhang
|
Exhibit Hall I #37 | |
|
A Token-level Text Image Foundation Model for Document Understanding
Poster Session 5 & Exhibit Hall
Tongkun Guan ⋅ Zining Wang ⋅ Pei Fu ⋅ Zhentao Guo ⋅ Wei Shen ⋅ Kai zhou ⋅ Tiezhu Yue ⋅ Chen Duan ⋅ Hao Sun ⋅ Qianyi Jiang ⋅ Junfeng Luo ⋅ Xiaokang Yang
|
Exhibit Hall I #322 | |
|
Learning 3D Object Spatial Relationships from Pre-trained 2D Diffusion Models
Poster Session 2 & Exhibit Hall with Coffee Break
Sangwon Baik ⋅ Hyeonwoo Kim ⋅ Hanbyul Joo
|
Exhibit Hall I #320 | |
|
MonoMVSNet: Monocular Priors Guided Multi-View Stereo Network
Poster Session 6 & Exhibit Hall with Coffee Break
Jianfei Jiang ⋅ Qiankun Liu ⋅ Haochen Yu ⋅ Hongyuan Liu ⋅ Liyong Wang ⋅ Jiansheng Chen ⋅ Huimin Ma
|
Exhibit Hall I #298 | |
|
Instance-Level Video Depth in Groups Beyond Occlusions
Poster Session 2 & Exhibit Hall with Coffee Break
Yuan Liang ⋅ Yang Zhou ⋅ Ziming Sun ⋅ Tianyi Xiang ⋅ Guiqing Li ⋅ Shengfeng He
|
Exhibit Hall I #241 | |
|
Flow Stochastic Segmentation Networks
Poster Session 3 & Exhibit Hall
Fabio De Sousa Ribeiro ⋅ Omar Todd ⋅ Charles Jones ⋅ Avinash Kori ⋅ Raghav Mehta ⋅ Ben Glocker
|
Exhibit Hall I #447 | |
|
Animate Anyone 2: High-Fidelity Character Image Animation with Environment Affordance
Poster Session 3 & Exhibit Hall
Li Hu ⋅ wang yuan ⋅ Zhen Shen ⋅ Xin Gao ⋅ Dechao Meng ⋅ Li'an Zhuo ⋅ Peng Zhang ⋅ Bang Zhang ⋅ Liefeng Bo
|
Exhibit Hall I #19 | |
|
Future-Aware Interaction Network For Motion Forecasting
Poster Session 2 & Exhibit Hall with Coffee Break
Shijie Li ⋅ Chunyu Liu ⋅ Xun Xu ⋅ Si Yong Yeo ⋅ Xulei Yang
|
Exhibit Hall I #234 | |
|
ScanEdit: Hierarchically-Guided Functional 3D Scan Editing
Poster Session 6 & Exhibit Hall with Coffee Break
Mohamed El Amine Boudjoghra ⋅ Ivan Laptev ⋅ Angela Dai
|
Exhibit Hall I #231 | |
|
Latent Diffusion Models with Masked AutoEncoders
Poster Session 4 & Exhibit Hall with Coffee Break
Junho Lee ⋅ Jeongwoo Shin ⋅ Hyungwook Choi ⋅ Joonseok Lee
|
Exhibit Hall I #243 | |
|
DreamCube: RGB-D Panorama Generation via Multi-plane Synchronization
Poster Session 6 & Exhibit Hall with Coffee Break
Yukun Huang ⋅ Yanning Zhou ⋅ Jianan Wang ⋅ Kaiyi Huang ⋅ Xihui Liu
|
Exhibit Hall I #19 | |
|
From Enhancement to Understanding: Build a Generalized Bridge for Low-light Vision via Semantically Consistent Unsupervised Fine-tuning
Poster Session 3 & Exhibit Hall
Sen Wang ⋅ Shao Zeng ⋅ Tianjun Gu ⋅ zhizhong zhang ⋅ Ruixin Zhang ⋅ Shouhong Ding ⋅ Jingyun Zhang ⋅ Jun Wang ⋅ Xin TAN ⋅ Yuan Xie ⋅ Lizhuang Ma
|
Exhibit Hall I #357 | |
|
Inpaint4Drag: Repurposing Inpainting Models for Drag-Based Image Editing via Bidirectional Warping
Poster Session 4 & Exhibit Hall with Coffee Break
Jingyi Lu ⋅ Kai Han
|
Exhibit Hall I #328 | |
|
MotionAgent: Fine-grained Controllable Video Generation via Motion Field Agent
Poster Session 3 & Exhibit Hall
Xinyao Liao ⋅ Xianfang Zeng ⋅ Liao Wang ⋅ Gang YU ⋅ Guosheng Lin ⋅ Chi Zhang
|
Exhibit Hall I #122 | |
|
Unified Video Generation via Next-Set Prediction in Continuous Domain
Poster Session 4 & Exhibit Hall with Coffee Break
Zhanzhou Feng ⋅ Qingpei Guo ⋅ Xinyu Xiao ⋅ Ruihan Xu ⋅ Ming Yang ⋅ Shiliang Zhang
|
Exhibit Hall I #435 | |
|
PropVG: End-to-End Proposal-Driven Visual Grounding with Multi-Granularity Discrimination
Poster Session 2 & Exhibit Hall with Coffee Break
Ming Dai ⋅ Wenxuan Cheng ⋅ Jiedong Zhuang ⋅ Jiang-Jiang Liu ⋅ Hongshen Zhao ⋅ Zhenhua Feng ⋅ Wankou Yang
|
Exhibit Hall I #191 | |
|
LazyMAR: Accelerating Masked Autoregressive Models via Feature Caching
Poster Session 4 & Exhibit Hall with Coffee Break
Feihong Yan ⋅ qingyan wei ⋅ Jiayi Tang ⋅ Jiajun Li ⋅ Yulin Wang ⋅ Xuming Hu ⋅ Huiqi Li ⋅ Linfeng Zhang
|
Exhibit Hall I #62 | |
|
Visual Intention Grounding for Egocentric Assistants
Poster Session 1 & Exhibit Hall
Pengzhan Sun ⋅ Junbin Xiao ⋅ Tze Ho Elden Tse ⋅ Yicong Li ⋅ Arjun Akula ⋅ Angela Yao
|
Exhibit Hall I #231 | |
|
Omni-scene Perception-oriented Point Cloud Geometry Enhancement for Coordinate Quantization
Poster Session 6 & Exhibit Hall with Coffee Break
Wang Liu ⋅ Wei Gao
|
Exhibit Hall I #127 | |
|
MVQA: Mamba with Unified Sampling for Efficient Video Quality Assessment
Yachun Mi ⋅ Yu Li ⋅ Weicheng Meng ⋅ Chaofeng Chen ⋅ Chen Hui ⋅ Shaohui Liu
|
Exhibit Hall I #346 | |
|
INS-MMBench: A Comprehensive Benchmark for Evaluating LVLMs' Performance in Insurance
Poster Session 2 & Exhibit Hall with Coffee Break
Chenwei Lin ⋅ Hanjia Lyu ⋅ Xian Xu ⋅ Jiebo Luo
|
Exhibit Hall I #377 | |
|
FreeFlux: Understanding and Exploiting Layer-Specific Roles in RoPE-Based MMDiT for Versatile Image Editing
Poster Session 4 & Exhibit Hall with Coffee Break
Tianyi Wei ⋅ Yifan Zhou ⋅ Dongdong Chen ⋅ Xingang Pan
|
Exhibit Hall I #178 | |
|
PriOr-Flow: Enhancing Primitive Panoramic Optical Flow with Orthogonal View
Longliang Liu ⋅ Miaojie Feng ⋅ Junda Cheng ⋅ Jijun Xiang ⋅ Xuan Zhu ⋅ Xin Yang
|
Exhibit Hall I #29 | |
|
Retinex-MEF: Retinex-based Glare Effects Aware Unsupervised Multi-Exposure Image Fusion
Poster Session 2 & Exhibit Hall with Coffee Break
Haowen Bai ⋅ Jiangshe Zhang ⋅ Zixiang Zhao ⋅ Lilun Deng ⋅ Yukun Cui ⋅ Shuang Xu
|
Exhibit Hall I #209 | |
|
Zero-Shot Composed Image Retrieval via Dual-Stream Instruction-Aware Distillation
Poster Session 5 & Exhibit Hall
Wenliang Zhong ⋅ Rob Barton ⋅ Weizhi An ⋅ Feng Jiang ⋅ Hehuan Ma ⋅ Yuzhi Guo ⋅ Abhishek Dan ⋅ Shioulin Sam ⋅ Karim Bouyarmane ⋅ Junzhou Huang
|
Exhibit Hall I #226 |
Successful Page Load