Skip to yearly menu bar Skip to main content


Poster

Local Scale Equivariance with Deep Equilibrium Canonicalizer in the Latent Space

Md Ashiqur Rahman · Chiao-An Yang · Michael Cheng · Lim Hao · Jeremiah Jiang · Teck-Yian Lim · Raymond Yeh


Abstract:

Scale variation is a fundamental challenge in computer vision. Objects of the same class can have different sizes, and their perceived size is further affected by the distance from the camera. These variations are local to the objects, i.e., different object sizes may change differently within the same image. To effectively handle scale variations, we present a deep equilibrium canonicalizer (DEC) to improve the local scale equivariance of a model. DEC can be easily incorporated into existing network architectures and can be adapted to a pre-trained model. Notably, we show that on the competitive ImageNet benchmark, DEC improves both model performance and local scale consistency across four popular pre-trained deep-nets, e.g., ViT, DeiT, Swin, and BEiT.

Live content is unavailable. Log in and register to view live content