REViT: Roto-reflection Equivariant Convolutional Vision Transformer

arXiv:2606.25318v1 Announce Type: new Abstract: In this paper, we propose a discrete roto-reflection group equivariant vision transformer with convolutional attention. Roto-reflection equivariant networks preserve the rotational, flip and positional symmetry in feature maps, making them useful for tasks where orientation of the inputs is relevant to the model outputs. In image classification and object detection, most of the studies on roto-reflection equivariant models have focused on using con...

arXiv cs.CV ·Sheir A. Zaheer, Alexander C. Holston, Chan Y. Park ·
compartilhar: