← 返回大厅
arXiv (CS.CV) 2026-06-25 12:00 DOI: arXiv:2606.25318

REViT: Roto-reflection Equivariant Convolutional Vision Transformer

摘要 / Abstract

In this paper, we propose a discrete roto-reflection group equivariant vision transformer with convolutional attention. Roto-reflection equivariant networks preserve the rotational, flip and positional symmetry in feature maps, making them useful for tasks where orientation of the inputs is relevant to the model outputs. In image classification and object detection, most of the studies on roto-reflection equivariant models have focused on using convolutional neural networks rather than vision transformers. In this paper, we examine the challenges involved in achieving equivariance in vision transformers, and we propose a simpler way to implement a discretized roto-reflection group equivariant vision transformer. The experimental results demonstrate that our approach outperforms the existing approaches for developing discrete roto-reflection group equivariant neural networks for image classification.

同行评议区

登录学者账户后即可在此处发表评述或点赞。

立即登录

暂无评议记录。