BViT: Broad Attention-Based Vision Transformer
BViT: Broad Attention-Based Vision Transformer
Recent works have demonstrated that transformer can achieve promising performance in computer vision, by exploiting the relationship among image patches with self-attention. They only consider the attention in a single feature layer, but ignore the complementarity of attention in different layers. In this article, we propose broad attention to improve …