DirecFormer: A Directed Attention in Transformer Approach to Robust Action Recognition
DirecFormer: A Directed Attention in Transformer Approach to Robust Action Recognition
Human action recognition has recently become one of the popular research topics in the computer vision community. Various 3D-CNN based methods have been presented to tackle both the spatial and temporal dimensions in the task of video action recognition with competitive results. However, these methods have suffered some fundamental limitations …