Vitranspad: Video Transformer Using Convolution And Self-Attention For Face Presentation Attack Detection
Vitranspad: Video Transformer Using Convolution And Self-Attention For Face Presentation Attack Detection
Face Presentation Attack Detection (PAD) is an important measure to prevent spoof attacks for face biometric systems. Many works based on Convolution Neural Networks (CNNs) for face PAD formulate the problem as an image-level binary classification task without considering the context. Alternatively, Vision Transformers (ViT) using self-attention to attend the …