POSTER: A Pyramid Cross-Fusion Transformer Network for Facial Expression Recognition

Type: Article

Publication Date: 2023-10-02

Citations: 46

DOI: https://doi.org/10.1109/iccvw60793.2023.00339

Download PDF

Abstract

Facial expression recognition (FER) is an important task in computer vision, having practical applications in areas such as human-computer interaction, education, health-care, and online monitoring. In this challenging FER task, there are three key issues especially prevalent: inter-class similarity, intra-class discrepancy, and scale sensitivity. While existing works typically address some of these issues, none have fully addressed all three challenges in a unified framework. In this paper, we propose a two-stream Pyramid crOss-fuSion TransformER network (POSTER), that aims to holistically solve all three issues. Specifically, we design a transformer-based cross-fusion method that enables effective collaboration of facial landmark features and image features to maximize proper attention to salient facial regions. Furthermore, POSTER employs a pyramid structure to promote scale invariance. Extensive experimental results demonstrate that our POSTER achieves new state-of-the-art results on RAF-DB (92.05%), FERPlus (91.62%), as well as AffectNet 7 class (67.31%) and 8 class (63.34%). Code is available at https://github.com/zczcwh/POSTER.

Locations

  • arXiv (Cornell University) - View - PDF

Similar Works

Action Title Year Authors
+ POSTER: A Pyramid Cross-Fusion Transformer Network for Facial Expression Recognition 2022 Ce Zheng
Matías Mendieta
Chen Chen
+ PDF Chat A Lightweight Attention-based Deep Network via Multi-Scale Feature Fusion for Multi-View Facial Expression Recognition 2024 Ali Ezati
M Dezyani
Rajib Rana
Roozbeh Rajabi
Ahmad Ayatollahi
+ PDF Chat Facial Expression Recognition With Visual Transformers and Attentional Selective Fusion 2021 Fuyan Ma
Bin Sun
Shutao Li
+ Multi Loss-based Feature Fusion and Top Two Voting Ensemble Decision Strategy for Facial Expression Recognition in the Wild 2023 Guangyao Zhou
Yuanlun Xie
Wenhong Tian
+ Robust Facial Expression Recognition with Convolutional Visual Transformers 2021 Fuyan Ma
Bin Sun
Shutao Li
+ MVT: Mask Vision Transformer for Facial Expression Recognition in the wild 2021 Hanting Li
Mingzhe Sui
Feng Zhao
Zheng-Jun Zha
Feng Wu
+ Landmark-Aware and Part-based Ensemble Transfer Learning Network for Facial Expression Recognition from Static images. 2021 Rohan Wadhawan
Tapan Kumar Gandhi
+ PDF Chat Imponderous Net for Facial Expression Recognition in the Wild 2021 Darshan Gera
S. Balasubramanian
+ Imponderous Net for Facial Expression Recognition in the Wild 2021 Darshan Gera
S. Balasubramanian
+ FER-former: Multi-modal Transformer for Facial Expression Recognition 2023 Yande Li
Mingjie Wang
Minglun Gong
Yonggang Lu
Li Liu
+ Facial Expression Recognition with Swin Transformer 2022 Jun-Hwa Kim
Nam‐Ho Kim
Chee Sun Won
+ Distract Your Attention: Multi-head Cross Attention Network for Facial Expression Recognition 2021 Zhengyao Wen
Wenzhong Lin
Tao Wang
Ge Xu
+ PDF Chat Cross-Domain Facial Expression Recognition: A Unified Evaluation Benchmark and Adversarial Graph Learning 2021 Tianshui Chen
Tao Pu
Hefeng Wu
Yuan Xie
Lingbo Liu
Liang Lin
+ Adaptively Lighting up Facial Expression Crucial Regions via Local Non-Local Joint Network 2022 Guanghui Shi
Shasha Mao
Shuiping Gou
Dandan Yan
Licheng Jiao
Lin Xiong
+ PDF Chat Deep Facial Expression Recognition: A Survey 2020 Shan Li
Weihong Deng
+ PDF Chat Emotic Masked Autoencoder with Attention Fusion for Facial Expression Recognition 2024 Bach Nguyen-Xuan
Thien Nguyen-Hoang
Nhu Tai-Do
+ Intensity-Aware Loss for Dynamic Facial Expression Recognition in the Wild 2022 Hanting Li
Hongjing Niu
Zhaoqing Zhu
Feng Zhao
+ Landmark-Aware and Part-based Ensemble Transfer Learning Network for Facial Expression Recognition from Static images 2021 Rohan Wadhawan
Tapan Kumar Gandhi
+ PDF Chat Vision Transformer With Attentive Pooling for Robust Facial Expression Recognition 2022 Fanglei Xue
Qiangchang Wang
Zichang Tan
Zhongsong Ma
Guodong Guo
+ Learning Vision Transformer with Squeeze and Excitation for Facial Expression Recognition. 2021 Mouath Aouayeb
Wassim Hamidouche
Catherine Soladié
Kidiyo Kpalma
Renaud Séguier

Works Cited by This (20)

Action Title Year Authors
+ PDF Chat Challenges in Representation Learning: A Report on Three Machine Learning Contests 2013 Ian Goodfellow
Dumitru Erhan
Pierre Carrier
Aaron Courville
Mehdi Mirza
Ben Hamner
Will Cukierski
Yichuan Tang
David S. Thaler
Dong‐Hyun Lee
+ PDF Chat Training deep networks for facial expression recognition with crowd-sourced label distribution 2016 Emad Barsoum
Cha Zhang
Cristian Canton Ferrer
Zhengyou Zhang
+ PDF Chat Feature Pyramid Networks for Object Detection 2017 Tsung-Yi Lin
Piotr Dollár
Ross Girshick
Kaiming He
Bharath Hariharan
Serge Belongie
+ PDF Chat Facial Expression Recognition Using Enhanced Deep 3D Convolutional Neural Networks 2017 Behzad Hasani
Mohammad H. Mahoor
+ Facial Expression Recognition using Facial Landmark Detection and Feature Extraction via Neural Networks 2018 Fuzail Khan
+ PDF Chat SphereFace: Deep Hypersphere Embedding for Face Recognition 2017 Weiyang Liu
Yandong Wen
Zhiding Yu
Ming Li
Bhiksha Raj
Le Song
+ PDF Chat ArcFace: Additive Angular Margin Loss for Deep Face Recognition 2019 Jiankang Deng
Jia Guo
Niannan Xue
Stefanos Zafeiriou
+ PDF Chat Region Attention Networks for Pose and Occlusion Robust Facial Expression Recognition 2020 Kai Wang
Xiaojiang Peng
Jianfei Yang
Debin Meng
Yu Qiao
+ PDF Chat Deep High-Resolution Representation Learning for Visual Recognition 2020 Jingdong Wang
Ke Sun
Tianheng Cheng
Borui Jiang
Chaorui Deng
Yang Zhao
Dong Liu
Yadong Mu
Mingkui Tan
Xinggang Wang
+ PDF Chat Suppressing Uncertainties for Large-Scale Facial Expression Recognition 2020 Kai Wang
Xiaojiang Peng
Jianfei Yang
Shijian Lu
Yu Qiao