Rethinking Temporal Fusion for Video-Based Person Re-Identification on Semantic and Time Aspect

Type: Article

Publication Date: 2020-04-03

Citations: 13

DOI: https://doi.org/10.1609/aaai.v34i07.6770

Abstract

Recently, the research interest of person re-identification (ReID) has gradually turned to video-based methods, which acquire a person representation by aggregating frame features of an entire video. However, existing video-based ReID methods do not consider the semantic difference brought by the outputs of different network stages, which potentially compromises the information richness of the person features. Furthermore, traditional methods ignore important relationship among frames, which causes information redundancy in fusion along the time axis. To address these issues, we propose a novel general temporal fusion framework to aggregate frame features on both semantic aspect and time aspect. As for the semantic aspect, a multi-stage fusion network is explored to fuse richer frame features at multiple semantic levels, which can effectively reduce the information loss caused by the traditional single-stage fusion. While, for the time axis, the existing intra-frame attention method is improved by adding a novel inter-frame attention module, which effectively reduces the information redundancy in temporal fusion by taking the relationship among frames into consideration. The experimental results show that our approach can effectively improve the video-based re-identification accuracy, achieving the state-of-the-art performance.

Locations

  • Proceedings of the AAAI Conference on Artificial Intelligence - View - PDF
  • arXiv (Cornell University) - View - PDF

Similar Works

Action Title Year Authors
+ Rethinking Temporal Fusion for Video-based Person Re-identification on Semantic and Time Aspect 2019 Xinyang Jiang
Yifei Gong
Xiaowei Guo
Qize Yang
Feiyue Huang
Wei‐Shi Zheng
Feng Zheng
Xing Sun
+ PDF Chat Convolutional Temporal Attention Model for Video-Based Person Re-Identification 2019 Tanzila Rahman
Mrigank Rochan
Yang Wang
+ PDF Chat Multi-Scale Temporal Cues Learning for Video Person Re-Identification 2020 Jianing Li
Shiliang Zhang
Tiejun Huang
+ PDF Chat Global-Local Temporal Representations for Video Person Re-Identification 2019 Jianing Li
Shiliang Zhang
Jingdong Wang
Wen Gao
Qi Tian
+ Jointly Attentive Spatial-Temporal Pooling Networks for Video-based Person Re-Identification 2017 Shuangjie Xu
Yu Cheng
Kang Gu
Yang Yang
Shiyu Chang
Pan Zhou
+ PDF Chat BiCnet-TKS: Learning Efficient Spatial-Temporal Representation for Video Person Re-Identification 2021 Ruibing Hou
Hong Chang
Bingpeng Ma
Rui Huang
Shiguang Shan
+ Convolutional Temporal Attention Model for Video-based Person Re-identification 2019 Tanzila Rahman
Mrigank Rochan
Yang Wang
+ PDF Chat Jointly Attentive Spatial-Temporal Pooling Networks for Video-Based Person Re-identification 2017 Shuangjie Xu
Yu Cheng
Kang Gu
Yang Yang
Shiyu Chang
Pan Zhou
+ Video-based Person Re-identification Using Spatial-Temporal Attention Networks 2018 Shivansh Rao
Tanzila Rahman
Mrigank Rochan
Yang Wang
+ Spatial and Temporal Mutual Promotion for Video-based Person Re-identification 2018 Yiheng Liu
Zhenxun Yuan
Wengang Zhou
Houqiang Li
+ PDF Chat Spatial and Temporal Mutual Promotion for Video-Based Person Re-Identification 2019 Yiheng Liu
Zhenxun Yuan
Wengang Zhou
Houqiang Li
+ PDF Chat Multi-Stage Spatio-Temporal Aggregation Transformer for Video Person Re-Identification 2022 Ziyi Tang
Ruimao Zhang
Zhanglin Peng
Jinrui Chen
Liang Lin
+ Multi-Stage Spatio-Temporal Aggregation Transformer for Video Person Re-identification 2023 Ziyi Tang
Ruimao Zhang
Zhanglin Peng
Jinrui Chen
Liang Lin
+ Three-Stream Convolutional Networks for Video-based Person Re-Identification 2017 Yu Zeng
Tianrui Li
Ning Yu
Xun Gong
Ke Chen
Yi Pan
+ PDF Chat Video Person Re-Identification by Temporal Residual Learning 2018 Ju Dai
Pingping Zhang
Dong Wang
Huchuan Lu
Hongyu Wang
+ PDF Chat SCAN: Self-and-Collaborative Attention Network for Video Person Re-Identification 2019 Ruimao Zhang
Jingyu Li
Hongbin Sun
Yuying Ge
Ping Luo
Xiaogang Wang
Liang Lin
+ Co-Saliency Spatio-Temporal Interaction Network for Person Re-Identification in Videos 2020 Jiawei Liu
Zheng-Jun Zha
Xierong Zhu
Na Jiang
+ Spatially and Temporally Efficient Non-local Attention Network for Video-based Person Re-Identification 2019 Chih‐Ting Liu
Chih-Wei Wu
Yu-Chiang Frank Wang
Shao‐Yi Chien
+ Spatially and Temporally Efficient Non-local Attention Network for Video-based Person Re-Identification 2019 Chih‐Ting Liu
Chih-Wei Wu
Yu-Chiang Frank Wang
Shao‐Yi Chien
+ PDF Chat Three-Stream Convolutional Networks for Video-based Person Re-Identification 2019 Yu Zeng
Tianrui Li
Ning Yu
Xun Gong