Rethinking Temporal Fusion for Video-Based Person Re-Identification on Semantic and Time Aspect

Xinyang Jiang, Yifei Gong, Xiaowei Guo, Qize Yang, Feiyue Huang, Wei‐Shi Zheng, Feng Zheng, Xing Sun

Type: Article

Publication Date: 2020-04-03

Citations: 13

DOI: https://doi.org/10.1609/aaai.v34i07.6770

Abstract

Recently, the research interest of person re-identification (ReID) has gradually turned to video-based methods, which acquire a person representation by aggregating frame features of an entire video. However, existing video-based ReID methods do not consider the semantic difference brought by the outputs of different network stages, which potentially compromises the information richness of the person features. Furthermore, traditional methods ignore important relationship among frames, which causes information redundancy in fusion along the time axis. To address these issues, we propose a novel general temporal fusion framework to aggregate frame features on both semantic aspect and time aspect. As for the semantic aspect, a multi-stage fusion network is explored to fuse richer frame features at multiple semantic levels, which can effectively reduce the information loss caused by the traditional single-stage fusion. While, for the time axis, the existing intra-frame attention method is improved by adding a novel inter-frame attention module, which effectively reduces the information redundancy in temporal fusion by taking the relationship among frames into consideration. The experimental results show that our approach can effectively improve the video-based re-identification accuracy, achieving the state-of-the-art performance.

Locations

Proceedings of the AAAI Conference on Artificial Intelligence - View - PDF
arXiv (Cornell University) - View - PDF

Similar Works

Action	Title	Year	Authors
+	Rethinking Temporal Fusion for Video-based Person Re-identification on Semantic and Time Aspect	2019	Xinyang Jiang Yifei Gong Xiaowei Guo Qize Yang Feiyue Huang Wei‐Shi Zheng Feng Zheng Xing Sun
+ PDF Chat	Convolutional Temporal Attention Model for Video-Based Person Re-Identification	2019	Tanzila Rahman Mrigank Rochan Yang Wang
+ PDF Chat	Multi-Scale Temporal Cues Learning for Video Person Re-Identification	2020	Jianing Li Shiliang Zhang Tiejun Huang
+ PDF Chat	Global-Local Temporal Representations for Video Person Re-Identification	2019	Jianing Li Shiliang Zhang Jingdong Wang Wen Gao Qi Tian
+	Jointly Attentive Spatial-Temporal Pooling Networks for Video-based Person Re-Identification	2017	Shuangjie Xu Yu Cheng Kang Gu Yang Yang Shiyu Chang Pan Zhou
+ PDF Chat	BiCnet-TKS: Learning Efficient Spatial-Temporal Representation for Video Person Re-Identification	2021	Ruibing Hou Hong Chang Bingpeng Ma Rui Huang Shiguang Shan
+	Convolutional Temporal Attention Model for Video-based Person Re-identification	2019	Tanzila Rahman Mrigank Rochan Yang Wang
+ PDF Chat	Jointly Attentive Spatial-Temporal Pooling Networks for Video-Based Person Re-identification	2017	Shuangjie Xu Yu Cheng Kang Gu Yang Yang Shiyu Chang Pan Zhou
+	Video-based Person Re-identification Using Spatial-Temporal Attention Networks	2018	Shivansh Rao Tanzila Rahman Mrigank Rochan Yang Wang
+	Spatial and Temporal Mutual Promotion for Video-based Person Re-identification	2018	Yiheng Liu Zhenxun Yuan Wengang Zhou Houqiang Li
+ PDF Chat	Spatial and Temporal Mutual Promotion for Video-Based Person Re-Identification	2019	Yiheng Liu Zhenxun Yuan Wengang Zhou Houqiang Li
+ PDF Chat	Multi-Stage Spatio-Temporal Aggregation Transformer for Video Person Re-Identification	2022	Ziyi Tang Ruimao Zhang Zhanglin Peng Jinrui Chen Liang Lin
+	Multi-Stage Spatio-Temporal Aggregation Transformer for Video Person Re-identification	2023	Ziyi Tang Ruimao Zhang Zhanglin Peng Jinrui Chen Liang Lin
+	Three-Stream Convolutional Networks for Video-based Person Re-Identification	2017	Yu Zeng Tianrui Li Ning Yu Xun Gong Ke Chen Yi Pan
+ PDF Chat	Video Person Re-Identification by Temporal Residual Learning	2018	Ju Dai Pingping Zhang Dong Wang Huchuan Lu Hongyu Wang
+ PDF Chat	SCAN: Self-and-Collaborative Attention Network for Video Person Re-Identification	2019	Ruimao Zhang Jingyu Li Hongbin Sun Yuying Ge Ping Luo Xiaogang Wang Liang Lin
+	Co-Saliency Spatio-Temporal Interaction Network for Person Re-Identification in Videos	2020	Jiawei Liu Zheng-Jun Zha Xierong Zhu Na Jiang
+	Spatially and Temporally Efficient Non-local Attention Network for Video-based Person Re-Identification	2019	Chih‐Ting Liu Chih-Wei Wu Yu-Chiang Frank Wang Shao‐Yi Chien
+	Spatially and Temporally Efficient Non-local Attention Network for Video-based Person Re-Identification	2019	Chih‐Ting Liu Chih-Wei Wu Yu-Chiang Frank Wang Shao‐Yi Chien
+ PDF Chat	Three-Stream Convolutional Networks for Video-based Person Re-Identification	2019	Yu Zeng Tianrui Li Ning Yu Xun Gong

Works That Cite This (5)

Action	Title	Year	Authors
+ PDF Chat	Multidirection and Multiscale Pyramid in Transformer for Video-Based Pedestrian Retrieval	2022	Xianghao Zang Ge Li Wei Gao
+ PDF Chat	CoNAN: Conditional Neural Aggregation Network For Unconstrained Face Feature Fusion	2023	Bhavin Jawade Deen Dayal Mohan Dennis Fedorishin Srirangaraj Setlur Venu Govindaraju
+ PDF Chat	TIPCB: A simple but effective part-based convolutional baseline for text-based person search	2022	Yuhao Chen Guoqing Zhang Yujiang Lu Zhenxing Wang Yuhui Zheng
+ PDF Chat	Spatial-Temporal Correlation and Topology Learning for Person Re-Identification in Videos	2021	Jiawei Liu Zheng-Jun Zha Wei Wu Kecheng Zheng Qibin Sun
+	Spatial-Temporal Correlation and Topology Learning for Person Re-Identification in Videos	2021	Jiawei Liu Zheng-Jun Zha Wei Wu Kecheng Zheng Qibin Sun

Works Cited by This (20)

Action	Title	Year	Authors
+	Two-Stream Convolutional Networks for Action Recognition in Videos	2014	Karen Simonyan Andrew Zisserman
+ PDF Chat	Performance Measures and a Data Set for Multi-target, Multi-camera Tracking	2016	Ergys Ristani Francesco Solera Roger S. Zou Rita Cucchiara Carlo Tomasi
+	Person Re-identification: Past, Present and Future	2016	Liang Zheng Yi Yang Alexander G. Hauptmann
+ PDF Chat	Person re-identification by unsupervised video matching	2016	Xiaolong Ma Xiatian Zhu Shaogang Gong Xudong Xie Jianming Hu Kin‐Man Lam Yisheng Zhong
+ PDF Chat	Video-Based Person Re-Identification With Accumulative Motion Context	2017	Hao Liu Zequn Jie Karlekar Jayashree Meibin Qi Jianguo Jiang Shuicheng Yan Jiashi Feng
+	In Defense of the Triplet Loss for Person Re-Identification	2017	Alexander Hermans Lucas Beyer Bastian Leibe
+	Person re-identification with fusion of hand-crafted and deep pose-based body region features	2018	Jubin Johnson Shunsuke Yasugi Yoichi Sugino Sugiri Pranata Shengmei Shen
+	Revisiting Temporal Modeling for Video-based Person ReID	2018	Jiyang Gao Ram Nevatia
+ PDF Chat	Video-Based Person Re-identification via 3D Convolutional Networks and Non-local Attention	2019	Xingyu Liao Lingxiao He Zhouwang Yang Chi Zhang
+	Deep Sequential Multi-camera Feature Fusion for Person Re-identification.	2018	K L Navaneet Ravi Kiran Sarvadevabhatla R. Venkatesh Babu Anirban Chakraborty