A discriminative CNN video representation for event detection

Type: Preprint

Publication Date: 2015-06-01

Citations: 438

DOI: https://doi.org/10.1109/cvpr.2015.7298789

Download PDF

Abstract

In this paper, we propose a discriminative video representation for event detection over a large scale video dataset when only limited hardware resources are available. The focus of this paper is to effectively leverage deep Convolutional Neural Networks (CNNs) to advance event detection, where only frame level static descriptors can be extracted by the existing CNN toolkits. This paper makes two contributions to the inference of CNN video representation. First, while average pooling and max pooling have long been the standard approaches to aggregating frame level static features, we show that performance can be significantly improved by taking advantage of an appropriate encoding method. Second, we propose using a set of latent concept descriptors as the frame descriptor, which enriches visual information while keeping it computationally affordable. The integration of the two contributions results in a new state-of-the-art performance in event detection over the largest video datasets. Compared to improved Dense Trajectories, which has been recognized as the best video representation for event detection, our new representation improves the Mean Average Precision (mAP) from 27.6% to 36.8% for the TRECVID MEDTest 14 dataset and from 34.0% to 44.6% for the TRECVID MEDTest 13 dataset.

Locations

  • arXiv (Cornell University) - View - PDF

Similar Works

Action Title Year Authors
+ A Discriminative CNN Video Representation for Event Detection 2014 Zhongwen Xu
Yi Yang
Alexander G. Hauptmann
+ Exploiting Image-trained CNN Architectures for Unconstrained Video Classification 2015 Shengxin Zha
Florian Luisier
W.D. Andrews
Nitish Srivastava
Ruslan Salakhutdinov
+ EventNet Version 1.1 Technical Report 2016 Dongang Wang
Zheng Shou
Hongyi Liu
Shih‐Fu Chang
+ PDF Chat IOD-CNN: Integrating object detection networks for event recognition 2017 Sungmin Eum
Hyungtae Lee
Heesung Kwon
David Doermann
+ IOD-CNN: Integrating Object Detection Networks for Event Recognition 2017 Sungmin Eum
Hyungtae Lee
Heesung Kwon
David Doermann
+ PDF Chat Learning Temporal Alignment Uncertainty for Efficient Event Detection 2015 Iman Abbasnejad
Sridha Sridharan
Simon Denman
Clinton Fookes
Simon Lucey
+ Learning Temporal Alignment Uncertainty for Efficient Event Detection 2015 Iman Abbasnejad
Sridha Sridharan
Simon Denman
Clinton Fookes
Simon Lucey
+ Learning Temporal Alignment Uncertainty for Efficient Event Detection 2015 Iman Abbasnejad
Sridha Sridharan
Simon Denman
Clinton Fookes
Simon Lucey
+ Joint Detection and Recounting of Abnormal Events by Learning Deep Generic Knowledge 2017 Ryota Hinami
Tao Mei
Shin’ichi Satoh
+ Joint Detection and Recounting of Abnormal Events by Learning Deep Generic Knowledge 2017 Ryota Hinami
Tao Mei
Shin’ichi Satoh
+ PDF Chat Joint Detection and Recounting of Abnormal Events by Learning Deep Generic Knowledge 2017 Ryota Hinami
Tao Mei
Shin’ichi Satoh
+ Detection Bank: An Object Detection Based Video Representation for Multimedia Event Recognition 2014 Tim Althoff
Hyun Oh Song
Trevor Darrell
+ TagBook: A Semantic Video Representation without Supervision for Event Detection 2015 Masoud Mazloom
Xirong Li
Cees G. M. Snoek
+ TagBook: A Semantic Video Representation without Supervision for Event Detection 2015 Masoud Mazloom
Xirong Li
Cees G. M. Snoek
+ PDF Chat TagBook: A Semantic Video Representation Without Supervision for Event Detection 2016 Masoud Mazloom
Xirong Li
Cees G. M. Snoek
+ PDF Chat Bag of Attributes for Video Event Retrieval 2018 Leonardo A. Duarte
OtĂĄvio A. B. Penatti
Jurandy Almeida
+ Local Compressed Video Stream Learning for Generic Event Boundary Detection 2023 Libo Zhang
Xin Gu
Congcong Li
Tiejian Luo
Heng Fan
+ EventNet: A Large Scale Structured Concept Library for Complex Event Detection in Video 2015 Guangnan Ye
Yitong Li
Hongliang Xu
Dong Liu
Shih‐Fu Chang
+ Structured Context Transformer for Generic Event Boundary Detection 2022 Congcong Li
Xinyao Wang
Dexiang Hong
Yufei Wang
Libo Zhang
Tiejian Luo
Longyin Wen
+ PDF Chat Joint Event Detection and Description in Continuous Video Streams 2019 Huijuan Xu
Boyang Li
Vasili Ramanishka
Leonid Sigal
Kate Saenko