Learning Salient Boundary Feature for Anchor-free Temporal Action Localization

Type: Article

Publication Date: 2021-06-01

Citations: 202

DOI: https://doi.org/10.1109/cvpr46437.2021.00333

Abstract

Temporal action localization is an important yet challenging task in video understanding. Typically, such a task aims at inferring both the action category and localization of the start and end frame for each action instance in a long, untrimmed video. While most current models achieve good results by using pre-defined anchors and numerous actionness, such methods could be bothered with both large number of outputs and heavy tuning of locations and sizes corresponding to different anchors. Instead, anchor-free methods is lighter, getting rid of redundant hyper-parameters, but gains few attention. In this paper, we propose the first purely anchor-free temporal localization method, which is both efficient and effective. Our model includes (i) an end-to-end trainable basic predictor, (ii) a saliency-based refinement module to gather more valuable boundary features for each proposal with a novel boundary pooling, and (iii) several consistency constraints to make sure our model can find the accurate boundary given arbitrary proposals. Extensive experiments show that our method beats all anchor-based and actionness-guided methods with a remarkable margin on THUMOS14, achieving state-of-the-art results, and comparable ones on ActivityNet v1.3. Code is available at https://github.com/TencentYoutuResearch/ActionDetection-AFSD.

Locations

  • arXiv (Cornell University) - View - PDF
  • 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) - View

Similar Works

Action Title Year Authors
+ Learning Salient Boundary Feature for Anchor-free Temporal Action Localization 2021 Chuming Lin
Chengming Xu
Donghao Luo
Yabiao Wang
Ying Tai
Chengjie Wang
Jilin Li
Feiyue Huang
Yanwei Fu
+ PDF Chat Boundary-Aware Proposal Generation Method for Temporal Action Localization 2023 Hao Zhang
Feng Chun-yan
Jiahui Yang
Caili Guo
Zheng Li
Xin Wang
+ Faster Learning of Temporal Action Proposal via Sparse Multilevel Boundary Generator 2023 Qing Song
Yang Zhou
Mengjie Hu
Chun Liu
+ PDF Chat Precise Temporal Action Localization by Evolving Temporal Proposals 2018 Haonan Qiu
Yingbin Zheng
Hao Ye
Yao Lu
Feng Wang
Liang He
+ PDF Chat TriDet: Temporal Action Detection with Relative Boundary Modeling 2023 Dingfeng Shi
Yujie Zhong
Qiong Cao
Lin Ma
Jia Lit
Dacheng Tao
+ Boundary-Aware Proposal Generation Method for Temporal Action Localization 2023 Hao Zhang
Feng Chun-yan
Jiahui Yang
Zheng Li
Caili Guo
+ TriDet: Temporal Action Detection with Relative Boundary Modeling 2023 Dingfeng Shi
Yujie Zhong
Qiong Cao
Lin Ma
Jia Li
Dacheng Tao
+ HTNet: Anchor-free Temporal Action Localization with Hierarchical Transformers 2022 Tae-Kyung Kang
Gunhee Lee
Seong‐Whan Lee
+ PcmNet: Position-Sensitive Context Modeling Network for Temporal Action Localization 2021 Qin Xin
Hanbin Zhao
Guangchen Lin
Hao Zeng
Songcen Xu
Xi Li
+ PDF Chat HTNet: Anchor-free Temporal Action Localization with Hierarchical Transformers 2022 Tae-Kyung Kang
Gunhee Lee
Seong‐Whan Lee
+ PDF Chat RCL: Recurrent Continuous Localization for Temporal Action Detection 2022 Qiang Wang
Yanhao Zhang
Yun Zheng
Pan Pan
+ SF-Net: Single-Frame Supervision for Temporal Action Localization 2020 Fan Ma
Linchao Zhu
Yi Yang
Shengxin Zha
Gourab Kundu
Matt Feiszli
Zheng Shou
+ Adaptive Perception Transformer for Temporal Action Localization 2022 Yizheng Ouyang
Tianjin Zhang
Weibo Gu
Hongfa Wang
Liming Wang
Xiaojie Guo
+ Towards High-Quality Temporal Action Detection with Sparse Proposals 2021 Jiannan Wu
Peize Sun
Shoufa Chen
Jiewen Yang
Zihao Qi
Lan Ma
Ping Luo
+ PDF Chat RCL: Recurrent Continuous Localization for Temporal Action Detection 2022 Qiang Wang
Yanhao Zhang
Yun Zheng
Pan Pan
+ RCL: Recurrent Continuous Localization for Temporal Action Detection 2022 Qiang Wang
Yanhao Zhang
Yun Zheng
Pan Pan
+ Prior-enhanced Temporal Action Localization using Subject-aware Spatial Attention 2022 Yifan Liu
Youbao Tang
Ning Zhang
Ruei-Sung Lin
Haoqian Wang
+ Prior-Enhanced Temporal Action Localization Using Subject-Aware Spatial Attention 2023 Yifan Liu
Youbao Tang
Ning Zhang
Ruei-Sung Lin
Haoqian Wang
+ PDF Chat BSN++: Complementary Boundary Regressor with Scale-Balanced Relation Modeling for Temporal Action Proposal Generation 2021 Haisheng Su
Weihao Gan
Wei Wu
Yu Qiao
Junjie Yan
+ PDF Chat End-to-End Temporal Action Detection With Transformer 2022 Xiaolong Liu
Qimeng Wang
Yao Hu
Xu Tang
Shiwei Zhang
Song Bai
Xiang Bai

Works Cited by This (38)

Action Title Year Authors
+ PDF Chat Learning Spatiotemporal Features with 3D Convolutional Networks 2015 Du Tran
Lubomir Bourdev
Rob Fergus
Lorenzo Torresani
Manohar Paluri
+ Two-Stream Convolutional Networks for Action Recognition in Videos 2014 Karen Simonyan
Andrew Zisserman
+ PDF Chat CDC: Convolutional-De-Convolutional Networks for Precise Temporal Action Localization in Untrimmed Videos 2017 Zheng Shou
Jonathan Chan
Alireza Zareian
Kazuyuki Miyazawa
Shih‐Fu Chang
+ PDF Chat TURN TAP: Temporal Unit Regression Network for Temporal Action Proposals 2017 Jiyang Gao
Zhenheng Yang
Chen Sun
Kan Chen
Ram Nevatia
+ PDF Chat Diagnosing Error in Temporal Action Detectors 2018 Humam Alwassel
Fabian Caba Heilbron
VĂ­ctor Escorcia
Bernard Ghanem
+ PDF Chat Gaussian Temporal Awareness Networks for Action Localization 2019 Fuchen Long
Ting Yao
Zhaofan Qiu
Xinmei Tian
Jiebo Luo
Tao Mei
+ Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks 2015 Shaoqing Ren
Kaiming He
Ross Girshick
Jian Sun
+ PDF Chat BSN: Boundary Sensitive Network for Temporal Action Proposal Generation 2018 Tianwei Lin
Xu Zhao
Haisheng Su
Chongjing Wang
Ming Yang
+ PDF Chat Rethinking the Faster R-CNN Architecture for Temporal Action Localization 2018 Yu-Wei Chao
Sudheendra Vijayanarasimhan
Bryan Seybold
David A. Ross
Jia Deng
Rahul Sukthankar
+ PDF Chat You Only Look Once: Unified, Real-Time Object Detection 2016 Joseph Redmon
Santosh Divvala
Ross Girshick
Ali Farhadi