+
PDF
Chat
|
NTU RGB+D: A Large Scale Dataset for 3D Human Activity Analysis
|
2016
|
Amir Shahroudy
Jun Liu
Tian-Tsong Ng
Gang Wang
|
5
|
+
PDF
Chat
|
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
|
2017
|
João Carreira
Andrew Zisserman
|
4
|
+
PDF
Chat
|
Spatial Temporal Graph Convolutional Networks for Skeleton-Based Action Recognition
|
2018
|
Sijie Yan
Yuanjun Xiong
Dahua Lin
|
4
|
+
|
Distilling the Knowledge in a Neural Network
|
2015
|
Geoffrey E. Hinton
Oriol Vinyals
Jay B. Dean
|
3
|
+
PDF
Chat
|
Self-Supervised Video Representation Learning with Odd-One-Out Networks
|
2017
|
Basura Fernando
Hakan Bilen
Efstratios Gavves
Stephen Jay Gould
|
3
|
+
PDF
Chat
|
Emotion Recognition in Speech using Cross-Modal Transfer in the Wild
|
2018
|
Samuel Albanie
Arsha Nagrani
Andrea Vedaldi
Andrew Zisserman
|
3
|
+
PDF
Chat
|
Spatio-Temporal LSTM with Trust Gates for 3D Human Action Recognition
|
2016
|
Jun Liu
Amir Shahroudy
Dong Xu
Gang Wang
|
3
|
+
PDF
Chat
|
Interpretable 3D Human Action Analysis with Temporal Convolutional Networks
|
2017
|
Tae Soo Kim
Austin Reiter
|
3
|
+
PDF
Chat
|
SpeedNet: Learning the Speediness in Videos
|
2020
|
Sagie Benaim
Ariel Ephrat
Oran Lang
Inbar Mosseri
William T. Freeman
Michael Rubinstein
Michal Irani
Tali Dekel
|
3
|
+
PDF
Chat
|
Cross Modal Distillation for Supervision Transfer
|
2016
|
Saurabh Gupta
Judy Hoffman
Jitendra Malik
|
3
|
+
PDF
Chat
|
Temporal Segment Networks: Towards Good Practices for Deep Action Recognition
|
2016
|
Limin Wang
Yuanjun Xiong
Zhe Wang
Yu Qiao
Dahua Lin
Xiaoou Tang
Luc Van Gool
|
3
|
+
|
Co-occurrence Feature Learning from Skeleton Data for Action Recognition and Detection with Hierarchical Aggregation
|
2018
|
Chao Li
Qiaoyong Zhong
Di Xie
Shiliang Pu
|
3
|
+
PDF
Chat
|
SlowFast Networks for Video Recognition
|
2019
|
Christoph Feichtenhofer
Haoqi Fan
Jitendra Malik
Kaiming He
|
3
|
+
PDF
Chat
|
Modality Distillation with Multiple Stream Networks for Action Recognition
|
2018
|
Nuno Garcia
Pietro Morerio
Vittorio Murino
|
3
|
+
|
Two-Stream Convolutional Networks for Action Recognition in Videos
|
2014
|
Karen Simonyan
Andrew Zisserman
|
3
|
+
PDF
Chat
|
A Closer Look at Spatiotemporal Convolutions for Action Recognition
|
2018
|
Du Tran
Heng Wang
Lorenzo Torresani
Jamie Ray
Yann LeCun
Manohar Paluri
|
3
|
+
PDF
Chat
|
View Adaptive Recurrent Neural Networks for High Performance Human Action Recognition from Skeleton Data
|
2017
|
Pengfei Zhang
Cuiling Lan
Junliang Xing
Wenjun Zeng
Jianru Xue
Nanning Zheng
|
3
|
+
PDF
Chat
|
Graph Distillation for Action Detection with Privileged Modalities
|
2018
|
Zelun Luo
Jun-Ting Hsieh
Lu Jiang
Juan Carlos Niebles
Li Fei-Fei
|
2
|
+
PDF
Chat
|
Spatio-temporal Channel Correlation Networks for Action Classification
|
2018
|
Ali Diba
Mohsen Fayyaz
Vivek Sharma
Mohammad Mahdi Arzani
Rahman Yousefzadeh
Jüergen Gall
Luc Van Gool
|
2
|
+
PDF
Chat
|
Shuffle and Learn: Unsupervised Learning Using Temporal Order Verification
|
2016
|
Ishan Misra
C. Lawrence Zitnick
Martial Hebert
|
2
|
+
PDF
Chat
|
ASCNet: Self-supervised Video Representation Learning with Appearance-Speed Consistency
|
2021
|
Deng Huang
Wenhao Wu
Wei-Wen Hu
Xu Liu
Dongliang He
Zhihua Wu
Xiangmiao Wu
Mingkui Tan
Errui Ding
|
2
|
+
PDF
Chat
|
Removing the Background by Adding the Background: Towards Background Robust Self-supervised Video Representation Learning
|
2021
|
Jinpeng Wang
Yuting Gao
Ke Li
Yiqi Lin
J. Andy
Hao Cheng
Pai Peng
Feiyue Huang
Rongrong Ji
Xing Sun
|
2
|
+
|
A Survey of Model Compression and Acceleration for Deep Neural Networks
|
2017
|
Yu Cheng
Duo Wang
Pan Zhou
Tao Zhang
|
2
|
+
PDF
Chat
|
Self-Supervised Video Representation Learning with Meta-Contrastive Network
|
2021
|
Yuanze Lin
Xun Guo
Yan Lu
|
2
|
+
|
Self-Supervised Learning for Videos: A Survey
|
2022
|
Madeline C. Schiappa
Yogesh Singh Rawat
Mubarak Shah
|
2
|
+
PDF
Chat
|
Self-Supervised Visual Learning by Variable Playback Speeds Prediction of a Video
|
2021
|
Hyeon Cho
Taehoon Kim
Hyung Jin Chang
Wonjun Hwang
|
2
|
+
PDF
Chat
|
Motion-Augmented Self-Training for Video Recognition at Smaller Scale
|
2021
|
Kirill Gavrilyuk
Mihir Jain
Ilia Karmanov
Cees G. M. Snoek
|
2
|
+
PDF
Chat
|
A Large-Scale Study on Unsupervised Spatiotemporal Representation Learning
|
2021
|
Christoph Feichtenhofer
Haoqi Fan
Bo Xiong
Ross Girshick
Kaiming He
|
2
|
+
PDF
Chat
|
Unsupervised Visual Representation Learning by Tracking Patches in Video
|
2021
|
Guangting Wang
Yizhou Zhou
Chong Luo
Wenxuan Xie
Wenjun Zeng
Zhiwei Xiong
|
2
|
+
PDF
Chat
|
Video Representation Learning by Recognizing Temporal Transformations
|
2020
|
Simon Jenni
Givi Meishvili
Paolo Favaro
|
2
|
+
PDF
Chat
|
Spatiotemporal Contrastive Video Representation Learning
|
2021
|
Rui Qian
Tianjian Meng
Boqing Gong
Ming–Hsuan Yang
Huisheng Wang
Serge Belongie
Yin Cui
|
2
|
+
PDF
Chat
|
Audio-Visual Instance Discrimination with Cross-Modal Agreement
|
2021
|
Pedro Morgado
Nuno Vasconcelos
Ishan Misra
|
2
|
+
|
Context-Aware and Scale-Insensitive Temporal Repetition Counting
|
2020
|
Huaidong Zhang
Xuemiao Xu
Guoqiang Han
Shengfeng He
|
2
|
+
PDF
Chat
|
NTU RGB+D 120: A Large-Scale Benchmark for 3D Human Activity Understanding
|
2019
|
Jun Liu
Amir Shahroudy
Mauricio Pérez
Gang Wang
Ling‐Yu Duan
Alex C. Kot
|
2
|
+
PDF
Chat
|
Convolutional Two-Stream Network Fusion for Video Action Recognition
|
2016
|
Christoph Feichtenhofer
Axel Pinz
Andrew Zisserman
|
2
|
+
PDF
Chat
|
Video Playback Rate Perception for Self-Supervised Spatio-Temporal Representation Learning
|
2020
|
Yuan Yao
Chang Liu
Dezhao Luo
Yu Zhou
Qixiang Ye
|
2
|
+
PDF
Chat
|
FineGym: A Hierarchical Video Dataset for Fine-Grained Action Understanding
|
2020
|
Dian Shao
Yue Zhao
Bo Dai
Dahua Lin
|
2
|
+
PDF
Chat
|
Two-Stream Adaptive Graph Convolutional Networks for Skeleton-Based Action Recognition
|
2019
|
Lei Shi
Yifan Zhang
Jian Cheng
Hanqing Lu
|
2
|
+
|
An End-to-End Spatio-Temporal Attention Model for Human Action Recognition from Skeleton Data
|
2016
|
Sijie Song
Cuiling Lan
Junliang Xing
Wenjun Zeng
Jiaying Liu
|
2
|
+
PDF
Chat
|
Self-supervised Video Representation Learning by Pace Prediction
|
2020
|
Jiangliu Wang
Jianbo Jiao
Yunhui Liu
|
2
|
+
PDF
Chat
|
Video Cloze Procedure for Self-Supervised Spatio-Temporal Learning
|
2020
|
Dezhao Luo
Chang Liu
Yu Zhou
Dongbao Yang
Can Ma
Qixiang Ye
Weiping Wang
|
2
|
+
|
TCLR: Temporal contrastive learning for video representation
|
2022
|
Ishan R. Dave
Rohit Gupta
Mamshad Nayeem Rizve
Mubarak Shah
|
2
|
+
|
UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild
|
2012
|
Khurram Soomro
Amir Zamir
Mubarak Shah
|
2
|
+
|
Hollywood in Homes: Crowdsourcing Data Collection for Activity Understanding
|
2016
|
Gunnar A. Sigurdsson
Gül Varol
Xiaolong Wang
Ali Farhadi
Ivan Laptev
Abhinav Gupta
|
2
|
+
PDF
Chat
|
A New Representation of Skeleton Sequences for 3D Action Recognition
|
2017
|
Qiuhong Ke
Mohammed Bennamoun
Senjian An
Ferdous Sohel
Farid Boussaïd
|
2
|
+
PDF
Chat
|
The “Something Something” Video Database for Learning and Evaluating Visual Common Sense
|
2017
|
Raghav Goyal
Samira Ebrahimi Kahou
Vincent Michalski
Joanna Materzyńska
Susanne Westphal
Heuna Kim
Valentin Haenel
Ingo Fruend
P.N. Yianilos
Moritz Mueller-Freitag
|
2
|
+
PDF
Chat
|
VideoMoCo: Contrastive Video Representation Learning with Temporally Adversarial Examples
|
2021
|
Pan Tian
Yibing Song
Tianyu Yang
Wenhao Jiang
Wei Liu
|
2
|
+
PDF
Chat
|
Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in Video Classification
|
2018
|
Saining Xie
Chen Sun
Jonathan Huang
Zhuowen Tu
Kevin Murphy
|
1
|
+
PDF
Chat
|
T-CNN: Tubelets With Convolutional Neural Networks for Object Detection From Videos
|
2017
|
Kai Kang
Hongsheng Li
Junjie Yan
Xingyu Zeng
Bin Yang
Tong Xiao
Cong Zhang
Zhe Wang
Ruohui Wang
Xiaogang Wang
|
1
|
+
|
Unsupervised Representation Learning by Predicting Image Rotations
|
2018
|
Spyros Gidaris
Praveer Singh
Nikos Komodakis
|
1
|