+
PDF
Chat
|
Deep Residual Learning for Image Recognition
|
2016
|
Kaiming He
Xiangyu Zhang
Shaoqing Ren
Jian Sun
|
14
|
+
PDF
Chat
|
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
|
2017
|
JoĂŁo Carreira
Andrew Zisserman
|
12
|
+
|
The Kinetics Human Action Video Dataset
|
2017
|
Andrew Zisserman
JoĂŁo Carreira
Karen Simonyan
Will Kay
Brian Zhang
Chloe Hillier
Sudheendra Vijayanarasimhan
Fabio Viola
T.C. Green
Trevor Back
|
11
|
+
PDF
Chat
|
Learning Spatiotemporal Features with 3D Convolutional Networks
|
2015
|
Du Tran
Lubomir Bourdev
Rob Fergus
Lorenzo Torresani
Manohar Paluri
|
11
|
+
PDF
Chat
|
Temporal Relational Reasoning in Videos
|
2018
|
Bolei Zhou
Alex Andonian
Aude Oliva
Antonio Torralba
|
10
|
+
PDF
Chat
|
Non-local Neural Networks
|
2018
|
Xiaolong Wang
Ross Girshick
Abhinav Gupta
Kaiming He
|
10
|
+
|
Two-Stream Convolutional Networks for Action Recognition in Videos
|
2014
|
Karen Simonyan
Andrew Zisserman
|
10
|
+
PDF
Chat
|
TSM: Temporal Shift Module for Efficient Video Understanding
|
2019
|
Ji Lin
Chuang Gan
Song Han
|
10
|
+
PDF
Chat
|
The âSomething Somethingâ Video Database for Learning and Evaluating Visual Common Sense
|
2017
|
Raghav Goyal
Samira Ebrahimi Kahou
Vincent Michalski
Joanna MaterzyĆska
Susanne Westphal
Heuna Kim
Valentin Haenel
Ingo Fruend
P.N. Yianilos
Moritz Mueller-Freitag
|
9
|
+
PDF
Chat
|
Going deeper with convolutions
|
2015
|
Christian Szegedy
Wei Liu
Yangqing Jia
Pierre Sermanet
Scott Reed
Dragomir Anguelov
Dumitru Erhan
Vincent Vanhoucke
Andrew Rabinovich
|
8
|
+
PDF
Chat
|
Temporal Segment Networks: Towards Good Practices for Deep Action Recognition
|
2016
|
Limin Wang
Yuanjun Xiong
Zhe Wang
Yu Qiao
Dahua Lin
Xiaoou Tang
Luc Van Gool
|
8
|
+
PDF
Chat
|
Long-term recurrent convolutional networks for visual recognition and description
|
2015
|
Jeff Donahue
Lisa Anne Hendricks
Sergio Guadarrama
Marcus Rohrbach
Subhashini Venugopalan
Trevor Darrell
Kate Saenko
|
7
|
+
PDF
Chat
|
Learning Deep Features for Discriminative Localization
|
2016
|
Bolei Zhou
Aditya Khosla
Ăgata Lapedriza
Aude Oliva
Antonio Torralba
|
7
|
+
|
YouTube-8M: A Large-Scale Video Classification Benchmark
|
2016
|
Sami Abu-El-Haija
Nisarg Kothari
Joonseok Lee
Apostol Natsev
George Toderici
Balakrishnan Varadarajan
Sudheendra Vijayanarasimhan
|
6
|
+
PDF
Chat
|
Moments in Time Dataset: One Million Videos for Event Understanding
|
2019
|
Mathew Monfort
Carl Vondrick
Aude Oliva
Alex Andonian
Bolei Zhou
Kandan Ramakrishnan
Sarah Adel Bargal
Tom Yan
Lisa M. Brown
Quanfu Fan
|
6
|
+
PDF
Chat
|
Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation
|
2014
|
Ross Girshick
Jeff Donahue
Trevor Darrell
Jitendra Malik
|
5
|
+
|
UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild
|
2012
|
Khurram Soomro
Amir Zamir
Mubarak Shah
|
5
|
+
PDF
Chat
|
Squeeze-and-Excitation Networks
|
2019
|
Jie Hu
Li Shen
Samuel Albanie
Gang Sun
Enhua Wu
|
5
|
+
PDF
Chat
|
Rethinking the Inception Architecture for Computer Vision
|
2016
|
Christian Szegedy
Vincent Vanhoucke
Sergey Ioffe
Jon Shlens
Zbigniew Wojna
|
5
|
+
PDF
Chat
|
Aggregated Residual Transformations for Deep Neural Networks
|
2017
|
Saining Xie
Ross Girshick
Piotr DollĂĄr
Zhuowen Tu
Kaiming He
|
5
|
+
|
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift
|
2015
|
Sergey Ioffe
Christian Szegedy
|
4
|
+
|
EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks
|
2019
|
Mingxing Tan
Quoc V. Le
|
4
|
+
|
SlowFast Networks for Video Recognition
|
2018
|
Christoph Feichtenhofer
Haoqi Fan
Jitendra Malik
Kaiming He
|
4
|
+
PDF
Chat
|
Beyond short snippets: Deep networks for video classification
|
2015
|
Joe Yue-Hei Ng
Matthew Hausknecht
Sudheendra Vijayanarasimhan
Oriol Vinyals
Rajat Monga
George Toderici
|
4
|
+
|
Obfuscated Gradients Give a False Sense of Security: Circumventing Defenses to Adversarial Examples
|
2018
|
Anish Athalye
Nicholas Carlini
David Wagner
|
4
|
+
PDF
Chat
|
Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in Video Classification
|
2018
|
Saining Xie
Chen Sun
Jonathan Huang
Zhuowen Tu
Kevin Murphy
|
4
|
+
PDF
Chat
|
X3D: Expanding Architectures for Efficient Video Recognition
|
2020
|
Christoph Feichtenhofer
|
4
|
+
PDF
Chat
|
Video Classification With Channel-Separated Convolutional Networks
|
2019
|
Du Tran
Heng Wang
Matt Feiszli
Lorenzo Torresani
|
4
|
+
PDF
Chat
|
Temporal Pyramid Network for Action Recognition
|
2020
|
Ceyuan Yang
Yinghao Xu
Jianping Shi
Bo Dai
Bolei Zhou
|
4
|
+
PDF
Chat
|
Optical Flow Guided Feature: A Fast and Robust Motion Representation for Video Action Recognition
|
2018
|
Shuyang Sun
Zhanghui Kuang
Lu Sheng
Wanli Ouyang
Wei Zhang
|
3
|
+
PDF
Chat
|
Collaborative Spatiotemporal Feature Learning for Video Action Recognition
|
2019
|
Chao Li
Qiaoyong Zhong
Di Xie
Shiliang Pu
|
3
|
+
|
Very Deep Convolutional Networks for Large-Scale Image Recognition
|
2014
|
Karen Simonyan
Andrew Zisserman
|
3
|
+
PDF
Chat
|
Timeception for Complex Action Recognition
|
2019
|
Noureldien Hussein
Efstratios Gavves
A.W.M. Smeulders
|
3
|
+
PDF
Chat
|
STM: SpatioTemporal and Motion Encoding for Action Recognition
|
2019
|
Boyuan Jiang
Mengmeng Wang
Weihao Gan
Wei Wu
Junjie Yan
|
3
|
+
PDF
Chat
|
Densely Connected Convolutional Networks
|
2017
|
Gao Huang
Zhuang Liu
Laurens van der Maaten
Kilian Q. Weinberger
|
3
|
+
|
mixup: Beyond Empirical Risk Minimization
|
2017
|
Hongyi Zhang
Moustapha Cissé
Yann Dauphin
David LĂłpez-Paz
|
3
|
+
PDF
Chat
|
YOLO9000: Better, Faster, Stronger
|
2017
|
Joseph Redmon
Ali Farhadi
|
3
|
+
PDF
Chat
|
Learning Spatio-Temporal Features with 3D Residual Networks for Action Recognition
|
2017
|
Kensho Hara
Hirokatsu Kataoka
Yutaka Satoh
|
3
|
+
PDF
Chat
|
Attention Augmented Convolutional Networks
|
2019
|
Irwan Bello
Barret Zoph
Quoc V. Le
Ashish Vaswani
Jonathon Shlens
|
3
|
+
PDF
Chat
|
Can Spatiotemporal 3D CNNs Retrace the History of 2D CNNs and ImageNet?
|
2018
|
Kensho Hara
Hirokatsu Kataoka
Yutaka Satoh
|
3
|
+
PDF
Chat
|
A Closer Look at Spatiotemporal Convolutions for Action Recognition
|
2018
|
Du Tran
Heng Wang
Lorenzo Torresani
Jamie Ray
Yann LeCun
Manohar Paluri
|
3
|
+
PDF
Chat
|
Motion Feature Network: Fixed Motion Filter for Action Recognition
|
2018
|
Myunggi Lee
Seungeui Lee
Sungjoon Son
Gyutae Park
Nojun Kwak
|
3
|
+
PDF
Chat
|
VQA: Visual Question Answering
|
2015
|
Stanislaw Antol
Aishwarya Agrawal
Jiasen Lu
Margaret Mitchell
Dhruv Batra
C. Lawrence Zitnick
Devi Parikh
|
3
|
+
|
Structured Adversarial Attack: Towards General Implementation and Better Interpretability
|
2018
|
Kaidi Xu
Sijia Liu
Pu Zhao
PinâYu Chen
Huan Zhang
Quanfu Fan
Deniz ErdoÄmuĆ
Yanzhi Wang
Xue Lin
|
3
|
+
PDF
Chat
|
MobileNetV2: Inverted Residuals and Linear Bottlenecks
|
2018
|
Mark Sandler
Andrew Howard
Menglong Zhu
Andrey Zhmoginov
Liang-Chieh Chen
|
3
|
+
PDF
Chat
|
Spatio-temporal Channel Correlation Networks for Action Classification
|
2018
|
Ali Diba
Mohsen Fayyaz
Vivek Sharma
Mohammad Mahdi Arzani
Rahman Yousefzadeh
JĂŒergen Gall
Luc Van Gool
|
3
|
+
PDF
Chat
|
Videos as Space-Time Region Graphs
|
2018
|
Xiaolong Wang
Abhinav Gupta
|
3
|
+
PDF
Chat
|
ImageNet Large Scale Visual Recognition Challenge
|
2015
|
Olga Russakovsky
Jia Deng
Hao Su
Jonathan Krause
Sanjeev Satheesh
Sean Ma
Zhiheng Huang
Andrej Karpathy
Aditya Khosla
Michael S. Bernstein
|
3
|
+
PDF
Chat
|
Squeeze-and-Excitation Networks
|
2018
|
Jie Hu
Li Shen
Gang Sun
|
3
|
+
PDF
Chat
|
Feature Pyramid Networks for Object Detection
|
2017
|
Tsung-Yi Lin
Piotr DollĂĄr
Ross Girshick
Kaiming He
Bharath Hariharan
Serge Belongie
|
3
|