+
|
Adam: A Method for Stochastic Optimization
|
2014
|
Diederik P. Kingma
Jimmy Ba
|
15
|
+
PDF
Chat
|
Deep Residual Learning for Image Recognition
|
2016
|
Kaiming He
Xiangyu Zhang
Shaoqing Ren
Jian Sun
|
12
|
+
|
Very Deep Convolutional Networks for Large-Scale Image Recognition
|
2014
|
Karen Simonyan
Andrew Zisserman
|
9
|
+
PDF
Chat
|
Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation
|
2014
|
Ross Girshick
Jeff Donahue
Trevor Darrell
Jitendra Malik
|
7
|
+
PDF
Chat
|
Temporal Action Localization in Untrimmed Videos via Multi-stage CNNs
|
2016
|
Zheng Shou
Dongang Wang
ShihâFu Chang
|
7
|
+
|
CUHK & ETHZ & SIAT Submission to ActivityNet Challenge 2016.
|
2016
|
Yuanjun Xiong
Limin Wang
Zhe Wang
Bowen Zhang
Hang Song
Wei Li
Dahua Lin
Yu Qiao
Luc Van Gool
Xiaoou Tang
|
7
|
+
PDF
Chat
|
Natural Language Object Retrieval
|
2016
|
Ronghang Hu
Huazhe Xu
Marcus Rohrbach
Jiashi Feng
Kate Saenko
Trevor Darrell
|
7
|
+
PDF
Chat
|
TURN TAP: Temporal Unit Regression Network for Temporal Action Proposals
|
2017
|
Jiyang Gao
Zhenheng Yang
Chen Sun
Kan Chen
Ram Nevatia
|
6
|
+
|
Temporal Action Localization in Untrimmed Videos via Multi-stage CNNs
|
2016
|
Zheng Shou
Dongang Wang
ShihâFu Chang
|
6
|
+
PDF
Chat
|
Deep visual-semantic alignments for generating image descriptions
|
2015
|
Andrej Karpathy
Li Fei-Fei
|
6
|
+
PDF
Chat
|
You Only Look Once: Unified, Real-Time Object Detection
|
2016
|
Joseph Redmon
Santosh Divvala
Ross Girshick
Ali Farhadi
|
5
|
+
PDF
Chat
|
Learning Spatiotemporal Features with 3D Convolutional Networks
|
2015
|
Du Tran
Lubomir Bourdev
Rob Fergus
Lorenzo Torresani
Manohar Paluri
|
5
|
+
|
Distilling the Knowledge in a Neural Network
|
2015
|
Geoffrey E. Hinton
Oriol Vinyals
Jay B. Dean
|
5
|
+
|
SSD: Single Shot MultiBox Detector
|
2016
|
Wei Liu
Dragomir Anguelov
Dumitru Erhan
Christian Szegedy
Scott Reed
Cheng-Yang Fu
Alexander C. Berg
|
5
|
+
PDF
Chat
|
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
|
2016
|
Shaoqing Ren
Kaiming He
Ross Girshick
Jian Sun
|
4
|
+
PDF
Chat
|
TALL: Temporal Activity Localization via Language Query
|
2017
|
Jiyang Gao
Chen Sun
Zhenheng Yang
Ram Nevatia
|
4
|
+
|
Argoverse: 3D Tracking and Forecasting with Rich Maps
|
2019
|
Ming-Fang Chang
John Lambert
Patsorn Sangkloy
Jagjeet Singh
SĹawomir BÄ
k
Andrew T. Hartnett
Wang De
Peter Carr
Simon Lucey
Deva Ramanan
|
4
|
+
PDF
Chat
|
Rules of the Road: Predicting Driving Behavior With a Convolutional Model of Semantic Interactions
|
2019
|
Joey Hong
Benjamin Sapp
James Philbin
|
4
|
+
|
MultiPath: Multiple Probabilistic Anchor Trajectory Hypotheses for Behavior Prediction
|
2019
|
Yuning Chai
Benjamin Sapp
Mayank Bansal
Dragomir Anguelov
|
4
|
+
PDF
Chat
|
Query-Guided Regression Network with Context Policy for Phrase Grounding
|
2017
|
Kan Chen
Rama Kovvuri
Ram Nevatia
|
4
|
+
PDF
Chat
|
Knowledge Aided Consistency for Weakly Supervised Phrase Grounding
|
2018
|
Kan Chen
Jiyang Gao
Ram Nevatia
|
4
|
+
PDF
Chat
|
End-to-End Learning of Action Detection from Frame Glimpses in Videos
|
2016
|
Serena Yeung
Olga Russakovsky
Greg Mori
Li Fei-Fei
|
4
|
+
PDF
Chat
|
Argoverse: 3D Tracking and Forecasting With Rich Maps
|
2019
|
Ming-Fang Chang
Deva Ramanan
James Hays
John Lambert
Patsorn Sangkloy
Jagjeet Singh
SĹawomir BÄ
k
Andrew T. Hartnett
Wang De
Peter Carr
|
4
|
+
PDF
Chat
|
Motion-Appearance Co-memory Networks for Video Question Answering
|
2018
|
Jiyang Gao
Runzhou Ge
Kan Chen
Ram Nevatia
|
4
|
+
PDF
Chat
|
Social GAN: Socially Acceptable Trajectories with Generative Adversarial Networks
|
2018
|
Agrim Gupta
Justin Johnson
Li Fei-Fei
Silvio Savarese
Alexandre Alahi
|
4
|
+
PDF
Chat
|
Long-term recurrent convolutional networks for visual recognition and description
|
2015
|
Jeff Donahue
Lisa Anne Hendricks
Sergio Guadarrama
Marcus Rohrbach
Subhashini Venugopalan
Trevor Darrell
Kate Saenko
|
4
|
+
|
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift
|
2015
|
Sergey Ioffe
Christian Szegedy
|
4
|
+
|
Fast R-CNN
|
2015
|
Ross Girshick
|
4
|
+
PDF
Chat
|
Graph R-CNN for Scene Graph Generation
|
2018
|
Jianwei Yang
Jiasen Lu
Stefan Lee
Dhruv Batra
Devi Parikh
|
4
|
+
|
ABC-CNN: An Attention Based Convolutional Neural Network for Visual Question Answering
|
2015
|
Kan Chen
Jiang Wang
Liang-Chieh Chen
Haoyuan Gao
Wei Xu
Ram Nevatia
|
4
|
+
PDF
Chat
|
Automatic Concept Discovery from Parallel Text and Visual Corpora
|
2015
|
Chen Sun
Chuang Gan
Ram Nevatia
|
4
|
+
|
Relational inductive biases, deep learning, and graph networks
|
2018
|
Peter Battaglia
Jessica B. Hamrick
Victor Bapst
Ălvaro SĂĄnchezâGonzĂĄlez
VinĂcius Zambaldi
Mateusz Malinowski
Andrea Tacchetti
David Raposo
Adam Santoro
Ryan Faulkner
|
4
|
+
|
Two-Stream Convolutional Networks for Action Recognition in Videos
|
2014
|
Karen Simonyan
Andrew Zisserman
|
4
|
+
PDF
Chat
|
Grounding of Textual Phrases in Images by Reconstruction
|
2016
|
Anna Rohrbach
Marcus Rohrbach
Ronghang Hu
Trevor Darrell
Bernt Schiele
|
4
|
+
|
Very Deep Convolutional Networks for Large-Scale Image Recognition
|
2014
|
Karen Simonyan
Andrew Zisserman
|
4
|
+
PDF
Chat
|
Multimodal Trajectory Predictions for Autonomous Driving using Deep Convolutional Networks
|
2019
|
Henggang Cui
Vladan RadosavljeviÄ
FangâChieh Chou
Tsung-Han Lin
Thi Nguyen
Tzu-Kuo Huang
Jeff Schneider
Nemanja Djuric
|
3
|
+
PDF
Chat
|
PointPillars: Fast Encoders for Object Detection From Point Clouds
|
2019
|
Alex Lang
Sourabh Vora
Holger Caesar
Lubing Zhou
Jiong Yang
Oscar Beijbom
|
3
|
+
|
Cascaded Boundary Regression for Temporal Action Detection
|
2017
|
Jiyang Gao
Zhenheng Yang
Ram Nevatia
|
3
|
+
PDF
Chat
|
CDC: Convolutional-De-Convolutional Networks for Precise Temporal Action Localization in Untrimmed Videos
|
2017
|
Zheng Shou
Jonathan Chan
Alireza Zareian
Kazuyuki Miyazawa
ShihâFu Chang
|
3
|
+
|
R-CNNs for Pose Estimation and Action Detection
|
2014
|
Georgia Gkioxari
Bharath Hariharan
Ross Girshick
Jitendra Malik
|
3
|
+
PDF
Chat
|
DESIRE: Distant Future Prediction in Dynamic Scenes with Interacting Agents
|
2017
|
Namhoon Lee
Wongun Choi
Paul Vernaza
Christopher Choy
Philip H. S. Torr
Manmohan Chandraker
|
3
|
+
PDF
Chat
|
Focal Loss for Dense Object Detection
|
2017
|
Tsung-Yi Lin
Priya Goyal
Ross Girshick
Kaiming He
Piotr DollĂĄr
|
3
|
+
PDF
Chat
|
PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation
|
2017
|
Raffaelli Charles
Hao Su
Kaichun Mo
Leonidas Guibas
|
3
|
+
|
PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space
|
2017
|
Charles R. Qi
Yi Li
Hao Su
Leonidas Guibas
|
3
|
+
|
RED: Reinforced Encoder-Decoder Networks for Action Anticipation
|
2017
|
Jiyang Gao
Zhenheng Yang
Ram Nevatia
|
3
|
+
|
Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations
|
2016
|
Ranjay Krishna
Yuke Zhu
Oliver Groth
Justin Johnson
Kenji Hata
Joshua Kravitz
Stephanie Chen
Yannis Kalantidis
Li-Jia Li
David A. Shamma
|
3
|
+
|
Deep Residual Learning for Image Recognition
|
2015
|
Kaiming He
Xiangyu Zhang
Shaoqing Ren
Jian Sun
|
3
|
+
PDF
Chat
|
The highD Dataset: A Drone Dataset of Naturalistic Vehicle Trajectories on German Highways for Validation of Highly Automated Driving Systems
|
2018
|
Robert Krajewski
Julian Bock
Laurent Kloeker
Lutz Eckstein
|
3
|
+
PDF
Chat
|
Relational Action Forecasting
|
2019
|
Chen Sun
Abhinav Shrivastava
Carl Vondrick
Rahul Sukthankar
Kevin Murphy
Cordelia Schmid
|
3
|
+
|
Two-Stream Convolutional Networks for Action Recognition in Videos
|
2014
|
Karen Simonyan
Andrew Zisserman
|
3
|