+
PDF
Chat
|
Deep Residual Learning for Image Recognition
|
2016
|
Kaiming He
Xiangyu Zhang
Shaoqing Ren
Jian Sun
|
6
|
+
|
Adam: A Method for Stochastic Optimization
|
2014
|
Diederik P. Kingma
Jimmy Ba
|
4
|
+
PDF
Chat
|
CIDEr: Consensus-based image description evaluation
|
2015
|
Ramakrishna Vedantam
C. Lawrence Zitnick
Devi Parikh
|
4
|
+
|
Distilling the Knowledge in a Neural Network
|
2015
|
Geoffrey E. Hinton
Oriol Vinyals
Jay B. Dean
|
3
|
+
PDF
Chat
|
Can Spatiotemporal 3D CNNs Retrace the History of 2D CNNs and ImageNet?
|
2018
|
Kensho Hara
Hirokatsu Kataoka
Yutaka Satoh
|
3
|
+
|
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
|
2018
|
Jacob Devlin
Ming‐Wei Chang
Kenton Lee
Kristina Toutanova
|
3
|
+
PDF
Chat
|
Memory-Attended Recurrent Network for Video Captioning
|
2019
|
Wenjie Pei
Jiyuan Zhang
Xiangrong Wang
Lei Ke
Xiaoyong Shen
Yu‐Wing Tai
|
3
|
+
PDF
Chat
|
Spatio-Temporal Dynamics and Semantic Attribute Enriched Visual Encoding for Video Captioning
|
2019
|
Nayyer Aafaq
Naveed Akhtar
Wei Liu
Syed Zulqarnain Gilani
Ajmal Mian
|
3
|
+
|
DARA: Dynamics-Aware Reward Augmentation in Offline Reinforcement Learning
|
2022
|
Jinxin Liu
Hongyin Zhang
Donglin Wang
|
2
|
+
|
Semi-Supervised Classification with Graph Convolutional Networks
|
2016
|
Thomas Kipf
Max Welling
|
2
|
+
|
Translating Videos to Natural Language Using Deep Recurrent Neural Networks
|
2015
|
Subhashini Venugopalan
Huijuan Xu
Jeff Donahue
Marcus Rohrbach
Raymond J. Mooney
Kate Saenko
|
2
|
+
PDF
Chat
|
Controllable Video Captioning With POS Sequence Guidance Based on Gated Fusion Network
|
2019
|
Bairui Wang
Lin Ma
Wei Zhang
Wenhao Jiang
Jingwen Wang
Wei Liu
|
2
|
+
PDF
Chat
|
Machine Learning Testing: Survey, Landscapes and Horizons
|
2020
|
Jie M. Zhang
Mark Harman
Lei Ma
Yang Liu
|
2
|
+
|
Unsupervised Domain Adaptation with Dynamics-Aware Rewards in Reinforcement Learning
|
2021
|
Jinxin Liu
Hao Shen
Donglin Wang
Yachen Kang
Qiangxing Tian
|
2
|
+
PDF
Chat
|
VaTeX: A Large-Scale, High-Quality Multilingual Dataset for Video-and-Language Research
|
2019
|
Xin Wang
Jiawei Wu
Junkun Chen
Lei Li
Yuan‐Fang Wang
William Yang Wang
|
2
|
+
PDF
Chat
|
Hierarchical Recurrent Neural Encoder for Video Representation with Application to Captioning
|
2016
|
Pingbo Pan
Zhongwen Xu
Yi Yang
Fei Wu
Yueting Zhuang
|
2
|
+
|
Out of the Box: Reasoning with Graph Convolution Nets for Factual Visual Question Answering
|
2018
|
Medhini Narasimhan
Svetlana Lazebnik
Alexander G. Schwing
|
2
|
+
PDF
Chat
|
An Analysis of Incorporating an External Language Model into a Sequence-to-Sequence Model
|
2018
|
Anjuli Kannan
Yonghui Wu
Patrick Nguyen
Tara N. Sainath
ZhiJeng Chen
Rohit Prabhavalkar
|
2
|
+
PDF
Chat
|
Towards Evaluating the Robustness of Neural Networks
|
2017
|
Nicholas Carlini
David Wagner
|
2
|
+
PDF
Chat
|
Non-local Neural Networks
|
2018
|
Xiaolong Wang
Ross Girshick
Abhinav Gupta
Kaiming He
|
2
|
+
|
One-step and Two-step Classification for Abusive Language Detection on Twitter
|
2017
|
Ji Ho Park
Pascale Fung
|
2
|
+
PDF
Chat
|
Video Paragraph Captioning Using Hierarchical Recurrent Neural Networks
|
2016
|
Haonan Yu
Jiang Wang
Zhiheng Huang
Yi Yang
Wei Xu
|
2
|
+
PDF
Chat
|
Aggregated Residual Transformations for Deep Neural Networks
|
2017
|
Saining Xie
Ross Girshick
Piotr Dollár
Zhuowen Tu
Kaiming He
|
2
|
+
|
Very Deep Convolutional Networks for Large-Scale Image Recognition
|
2014
|
Karen Simonyan
Andrew Zisserman
|
2
|
+
PDF
Chat
|
Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning
|
2017
|
Christian Szegedy
Sergey Ioffe
Vincent Vanhoucke
Alexander A. Alemi
|
2
|
+
PDF
Chat
|
Sequence to Sequence -- Video to Text
|
2015
|
Subhashini Venugopalan
Marcus Rohrbach
Jeffrey Donahue
Raymond J. Mooney
Trevor Darrell
Kate Saenko
|
2
|
+
PDF
Chat
|
Jointly Modeling Embedding and Translation to Bridge Video and Language
|
2016
|
Yingwei Pan
Tao Mei
Ting Yao
Houqiang Li
Yong Rui
|
2
|
+
PDF
Chat
|
Relation-Aware Graph Attention Network for Visual Question Answering
|
2019
|
Linjie Li
Zhe Gan
Yu Cheng
Jingjing Liu
|
2
|
+
PDF
Chat
|
Controllable Video Captioning with an Exemplar Sentence
|
2020
|
Yitian Yuan
Lin Ma
Jingwen Wang
Wenwu Zhu
|
2
|
+
PDF
Chat
|
Deep Learning for Hate Speech Detection in Tweets
|
2017
|
Pinkesh Badjatiya
Shashank Gupta
Manish Gupta
Vasudeva Varma
|
2
|
+
|
Efficient Estimation of Word Representations in Vector Space
|
2013
|
Tomáš Mikolov
Kai Chen
Greg S. Corrado
Jay B. Dean
|
2
|
+
PDF
Chat
|
Exploring Visual Relationship for Image Captioning
|
2018
|
Ting Yao
Yingwei Pan
Yehao Li
Tao Mei
|
2
|
+
|
Towards Deep Learning Models Resistant to Adversarial Attacks
|
2017
|
Aleksander Mądry
Aleksandar Makelov
Ludwig Schmidt
Dimitris Tsipras
Adrian Vladu
|
2
|
+
PDF
Chat
|
Videos as Space-Time Region Graphs
|
2018
|
Xiaolong Wang
Abhinav Gupta
|
2
|
+
|
The Kinetics Human Action Video Dataset
|
2017
|
Andrew Zisserman
João Carreira
Karen Simonyan
Will Kay
Brian Zhang
Chloe Hillier
Sudheendra Vijayanarasimhan
Fabio Viola
T.C. Green
Trevor Back
|
2
|
+
PDF
Chat
|
DeepXplore
|
2017
|
Kexin Pei
Yinzhi Cao
Junfeng Yang
Suman Jana
|
2
|
+
PDF
Chat
|
Object-Aware Aggregation With Bidirectional Temporal Graph for Video Captioning
|
2019
|
Junchao Zhang
Yuxin Peng
|
2
|
+
|
MMDetection: Open MMLab Detection Toolbox and Benchmark.
|
2019
|
Kai Chen
Jiaqi Wang
Jiangmiao Pang
Yuhang Cao
Yu Xiong
Xiaoxiao Li
Shuyang Sun
Wansen Feng
Ziwei Liu
Jiarui Xu
|
2
|
+
|
Data Sets: Word Embeddings Learned from Tweets and General Data
|
2017
|
Quanzhi Li
Sameena Shah
Xiaomo Liu
Armineh Nourbakhsh
|
2
|
+
PDF
Chat
|
Cold Fusion: Training Seq2Seq Models Together with Language Models
|
2018
|
Anuroop Sriram
Heewoo Jun
Sanjeev Satheesh
Adam Coates
|
2
|
+
PDF
Chat
|
DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs
|
2017
|
Liang-Chieh Chen
George Papandreou
Iasonas Kokkinos
Kevin Murphy
Alan Yuille
|
2
|
+
PDF
Chat
|
Reconstruction Network for Video Captioning
|
2018
|
Bairui Wang
Lin Ma
Wei Zhang
Wei Liu
|
2
|
+
PDF
Chat
|
Describing Videos by Exploiting Temporal Structure
|
2015
|
Li Yao
Atousa Torabi
Kyunghyun Cho
Nicolas Ballas
Christopher Pal
Hugo Larochelle
Aaron Courville
|
2
|
+
PDF
Chat
|
Knockoff Nets: Stealing Functionality of Black-Box Models
|
2019
|
Tribhuvanesh Orekondy
Bernt Schiele
Mario Fritz
|
2
|
+
|
Learning Conditioned Graph Structures for Interpretable Visual Question Answering
|
2018
|
Will Norcliffe-Brown
Stathis Vafeias
Sarah Parisot
|
2
|
+
PDF
Chat
|
Neural Motifs: Scene Graph Parsing with Global Context
|
2018
|
Rowan Zellers
Mark Yatskar
Sam Thomson
Yejin Choi
|
2
|
+
PDF
Chat
|
Less Is More: Picking Informative Frames for Video Captioning
|
2018
|
Yangyu Chen
Shuhui Wang
Weigang Zhang
Qingming Huang
|
2
|
+
PDF
Chat
|
ImageNet Large Scale Visual Recognition Challenge
|
2015
|
Olga Russakovsky
Jia Deng
Hao Su
Jonathan Krause
Sanjeev Satheesh
Sean Ma
Zhiheng Huang
Andrej Karpathy
Aditya Khosla
Michael S. Bernstein
|
2
|
+
|
Sequence Level Training with Recurrent Neural Networks
|
2015
|
Marc’Aurelio Ranzato
Sumit Chopra
Michael Auli
Wojciech Zaremba
|
1
|
+
|
Depth Map Prediction from a Single Image using a Multi-Scale Deep Network
|
2014
|
David Eigen
Christian Puhrsch
Rob Fergus
|
1
|