+
PDF
Chat
|
Deep Residual Learning for Image Recognition
|
2016
|
Kaiming He
Xiangyu Zhang
Shaoqing Ren
Jian Sun
|
8
|
+
PDF
Chat
|
Natural Language Object Retrieval
|
2016
|
Ronghang Hu
Huazhe Xu
Marcus Rohrbach
Jiashi Feng
Kate Saenko
Trevor Darrell
|
6
|
+
PDF
Chat
|
Generation and Comprehension of Unambiguous Object Descriptions
|
2016
|
Junhua Mao
Jonathan Huang
Alexander Toshev
Oana Camburu
Alan Yuille
Kevin Murphy
|
6
|
+
PDF
Chat
|
Boosting Image Captioning with Attributes
|
2017
|
Ting Yao
Yingwei Pan
Yehao Li
Zhaofan Qiu
Tao Mei
|
5
|
+
PDF
Chat
|
Comprehension-Guided Referring Expressions
|
2017
|
Ruotian Luo
Gregory Shakhnarovich
|
5
|
+
PDF
Chat
|
Modeling Context Between Objects for Referring Expression Understanding
|
2016
|
Varun Nagaraja
Vlad I. Morariu
Larry S. Davis
|
5
|
+
PDF
Chat
|
Learning Deep Structure-Preserving Image-Text Embeddings
|
2016
|
Liwei Wang
Yin Li
Svetlana Lazebnik
|
5
|
+
PDF
Chat
|
Image Captioning and Visual Question Answering Based on Attributes and External Knowledge
|
2017
|
Qi Wu
Chunhua Shen
Peng Wang
Anthony Dick
Anton van den Hengel
|
5
|
+
PDF
Chat
|
Knowledge Aided Consistency for Weakly Supervised Phrase Grounding
|
2018
|
Kan Chen
Jiyang Gao
Ram Nevatia
|
5
|
+
PDF
Chat
|
MAttNet: Modular Attention Network for Referring Expression Comprehension
|
2018
|
Licheng Yu
Zhe Lin
Xiaohui Shen
Shuicheng Yan
Xin Lu
Mohit Bansal
Tamara L. Berg
|
5
|
+
PDF
Chat
|
Query-Guided Regression Network with Context Policy for Phrase Grounding
|
2017
|
Kan Chen
Rama Kovvuri
Ram Nevatia
|
5
|
+
PDF
Chat
|
Grounding Referring Expressions in Images by Variational Context
|
2018
|
Hanwang Zhang
Yulei Niu
Shih‐Fu Chang
|
5
|
+
PDF
Chat
|
ImageSpirit
|
2014
|
Ming‐Ming Cheng
Shuai Zheng
Wen-Yan Lin
Vibhav Vineet
Paul Sturgess
Nigel Crook
Niloy J. Mitra
Philip H. S. Torr
|
4
|
+
PDF
Chat
|
Modeling Context in Referring Expressions
|
2016
|
Licheng Yu
Patrick Poirson
Shan Yang
Alexander C. Berg
Tamara L. Berg
|
4
|
+
PDF
Chat
|
Adaptive Reconstruction Network for Weakly Supervised Referring Expression Grounding
|
2019
|
Xuejing Liu
Liang Li
Shuhui Wang
Zheng-Jun Zha
Dechao Meng
Qingming Huang
|
4
|
+
PDF
Chat
|
A Joint Speaker-Listener-Reinforcer Model for Referring Expressions
|
2017
|
Licheng Yu
Hao Tan
Mohit Bansal
Tamara L. Berg
|
4
|
+
PDF
Chat
|
Long-term recurrent convolutional networks for visual recognition and description
|
2015
|
Jeff Donahue
Lisa Anne Hendricks
Sergio Guadarrama
Marcus Rohrbach
Subhashini Venugopalan
Trevor Darrell
Kate Saenko
|
4
|
+
PDF
Chat
|
Vision-and-Language Navigation: Interpreting Visually-Grounded Navigation Instructions in Real Environments
|
2018
|
Peter Anderson
Qi Wu
Damien Teney
Jake Bruce
Mark Johnson
Niko Sünderhauf
Ian Reid
Stephen Jay Gould
Anton van den Hengel
|
4
|
+
PDF
Chat
|
Modeling Relationships in Referential Expressions with Compositional Modular Networks
|
2017
|
Ronghang Hu
Marcus Rohrbach
Jacob Andreas
Trevor Darrell
Kate Saenko
|
4
|
+
PDF
Chat
|
Grounding of Textual Phrases in Images by Reconstruction
|
2016
|
Anna Rohrbach
Marcus Rohrbach
Ronghang Hu
Trevor Darrell
Bernt Schiele
|
4
|
+
PDF
Chat
|
Embodied Question Answering
|
2018
|
Abhishek Das
Samyak Datta
Georgia Gkioxari
Stefan Lee
Devi Parikh
Dhruv Batra
|
3
|
+
PDF
Chat
|
Knowledge-guided Pairwise Reconstruction Network for Weakly Supervised Referring Expression Grounding
|
2019
|
Xuejing Liu
Liang Li
Shuhui Wang
Zheng-Jun Zha
Li Su
Qingming Huang
|
3
|
+
PDF
Chat
|
IQA: Visual Question Answering in Interactive Environments
|
2018
|
Daniel Gordon
Aniruddha Kembhavi
Mohammad Rastegari
Joseph Redmon
Dieter Fox
Ali Farhadi
|
3
|
+
|
Adam: A Method for Stochastic Optimization
|
2014
|
Diederik P. Kingma
Jimmy Ba
|
3
|
+
PDF
Chat
|
Weakly-Supervised Visual Grounding of Phrases with Linguistic Structures
|
2017
|
Fanyi Xiao
Leonid Sigal
Yong Jae Lee
|
3
|
+
PDF
Chat
|
Show and tell: A neural image caption generator
|
2015
|
Oriol Vinyals
Alexander Toshev
Samy Bengio
Dumitru Erhan
|
3
|
+
PDF
Chat
|
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
|
2016
|
Shaoqing Ren
Kaiming He
Ross Girshick
Jian Sun
|
3
|
+
|
Real-Time Referring Expression Comprehension by Single-Stage Grounding Network.
|
2018
|
Xinpeng Chen
Lin Ma
Jingyuan Chen
Zequn Jie
Wei Liu
Jiebo Luo
|
2
|
+
PDF
Chat
|
Hard-Aware Deeply Cascaded Embedding
|
2017
|
Yuhui Yuan
Kuiyuan Yang
Chao Zhang
|
2
|
+
PDF
Chat
|
Learning Compact Appearance Representation for Video-Based Person Re-Identification
|
2018
|
Wei Zhang
Shengnan Hu
Kan Liu
Zheng-Jun Zha
|
2
|
+
PDF
Chat
|
RAM: A Region-Aware Deep Model for Vehicle Re-Identification
|
2018
|
Xiaobin Liu
Shiliang Zhang
Qingming Huang
Wen Gao
|
2
|
+
PDF
Chat
|
A Dual-Path Model With Adaptive Attention for Vehicle Re-Identification
|
2019
|
Pirazh Khorramshahi
Amit Kumar
Neehar Peri
Sai Saketh Rambhatla
Jun-Cheng Chen
Rama Chellappa
|
2
|
+
PDF
Chat
|
Pyramidal Person Re-IDentification via Multi-Loss Dynamic Training
|
2019
|
Feng Zheng
Cheng Deng
Xing Sun
Xinyang Jiang
Xiaowei Guo
Zongqiao Yu
Feiyue Huang
Rongrong Ji
|
2
|
+
PDF
Chat
|
Bag of Tricks and a Strong Baseline for Deep Person Re-Identification
|
2019
|
Hao Luo
Youzhi Gu
Xingyu Liao
Shenqi Lai
Wei Jiang
|
2
|
+
PDF
Chat
|
Harmonious Attention Network for Person Re-identification
|
2018
|
Wei Li
Xiatian Zhu
Shaogang Gong
|
2
|
+
|
VL-BERT: Pre-training of Generic Visual-Linguistic Representations
|
2019
|
Weijie Su
Xizhou Zhu
Yue Cao
Bin Li
Lewei Lu
Furu Wei
Jifeng Dai
|
2
|
+
PDF
Chat
|
Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks
|
2017
|
Jun-Yan Zhu
Taesung Park
Phillip Isola
Alexei A. Efros
|
2
|
+
PDF
Chat
|
A large-scale car dataset for fine-grained categorization and verification
|
2015
|
Linjie Yang
Ping Luo
Chen Change Loy
Xiaoou Tang
|
2
|
+
PDF
Chat
|
Learning Deep Neural Networks for Vehicle Re-ID with Visual-spatio-Temporal Path Proposals
|
2017
|
Yantao Shen
Tong Xiao
Hongsheng Li
Shuai Yi
Xiaogang Wang
|
2
|
+
|
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
|
2015
|
Shaoqing Ren
Kaiming He
Ross Girshick
Jian Sun
|
2
|
+
PDF
Chat
|
YOLO9000: Better, Faster, Stronger
|
2017
|
Joseph Redmon
Ali Farhadi
|
2
|
+
|
On the Number of Linear Regions of Deep Neural Networks
|
2014
|
Guido Montúfar
Razvan Pascanu
Kyunghyun Cho
Yoshua Bengio
|
2
|
+
PDF
Chat
|
Flickr30k Entities: Collecting Region-to-Phrase Correspondences for Richer Image-to-Sentence Models
|
2016
|
Bryan A. Plummer
Liwei Wang
Chris M. Cervantes
Juan C. Caicedo
Julia Hockenmaier
Svetlana Lazebnik
|
2
|
+
|
On the Number of Linear Regions of Deep Neural Networks
|
2014
|
Guido Montúfar
Razvan Pascanu
Kyunghyun Cho
Yoshua Bengio
|
2
|
+
PDF
Chat
|
ImageNet Large Scale Visual Recognition Challenge
|
2015
|
Olga Russakovsky
Jia Deng
Hao Su
Jonathan Krause
Sanjeev Satheesh
Sean Ma
Zhiheng Huang
Andrej Karpathy
Aditya Khosla
Michael S. Bernstein
|
2
|
+
PDF
Chat
|
Human Semantic Parsing for Person Re-identification
|
2018
|
Mahdi M. Kalayeh
Emrah Başaran
Muhittin Gökmen
Mustafa E. Kamaşak
Mubarak Shah
|
2
|
+
PDF
Chat
|
Person re-identification by Local Maximal Occurrence representation and metric learning
|
2015
|
Shengcai Liao
Yang Hu
Xiangyu Zhu
Stan Z. Li
|
2
|
+
|
Attributes Guided Feature Learning for Vehicle Re-identification
|
2019
|
Aihua Zheng
Xianmin Lin
Chenglong Li
Ran He
Jin Tang
|
2
|
+
PDF
Chat
|
Going deeper with convolutions
|
2015
|
Christian Szegedy
Wei Liu
Yangqing Jia
Pierre Sermanet
Scott Reed
Dragomir Anguelov
Dumitru Erhan
Vincent Vanhoucke
Andrew Rabinovich
|
2
|
+
PDF
Chat
|
Beyond Part Models: Person Retrieval with Refined Part Pooling (and A Strong Convolutional Baseline)
|
2018
|
Yifan Sun
Liang Zheng
Yi Yang
Qi Tian
Shengjin Wang
|
2
|