+
PDF
Chat
|
Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization
|
2017
|
Ramprasaath R. Selvaraju
Michael Cogswell
Abhishek Das
Ramakrishna Vedantam
Devi Parikh
Dhruv Batra
|
10
|
+
PDF
Chat
|
VQA: Visual Question Answering
|
2015
|
Stanislaw Antol
Aishwarya Agrawal
Jiasen Lu
Margaret Mitchell
Dhruv Batra
C. Lawrence Zitnick
Devi Parikh
|
10
|
+
PDF
Chat
|
Taking a HINT: Leveraging Explanations to Make Vision and Language Models More Grounded
|
2019
|
Ramprasaath R. Selvaraju
Stefan Lee
Yilin Shen
Hongxia Jin
Shalini Ghosh
Larry Heck
Dhruv Batra
Devi Parikh
|
8
|
+
PDF
Chat
|
Fully convolutional networks for semantic segmentation
|
2015
|
Jonathan Long
Evan Shelhamer
Trevor Darrell
|
6
|
+
PDF
Chat
|
ImageNet Large Scale Visual Recognition Challenge
|
2015
|
Olga Russakovsky
Jia Deng
Hao Su
Jonathan Krause
Sanjeev Satheesh
Sean Ma
Zhiheng Huang
Andrej Karpathy
Aditya Khosla
Michael S. Bernstein
|
6
|
+
|
VQA: Visual Question Answering
|
2015
|
Aishwarya Agrawal
Jiasen Lu
Stanislaw Antol
Margaret Mitchell
C. Lawrence Zitnick
Dhruv Batra
Devi Parikh
|
5
|
+
PDF
Chat
|
Human Attention in Visual Question Answering: Do Humans and Deep Networks look at the same regions?
|
2016
|
Abhishek Das
Harsh Agrawal
Larry Zitnick
Devi Parikh
Dhruv Batra
|
5
|
+
|
Analyzing the Behavior of Visual Question Answering Models
|
2016
|
Aishwarya Agrawal
Dhruv Batra
Devi Parikh
|
5
|
+
|
Human Attention in Visual Question Answering: Do Humans and Deep Networks Look at the Same Regions?
|
2017
|
Abhishek Das
Harsh Agrawal
Larry Zitnick
Devi Parikh
Dhruv Batra
|
5
|
+
|
Grad-CAM: Why did you say that? Visual Explanations from Deep Networks via Gradient-based Localization
|
2016
|
Ramprasaath R. Selvaraju
Abhishek Das
Ramakrishna Vedantam
Michael Cogswell
Devi Parikh
Dhruv Batra
|
5
|
+
|
Very Deep Convolutional Networks for Large-Scale Image Recognition
|
2014
|
Karen Simonyan
Andrew Zisserman
|
5
|
+
|
Exploring Human-like Attention Supervision in Visual Question Answering
|
2017
|
Tingting Qiao
Jianfeng Dong
D. S. Xu
|
5
|
+
PDF
Chat
|
Momentum Contrast for Unsupervised Visual Representation Learning
|
2020
|
Kaiming He
Haoqi Fan
Yuxin Wu
Saining Xie
Ross Girshick
|
5
|
+
|
Exploring models and data for image question answering
|
2015
|
Mengye Ren
Ryan Kiros
Richard S. Zemel
|
5
|
+
PDF
Chat
|
Exploring Human-Like Attention Supervision in Visual Question Answering
|
2018
|
Tingting Qiao
Jianfeng Dong
Duanqing Xu
|
5
|
+
PDF
Chat
|
Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation
|
2014
|
Ross Girshick
Jeff Donahue
Trevor Darrell
Jitendra Malik
|
4
|
+
PDF
Chat
|
Contrastive Multiview Coding
|
2020
|
Yonglong Tian
Dilip Krishnan
Phillip Isola
|
4
|
+
PDF
Chat
|
Self-Supervised Learning of Pretext-Invariant Representations
|
2020
|
Ishan Misra
Laurens van der Maaten
|
4
|
+
|
Improved Baselines with Momentum Contrastive Learning
|
2020
|
Xinlei Chen
Haoqi Fan
Ross Girshick
Kaiming He
|
4
|
+
PDF
Chat
|
Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering
|
2017
|
Yash Goyal
Tejas Khot
Douglas Summers-Stay
Dhruv Batra
Devi Parikh
|
4
|
+
|
Hierarchical Question-Image Co-Attention for Visual Question Answering
|
2016
|
Jiasen Lu
Jianwei Yang
Dhruv Batra
Devi Parikh
|
4
|
+
PDF
Chat
|
Deep Residual Learning for Image Recognition
|
2016
|
Kaiming He
Xiangyu Zhang
Shaoqing Ren
Jian Sun
|
4
|
+
|
Representation Learning with Contrastive Predictive Coding
|
2018
|
Aäron van den Oord
Yazhe Li
Oriol Vinyals
|
4
|
+
PDF
Chat
|
Deep visual-semantic alignments for generating image descriptions
|
2015
|
Andrej Karpathy
Li Fei-Fei
|
4
|
+
PDF
Chat
|
Unsupervised Embedding Learning via Invariant and Spreading Instance Feature
|
2019
|
Mang Ye
Xu Zhang
Pong C. Yuen
Shih‐Fu Chang
|
4
|
+
|
Striving for Simplicity: The All Convolutional Net
|
2014
|
Jost Tobias Springenberg
Alexey Dosovitskiy
Thomas Brox
Martin Riedmiller
|
4
|
+
|
Object Detectors Emerge in Deep Scene CNNs
|
2014
|
Bolei Zhou
Aditya Khosla
Àgata Lapedriza
Aude Oliva
Antonio Torralba
|
3
|
+
|
What Makes for Good Views for Contrastive Learning
|
2020
|
Yonglong Tian
Chen Sun
Ben Poole
Dilip Krishnan
Cordelia Schmid
Phillip Isola
|
3
|
+
PDF
Chat
|
CASTing Your Model: Learning to Localize Improves Self-Supervised Representations
|
2021
|
Ramprasaath R. Selvaraju
Karan Desai
Justin Johnson
Nikhil Naik
|
3
|
+
PDF
Chat
|
VirTex: Learning Visual Representations from Textual Annotations
|
2021
|
Karan Desai
Justin Johnson
|
3
|
+
|
SQuINTing at VQA Models: Introspecting VQA Models with Sub-Questions
|
2020
|
Ramprasaath R. Selvaraju
Purva Tendulkar
Devi Parikh
Eric Horvitz
Marco Ribeiro
Besmira Nushi
Ece Kamar
|
3
|
+
PDF
Chat
|
YFCC100M
|
2016
|
Bart Thomée
David A. Shamma
Gerald Friedland
Benjamin Elizalde
Karl Ni
Douglas N. Poland
Damian Borth
Li-Jia Li
|
3
|
+
|
Big Self-Supervised Models are Strong Semi-Supervised Learners
|
2020
|
Ting Chen
Simon Kornblith
Kevin Swersky
Mohammad Norouzi
Geoffrey E. Hinton
|
3
|
+
|
Object-aware Contrastive Learning for Debiased Scene Representation
|
2021
|
Sangwoo Mo
Hyunwoo Kang
Kihyuk Sohn
Chun‐Liang Li
Jinwoo Shin
|
3
|
+
PDF
Chat
|
DenseCap: Fully Convolutional Localization Networks for Dense Captioning
|
2016
|
Justin Johnson
Andrej Karpathy
Li Fei-Fei
|
3
|
+
PDF
Chat
|
Yin and Yang: Balancing and Answering Binary Visual Questions
|
2016
|
Peng Zhang
Yash Goyal
Douglas Summers-Stay
Dhruv Batra
Devi Parikh
|
3
|
+
PDF
Chat
|
Attention Correctness in Neural Image Captioning
|
2017
|
Chenxi Liu
Junhua Mao
Fei Sha
Alan Yuille
|
3
|
+
PDF
Chat
|
Top-Down Neural Attention by Excitation Backprop
|
2016
|
Jianming Zhang
Zhe Lin
Jonathan Brandt
Xiaohui Shen
Stan Sclaroff
|
3
|
+
PDF
Chat
|
Women Also Snowboard: Overcoming Bias in Captioning Models
|
2018
|
Lisa Anne Hendricks
Kaylee Burns
Kate Saenko
Trevor Darrell
Anna Rohrbach
|
3
|
+
PDF
Chat
|
Don't Just Assume; Look and Answer: Overcoming Priors for Visual Question Answering
|
2018
|
Aishwarya Agrawal
Dhruv Batra
Devi Parikh
Aniruddha Kembhavi
|
3
|
+
|
Striving for Simplicity: The All Convolutional Net
|
2014
|
Jost Tobias Springenberg
Alexey Dosovitskiy
Thomas Brox
Martin Riedmiller
|
3
|
+
PDF
Chat
|
Multimodal Explanations: Justifying Decisions and Pointing to the Evidence
|
2018
|
Dong Huk Park
Lisa Anne Hendricks
Zeynep Akata
Anna Rohrbach
Bernt Schiele
Trevor Darrell
Marcus Rohrbach
|
3
|
+
PDF
Chat
|
Cycle-Consistency for Robust Visual Question Answering
|
2019
|
Meet Shah
Xinlei Chen
Marcus Rohrbach
Devi Parikh
|
3
|
+
|
Very Deep Convolutional Networks for Large-Scale Image Recognition
|
2014
|
Karen Simonyan
Andrew Zisserman
|
3
|
+
PDF
Chat
|
From captions to visual concepts and back
|
2015
|
Hao Fang
Saurabh Gupta
Forrest Iandola
Rupesh K. Srivastava
Li Deng
Piotr Dollár
Jianfeng Gao
Xiaodong He
Margaret Mitchell
John Platt
|
3
|
+
PDF
Chat
|
Learning Deep Features for Discriminative Localization
|
2016
|
Bolei Zhou
Aditya Khosla
Àgata Lapedriza
Aude Oliva
Antonio Torralba
|
3
|
+
PDF
Chat
|
Ask Your Neurons: A Neural-Based Approach to Answering Questions about Images
|
2015
|
Mateusz Malinowski
Marcus Rohrbach
Mario Fritz
|
3
|
+
|
Yin and Yang: Balancing and Answering Binary Visual Questions
|
2015
|
Peng Zhang
Yash Goyal
Douglas Summers-Stay
Dhruv Batra
Devi Parikh
|
3
|
+
|
Pythia v0.1: the Winning Entry to the VQA Challenge 2018
|
2018
|
Yu Jiang
Vivek Natarajan
Xinlei Chen
Marcus Rohrbach
Dhruv Batra
Devi Parikh
|
3
|
+
|
Inverting Convolutional Networks with Convolutional Networks.
|
2015
|
Alexey Dosovitskiy
Thomas Brox
|
2
|