Projects
Reading
People
Chat
SU\G
(𝔸)
/K·U
Projects
Reading
People
Chat
Sign Up
Light
Dark
System
CLEVR_HYP: A Challenge Dataset and Baselines for Visual Question Answering with Hypothetical Actions over Images
Shailaja Keyur Sampat
,
Akshay Kumar
,
Yezhou Yang
,
Chitta Baral
Type:
Preprint
Publication Date:
2021-04-13
Citations:
0
View Publication
Share
Locations
arXiv (Cornell University) -
View
Similar Works
Action
Title
Year
Authors
+
CLEVR_HYP: A Challenge Dataset and Baselines for Visual Question Answering with Hypothetical Actions over Images
2021
Shailaja Keyur Sampat
Akshay Kumar
Yezhou Yang
Chitta Baral
+
A-OKVQA: A Benchmark for Visual Question Answering using World Knowledge
2022
Dustin Schwenk
Apoorv Khandelwal
Christopher M. Clark
Kenneth Marino
Roozbeh Mottaghi
+
PDF
Chat
CLEVR_HYP: A Challenge Dataset and Baselines for Visual Question Answering with Hypothetical Actions over Images
2021
Shailaja Keyur Sampat
Akshay Kumar
Yezhou Yang
Chitta Baral
+
PDF
Chat
Natural Language Understanding and Inference with MLLM in Visual Question Answering: A Survey
2024
Jiayi Kuang
Jingyou Xie
Haohao Luo
Ronghao Li
Zhe Xu
Xianfeng Cheng
Yinghui Li
Xika Lin
Ying Shen
+
Visuo-Linguistic Question Answering (VLQA) Challenge
2020
Shailaja Keyur Sampat
Yezhou Yang
Chitta Baral
+
Visual Question Answering: A Survey of Methods and Datasets
2016
Qi Wu
Damien Teney
Peng Wang
Chunhua Shen
Anthony Dick
Anton van den Hengel
+
Visuo-Linguistic Question Answering (VLQA) Challenge
2020
Shailaja Keyur Sampat
Yezhou Yang
Chitta Baral
+
Visuo-Linguistic Question Answering (VLQA) Challenge
2020
Shailaja Keyur Sampat
Yezhou Yang
Chitta Baral
+
Language Guided Visual Question Answering: Elevate Your Multimodal Language Model Using Knowledge-Enriched Prompts
2023
Deepanway Ghosal
Navonil Majumder
Roy Ka-Wei Lee
Rada Mihalcea
Soujanya Poria
+
Language Guided Visual Question Answering: Elevate Your Multimodal Language Model Using Knowledge-Enriched Prompts
2023
Deepanway Ghosal
Navonil Majumder
Roy Lee
Rada Mihalcea
Soujanya Poria
+
iVQA: Inverse Visual Question Answering
2017
Feng Liu
Tao Xiang
Timothy M. Hospedales
Wankou Yang
Changyin Sun
+
PDF
Chat
iVQA: Inverse Visual Question Answering
2018
Feng Liu
Tao Xiang
Timothy M. Hospedales
Wankou Yang
Changyin Sun
+
GQA: A New Dataset for Real-World Visual Reasoning and Compositional Question Answering
2019
Drew A. Hudson
Christopher D. Manning
+
GQA: A New Dataset for Real-World Visual Reasoning and Compositional Question Answering
2019
Drew A. Hudson
Christopher D. Manning
+
VQA: Visual Question Answering
2015
Aishwarya Agrawal
Jiasen Lu
Stanislaw Antol
Margaret Mitchell
C. Lawrence Zitnick
Dhruv Batra
Devi Parikh
+
VQA: Visual Question Answering
2015
Aishwarya Agrawal
Jiasen Lu
Stanislaw Antol
Margaret Mitchell
C. Lawrence Zitnick
Dhruv Batra
Devi Parikh
+
PDF
Chat
VQA: Visual Question Answering
2015
Stanislaw Antol
Aishwarya Agrawal
Jiasen Lu
Margaret Mitchell
Dhruv Batra
C. Lawrence Zitnick
Devi Parikh
+
PDF
Chat
Socratic Questioning: Learn to Self-guide Multimodal Reasoning in the Wild
2025
Wenjie Hu
Haodi Liu
Chen Lin
Feng Zhou
Changming Xiao
Huazhe Yang
Changshui Zhang
+
OK-VQA: A Visual Question Answering Benchmark Requiring External Knowledge
2019
Kenneth Marino
Mohammad Rastegari
Ali Farhadi
Roozbeh Mottaghi
+
Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering
2016
Yash Goyal
Tejas Khot
Douglas Summers-Stay
Dhruv Batra
Devi Parikh
Works That Cite This (0)
Action
Title
Year
Authors
Works Cited by This (33)
Action
Title
Year
Authors
+
PDF
Chat
Generative Adversarial Networks
2022
Ian J. Goodfellow
Jean Pouget-Abadie
Mehdi Mirza
Bing Xu
David Warde-Farley
Sherjil Ozair
Aaron Courville
Yoshua Bengio
+
PDF
Chat
VQA: Visual Question Answering
2015
Stanislaw Antol
Aishwarya Agrawal
Jiasen Lu
Margaret Mitchell
Dhruv Batra
C. Lawrence Zitnick
Devi Parikh
+
PDF
Chat
Deep Residual Learning for Image Recognition
2016
Kaiming He
Xiangyu Zhang
Shaoqing Ren
Jian Sun
+
PDF
Chat
CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning
2017
Justin Johnson
Bharath Hariharan
Laurens van der Maaten
Li Fei-Fei
C. Lawrence Zitnick
Ross Girshick
+
FigureQA: An Annotated Figure Dataset for Visual Reasoning
2017
Samira Ebrahimi Kahou
Vincent Michalski
Adam Atkinson
Ákos Kádár
Adam Trischler
Yoshua Bengio
+
A Dataset and Architecture for Visual Reasoning with a Working Memory
2018
Guangyu Robert Yang
Igor Ganichev
Xiao‐Jing Wang
Jonathon Shlens
David Sussillo
+
PDF
Chat
Composing Text and Image for Image Retrieval - an Empirical Odyssey
2019
Nam Vo
Lu Jiang
Chen Sun
Kevin Murphy
Li-Jia Li
Li Fei-Fei
James Hays
+
PDF
Chat
fairseq: A Fast, Extensible Toolkit for Sequence Modeling
2019
Myle Ott
Sergey Edunov
Alexei Baevski
Angela Fan
Sam Gross
Nathan Ng
David Grangier
Michael Auli
+
Exploring Models and Data for Image Question Answering
2015
Mengye Ren
Ryan Kiros
Richard S. Zemel
+
Answering Visual What-If Questions: From Actions to Predicted Scene Descriptions
2018
Misha Wagner
Hector Basevi
Rakshith Shetty
W. Li
Mateusz Malinowski
Mario Fritz
Aleš Leonardis