CLEVR_HYP: A Challenge Dataset and Baselines for Visual Question Answering with Hypothetical Actions over Images

Type: Preprint

Publication Date: 2021-04-13

Citations: 0

Locations

  • arXiv (Cornell University) - View

Similar Works

Action Title Year Authors
+ CLEVR_HYP: A Challenge Dataset and Baselines for Visual Question Answering with Hypothetical Actions over Images 2021 Shailaja Keyur Sampat
Akshay Kumar
Yezhou Yang
Chitta Baral
+ A-OKVQA: A Benchmark for Visual Question Answering using World Knowledge 2022 Dustin Schwenk
Apoorv Khandelwal
Christopher M. Clark
Kenneth Marino
Roozbeh Mottaghi
+ PDF Chat CLEVR_HYP: A Challenge Dataset and Baselines for Visual Question Answering with Hypothetical Actions over Images 2021 Shailaja Keyur Sampat
Akshay Kumar
Yezhou Yang
Chitta Baral
+ PDF Chat Natural Language Understanding and Inference with MLLM in Visual Question Answering: A Survey 2024 Jiayi Kuang
Jingyou Xie
Haohao Luo
Ronghao Li
Zhe Xu
Xianfeng Cheng
Yinghui Li
Xika Lin
Ying Shen
+ Visuo-Linguistic Question Answering (VLQA) Challenge 2020 Shailaja Keyur Sampat
Yezhou Yang
Chitta Baral
+ Visual Question Answering: A Survey of Methods and Datasets 2016 Qi Wu
Damien Teney
Peng Wang
Chunhua Shen
Anthony Dick
Anton van den Hengel
+ Visuo-Linguistic Question Answering (VLQA) Challenge 2020 Shailaja Keyur Sampat
Yezhou Yang
Chitta Baral
+ Visuo-Linguistic Question Answering (VLQA) Challenge 2020 Shailaja Keyur Sampat
Yezhou Yang
Chitta Baral
+ Language Guided Visual Question Answering: Elevate Your Multimodal Language Model Using Knowledge-Enriched Prompts 2023 Deepanway Ghosal
Navonil Majumder
Roy Ka-Wei Lee
Rada Mihalcea
Soujanya Poria
+ Language Guided Visual Question Answering: Elevate Your Multimodal Language Model Using Knowledge-Enriched Prompts 2023 Deepanway Ghosal
Navonil Majumder
Roy Lee
Rada Mihalcea
Soujanya Poria
+ iVQA: Inverse Visual Question Answering 2017 Feng Liu
Tao Xiang
Timothy M. Hospedales
Wankou Yang
Changyin Sun
+ PDF Chat iVQA: Inverse Visual Question Answering 2018 Feng Liu
Tao Xiang
Timothy M. Hospedales
Wankou Yang
Changyin Sun
+ GQA: A New Dataset for Real-World Visual Reasoning and Compositional Question Answering 2019 Drew A. Hudson
Christopher D. Manning
+ GQA: A New Dataset for Real-World Visual Reasoning and Compositional Question Answering 2019 Drew A. Hudson
Christopher D. Manning
+ VQA: Visual Question Answering 2015 Aishwarya Agrawal
Jiasen Lu
Stanislaw Antol
Margaret Mitchell
C. Lawrence Zitnick
Dhruv Batra
Devi Parikh
+ VQA: Visual Question Answering 2015 Aishwarya Agrawal
Jiasen Lu
Stanislaw Antol
Margaret Mitchell
C. Lawrence Zitnick
Dhruv Batra
Devi Parikh
+ PDF Chat VQA: Visual Question Answering 2015 Stanislaw Antol
Aishwarya Agrawal
Jiasen Lu
Margaret Mitchell
Dhruv Batra
C. Lawrence Zitnick
Devi Parikh
+ PDF Chat Socratic Questioning: Learn to Self-guide Multimodal Reasoning in the Wild 2025 Wenjie Hu
Haodi Liu
Chen Lin
Feng Zhou
Changming Xiao
Huazhe Yang
Changshui Zhang
+ OK-VQA: A Visual Question Answering Benchmark Requiring External Knowledge 2019 Kenneth Marino
Mohammad Rastegari
Ali Farhadi
Roozbeh Mottaghi
+ Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering 2016 Yash Goyal
Tejas Khot
Douglas Summers-Stay
Dhruv Batra
Devi Parikh

Works That Cite This (0)

Action Title Year Authors

Works Cited by This (33)

Action Title Year Authors
+ PDF Chat Generative Adversarial Networks 2022 Ian J. Goodfellow
Jean Pouget-Abadie
Mehdi Mirza
Bing Xu
David Warde-Farley
Sherjil Ozair
Aaron Courville
Yoshua Bengio
+ PDF Chat VQA: Visual Question Answering 2015 Stanislaw Antol
Aishwarya Agrawal
Jiasen Lu
Margaret Mitchell
Dhruv Batra
C. Lawrence Zitnick
Devi Parikh
+ PDF Chat Deep Residual Learning for Image Recognition 2016 Kaiming He
Xiangyu Zhang
Shaoqing Ren
Jian Sun
+ PDF Chat CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning 2017 Justin Johnson
Bharath Hariharan
Laurens van der Maaten
Li Fei-Fei
C. Lawrence Zitnick
Ross Girshick
+ FigureQA: An Annotated Figure Dataset for Visual Reasoning 2017 Samira Ebrahimi Kahou
Vincent Michalski
Adam Atkinson
Ákos Kádár
Adam Trischler
Yoshua Bengio
+ A Dataset and Architecture for Visual Reasoning with a Working Memory 2018 Guangyu Robert Yang
Igor Ganichev
Xiao‐Jing Wang
Jonathon Shlens
David Sussillo
+ PDF Chat Composing Text and Image for Image Retrieval - an Empirical Odyssey 2019 Nam Vo
Lu Jiang
Chen Sun
Kevin Murphy
Li-Jia Li
Li Fei-Fei
James Hays
+ PDF Chat fairseq: A Fast, Extensible Toolkit for Sequence Modeling 2019 Myle Ott
Sergey Edunov
Alexei Baevski
Angela Fan
Sam Gross
Nathan Ng
David Grangier
Michael Auli
+ Exploring Models and Data for Image Question Answering 2015 Mengye Ren
Ryan Kiros
Richard S. Zemel
+ Answering Visual What-If Questions: From Actions to Predicted Scene Descriptions 2018 Misha Wagner
Hector Basevi
Rakshith Shetty
W. Li
Mateusz Malinowski
Mario Fritz
Aleš Leonardis