Understanding in Artificial Intelligence.

Stefan Maetschke, David Martínez, Pieter Barnard, Elaheh ShafieiBavani, Peter Zhong, Ying Xu, Antonio Jimeno Yepes

Type: Preprint

Publication Date: 2021-01-17

Citations: 0

View

Locations

arXiv (Cornell University) - View

Similar Works

Action	Title	Year	Authors
+	Understanding in Artificial Intelligence	2021	Stefan Maetschke David Martínez Pieter Barnard Elaheh ShafieiBavani Peter Zhong Ying Xu Antonio Jimeno Yepes
+ PDF Chat	Measuring Machine Intelligence Through Visual Question Answering	2016	C. Lawrence Zitnick Aishwarya Agrawal Stanislaw Antol Margaret Mitchell Dhruv Batra Devi Parikh
+	Measuring Machine Intelligence Through Visual Question Answering	2016	C. Lawrence Zitnick Aishwarya Agrawal Stanislaw Antol Margaret Mitchell Dhruv Batra Devi Parikh
+	Measuring Machine Intelligence Through Visual Question Answering	2016	C. Lawrence Zitnick Aishwarya Agrawal Stanislaw Antol Margaret Mitchell Dhruv Batra Devi Parikh
+ PDF Chat	A Comprehensive Survey on Visual Question Answering Datasets and Algorithms	2024	Raihan Kabir N Haque Md. Saiful Islam Marium-E-Jannat
+	Hard to Cheat: A Turing Test based on Answering Questions about Images	2015	Mateusz Malinowski Mario Fritz
+	Hard to Cheat: A Turing Test based on Answering Questions about Images	2015	Mateusz Malinowski Mario Fritz
+	OK-VQA: A Visual Question Answering Benchmark Requiring External Knowledge	2019	Kenneth Marino Mohammad Rastegari Ali Farhadi Roozbeh Mottaghi
+	Integrating Knowledge and Reasoning in Image Understanding	2019	Somak Aditya Yezhou Yang Chitta Baral
+	Integrating Knowledge and Reasoning in Image Understanding	2019	Somak Aditya Yezhou Yang Chitta Baral
+	Integrating Knowledge and Reasoning in Image Understanding	2019	Somak Aditya Yezhou Yang Chitta Baral
+	Visual Question Answering: A Survey of Methods and Datasets	2016	Qi Wu Damien Teney Peng Wang Chunhua Shen Anthony Dick Anton van den Hengel
+	Seeing in Words: Learning to Classify through Language Bottlenecks	2023	Khalid Saifullah Yuxin Wen Jonas Geiping Micah Goldblum Tom Goldstein
+	OK-VQA: A Visual Question Answering Benchmark Requiring External Knowledge	2019	Kenneth Marino Mohammad Rastegari Ali Farhadi Roozbeh Mottaghi
+ PDF Chat	Multi: Multimodal Understanding Leaderboard with Text and Images	2024	Zichen Zhu Yang Xu Lu Chen Jingkai Yang Yichuan Ma Yiming Sun Hailin Wen Jiaqi Liu Jinyu Cai Yingzi Ma
+ PDF Chat	LangXAI: Integrating Large Vision Models for Generating Textual Explanations to Enhance Explainability in Visual Perception Tasks	2024	Hung Q. Nguyen Tobias Clement Loc Nguyen Nils Kemmerzell Binh Quang Truong Khang Nguyen Mohamed Abdelaal Hung Cao
+ PDF Chat	Challenges and Prospects in Vision and Language Research	2019	Kushal Kafle Robik Shrestha Christopher Kanan
+	WenLan 2.0: Make AI Imagine via a Multimodal Foundation Model.	2021	Nanyi Fei Zhiwu Lu Yizhao Gao Guoxing Yang Yuqi Huo Jingyuan Wen Haoyu Lu Ruihua Song Xin Gao Tao Xiang
+ PDF Chat	MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding Benchmark	2024	Xiang Yue Tianyu Zheng Yuansheng Ni Yubo Wang Kai Zhang Shengbang Tong Yuxuan Sun Botao Yu Ge ZHANG Huan Sun
+ PDF Chat	LangXAI: Integrating Large Vision Models for Generating Textual Explanations to Enhance Explainability in Visual Perception Tasks	2024	Truong Thanh Hung Nguyen Tobias Clement Phuc Truong Loc Nguyen Nils Kemmerzell Van Binh Truong Vo Thanh Khang Nguyen Mohamed Abdelaal Hung Quang Cao

Cited by (0)

Action	Title	Year	Authors

Citing (151)

Action	Title	Year	Authors
+	A Review of Relational Machine Learning for Knowledge Graphs	2015	Maximilian Nickel Kevin Murphy Volker Tresp Evgeniy Gabrilovich
+	Embedding Entities and Relations for Learning and Inference in Knowledge Bases	2014	Bishan Yang Wen-tau Yih Xiaodong He Jianfeng Gao Li Deng
+	Teaching Machines to Read and Comprehend	2015	Karl Moritz Hermann Tomáš Kočiský Edward Grefenstette Lasse Espeholt Will Kay Mustafa Suleyman Phil Blunsom
+	Video (language) modeling: a baseline for generative models of natural videos.	2014	Marc’Aurelio Ranzato Arthur Szlam Joan Bruna Michaël Mathieu Ronan Collobert Sumit Chopra
+ PDF Chat	Recurrent Network Models for Human Dynamics	2015	Katerina Fragkiadaki Sergey Levine Panna Felsen Jitendra Malik
+ PDF Chat	Flickr30k Entities: Collecting Region-to-Phrase Correspondences for Richer Image-to-Sentence Models	2015	Bryan A. Plummer Liwei Wang Chris M. Cervantes Juan C. Caicedo Julia Hockenmaier Svetlana Lazebnik
+	DRAW: A Recurrent Neural Network For Image Generation	2015	Karol Gregor Ivo Danihelka Alex Graves Danilo Jimenez Rezende Daan Wierstra
+	Microsoft COCO Captions: Data Collection and Evaluation Server	2015	Xinlei Chen Hao Fang Tsung-Yi Lin Ramakrishna Vedantam Saurabh Gupta Piotr Dollár C. Lawrence Zitnick
+ PDF Chat	Show and tell: A neural image caption generator	2015	Oriol Vinyals Alexander Toshev Samy Bengio Dumitru Erhan
+ PDF Chat	Deep neural networks are easily fooled: High confidence predictions for unrecognizable images	2015	Anh‐Tu Nguyen Jason Yosinski Jeff Clune
+ PDF Chat	VQA: Visual Question Answering	2015	Stanislaw Antol Aishwarya Agrawal Jiasen Lu Margaret Mitchell Dhruv Batra C. Lawrence Zitnick Devi Parikh
+ PDF Chat	Deep learning in neural networks: An overview	2014	Jürgen Schmidhuber
+	Neural Programmer-Interpreters	2015	Scott Reed Nando de Freitas
+	Gated Graph Sequence Neural Networks	2016	Yujia Li Daniel Tarlow Marc Brockschmidt Richard S. Zemel
+	Harnessing Deep Neural Networks with Logic Rules	2016	Zhiting Hu Xuezhe Ma Zhengzhong Liu Eduard Hovy Eric P. Xing
+	Attend, Infer, Repeat: Fast Scene Understanding with Generative Models	2016	S. M. Ali Eslami Nicolas Heess Théophane Weber Yuval Tassa David Szepesvari Koray Kavukcuoglu Geoffrey E. Hinton
+	A Corpus and Evaluation Framework for Deeper Understanding of Commonsense Stories	2016	Nasrin Mostafazadeh Nathanael Chambers Xiaodong He Devi Parikh Dhruv Batra Lucy Vanderwende Pushmeet Kohli James F. Allen
+ PDF Chat	Anticipating Visual Representations from Unlabeled Video	2016	Carl Vondrick Hamed Pirsiavash Antonio Torralba
+	Logic Tensor Networks: Deep Learning and Logical Reasoning from Data and Knowledge	2016	Luciano Serafini Artur S. d’Avila Garcez
+	SQuAD: 100,000+ Questions for Machine Comprehension of Text	2016	Pranav Rajpurkar Jian Zhang Konstantin Lopyrev Percy Liang
+	Hierarchical Question-Image Co-Attention for Visual Question Answering	2016	Jiasen Lu Jianwei Yang Dhruv Batra Devi Parikh
+	Tutorial on Variational Autoencoders	2016	Carl Doersch
+	The LAMBADA dataset: Word prediction requiring a broad discourse context	2016	Denis Paperno Germán Kruszewski Angeliki Lazaridou Quan Ngoc Pham Raffaella Bernardi Sandro Pezzelle Marco Baroni Gemma Boleda Raquel Fernández
+	TerpreT: A Probabilistic Programming Language for Program Induction	2016	Alexander L. Gaunt Marc Brockschmidt Rishabh Singh Nate Kushman Pushmeet Kohli Jonathan M. Taylor Daniel Tarlow
+	Generating Videos with Scene Dynamics	2016	Carl Vondrick Hamed Pirsiavash Antonio Torralba
+ PDF Chat	Visual question answering: Datasets, algorithms, and future challenges	2017	Kushal Kafle Christopher Kanan
+ PDF Chat	Modeling Relationships in Referential Expressions with Compositional Modular Networks	2017	Ronghang Hu Marcus Rohrbach Jacob Andreas Trevor Darrell Kate Saenko
+ PDF Chat	Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering	2017	Yash Goyal Tejas Khot Douglas Summers-Stay Dhruv Batra Devi Parikh
+	AMR-to-text Generation with Synchronous Node Replacement Grammar	2017	Linfeng Song Xiaochang Peng Yue Zhang Zhiguo Wang Daniel Gildea
+ PDF Chat	An Analysis of Visual Question Answering Algorithms	2017	Kushal Kafle Christopher Kanan
+ PDF Chat	Forecasting Human Dynamics from Static Images	2017	Yu-Wei Chao Shuicheng Yan Brian Price Scott Cohen Jia Deng
+	C-VQA: A Compositional Split of the Visual Question Answering (VQA) v1.0 Dataset	2017	Aishwarya Agrawal Aniruddha Kembhavi Dhruv Batra Devi Parikh
+	TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension	2017	Mandar Joshi Eunsol Choi Daniel S. Weld Luke Zettlemoyer
+	Neural Embeddings of Graphs in Hyperbolic Space	2017	Benjamin Paul Chamberlain James R. Clough Marc Peter Deisenroth
+	Seq2SQL: Generating Structured Queries from Natural Language using Reinforcement Learning	2017	Victor W. Zhong Caiming Xiong Richard Socher
+	Simple and Effective Multi-Paragraph Reading Comprehension	2017	Christopher Clark Matt Gardner
+	Dynamic Integration of Background Knowledge in Neural NLU Systems	2017	Dirk Weissenborn Tomáš Kočiský Chris Dyer
+	KBGAN: Adversarial Learning for Knowledge Graph Embeddings	2017	Liwei Cai William Yang Wang
+	Stochastic Answer Networks for Machine Reading Comprehension	2017	Xiaodong Liu Yelong Shen Kevin Duh Jianfeng Gao
+ PDF Chat	First-Person Activity Forecasting with Online Inverse Reinforcement Learning	2017	Nicholas Rhinehart Kris Kitani