Projects
Reading
People
Chat
SU\G
(𝔸)
/K·U
Projects
Reading
People
Chat
Sign Up
Light
Dark
System
Understanding in Artificial Intelligence.
Stefan Maetschke
,
David Martínez
,
Pieter Barnard
,
Elaheh ShafieiBavani
,
Peter Zhong
,
Ying Xu
,
Antonio Jimeno Yepes
Type:
Preprint
Publication Date:
2021-01-17
Citations:
0
View
Share
Locations
arXiv (Cornell University) -
View
Similar Works
Action
Title
Year
Authors
+
Understanding in Artificial Intelligence
2021
Stefan Maetschke
David Martínez
Pieter Barnard
Elaheh ShafieiBavani
Peter Zhong
Ying Xu
Antonio Jimeno Yepes
+
PDF
Chat
Measuring Machine Intelligence Through Visual Question Answering
2016
C. Lawrence Zitnick
Aishwarya Agrawal
Stanislaw Antol
Margaret Mitchell
Dhruv Batra
Devi Parikh
+
Measuring Machine Intelligence Through Visual Question Answering
2016
C. Lawrence Zitnick
Aishwarya Agrawal
Stanislaw Antol
Margaret Mitchell
Dhruv Batra
Devi Parikh
+
Measuring Machine Intelligence Through Visual Question Answering
2016
C. Lawrence Zitnick
Aishwarya Agrawal
Stanislaw Antol
Margaret Mitchell
Dhruv Batra
Devi Parikh
+
PDF
Chat
A Comprehensive Survey on Visual Question Answering Datasets and Algorithms
2024
Raihan Kabir
N Haque
Md. Saiful Islam
Marium-E-Jannat
+
Hard to Cheat: A Turing Test based on Answering Questions about Images
2015
Mateusz Malinowski
Mario Fritz
+
Hard to Cheat: A Turing Test based on Answering Questions about Images
2015
Mateusz Malinowski
Mario Fritz
+
OK-VQA: A Visual Question Answering Benchmark Requiring External Knowledge
2019
Kenneth Marino
Mohammad Rastegari
Ali Farhadi
Roozbeh Mottaghi
+
Integrating Knowledge and Reasoning in Image Understanding
2019
Somak Aditya
Yezhou Yang
Chitta Baral
+
Integrating Knowledge and Reasoning in Image Understanding
2019
Somak Aditya
Yezhou Yang
Chitta Baral
+
Integrating Knowledge and Reasoning in Image Understanding
2019
Somak Aditya
Yezhou Yang
Chitta Baral
+
Visual Question Answering: A Survey of Methods and Datasets
2016
Qi Wu
Damien Teney
Peng Wang
Chunhua Shen
Anthony Dick
Anton van den Hengel
+
Seeing in Words: Learning to Classify through Language Bottlenecks
2023
Khalid Saifullah
Yuxin Wen
Jonas Geiping
Micah Goldblum
Tom Goldstein
+
OK-VQA: A Visual Question Answering Benchmark Requiring External Knowledge
2019
Kenneth Marino
Mohammad Rastegari
Ali Farhadi
Roozbeh Mottaghi
+
PDF
Chat
Multi: Multimodal Understanding Leaderboard with Text and Images
2024
Zichen Zhu
Yang Xu
Lu Chen
Jingkai Yang
Yichuan Ma
Yiming Sun
Hailin Wen
Jiaqi Liu
Jinyu Cai
Yingzi Ma
+
PDF
Chat
LangXAI: Integrating Large Vision Models for Generating Textual Explanations to Enhance Explainability in Visual Perception Tasks
2024
Hung Q. Nguyen
Tobias Clement
Loc Nguyen
Nils Kemmerzell
Binh Quang Truong
Khang Nguyen
Mohamed Abdelaal
Hung Cao
+
PDF
Chat
Challenges and Prospects in Vision and Language Research
2019
Kushal Kafle
Robik Shrestha
Christopher Kanan
+
WenLan 2.0: Make AI Imagine via a Multimodal Foundation Model.
2021
Nanyi Fei
Zhiwu Lu
Yizhao Gao
Guoxing Yang
Yuqi Huo
Jingyuan Wen
Haoyu Lu
Ruihua Song
Xin Gao
Tao Xiang
+
PDF
Chat
MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding Benchmark
2024
Xiang Yue
Tianyu Zheng
Yuansheng Ni
Yubo Wang
Kai Zhang
Shengbang Tong
Yuxuan Sun
Botao Yu
Ge ZHANG
Huan Sun
+
PDF
Chat
LangXAI: Integrating Large Vision Models for Generating Textual Explanations to Enhance Explainability in Visual Perception Tasks
2024
Truong Thanh Hung Nguyen
Tobias Clement
Phuc Truong Loc Nguyen
Nils Kemmerzell
Van Binh Truong
Vo Thanh Khang Nguyen
Mohamed Abdelaal
Hung Quang Cao
Cited by (0)
Action
Title
Year
Authors
Citing (151)
Action
Title
Year
Authors
+
A Review of Relational Machine Learning for Knowledge Graphs
2015
Maximilian Nickel
Kevin Murphy
Volker Tresp
Evgeniy Gabrilovich
+
Embedding Entities and Relations for Learning and Inference in Knowledge Bases
2014
Bishan Yang
Wen-tau Yih
Xiaodong He
Jianfeng Gao
Li Deng
+
Teaching Machines to Read and Comprehend
2015
Karl Moritz Hermann
Tomáš Kočiský
Edward Grefenstette
Lasse Espeholt
Will Kay
Mustafa Suleyman
Phil Blunsom
+
Video (language) modeling: a baseline for generative models of natural videos.
2014
Marc’Aurelio Ranzato
Arthur Szlam
Joan Bruna
Michaël Mathieu
Ronan Collobert
Sumit Chopra
+
PDF
Chat
Recurrent Network Models for Human Dynamics
2015
Katerina Fragkiadaki
Sergey Levine
Panna Felsen
Jitendra Malik
+
PDF
Chat
Flickr30k Entities: Collecting Region-to-Phrase Correspondences for Richer Image-to-Sentence Models
2015
Bryan A. Plummer
Liwei Wang
Chris M. Cervantes
Juan C. Caicedo
Julia Hockenmaier
Svetlana Lazebnik
+
DRAW: A Recurrent Neural Network For Image Generation
2015
Karol Gregor
Ivo Danihelka
Alex Graves
Danilo Jimenez Rezende
Daan Wierstra
+
Microsoft COCO Captions: Data Collection and Evaluation Server
2015
Xinlei Chen
Hao Fang
Tsung-Yi Lin
Ramakrishna Vedantam
Saurabh Gupta
Piotr Dollár
C. Lawrence Zitnick
+
PDF
Chat
Show and tell: A neural image caption generator
2015
Oriol Vinyals
Alexander Toshev
Samy Bengio
Dumitru Erhan
+
PDF
Chat
Deep neural networks are easily fooled: High confidence predictions for unrecognizable images
2015
Anh‐Tu Nguyen
Jason Yosinski
Jeff Clune
+
PDF
Chat
VQA: Visual Question Answering
2015
Stanislaw Antol
Aishwarya Agrawal
Jiasen Lu
Margaret Mitchell
Dhruv Batra
C. Lawrence Zitnick
Devi Parikh
+
PDF
Chat
Deep learning in neural networks: An overview
2014
Jürgen Schmidhuber
+
Neural Programmer-Interpreters
2015
Scott Reed
Nando de Freitas
+
Gated Graph Sequence Neural Networks
2016
Yujia Li
Daniel Tarlow
Marc Brockschmidt
Richard S. Zemel
+
Harnessing Deep Neural Networks with Logic Rules
2016
Zhiting Hu
Xuezhe Ma
Zhengzhong Liu
Eduard Hovy
Eric P. Xing
+
Attend, Infer, Repeat: Fast Scene Understanding with Generative Models
2016
S. M. Ali Eslami
Nicolas Heess
Théophane Weber
Yuval Tassa
David Szepesvari
Koray Kavukcuoglu
Geoffrey E. Hinton
+
A Corpus and Evaluation Framework for Deeper Understanding of Commonsense Stories
2016
Nasrin Mostafazadeh
Nathanael Chambers
Xiaodong He
Devi Parikh
Dhruv Batra
Lucy Vanderwende
Pushmeet Kohli
James F. Allen
+
PDF
Chat
Anticipating Visual Representations from Unlabeled Video
2016
Carl Vondrick
Hamed Pirsiavash
Antonio Torralba
+
Logic Tensor Networks: Deep Learning and Logical Reasoning from Data and Knowledge
2016
Luciano Serafini
Artur S. d’Avila Garcez
+
SQuAD: 100,000+ Questions for Machine Comprehension of Text
2016
Pranav Rajpurkar
Jian Zhang
Konstantin Lopyrev
Percy Liang
+
Hierarchical Question-Image Co-Attention for Visual Question Answering
2016
Jiasen Lu
Jianwei Yang
Dhruv Batra
Devi Parikh
+
Tutorial on Variational Autoencoders
2016
Carl Doersch
+
The LAMBADA dataset: Word prediction requiring a broad discourse context
2016
Denis Paperno
Germán Kruszewski
Angeliki Lazaridou
Quan Ngoc Pham
Raffaella Bernardi
Sandro Pezzelle
Marco Baroni
Gemma Boleda
Raquel Fernández
+
TerpreT: A Probabilistic Programming Language for Program Induction
2016
Alexander L. Gaunt
Marc Brockschmidt
Rishabh Singh
Nate Kushman
Pushmeet Kohli
Jonathan M. Taylor
Daniel Tarlow
+
Generating Videos with Scene Dynamics
2016
Carl Vondrick
Hamed Pirsiavash
Antonio Torralba
+
PDF
Chat
Visual question answering: Datasets, algorithms, and future challenges
2017
Kushal Kafle
Christopher Kanan
+
PDF
Chat
Modeling Relationships in Referential Expressions with Compositional Modular Networks
2017
Ronghang Hu
Marcus Rohrbach
Jacob Andreas
Trevor Darrell
Kate Saenko
+
PDF
Chat
Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering
2017
Yash Goyal
Tejas Khot
Douglas Summers-Stay
Dhruv Batra
Devi Parikh
+
AMR-to-text Generation with Synchronous Node Replacement Grammar
2017
Linfeng Song
Xiaochang Peng
Yue Zhang
Zhiguo Wang
Daniel Gildea
+
PDF
Chat
An Analysis of Visual Question Answering Algorithms
2017
Kushal Kafle
Christopher Kanan
+
PDF
Chat
Forecasting Human Dynamics from Static Images
2017
Yu-Wei Chao
Shuicheng Yan
Brian Price
Scott Cohen
Jia Deng
+
C-VQA: A Compositional Split of the Visual Question Answering (VQA) v1.0 Dataset
2017
Aishwarya Agrawal
Aniruddha Kembhavi
Dhruv Batra
Devi Parikh
+
TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension
2017
Mandar Joshi
Eunsol Choi
Daniel S. Weld
Luke Zettlemoyer
+
Neural Embeddings of Graphs in Hyperbolic Space
2017
Benjamin Paul Chamberlain
James R. Clough
Marc Peter Deisenroth
+
Seq2SQL: Generating Structured Queries from Natural Language using Reinforcement Learning
2017
Victor W. Zhong
Caiming Xiong
Richard Socher
+
Simple and Effective Multi-Paragraph Reading Comprehension
2017
Christopher Clark
Matt Gardner
+
Dynamic Integration of Background Knowledge in Neural NLU Systems
2017
Dirk Weissenborn
Tomáš Kočiský
Chris Dyer
+
KBGAN: Adversarial Learning for Knowledge Graph Embeddings
2017
Liwei Cai
William Yang Wang
+
Stochastic Answer Networks for Machine Reading Comprehension
2017
Xiaodong Liu
Yelong Shen
Kevin Duh
Jianfeng Gao
+
PDF
Chat
First-Person Activity Forecasting with Online Inverse Reinforcement Learning
2017
Nicholas Rhinehart
Kris Kitani