Understanding in Artificial Intelligence.

Type: Preprint

Publication Date: 2021-01-17

Citations: 0

View

Locations

  • arXiv (Cornell University) - View

Similar Works

Action Title Year Authors
+ Understanding in Artificial Intelligence 2021 Stefan Maetschke
David Martínez
Pieter Barnard
Elaheh ShafieiBavani
Peter Zhong
Ying Xu
Antonio Jimeno Yepes
+ PDF Chat Measuring Machine Intelligence Through Visual Question Answering 2016 C. Lawrence Zitnick
Aishwarya Agrawal
Stanislaw Antol
Margaret Mitchell
Dhruv Batra
Devi Parikh
+ Measuring Machine Intelligence Through Visual Question Answering 2016 C. Lawrence Zitnick
Aishwarya Agrawal
Stanislaw Antol
Margaret Mitchell
Dhruv Batra
Devi Parikh
+ Measuring Machine Intelligence Through Visual Question Answering 2016 C. Lawrence Zitnick
Aishwarya Agrawal
Stanislaw Antol
Margaret Mitchell
Dhruv Batra
Devi Parikh
+ PDF Chat A Comprehensive Survey on Visual Question Answering Datasets and Algorithms 2024 Raihan Kabir
N Haque
Md. Saiful Islam
Marium-E-Jannat
+ Hard to Cheat: A Turing Test based on Answering Questions about Images 2015 Mateusz Malinowski
Mario Fritz
+ Hard to Cheat: A Turing Test based on Answering Questions about Images 2015 Mateusz Malinowski
Mario Fritz
+ OK-VQA: A Visual Question Answering Benchmark Requiring External Knowledge 2019 Kenneth Marino
Mohammad Rastegari
Ali Farhadi
Roozbeh Mottaghi
+ Integrating Knowledge and Reasoning in Image Understanding 2019 Somak Aditya
Yezhou Yang
Chitta Baral
+ Integrating Knowledge and Reasoning in Image Understanding 2019 Somak Aditya
Yezhou Yang
Chitta Baral
+ Integrating Knowledge and Reasoning in Image Understanding 2019 Somak Aditya
Yezhou Yang
Chitta Baral
+ Visual Question Answering: A Survey of Methods and Datasets 2016 Qi Wu
Damien Teney
Peng Wang
Chunhua Shen
Anthony Dick
Anton van den Hengel
+ Seeing in Words: Learning to Classify through Language Bottlenecks 2023 Khalid Saifullah
Yuxin Wen
Jonas Geiping
Micah Goldblum
Tom Goldstein
+ OK-VQA: A Visual Question Answering Benchmark Requiring External Knowledge 2019 Kenneth Marino
Mohammad Rastegari
Ali Farhadi
Roozbeh Mottaghi
+ PDF Chat Multi: Multimodal Understanding Leaderboard with Text and Images 2024 Zichen Zhu
Yang Xu
Lu Chen
Jingkai Yang
Yichuan Ma
Yiming Sun
Hailin Wen
Jiaqi Liu
Jinyu Cai
Yingzi Ma
+ PDF Chat LangXAI: Integrating Large Vision Models for Generating Textual Explanations to Enhance Explainability in Visual Perception Tasks 2024 Hung Q. Nguyen
Tobias Clement
Loc Nguyen
Nils Kemmerzell
Binh Quang Truong
Khang Nguyen
Mohamed Abdelaal
Hung Cao
+ PDF Chat Challenges and Prospects in Vision and Language Research 2019 Kushal Kafle
Robik Shrestha
Christopher Kanan
+ WenLan 2.0: Make AI Imagine via a Multimodal Foundation Model. 2021 Nanyi Fei
Zhiwu Lu
Yizhao Gao
Guoxing Yang
Yuqi Huo
Jingyuan Wen
Haoyu Lu
Ruihua Song
Xin Gao
Tao Xiang
+ PDF Chat MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding Benchmark 2024 Xiang Yue
Tianyu Zheng
Yuansheng Ni
Yubo Wang
Kai Zhang
Shengbang Tong
Yuxuan Sun
Botao Yu
Ge ZHANG
Huan Sun
+ PDF Chat LangXAI: Integrating Large Vision Models for Generating Textual Explanations to Enhance Explainability in Visual Perception Tasks 2024 Truong Thanh Hung Nguyen
Tobias Clement
Phuc Truong Loc Nguyen
Nils Kemmerzell
Van Binh Truong
Vo Thanh Khang Nguyen
Mohamed Abdelaal
Hung Quang Cao

Cited by (0)

Action Title Year Authors

Citing (151)

Action Title Year Authors
+ A Review of Relational Machine Learning for Knowledge Graphs 2015 Maximilian Nickel
Kevin Murphy
Volker Tresp
Evgeniy Gabrilovich
+ Embedding Entities and Relations for Learning and Inference in Knowledge Bases 2014 Bishan Yang
Wen-tau Yih
Xiaodong He
Jianfeng Gao
Li Deng
+ Teaching Machines to Read and Comprehend 2015 Karl Moritz Hermann
Tomáš Kočiský
Edward Grefenstette
Lasse Espeholt
Will Kay
Mustafa Suleyman
Phil Blunsom
+ Video (language) modeling: a baseline for generative models of natural videos. 2014 Marc’Aurelio Ranzato
Arthur Szlam
Joan Bruna
Michaël Mathieu
Ronan Collobert
Sumit Chopra
+ PDF Chat Recurrent Network Models for Human Dynamics 2015 Katerina Fragkiadaki
Sergey Levine
Panna Felsen
Jitendra Malik
+ PDF Chat Flickr30k Entities: Collecting Region-to-Phrase Correspondences for Richer Image-to-Sentence Models 2015 Bryan A. Plummer
Liwei Wang
Chris M. Cervantes
Juan C. Caicedo
Julia Hockenmaier
Svetlana Lazebnik
+ DRAW: A Recurrent Neural Network For Image Generation 2015 Karol Gregor
Ivo Danihelka
Alex Graves
Danilo Jimenez Rezende
Daan Wierstra
+ Microsoft COCO Captions: Data Collection and Evaluation Server 2015 Xinlei Chen
Hao Fang
Tsung-Yi Lin
Ramakrishna Vedantam
Saurabh Gupta
Piotr Dollár
C. Lawrence Zitnick
+ PDF Chat Show and tell: A neural image caption generator 2015 Oriol Vinyals
Alexander Toshev
Samy Bengio
Dumitru Erhan
+ PDF Chat Deep neural networks are easily fooled: High confidence predictions for unrecognizable images 2015 Anh‐Tu Nguyen
Jason Yosinski
Jeff Clune
+ PDF Chat VQA: Visual Question Answering 2015 Stanislaw Antol
Aishwarya Agrawal
Jiasen Lu
Margaret Mitchell
Dhruv Batra
C. Lawrence Zitnick
Devi Parikh
+ PDF Chat Deep learning in neural networks: An overview 2014 Jürgen Schmidhuber
+ Neural Programmer-Interpreters 2015 Scott Reed
Nando de Freitas
+ Gated Graph Sequence Neural Networks 2016 Yujia Li
Daniel Tarlow
Marc Brockschmidt
Richard S. Zemel
+ Harnessing Deep Neural Networks with Logic Rules 2016 Zhiting Hu
Xuezhe Ma
Zhengzhong Liu
Eduard Hovy
Eric P. Xing
+ Attend, Infer, Repeat: Fast Scene Understanding with Generative Models 2016 S. M. Ali Eslami
Nicolas Heess
Théophane Weber
Yuval Tassa
David Szepesvari
Koray Kavukcuoglu
Geoffrey E. Hinton
+ A Corpus and Evaluation Framework for Deeper Understanding of Commonsense Stories 2016 Nasrin Mostafazadeh
Nathanael Chambers
Xiaodong He
Devi Parikh
Dhruv Batra
Lucy Vanderwende
Pushmeet Kohli
James F. Allen
+ PDF Chat Anticipating Visual Representations from Unlabeled Video 2016 Carl Vondrick
Hamed Pirsiavash
Antonio Torralba
+ Logic Tensor Networks: Deep Learning and Logical Reasoning from Data and Knowledge 2016 Luciano Serafini
Artur S. d’Avila Garcez
+ SQuAD: 100,000+ Questions for Machine Comprehension of Text 2016 Pranav Rajpurkar
Jian Zhang
Konstantin Lopyrev
Percy Liang
+ Hierarchical Question-Image Co-Attention for Visual Question Answering 2016 Jiasen Lu
Jianwei Yang
Dhruv Batra
Devi Parikh
+ Tutorial on Variational Autoencoders 2016 Carl Doersch
+ The LAMBADA dataset: Word prediction requiring a broad discourse context 2016 Denis Paperno
Germán Kruszewski
Angeliki Lazaridou
Quan Ngoc Pham
Raffaella Bernardi
Sandro Pezzelle
Marco Baroni
Gemma Boleda
Raquel Fernández
+ TerpreT: A Probabilistic Programming Language for Program Induction 2016 Alexander L. Gaunt
Marc Brockschmidt
Rishabh Singh
Nate Kushman
Pushmeet Kohli
Jonathan M. Taylor
Daniel Tarlow
+ Generating Videos with Scene Dynamics 2016 Carl Vondrick
Hamed Pirsiavash
Antonio Torralba
+ PDF Chat Visual question answering: Datasets, algorithms, and future challenges 2017 Kushal Kafle
Christopher Kanan
+ PDF Chat Modeling Relationships in Referential Expressions with Compositional Modular Networks 2017 Ronghang Hu
Marcus Rohrbach
Jacob Andreas
Trevor Darrell
Kate Saenko
+ PDF Chat Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering 2017 Yash Goyal
Tejas Khot
Douglas Summers-Stay
Dhruv Batra
Devi Parikh
+ AMR-to-text Generation with Synchronous Node Replacement Grammar 2017 Linfeng Song
Xiaochang Peng
Yue Zhang
Zhiguo Wang
Daniel Gildea
+ PDF Chat An Analysis of Visual Question Answering Algorithms 2017 Kushal Kafle
Christopher Kanan
+ PDF Chat Forecasting Human Dynamics from Static Images 2017 Yu-Wei Chao
Shuicheng Yan
Brian Price
Scott Cohen
Jia Deng
+ C-VQA: A Compositional Split of the Visual Question Answering (VQA) v1.0 Dataset 2017 Aishwarya Agrawal
Aniruddha Kembhavi
Dhruv Batra
Devi Parikh
+ TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension 2017 Mandar Joshi
Eunsol Choi
Daniel S. Weld
Luke Zettlemoyer
+ Neural Embeddings of Graphs in Hyperbolic Space 2017 Benjamin Paul Chamberlain
James R. Clough
Marc Peter Deisenroth
+ Seq2SQL: Generating Structured Queries from Natural Language using Reinforcement Learning 2017 Victor W. Zhong
Caiming Xiong
Richard Socher
+ Simple and Effective Multi-Paragraph Reading Comprehension 2017 Christopher Clark
Matt Gardner
+ Dynamic Integration of Background Knowledge in Neural NLU Systems 2017 Dirk Weissenborn
Tomáš Kočiský
Chris Dyer
+ KBGAN: Adversarial Learning for Knowledge Graph Embeddings 2017 Liwei Cai
William Yang Wang
+ Stochastic Answer Networks for Machine Reading Comprehension 2017 Xiaodong Liu
Yelong Shen
Kevin Duh
Jianfeng Gao
+ PDF Chat First-Person Activity Forecasting with Online Inverse Reinforcement Learning 2017 Nicholas Rhinehart
Kris Kitani