Ask a Question

Prefer a chat interface with context about you and your work?

Explicit Reasoning over End-to-End Neural Architectures for Visual Question Answering

Explicit Reasoning over End-to-End Neural Architectures for Visual Question Answering

Many vision and language tasks require commonsense reasoning beyond data-driven image and natural language processing. Here we adopt Visual Question Answering (VQA) as an example task, where a system is expected to answer a question in natural language about an image. Current state-of-the-art systems attempted to solve the task using …