Ask a Question

Prefer a chat interface with context about you and your work?

GQA: A New Dataset for Real-World Visual Reasoning and Compositional Question Answering

GQA: A New Dataset for Real-World Visual Reasoning and Compositional Question Answering

We introduce GQA, a new dataset for real-world visual reasoning and compositional question answering, seeking to address key shortcomings of previous VQA datasets. We have developed a strong and robust question engine that leverages Visual Genome scene graph structures to create 22M diverse reasoning questions, which all come with functional …