Quantifying and Alleviating the Language Prior Problem in Visual Question Answering
Quantifying and Alleviating the Language Prior Problem in Visual Question Answering
Benefiting from the advancement of computer vision, natural language processing and information retrieval techniques, visual question answering (VQA), which aims to answer questions about an image or a video, has received lots of attentions over the past few years. Although some progress has been achieved so far, several studies have …