Prefer a chat interface with context about you and your work?
Can you even tell left from right? Presenting a new challenge for VQA
Visual Question Answering (VQA) needs a means of evaluating the strengths and weaknesses of models. One aspect of such an evaluation is the measurement of compositional generalisation. This relates to the ability of a model to answer well on scenes whose compositions are different from those of scenes in the …