Semantic-Aware Modular Capsule Routing for Visual Question Answering
Semantic-Aware Modular Capsule Routing for Visual Question Answering
Visual Question Answering (VQA) is fundamentally compositional in nature, and many questions are simply answered by decomposing them into modular sub-problems. The recent proposed Neural Module Network (NMN) employ this strategy to question answering, whereas heavily rest with off-the-shelf layout parser or additional expert policy regarding the network architecture design …