Knowledge Aided Consistency for Weakly Supervised Phrase Grounding
Knowledge Aided Consistency for Weakly Supervised Phrase Grounding
Given a natural language query, a phrase grounding system aims to localize mentioned objects in an image. In weakly supevised scenario, mapping between image regions (i.e., proposals) and language is not available in the training set. Previous methods address this deficiency by training a grounding system via learning to reconstruct …