Things not Written in Text: Exploring Spatial Commonsense from Visual Signals
Things not Written in Text: Exploring Spatial Commonsense from Visual Signals
Spatial commonsense, the knowledge about spatial position and relationship between objects (like the relative size of a lion and a girl, and the position of a boy relative to a bicycle when cycling), is an important part of commonsense knowledge. Although pretrained language models (PLMs) succeed in many NLP tasks, …