LCTR: On Awakening the Local Continuity of Transformer for Weakly Supervised Object Localization
LCTR: On Awakening the Local Continuity of Transformer for Weakly Supervised Object Localization
Weakly supervised object localization (WSOL) aims to learn object localizer solely by using image-level labels. The convolution neural network (CNN) based techniques often result in highlighting the most discriminative part of objects while ignoring the entire object extent. Recently, the transformer architecture has been deployed to WSOL to capture the …