LCTR: On Awakening the Local Continuity of Transformer for Weakly Supervised Object Localization

Type: Article

Publication Date: 2022-06-28

Citations: 28

DOI: https://doi.org/10.1609/aaai.v36i1.19918

Abstract

Weakly supervised object localization (WSOL) aims to learn object localizer solely by using image-level labels. The convolution neural network (CNN) based techniques often result in highlighting the most discriminative part of objects while ignoring the entire object extent. Recently, the transformer architecture has been deployed to WSOL to capture the long-range feature dependencies with self-attention mechanism and multilayer perceptron structure. Nevertheless, transformers lack the locality inductive bias inherent to CNNs and therefore may deteriorate local feature details in WSOL. In this paper, we propose a novel framework built upon the transformer, termed LCTR (Local Continuity TRansformer), which targets at enhancing the local perception capability of global features among long-range feature dependencies. To this end, we propose a relational patch-attention module (RPAM), which considers cross-patch information on a global basis. We further design a cue digging module (CDM), which utilizes local features to guide the learning trend of the model for highlighting the weak local responses. Finally, comprehensive experiments are carried out on two widely used datasets, ie, CUB-200-2011 and ILSVRC, to verify the effectiveness of our method.

Locations

  • Proceedings of the AAAI Conference on Artificial Intelligence - View - PDF
  • arXiv (Cornell University) - View - PDF

Works That Cite This (9)

Action Title Year Authors
+ PDF Chat TransCAM: Transformer attention-based CAM refinement for Weakly supervised semantic segmentation 2023 Ruiwen Li
Zheda Mai
Zhibo Zhang
Jongseong Jang
Scott Sanner
+ PDF Chat Dynamic Prototype Mask for Occluded Person Re-Identification 2022 Lei Tan
Pingyang Dai
Rongrong Ji
Yongjian Wu
+ PDF Chat Mixed-UNet: Refined class activation mapping for weakly-supervised semantic segmentation with multi-scale inference 2022 Yang Liu
Lijin Lian
Ersi Zhang
Lulu Xu
Chufan Xiao
Xiaoyun Zhong
Fang Li
Bin Jiang
Yuhan Dong
Lan Ma
+ PDF Chat Weakly Supervised Object Localization via Transformer with Implicit Spatial Calibration 2022 Haotian Bai
Ruimao Zhang
Jiong Wang
Xiang Wan
+ PDF Chat Spatial-Aware Token for Weakly Supervised Object Localization 2023 Pingyu Wu
Wei Zhai
Yang Cao
Jiebo Luo
Zheng-Jun Zha
+ PDF Chat Unsupervised Object Localization with Representer Point Selection 2023 Yeonghwan Song
Seok-Woo Jang
Dina Katabi
Jeany Son
+ PDF Chat Discriminative Sampling of Proposals in Self-Supervised Transformers for Weakly Supervised Object Localization 2023 Shakeeb Murtaza
Soufiane Belharbi
Marco Pedersoli
Aydin Sarraf
Éric Granger
+ PDF Chat Background-Aware Classification Activation Map for Weakly Supervised Object Localization 2023 Lei Zhu
Qi She
Qian Chen
Xiangxi Meng
Mufeng Geng
Lujia Jin
Yibao Zhang
Qiushi Ren
Yanye Lu
+ PDF Chat Generative Prompt Model for Weakly Supervised Object Localization 2023 Yuzhong Zhao
Qixiang Ye
Weijia Wu
Chunhua Shen
Fang Wan

Works Cited by This (32)

Action Title Year Authors
+ PDF Chat Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification 2015 Kaiming He
Xiangyu Zhang
Shaoqing Ren
Jian Sun
+ PDF Chat ImageNet Large Scale Visual Recognition Challenge 2015 Olga Russakovsky
Jia Deng
Hao Su
Jonathan Krause
Sanjeev Satheesh
Sean Ma
Zhiheng Huang
Andrej Karpathy
Aditya Khosla
Michael S. Bernstein
+ PDF Chat Learning Deep Features for Discriminative Localization 2016 Bolei Zhou
Aditya Khosla
Àgata Lapedriza
Aude Oliva
Antonio Torralba
+ PDF Chat Grad-CAM++: Generalized Gradient-Based Visual Explanations for Deep Convolutional Networks 2018 Aditya Chattopadhay
Anirban Sarkar
Prantik Howlader
Vineeth N Balasubramanian
+ PDF Chat Self-produced Guidance for Weakly-Supervised Object Localization 2018 Xiaolin Zhang
Yunchao Wei
Guoliang Kang
Yi Yang
Thomas Huang
+ Decoupled Weight Decay Regularization 2017 Ilya Loshchilov
Frank Hutter
+ PDF Chat Attention-Based Dropout Layer for Weakly Supervised Object Localization 2019 Junsuk Choe
Hyunjung Shim
+ PDF Chat Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization 2017 Ramprasaath R. Selvaraju
Michael Cogswell
Abhishek Das
Ramakrishna Vedantam
Devi Parikh
Dhruv Batra
+ PDF Chat Hide-and-Seek: Forcing a Network to be Meticulous for Weakly-Supervised Object and Action Localization 2017 Krishna Kumar Singh
Yong Jae Lee
+ PDF Chat Soft Proposal Networks for Weakly Supervised Object Localization 2017 Yi Zhu
Yanzhao Zhou
Qixiang Ye
Qiang Qiu
Jianbin Jiao