SDTP: Semantic-Aware Decoupled Transformer Pyramid for Dense Image Prediction
SDTP: Semantic-Aware Decoupled Transformer Pyramid for Dense Image Prediction
Although transformer has achieved great progress on computer vision tasks, the scale variation in dense image prediction is still the key challenge. Few effective multi-scale techniques are applied in transformer and there are two main limitations in the current methods. On the one hand, self-attention module in vanilla transformer fails …