Ask a Question

Prefer a chat interface with context about you and your work?

AutoFormer: Searching Transformers for Visual Recognition

AutoFormer: Searching Transformers for Visual Recognition

Recently, pure transformer-based models have shown great potentials for vision tasks such as image classification and detection. However, the design of transformer networks is challenging. It has been observed that the depth, embedding dimension, and number of heads can largely affect the performance of vision transformers. Previous models configure these …