Ask a Question

Prefer a chat interface with context about you and your work?

A Survey on Vision Transformer

A Survey on Vision Transformer

Transformer, first applied to the field of natural language processing, is a type of deep neural network mainly based on the self-attention mechanism. Thanks to its strong representation capabilities, researchers are looking at ways to apply transformer to computer vision tasks. In a variety of visual benchmarks, transformer-based models perform …