Ask a Question

Prefer a chat interface with context about you and your work?

Modeling Localness for Self-Attention Networks

Modeling Localness for Self-Attention Networks

Self-attention networks have proven to be of profound value for its strength of capturing global dependencies. In this work, we propose to model localness for self-attention networks, which enhances the ability of capturing useful local context. We cast localness modeling as a learnable Gaussian bias, which indicates the central and …