Ask a Question

Prefer a chat interface with context about you and your work?

A Proximal Operator for Inducing 2:4-Sparsity

A Proximal Operator for Inducing 2:4-Sparsity

Recent hardware advancements in AI Accelerators and GPUs allow to efficiently compute sparse matrix multiplications, especially when 2 out of 4 consecutive weights are set to zero. However, this so-called 2:4 sparsity usually comes at a decreased accuracy of the model. We derive a regularizer that exploits the local correlation …