Ask a Question

Prefer a chat interface with context about you and your work?

HAWQ: Hessian AWare Quantization of Neural Networks With Mixed-Precision

HAWQ: Hessian AWare Quantization of Neural Networks With Mixed-Precision

Model size and inference speed/power have become a major challenge in the deployment of neural networks for many applications. A promising approach to address these problems is quantization. However, uniformly quantizing a model to ultra-low precision leads to significant accuracy degradation. A novel solution for this is to use mixed-precision …