Ask AI a math question

Related Paper

HAWQ: Hessian AWare Quantization of Neural Networks With Mixed-Precision

Model size and inference speed/power have become a major challenge in the deployment of neural networks for many applications. A promising approach to address these problems is quantization. However, uniformly quantizing a model to ultra-low precision leads to significant accuracy degradation. A novel solution for this is to use mixed-precision …

Ask a Question