Compact and Computationally Efficient Representation of Deep Neural Networks
Compact and Computationally Efficient Representation of Deep Neural Networks
At the core of any inference procedure, deep neural networks are dot product operations, which are the component that requires the highest computational resources. For instance, deep neural networks, such as VGG-16, require up to 15-G operations in order to perform the dot products present in a single forward pass, …