Prefer a chat interface with context about you and your work?
Pruning and quantization for deep neural network acceleration: A survey