Ask a Question

Prefer a chat interface with context about you and your work?

DNNFusion: accelerating deep neural networks execution with advanced operator fusion

DNNFusion: accelerating deep neural networks execution with advanced operator fusion

Deep Neural Networks (DNNs) have emerged as the core enabler of many major applications on mobile devices. To achieve high accuracy, DNN models have become increasingly deep with hundreds or even thousands of operator layers, leading to high memory and computational requirements for inference. Operator fusion (or kernel/layer fusion) is …