PatDNN: Achieving Real-Time DNN Execution on Mobile Devices with Pattern-based Weight Pruning

Type: Preprint

Publication Date: 2020-03-09

Citations: 220

DOI: https://doi.org/10.1145/3373376.3378534

Locations

  • arXiv (Cornell University) - View - PDF
  • DataCite API - View

Similar Works

Action Title Year Authors
+ An Image Enhancing Pattern-based Sparsity for Real-time Inference on Mobile Devices 2020 Xiaolong Ma
Wei Niu
Tianyun Zhang
Sijia Liu
Sheng Lin
Hongjia Li
Xiang Chen
Jian Tang
Kaisheng Ma
Bin Ren
+ PDF Chat Automatic Mapping of the Best-Suited DNN Pruning Schemes for Real-Time Mobile Acceleration 2022 Yifan Gong
Geng Yuan
Zheng Zhan
Wei Niu
Zhengang Li
Pu Zhao
Yuxuan Cai
Sijia Liu
Bin Ren
Xue Lin
+ PCONV: The Missing but Desirable Sparsity in DNN Weight Pruning for Real-time Execution on Mobile Devices 2019 Xiaolong Ma
Fu-Ming Guo
Wei Niu
Xue Lin
Jian Tang
Kaisheng Ma
Bin Ren
Yanzhi Wang
+ PDF Chat PCONV: The Missing but Desirable Sparsity in DNN Weight Pruning for Real-Time Execution on Mobile Devices 2020 Xiaolong Ma
Fu-Ming Guo
Wei Niu
Xue Lin
Jian Tang
Kaisheng Ma
Bin Ren
Yanzhi Wang
+ Automatic Mapping of the Best-Suited DNN Pruning Schemes for Real-Time Mobile Acceleration 2021 Yifan Gong
Geng Yuan
Zheng Zhan
Wei Niu
Zhengang Li
Pu Zhao
Yuxuan Cai
Sijia Liu
Bin Ren
Xue Lin
+ A Privacy-Preserving-Oriented DNN Pruning and Mobile Acceleration Framework 2020 Yifan Gong
Zheng Zhan
Zhengang Li
Wei Niu
Xiaolong Ma
Wenhao Wang
Bin Ren
Caiwen Ding
Xue Lin
Xiaolin Xu
+ A Privacy-Preserving DNN Pruning and Mobile Acceleration Framework. 2020 Zheng Zhan
Yifan Gong
Zhengang Li
Pu Zhao
Xiaolong Ma
Wei Niu
Xiaolin Xu
Bin Ren
Yanzhi Wang
Xue Lin
+ PDF Chat A Privacy-Preserving-Oriented DNN Pruning and Mobile Acceleration Framework 2020 Yifan Gong
Zheng Zhan
Zhengang Li
Wei Niu
Xiaolong Ma
Wenhao Wang
Bin Ren
Caiwen Ding
Xue Lin
Xiaolin Xu
+ PDF Chat Enable Deep Learning on Mobile Devices: Methods, Systems, and Applications 2022 Han Cai
Ji Lin
Yujun Lin
Zhijian Liu
Haotian Tang
Hanrui Wang
Ligeng Zhu
Song Han
+ PDF Chat Dnnshifter: An Efficient Dnn Pruning System for Edge Computing 2023 Bailey J. Eccles
Philip Rodgers
Peter Kilpatrick
Ivor Spence
Blesson Varghese
+ DNNShifter: An Efficient DNN Pruning System for Edge Computing 2023 Bailey J. Eccles
Philip Rodgers
Peter Kilpatrick
Ivor Spence
Blesson Varghese
+ GRIM: A General, Real-Time Deep Learning Inference Framework for Mobile Devices Based on Fine-Grained Structured Weight Sparsity 2021 Wei Niu
Zhengang Li
Xiaolong Ma
Peiyan Dong
Gang Zhou
Xuehai Qian
Xue Lin
Yanzhi Wang
Bin Ren
+ DNNShifter: An efficient DNN pruning system for edge computing 2023 Bailey J. Eccles
Philip Rodgers
Peter Kilpatrick
Ivor Spence
Blesson Varghese
+ GRIM: A General, Real-Time Deep Learning Inference Framework for Mobile Devices based on Fine-Grained Structured Weight Sparsity 2021 Wei Niu
Zhengang Li
Xiaolong Ma
Peiyan Dong
Gang Zhou
Xuehai Qian
Xue Lin
Yanzhi Wang
Bin Ren
+ PDF Chat GRIM: A General, Real-Time Deep Learning Inference Framework for Mobile Devices based on Fine-Grained Structured Weight Sparsity 2021 Wei Niu
Zhengang Li
Xiaolong Ma
Peiyan Dong
Gang Zhou
Xuehai Qian
Xue Lin
Yanzhi Wang
Bin Ren
+ Towards Real-Time DNN Inference on Mobile Platforms with Model Pruning and Compiler Optimization 2020 Wei Niu
Pu Zhao
Zheng Zhan
Xue Lin
Yanzhi Wang
Bin Ren
+ Towards Real-Time DNN Inference on Mobile Platforms with Model Pruning and Compiler Optimization 2020 Wei Niu
Pu Zhao
Zheng Zhan
Xue Lin
Yanzhi Wang
Bin Ren
+ Class-Aware Pruning for Efficient Neural Networks 2023 Mengnan Jiang
Jingcun Wang
Amro Eldebiky
Xunzhao Yin
Cheng Zhuo
Ing-Chao Lin
Grace Li Zhang
+ AdaDeep: A Usage-Driven, Automated Deep Model Compression Framework for Enabling Ubiquitous Intelligent Mobiles 2020 Sicong Liu
Junzhao Du
Kaiming Nan
Zimu Zhou
Hui Liu
Zhangyang Wang
Yingyan Lin
+ PDF Chat Threshold Neuron: A Brain-inspired Artificial Neuron for Efficient On-device Inference 2024 Zihao Zheng
Yuanchun Li
Jiayu Chen
Peng Zhou
Xiang Chen
Yunxin Liu

Works That Cite This (74)

Action Title Year Authors
+ PDF Chat An Image Enhancing Pattern-Based Sparsity for Real-Time Inference on Mobile Devices 2020 Xiaolong Ma
Wei Niu
Tianyun Zhang
Sijia Liu
Sheng Lin
Hongjia Li
Wujie Wen
Xiang Chen
Jian Tang
Kaisheng Ma
+ PDF Chat EdgeFM: Leveraging Foundation Model for Open-set Learning on the Edge 2023 Bufang Yang
Lixing He
Neiwen Ling
Zhenyu Yan
Guoliang Xing
Xian Shuai
Xiaozhe Ren
Xin Jiang
+ PDF Chat R-TOSS: A Framework for Real-Time Object Detection using Semi-Structured Pruning 2023 Abhishek Balasubramaniam
Febin Sunny
Sudeep Pasricha
+ PDF Chat Data-Model-Circuit Tri-Design for Ultra-Light Video Intelligence on Edge Devices 2023 Yimeng Zhang
Akshay Karkal Kamath
Qiucheng Wu
Zhiwen Fan
Wuyang Chen
Zhangyang Wang
Shiyu Chang
Sijia Liu
Cong Hao
+ Adversarial Attacks on Brain-Inspired Hyperdimensional Computing-Based Classifiers 2020 Fangfang Yang
Shaolei Ren
+ PDF Chat DNNFusion: accelerating deep neural networks execution with advanced operator fusion 2021 Wei Niu
Jiexiong Guan
Yanzhi Wang
Gagan Agrawal
Bin Ren
+ PDF Chat Automatic Mapping of the Best-Suited DNN Pruning Schemes for Real-Time Mobile Acceleration 2022 Yifan Gong
Geng Yuan
Zheng Zhan
Wei Niu
Zhengang Li
Pu Zhao
Yuxuan Cai
Sijia Liu
Bin Ren
Xue Lin
+ HighLight: Efficient and Flexible DNN Acceleration with Hierarchical Structured Sparsity 2023 Yannan Nellie Wu
Po-An Tsai
Saurav Muralidharan
Angshuman Parashar
Vivienne Sze
Joel Emer
+ An Efficient End-to-End Deep Learning Training Framework via Fine-Grained Pattern-Based Pruning. 2020 Chengming Zhang
Geng Yuan
Wei Niu
Jiannan Tian
Sian Jin
Donglin Zhuang
Zhe Jiang
Yanzhi Wang
Bin Ren
Shuaiwen Leon Song
+ PDF Chat Methods for Pruning Deep Neural Networks 2022 Sunil Vadera
Salem Ameen