Reusing Deep Learning Models: Challenges and Directions in Software Engineering

Type: Preprint

Publication Date: 2024-04-25

Citations: 0

DOI: https://doi.org/10.48550/arxiv.2404.16688

Abstract

Deep neural networks (DNNs) achieve state-of-the-art performance in many areas, including computer vision, system configuration, and question-answering. However, DNNs are expensive to develop, both in intellectual effort (e.g., devising new architectures) and computational costs (e.g., training). Reusing DNNs is a promising direction to amortize costs within a company and across the computing industry. As with any new technology, however, there are many challenges in reusing DNNs. These challenges include both missing technical capabilities and missing engineering practices. This vision paper describes challenges in current approaches to DNN re-use. We summarize studies of re-use failures across the spectrum of re-use techniques, including conceptual (e.g., reusing based on a research paper), adaptation (e.g., re-using by building on an existing implementation), and deployment (e.g., direct re-use on a new device). We outline possible advances that would improve each kind of re-use.

Locations

  • arXiv (Cornell University) - View - PDF

Similar Works

Action Title Year Authors
+ PDF Chat Reusing Deep Learning Models: Challenges and Directions in Software Engineering 2023 James C. Davis
Purvish Jajal
Wenxin Jiang
Taylor R. Schorlemmer
Nicholas Synovic
George K. Thiruvathukal
+ PDF Chat Repairing DNN Architecture: Are We There Yet? 2023 Jin-Han Kim
Nargiz Humbatova
Gunel Jahangirova
Paolo Tonella
Shin Yoo
+ Repairing DNN Architecture: Are We There Yet? 2023 Jin-Han Kim
Nargiz Humbatova
Gunel Jahangirova
Paolo Tonella
Shin Yoo
+ Challenges and practices of deep learning model reengineering: A case study on computer vision 2024 Wenxin Jiang
Vishnu Banna
Naveen Vivek
Abhinav Goel
Nicholas Synovic
George K. Thiruvathukal
James C. Davis
+ PDF Chat An Empirical Study of Pre-Trained Model Reuse in the Hugging Face Deep Learning Model Registry 2023 Wenxin Jiang
Nicholas Synovic
Matt Hyatt
Taylor R. Schorlemmer
Rohan Sethi
Yung‐Hsiang Lu
George K. Thiruvathukal
James C. Davis
+ An Empirical Study of Pre-Trained Model Reuse in the Hugging Face Deep Learning Model Registry 2023 Wenxin Jiang
Nicholas Synovic
Matt Hyatt
Taylor R. Schorlemmer
Rohan Sethi
Yung‐Hsiang Lu
George K. Thiruvathukal
James C. Davis
+ Challenges and Practices of Deep Learning Model Reengineering: A Case Study on Computer Vision 2023 Wenxin Jiang
Vishnu Banna
Naveen Vivek
Abhinav Goel
Nicholas Synovic
George K. Thiruvathukal
James C. Davis
+ Reusing Deep Neural Network Models through Model Re-engineering 2023 Binhang Qi
Hailong Sun
Xiang Gao
Hongyu Zhang
Zhaotian Li
Xudong Liu
+ PDF Chat Reusing Deep Neural Network Models through Model Re-engineering 2023 Binhang Qi
Hailong Sun
Xiang Gao
Hongyu Zhang
Zhaotian Li
Xudong Liu
+ PDF Chat Deep Learning-based Software Engineering: Progress, Challenges, and Opportunities 2024 Xiangping Chen
Xing Hu
Yu‐An Huang
He Jiang
Weixing Ji
Yanjie Jiang
Yanyan Jiang
Bo Liu
Hui Liu
Xiaochen Li
+ Modularizing while Training: A New Paradigm for Modularizing DNN Models 2024 Binhang Qi
Hailong Sun
Hongyu Zhang
Ruobing Zhao
Xiang Gao
+ Modularizing while Training: A New Paradigm for Modularizing DNN Models 2023 Binhang Qi
Hailong Sun
Hongyu Zhang
Ruobing Zhao
Xiang Gao
+ Pitfalls in Experiments with DNN4SE: An Analysis of the State of the Practice 2023 Sira Vegas
Sebastian Elbaum
+ Exploring Naming Conventions (and Defects) of Pre-trained Deep Learning Models in Hugging Face and Other Model Hubs 2023 Wenxin Jiang
Chingwo Cheung
George K. Thiruvathukal
James C. Davis
+ PeaTMOSS: Mining Pre-Trained Models in Open-Source Software 2023 Wenxin Jiang
Jason Jones
Jerin Yasmin
Nicholas Synovic
Rajeev Sashti
Sophie Chen
George K. Thiruvathukal
Yuan Tian
James C. Davis
+ Analysis of Failures and Risks in Deep Learning Model Converters: A Case Study in the ONNX Ecosystem 2023 Purvish Jajal
Wenxin Jiang
Arav Tewari
Joseph Woo
Yung‐Hsiang Lu
George K. Thiruvathukal
James C. Davis
+ Reusing Convolutional Neural Network Models through Modularization and Composition 2023 Binhang Qi
Hailong Sun
Hongyu Zhang
Xiang Gao
+ PDF Chat Design Smells in Deep Learning Programs: An Empirical Study 2021 Amin Nikanjam
Foutse Khomh
+ PDF Chat Design Smells in Deep Learning Programs: An Empirical Study 2021 Amin Nikanjam
Foutse Khomh
+ Design Smells in Deep Learning Programs: An Empirical Study 2021 Amin Nikanjam
Foutse Khomh

Works That Cite This (0)

Action Title Year Authors

Works Cited by This (0)

Action Title Year Authors