Distributed Deep Learning Inference Acceleration using Seamless Collaboration in Edge Computing
Distributed Deep Learning Inference Acceleration using Seamless Collaboration in Edge Computing
This paper studies inference acceleration using distributed convolutional neural networks (CNNs) in collaborative edge computing. To ensure inference accuracy in inference task partitioning, we consider the receptive-field when performing segment-based partitioning. To maximize the parallelization between the communication and computing processes, thereby minimizing the total inference time of an inference …