CDMPP: A Device-Model Agnostic Framework for Latency Prediction of Tensor Programs
CDMPP: A Device-Model Agnostic Framework for Latency Prediction of Tensor Programs
Deep Neural Networks (DNNs) have shown excellent performance in a wide range of machine learning applications. Knowing the latency of running a DNN model or tensor program on a specific device is useful in various tasks, such as DNN graph- or tensor-level optimization and device selection. Considering the large space …