FlexTensor

Type: Article

Publication Date: 2020-03-09

Citations: 117

DOI: https://doi.org/10.1145/3373376.3378508

Similar Works

Action Title Year Authors
+ PDF Chat Decoupling the optimization space of tensor computation for a better understanding of performance on Intel CPU 2022 Nicolas Tollenaere
+ PDF Chat Tucker Tensor Decomposition on FPGA 2019 Kaiqi Zhang
Xiyuan Zhang
Zheng Zhang
+ Tucker Tensor Decomposition on FPGA 2019 Kaiqi Zhang
Xiyuan Zhang
Zheng Zhang
+ POPA: Expressing High and Portable Performance across Spatial and Vector Architectures for Tensor Computations 2024 X. Q. Hao
Hongbo Rong
Mingzhe Zhang
C. Sun
Hong Jiang
Yun Liang
+ PDF Chat Tensor processing primitives 2021 Evangelos Georganas
Dhiraj Kalamkar
Sasikanth Avancha
Menachem Adelman
Cristina Anderson
Alexander Breuer
Jeremy Bruestle
Narendra Chaudhary
Abhisek Kundu
Denise Kutnick
+ PDF Chat Tensor processing primitives 2021 Evangelos Georganas
Dhiraj Kalamkar
Sasikanth Avancha
Menachem Adelman
Cristina Anderson
Alexander Breuer
Jeremy Bruestle
Narendra Chaudhary
Abhisek Kundu
Denise Kutnick
+ Tensaurus: A Versatile Accelerator for Mixed Sparse-Dense Tensor Computations 2020 Srivastava Nitish
Hanchen Jin
Shaden Smith
Rong Hongbo
A Rosenblueth David
Zhiru Zhang
+ PDF Chat Dissecting Tensor Cores via Microbenchmarks: Latency, Throughput and Numeric Behaviors 2022 Wei Sun
Ang Li
Tong Geng
Sander Stuijk
Henk Corporaal
+ Dissecting Tensor Cores via Microbenchmarks: Latency, Throughput and Numeric Behaviors 2022 Wei Sun
Ang Li
Tong Geng
Sander Stuijk
Henk Corporaal
+ HASCO: Towards Agile HArdware and Software CO-design for Tensor Computation 2021 Qingcheng Xiao
Size Zheng
Bingzhe Wu
Pengcheng Xu
Xuehai Qian
Yun Liang
+ PDF Chat HASCO: Towards Agile HArdware and Software CO-design for Tensor Computation 2021 Qingcheng Xiao
Size Zheng
Bingzhe Wu
Pengcheng Xu
Xuehai Qian
Yun Liang
+ FEASTA: A Flexible and Efficient Accelerator for Sparse Tensor Algebra in Machine Learning 2024 Kai Zhong
Zhenhua Zhu
Guohao Dai
Hongyi Wang
Xinhao Yang
Haoyu Zhang
Jin Si
Qiuli Mao
Shulin Zeng
Ke Hong
+ PDF Chat Morphling: A Reconfigurable Architecture for Tensor Computation 2021 Liqiang Lu
Yun Liang
+ Tensaurus: A Versatile Accelerator for Mixed Sparse-Dense Tensor Computations 2020 Nitish Srivastava
Hanchen Jin
Shaden Smith
Hongbo Rong
David H. Albonesi
Zhiru Zhang
+ Tensor Computations: Applications and Optimization (Dagstuhl Seminar 20111) 2020 Paolo Bientinesi
David A. Ham
Furong Huang
Paul H. J. Kelly
Christian Lengauer
Saday Sadayappan
+ ConvStencil: Transform Stencil Computation to Matrix Multiplication on Tensor Cores 2024 Yuetao Chen
Kun Li
Yuhao Wang
Donglin Bai
Lei Wang
Lingxiao Ma
Liang Yuan
Yunquan Zhang
Ting Cao
Mao Yang
+ LoRAStencil: Low-Rank Adaptation of Stencil Computation on Tensor Cores 2024 Yiwei Zhang
Kun Li
Liang Yuan
Jiawen Cheng
Yunquan Zhang
Ting Cao
Mao Yang
+ CUTE: A scalable CPU-centric and Ultra-utilized Tensor Engine for convolutions 2024 Wenqing Li
Jinpeng Ye
Fuxin Zhang
Tianyi Liu
Tingting Zhang
Jian Wang
+ GPTPU: Accelerating Applications using Edge Tensor Processing Units 2021 Kuan-Chieh Hsu
Hung‐Wei Tseng
+ GPTPU: Accelerating Applications using Edge Tensor Processing Units 2021 Kuan-Chieh Hsu
Hung‐Wei Tseng

Works That Cite This (47)

Action Title Year Authors
+ PDF Chat Optimal Kernel Orchestration for Tensor Programs with Korch 2024 Muyan Hu
Ashwin Venkatram
S. Biswas
Balamurugan Marimuthu
Bohan Hou
G. Oliaro
Haojie Wang
Liyan Zheng
Xupeng Miao
Jidong Zhai
+ Exploiting Subgraph Similarities for Efficient Auto-tuning of Tensor Programs 2023 Mingzhen Li
Hailong Yang
Zhang Shan-jun
Fengwei Yu
Ruihao Gong
Yi Liu
Zhongzhi Luan
Depei Qian
+ PDF Chat ANT: Exploiting Adaptive Numerical Data Type for Low-bit Deep Neural Network Quantization 2022 Cong Guo
Chen Zhang
Jingwen Leng
Zihan Liu
Fan Yang
Yunxin Liu
Minyi Guo
Yuhao Zhu
+ TensorIR: An Abstraction for Automatic Tensorized Program Optimization 2023 Siyuan Feng
Bohan Hou
Hongyi Jin
Wuwei Lin
Junru Shao
Ruihang Lai
Zihao Ye
Lianmin Zheng
Cody Hao Yu
Yong Yu
+ POPA: Expressing High and Portable Performance across Spatial and Vector Architectures for Tensor Computations 2024 X. Q. Hao
Hongbo Rong
Mingzhe Zhang
C. Sun
Hong Jiang
Yun Liang
+ Fasor: A Fast Tensor Program Optimization Framework for Efficient DNN Deployment 2024 Hanxian Huang
Xin Chen
Jishen Zhao
+ PDF Chat Autotuning Convolutions Is Easier Than You Think 2022 Nicolas Tollenaere
Guillaume Iooss
StĂŠphane Pouget
Hugo Brunie
Christophe Guillon
Albert Cohen
P. Sadayappan
Fabrice Rastello
+ TENET: A Framework for Modeling Tensor Dataflow Based on Relation-centric Notation 2021 Liqiang Lu
Naiqing Guan
Yuyue Wang
Liancheng Jia
Zizhang Luo
Jieming Yin
Jason Cong
Yun Liang
+ ECBC: Efficient Convolution via Blocked Columnizing 2021 Tianli Zhao
Qinghao Hu
Xiangyu He
Weixiang Xu
Jiaxing Wang
Cong Leng
Jian Cheng
+ PDF Chat Automatic Generation of Spatial Accelerator for Tensor Algebra 2022 Liancheng Jia
Zizhang Luo
Liqiang Lu
Yun Liang