Zhihao Jia

Follow

Generating author description...

All published works
Action Title Year Authors
+ PDF Chat Communication Bounds for the Distributed Experts Problem 2025 Zhihao Jia
Qi Pang
Trung Thai Tran
David P. Woodruff
Zhihao Zhang
Wenting Zheng
+ PDF Chat Atlas: Hierarchical Partitioning for Quantum Circuit Simulation on GPUs (Extended Version) 2024 Mingkuan Xu
Shiyi Cao
Xupeng Miao
Umut A. Acar
Zhihao Jia
+ Quanto: Optimizing Quantum Circuits with Automatic Generation of Circuit Identities 2024 Jessica Pointing
Oded Padon
Zhihao Jia
Henry Ma
Auguste Hirth
Jens Palsberg
Alex Aiken
+ PDF Chat GraphPipe: Improving Performance and Scalability of DNN Training with Graph Pipeline Parallelism 2024 Byungsoo Jeon
Mengdi Wu
Shiyi Cao
Sunghyun Kim
Sunghyun Park
Neeraj Aggarwal
Colin Unger
Daiyaan Arfeen
Peiyuan Liao
Xupeng Miao
+ PDF Chat SpecExec: Massively Parallel Speculative Decoding for Interactive LLM Inference on Consumer Devices 2024 Ruslan Svirschevski
Avner May
Zhuoming Chen
Beidi Chen
Zhihao Jia
Max Ryabinin
+ PDF Chat Quarl: A Learning-Based Quantum Circuit Optimizer 2024 Zikun Li
Jinjun Peng
Yixuan Mei
Sina Lin
Yi Wu
Oded Padon
Zhihao Jia
+ PDF Chat Optimal Kernel Orchestration for Tensor Programs with Korch 2024 Muyan Hu
Ashwin Venkatram
S. Biswas
Balamurugan Marimuthu
Bohan Hou
G. Oliaro
Haojie Wang
Liyan Zheng
Xupeng Miao
Jidong Zhai
+ PDF Chat SpecInfer: Accelerating Large Language Model Serving with Tree-based Speculative Inference and Verification 2024 Xupeng Miao
G. Oliaro
Zhihao Zhang
X Cheng
Zeyu Wang
Zhengxin Zhang
Rae Ying Yee Wong
Alan Zhu
Lijie Yang
Xiaoxiang Shi
+ PDF Chat SpotServe: Serving Generative Large Language Models on Preemptible Instances 2024 Xupeng Miao
Chunan Shi
Jiangfei Duan
Xiaoli Xi
Dahua Lin
Bin Cui
Zhihao Jia
+ PDF Chat Sequoia: Scalable, Robust, and Hardware-aware Speculative Decoding 2024 Zhuoming Chen
Avner May
Ruslan Svirschevski
Yuhsun Huang
Max Ryabinin
Zhihao Jia
Beidi Chen
+ PDF Chat Drone-NeRF: Efficient NeRF based 3D scene reconstruction for large-scale drone survey 2024 Zhihao Jia
Bing Wang
Changhao Chen
+ Accelerating Retrieval-Augmented Language Model Serving with Speculation 2024 Zhihao Zhang
Alan Zhu
Lijie Yang
Yihua Xu
Lanting Li
Phitchaya Mangpo Phothilimthana
Zhihao Jia
+ Optimizing DNNs With Partially Equivalent Transformations and Automated Corrections 2023 Haojie Wang
Jidong Zhai
Mingyu Gao
Feng Zhang
Tuowei Wang
Zixuan Ma
Shizhi Tang
Liyan Zheng
Wen Wang
Kaiyuan Rong
+ Quarl: A Learning-Based Quantum Circuit Optimizer 2023 Zikun Li
Jinjun Peng
Yixuan Mei
Sina Lin
Yi Chang Wu
Oded Padon
Zhihao Jia
+ Drone-NeRF: Efficient NeRF Based 3D Scene Reconstruction for Large-Scale Drone Survey 2023 Zhihao Jia
Bing Wang
Changhao Chen
+ SpotServe: Serving Generative Large Language Models on Preemptible Instances 2023 Xupeng Miao
Chunan Shi
Jiangfei Duan
Xiaoli Xi
Dahua Lin
Bin Cui
Zhihao Jia
+ Towards Efficient Generative Large Language Model Serving: A Survey from Algorithms to Systems 2023 Xupeng Miao
G. Oliaro
Zhihao Zhang
X. G. Cheng
Hongyi Jin
Tianqi Chen
Zhihao Jia
+ Collage 2022 Byungsoo Jeon
Sunghyun Park
Peiyuan Liao
Sheng Xu
Tianqi Chen
Zhihao Jia
+ PDF Chat Software-hardware co-design for fast and scalable training of deep learning recommendation models 2022 Dheevatsa Mudigere
Yuchen Hao
Jianyu Huang
Zhihao Jia
Andrew Tulloch
Srinivas Sridharan
Xing Liu
Mustafa Özdal
Jade Nie
Jongsoo Park
+ PDF Chat Collage: Automated Integration of Deep Learning Backends 2021 Byungsoo Jeon
Sunghyun Park
Peiyuan Liao
Sheng Xu
Tianqi Chen
Zhihao Jia
+ TOD: Tensor-based Outlier Detection. 2021 Yue Zhao
George H. Chen
Zhihao Jia
+ Dorylus: Affordable, Scalable, and Accurate GNN Training over Billion-Edge Graphs 2021 John Thorpe
Yifan Qiao
Jonathan Eyolfson
Shen Teng
Guanzhou Hu
Zhihao Jia
Jinliang Wei
Keval Vora
Ravi Netravali
Miryung Kim
+ PET: Optimizing Tensor Programs with Partially Equivalent Transformations and Automated Corrections. 2021 Haojie Wang
Jidong Zhai
Mingyu Gao
Zixuan Ma
Shizhi Tang
Liyan Zheng
Yuanzhi Li
Kaiyuan Rong
Yuanyong Chen
Zhihao Jia
+ GradSign: Model Performance Inference with Theoretical Insights 2021 Zhihao Zhang
Zhihao Jia
+ IOS: Inter-Operator Scheduler for CNN Acceleration 2020 Yaoyao Ding
Ligeng Zhu
Zhihao Jia
Gennady Pekhimenko
Song Han
+ IOS: Inter-Operator Scheduler for CNN Acceleration 2020 Yaoyao Ding
Ligeng Zhu
Zhihao Jia
Gennady Pekhimenko
Song Han
+ Redundancy-Free Computation Graphs for Graph Neural Networks 2019 Zhihao Jia
Sina Lin
Rex Ying
Jiaxuan You
Jure Leskovec
Alex Aiken
+ Beyond Data and Model Parallelism for Deep Neural Networks 2018 Zhihao Jia
Matei Zaharia
Alex Aiken
+ Exploring Hidden Dimensions in Parallelizing Convolutional Neural Networks 2018 Zhihao Jia
Sina Lin
Charles R. Qi
Alex Aiken
+ Exploring Hidden Dimensions in Parallelizing Convolutional Neural Networks 2018 Zhihao Jia
Sina Lin
Charles R. Qi
Alex Aiken
+ Beyond Data and Model Parallelism for Deep Neural Networks 2018 Zhihao Jia
Matei Zaharia
Alex Aiken
Common Coauthors
Commonly Cited References
Action Title Year Authors # of times referenced
+ PDF Chat Deep Residual Learning for Image Recognition 2016 Kaiming He
Xiangyu Zhang
Shaoqing Ren
Jian Sun
7
+ PDF Chat Rethinking the Inception Architecture for Computer Vision 2016 Christian Szegedy
Vincent Vanhoucke
Sergey Ioffe
Jon Shlens
Zbigniew Wojna
4
+ PDF Chat DNNFusion: accelerating deep neural networks execution with advanced operator fusion 2021 Wei Niu
Jiexiong Guan
Yanzhi Wang
Gagan Agrawal
Bin Ren
3
+ Ansor: Generating High-Performance Tensor Programs for Deep Learning 2020 Lianmin Zheng
Chengfan Jia
Minmin Sun
Zhao Wu
Cody Hao Yu
Ameer Haj-Ali
Yida Wang
Jun Yang
Danyang Zhuo
Koushik Sen
3
+ PDF Chat Going deeper with convolutions 2015 Christian Szegedy
Wei Liu
Yangqing Jia
Pierre Sermanet
Scott Reed
Dragomir Anguelov
Dumitru Erhan
Vincent Vanhoucke
Andrew Rabinovich
3
+ PDF Chat Learning Transferable Architectures for Scalable Image Recognition 2018 Barret Zoph
Vijay Vasudevan
Jonathon Shlens
Quoc V. Le
3
+ Device Placement Optimization with Reinforcement Learning 2017 Azalia Mirhoseini
Hieu Pham
Quoc V. Le
Benoit Steiner
Rasmus Larsen
Yuefeng Zhou
Naveen Kumar
Mohammad Norouzi
Samy Bengio
Jeff Dean
3
+ PDF Chat Aggregated Residual Transformations for Deep Neural Networks 2017 Saining Xie
Ross Girshick
Piotr DollĂĄr
Zhuowen Tu
Kaiming He
2
+ PDF Chat Fast Algorithms for Convolutional Neural Networks 2016 Andrew Lavin
Scott Gray
2
+ Probabilistic algorithms for sparse polynomials 1979 Richard Zippel
2
+ Language Models are Few-Shot Learners 2020 T. B. Brown
Benjamin F. Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
Prafulla Dhariwal
Arvind Neelakantan
Pranav Shyam
Girish Sastry
Amanda Askell
2
+ PDF Chat Can Spatiotemporal 3D CNNs Retrace the History of 2D CNNs and ImageNet? 2018 Kensho Hara
Hirokatsu Kataoka
Yutaka Satoh
2
+ Learning to Optimize Tensor Programs 2018 Tianqi Chen
Lianmin Zheng
Eddie Yan
Ziheng Jiang
Thierry Moreau
LuĂ­s Ceze
Carlos Guestrin
Arvind Krishnamurthy
2
+ Introduction to Algorithms, third edition 2009 Thomas H. Cormen
Charles E. Leiserson
Ronald L. Rivest
Clifford Stein
2
+ PDF Chat CSRNet: Dilated Convolutional Neural Networks for Understanding the Highly Congested Scenes 2018 Yuhong Li
Xiaofan Zhang
Deming Chen
2
+ PDF Chat Generative adversarial networks 2020 Ian Goodfellow
Jean Pouget-Abadie
Mehdi Mirza
Bing Xu
David Warde-Farley
Sherjil Ozair
Aaron Courville
Yoshua Bengio
2
+ PDF Chat MobileNetV2: Inverted Residuals and Linear Bottlenecks 2018 Mark Sandler
Andrew Howard
Menglong Zhu
Andrey Zhmoginov
Liang-Chieh Chen
2
+ Going Deeper with Convolutions 2014 Christian Szegedy
Wei Liu
Yangqing Jia
Pierre Sermanet
Scott Reed
Dragomir Anguelov
Dumitru Erhan
Vincent Vanhoucke
Andrew Rabinovich
2
+ Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour 2017 Priya Goyal
Piotr DollĂĄr
Ross Girshick
Pieter Noordhuis
Lukasz Wesolowski
Aapo Kyrola
Andrew Tulloch
Yangqing Jia
Kaiming He
2
+ FlexTensor 2020 Size Zheng
Yun Liang
Shuo Wang
Renze Chen
Kaiwen Sheng
2
+ One weird trick for parallelizing convolutional neural networks 2014 Alex Krizhevsky
2
+ Tensor Comprehensions: Framework-Agnostic High-Performance Machine Learning Abstractions 2018 Nicolas Vasilache
Oleksandr Zinenko
Theodoros Theodoridis
Priya Goyal
Zachary DeVito
William S. Moses
Sven Verdoolaege
Andrew Adams
Albert Cohen
2
+ Tiramisu: a polyhedral compiler for expressing fast and portable code 2019 Riyadh Baghdadi
Jessica Ray
Malek Ben Romdhane
Emanuele Del Sozzo
Abdurrahman Akkas
Yunming Zhang
Patricia Suriana
Shoaib Kamil
Saman Amarasinghe
2
+ Very Deep Convolutional Networks for Large-Scale Image Recognition 2014 Karen Simonyan
Andrew Zisserman
2
+ Deep Learning using Rectified Linear Units (ReLU) 2018 Abien Fred Agarap
2
+ Deep Residual Learning for Image Recognition 2015 Kaiming He
Xiangyu Zhang
Shaoqing Ren
Jian Sun
2
+ cuDNN: Efficient Primitives for Deep Learning 2014 Sharan Chetlur
Cliff Woolley
Philippe Vandermersch
Jonathan Cohen
John Tran
Bryan Catanzaro
Evan Shelhamer
2
+ Adam: A Method for Stochastic Optimization 2014 Diederik P. Kingma
Jimmy Ba
2
+ Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation 2016 Yonghui Wu
Mike Schuster
Zhifeng Chen
Quoc V. Le
Mohammad Norouzi
Wolfgang Macherey
Maxim Krikun
Yuan Cao
Qin Gao
Klaus Macherey
1
+ Semi-Supervised Classification with Graph Convolutional Networks 2016 Thomas Kipf
Max Welling
1
+ PDF Chat ImageNet Large Scale Visual Recognition Challenge 2015 Olga Russakovsky
Jia Deng
Hao Su
Jonathan Krause
Sanjeev Satheesh
Sean Ma
Zhiheng Huang
Andrej Karpathy
Aditya Khosla
Michael S. Bernstein
1
+ Densely Connected Convolutional Networks 2016 Gao Huang
Zhuang Liu
Laurens van der Maaten
Kilian Q. Weinberger
1
+ PDF Chat Perceptual Losses for Real-Time Style Transfer and Super-Resolution 2016 Justin Johnson
Alexandre Alahi
Li Fei-Fei
1
+ PDF Chat Elementary gates for quantum computation 1995 Adriano Barenco
Charles H. Bennett
Richard Cleve
David P. DiVincenzo
Norman Margolus
Peter W. Shor
Tycho Sleator
John A. Smolin
Harald Weinfurter
1
+ PDF Chat Improved Classical Simulation of Quantum Circuits Dominated by Clifford Gates 2016 Sergey Bravyi
David Gosset
1
+ Isolation Forest 2008 Fei Tony Liu
Kai Ming Ting
Zhi‐Hua Zhou
1
+ SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <0.5MB model size 2016 Forrest Iandola
Song Han
Matthew W. Moskewicz
Khalid Ashraf
William J. Dally
Kurt Keutzer
1
+ TVM: End-to-End Optimization Stack for Deep Learning 2018 Tianqi Chen
Thierry Moreau
Ziheng Jiang
Haichen Shen
Eddie Yan
Leyuan Wang
Yuwei Hu
LuĂ­s Ceze
Carlos Guestrin
Arvind Krishnamurthy
1
+ PDF Chat Polynomial-Time T-Depth Optimization of Clifford+T Circuits Via Matroid Partitioning 2014 Matthew Amy
Dmitri Maslov
Michele Mosca
1
+ Quantum Circuit Identities 2003 Chris Lomont
1
+ Recurrent Neural Network Regularization 2014 Wojciech Zaremba
Ilya Sutskever
Oriol Vinyals
1
+ Exploring Hidden Dimensions in Parallelizing Convolutional Neural Networks 2018 Zhihao Jia
Sina Lin
Charles R. Qi
Alex Aiken
1
+ Deep Speech 2: End-to-End Speech Recognition in English and Mandarin 2015 Dario Amodei
Rishita Anubhai
Eric Battenberg
Carl Case
Jared Casper
Bryan Catanzaro
Jingdong Chen
Mike Chrzanowski
Adam Coates
Greg Diamos
1
+ Intel nGraph: An Intermediate Representation, Compiler, and Executor for Deep Learning 2018 Scott Cyphers
Arjun K. Bansal
Anahita Bhiwandiwalla
Jayaram Bobba
Matthew Brookhart
Avijit Chakraborty
Will Constable
Christian Convey
Leona Cook
Omar Kanawi
1
+ Faster gaze prediction with dense networks and Fisher pruning 2018 Lucas Theis
Iryna Korshunova
Alykhan Tejani
Ferenc HuszĂĄr
1
+ MXNet: A Flexible and Efficient Machine Learning Library for Heterogeneous Distributed Systems 2015 Tianqi Chen
Mu Li
Yutian Li
Min Lin
Naiyan Wang
Minjie Wang
Tianjun Xiao
Bing Xu
Chiyuan Zhang
Zheng Zhang
1
+ A Decomposition Theorem for Partially Ordered Sets 2009 R. P. Dilworth
1
+ Horovod: fast and easy distributed deep learning in TensorFlow 2018 Alexander Sergeev
Mike Del Balso
1
+ 3LC: Lightweight and Effective Traffic Compression for Distributed Machine Learning 2018 Hyeontaek Lim
David G. Andersen
Michael Kaminsky
1
+ Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm 2017 David Silver
Thomas Hubert
Julian Schrittwieser
Ioannis Antonoglou
Matthew Lai
Arthur Guez
Marc Lanctot
Laurent Sifre
Dharshan Kumaran
Thore Graepel
1