Amir Gholami

Follow

Generating author description...

All published works
Action Title Year Authors
+ PDF Chat Squeezed Attention: Accelerating Long Context Length LLM Inference 2024 Coleman Hooper
Sehoon Kim
Hiva Mohammadzadeh
Muthucumaru Maheswaran
Joonki Paik
Michael W. Mahoney
Kurt Keutzer
Amir Gholami
+ PDF Chat Efficient and Scalable Estimation of Tool Representations in Vector Space 2024 Suhong Moon
Siddharth Jha
Lutfi Eren Erdogan
Sehoon Kim
Woosang Lim
Kurt Keutzer
Amir Gholami
+ PDF Chat TinyAgent: Function Calling at the Edge 2024 Lutfi Eren Erdogan
Nick Lee
Siddharth Jha
Se Hoon Kim
Ryan Tabrizi
Suhong Moon
Coleman Hooper
Gopala K. Anumanchipalli
Kurt Keutzer
Amir Gholami
+ PDF Chat Characterizing Prompt Compression Methods for Long Context Inference 2024 Siddharth Jha
Lutfi Eren Erdogan
Sehoon Kim
Kurt Keutzer
Amir Gholami
+ PDF Chat Reliable edge machine learning hardware for scientific applications 2024 Tommaso Lisini Baldi
Javier Campos
Benjamin Hawks
J. Ngadiuba
Nhan Viet Tran
Daniel DĂ­az
J. Duarte
Ryan Kastner
Andres Meza
M. Quinnan
+ PDF Chat Reliable edge machine learning hardware for scientific applications 2024 Tommaso Lisini Baldi
Javier Campos
Benjamin Hawks
J. Ngadiuba
Nhan Viet Tran
Daniel DĂ­az
J. Duarte
Ryan Kastner
Andres Meza
M. Quinnan
+ PDF Chat LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement 2024 Nick Lee
Thanakul Wattanawong
Sehoon Kim
Karttikeya Mangalam
Sheng Shen
Gopala Anumanchipali
Michael W. Mahoney
Kurt Keutzer
Amir Gholami
+ PDF Chat KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization 2024 Coleman Hooper
Sehoon Kim
Hiva Mohammadzadeh
Michael W. Mahoney
Yakun Sophia Shao
Kurt Keutzer
Amir Gholami
+ Speculative Decoding with Big Little Decoder 2023 Sehoon Kim
Karttikeya Mangalam
Jitendra Malik
Michael W. Mahoney
Amir Gholami
Kurt Keutzer
+ Towards Foundation Models for Scientific Machine Learning: Characterizing Scaling and Transfer Behavior 2023 Shashank Subramanian
Peter Harrington
Kurt Keutzer
W. Bhimji
Dmitriy Morozov
Michael W. Mahoney
Amir Gholami
+ SqueezeLLM: Dense-and-Sparse Quantization 2023 Sehoon Kim
Coleman Hooper
Amir Gholami
Zhen Dong
Xiuyu Li
Sheng Shen
Michael W. Mahoney
Kurt Keutzer
+ SPEED: Speculative Pipelined Execution for Efficient Decoding 2023 Coleman Hooper
Sehoon Kim
Hiva Mohammadzadeh
Hasan Genç
Kurt Keutzer
Amir Gholami
Sophia Shao
+ An LLM Compiler for Parallel Function Calling 2023 Sehoon Kim
Suhong Moon
Ryan Tabrizi
Nick Lee
Michael W. Mahoney
Kurt Keutzer
Amir Gholami
+ PDF Chat Learned Token Pruning for Transformers 2022 Sehoon Kim
Sheng Shen
David Thorsley
Amir Gholami
Woosuk Kwon
Joseph Hassoun
Kurt Keutzer
+ PDF Chat Integer-Only Zero-Shot Quantization for Efficient Speech Recognition 2022 Sehoon Kim
Amir Gholami
Zhewei Yao
Nick Lee
Patrick Wang
Aniruddha Nrusimha
Bohan Zhai
Tianren Gao
Michael W. Mahoney
Kurt Keutzer
+ PDF Chat A Survey of Quantization Methods for Efficient Neural Network Inference 2022 Amir Gholami
Sehoon Kim
Zhen Dong
Zhewei Yao
Michael W. Mahoney
Kurt Keutzer
+ PDF Chat Hessian-Aware Pruning and Optimal Neural Implant 2022 Shixing Yu
Zhewei Yao
Amir Gholami
Zhen Dong
Sehoon Kim
Michael W. Mahoney
Kurt Keutzer
+ A Fast Post-Training Pruning Framework for Transformers 2022 Woosuk Kwon
Sehoon Kim
Michael W. Mahoney
Joseph Hassoun
Kurt Keutzer
Amir Gholami
+ Squeezeformer: An Efficient Transformer for Automatic Speech Recognition 2022 Sehoon Kim
Amir Gholami
Albert C. Shaw
Nicholas Lee
Karttikeya Mangalam
Jitendra Malik
Michael W. Mahoney
Kurt Keutzer
+ PDF Chat ADAHESSIAN: An Adaptive Second Order Optimizer for Machine Learning 2021 Zhewei Yao
Amir Gholami
Sheng Shen
Mustafa Mustafa
Kurt Keutzer
Michael W. Mahoney
+ Q-ASR: Integer-only Zero-shot Quantization for Efficient Speech Recognition. 2021 Sehoon Kim
Amir Gholami
Zhewei Yao
Aniruddha Nrusimha
Bohan Zhai
Tianren Gao
Michael W. Mahoney
Kurt Keutzer
+ I-BERT: Integer-only BERT Quantization 2021 Sehoon Kim
Amir Gholami
Zhewei Yao
Michael W. Mahoney
Kurt Keutzer
+ Hessian-Aware Pruning and Optimal Neural Implant 2021 Shixing Yu
Zhewei Yao
Amir Gholami
Zhen Dong
Michael W. Mahoney
Kurt Keutzer
+ Integer-only Zero-shot Quantization for Efficient Speech Recognition 2021 Sehoon Kim
Amir Gholami
Zhewei Yao
Nick Lee
Patrick Wang
Aniruddha Nrusimha
Bohan Zhai
Tianren Gao
Michael W. Mahoney
Kurt Keutzer
+ A Survey of Quantization Methods for Efficient Neural Network Inference 2021 Amir Gholami
Sehoon Kim
Zhen Dong
Zhewei Yao
Michael W. Mahoney
Kurt Keutzer
+ Learned Token Pruning for Transformers 2021 Sehoon Kim
Sheng Shen
David Thorsley
Amir Gholami
Woosuk Kwon
Joseph Hassoun
Kurt Keutzer
+ PDF Chat PyHessian: Neural Networks Through the Lens of the Hessian 2020 Zhewei Yao
Amir Gholami
Kurt Keutzer
Michael W. Mahoney
+ PDF Chat HAWQV3: Dyadic Neural Network Quantization 2020 Zhewei Yao
Zhen Dong
Zhangcheng Zheng
Amir Gholami
Jiali Yu
Eric Tan
Leyuan Wang
Qijing Huang
Yida Wang
Michael W. Mahoney
+ Boundary thickness and robustness in learning models 2020 Yaoqing Yang
Rajiv Khanna
Yaodong Yu
Amir Gholami
Kurt Keutzer
Joseph E. Gonzalez
Kannan Ramchandran
Michael W. Mahoney
+ PDF Chat ZeroQ: A Novel Zero Shot Quantization Framework 2020 Yaohui Cai
Zhewei Yao
Zhen Dong
Amir Gholami
Michael W. Mahoney
Kurt Keutzer
+ PDF Chat Inefficiency of K-FAC for Large Batch Size Training 2020 Linjian Ma
Gabe Montague
Jiayu Ye
Zhewei Yao
Amir Gholami
Kurt Keutzer
Michael W. Mahoney
+ PDF Chat Q-BERT: Hessian Based Ultra Low Precision Quantization of BERT 2020 Sheng Shen
Zhen Dong
Jiayu Ye
Linjian Ma
Zhewei Yao
Amir Gholami
Michael W. Mahoney
Kurt Keutzer
+ Rethinking Batch Normalization in Transformers 2020 Sheng Shen
Zhewei Yao
Amir Gholami
Michael W. Mahoney
Kurt Keutzer
+ PowerNorm: Rethinking Batch Normalization in Transformers 2020 Sheng Shen
Zhewei Yao
Amir Gholami
Michael W. Mahoney
Kurt Keutzer
+ ZeroQ: A Novel Zero Shot Quantization Framework 2020 Yaohui Cai
Zhewei Yao
Zhen Dong
Amir Gholami
Michael W. Mahoney
Kurt Keutzer
+ Boundary thickness and robustness in learning models 2020 Yaoqing Yang
Rajiv Khanna
Yaodong Yu
Amir Gholami
Kurt Keutzer
Joseph E. Gonzalez
Kannan Ramchandran
Michael W. Mahoney
+ PowerNorm: Rethinking Batch Normalization in Transformers 2020 Sheng Shen
Zhewei Yao
Amir Gholami
Michael W. Mahoney
Kurt Keutzer
+ HAWQV3: Dyadic Neural Network Quantization 2020 Zhewei Yao
Zhen Dong
Zhangcheng Zheng
Amir Gholami
Jiali Yu
Eric Tan
Leyuan Wang
Qijing Huang
Yida Wang
Michael W. Mahoney
+ ADAHESSIAN: An Adaptive Second Order Optimizer for Machine Learning 2020 Zhewei Yao
Amir Gholami
Sheng Shen
Mustafa Mustafa
Kurt Keutzer
Michael W. Mahoney
+ HAWQ-V2: Hessian Aware trace-Weighted Quantization of Neural Networks 2019 Zhen Dong
Zhewei Yao
Yaohui Cai
Daiyaan Arfeen
Amir Gholami
Michael W. Mahoney
Kurt Keutzer
+ Checkmate: Breaking the Memory Wall with Optimal Tensor Rematerialization 2019 Paras Jain
Ajay N. Jain
Aniruddha Nrusimha
Amir Gholami
Pieter Abbeel
Kurt Keutzer
Ion Stoica
Joseph E. Gonzalez
+ PDF Chat HAWQ: Hessian AWare Quantization of Neural Networks With Mixed-Precision 2019 Zhen Dong
Zhewei Yao
Amir Gholami
Michael W. Mahoney
Kurt Keutzer
+ PDF Chat Trust Region Based Adversarial Attack on Neural Networks 2019 Zhewei Yao
Amir Gholami
Peng Xu
Kurt Keutzer
Michael W. Mahoney
+ PDF Chat Simulation of glioblastoma growth using a 3D multispecies tumor model with mass effect 2019 Shashank Subramanian
Amir Gholami
George Biros
+ Inefficiency of K-FAC for Large Batch Size Training 2019 Linjian Ma
Gabe Montague
Jiayu Ye
Zhewei Yao
Amir Gholami
Kurt Keutzer
Michael W. Mahoney
+ HAWQ: Hessian AWare Quantization of Neural Networks with Mixed-Precision 2019 Zhen Dong
Zhewei Yao
Amir Gholami
Michael W. Mahoney
Kurt Keutzer
+ Q-BERT: Hessian Based Ultra Low Precision Quantization of BERT 2019 Sheng Shen
Zhen Dong
Jiayu Ye
Linjian Ma
Zhewei Yao
Amir Gholami
Michael W. Mahoney
Kurt Keutzer
+ PyHessian: Neural Networks Through the Lens of the Hessian 2019 Zhewei Yao
Amir Gholami
Kurt Keutzer
Michael W. Mahoney
+ Checkmate: Breaking the Memory Wall with Optimal Tensor Rematerialization 2019 Paras Jain
Ajay N. Jain
Aniruddha Nrusimha
Amir Gholami
Pieter Abbeel
Joseph E. Gonzalez
Kurt Keutzer
Ion Stoica
+ PDF Chat CLAIRE: A Distributed-Memory Solver for Constrained Large Deformation Diffeomorphic Image Registration 2019 Andreas Mang
Amir Gholami
Christos Davatzikos
George Biros
+ HAWQ-V2: Hessian Aware trace-Weighted Quantization of Neural Networks 2019 Zhen Dong
Zhewei Yao
Yaohui Cai
Daiyaan Arfeen
Amir Gholami
Michael W. Mahoney
Kurt Keutzer
+ PDF Chat Integrated Model, Batch, and Domain Parallelism in Training Neural Networks 2018 Amir Gholami
Ariful Azad
Peter Jin
Kurt Keutzer
Aydın Buluç
+ PDF Chat Co-design of deep neural nets and neural net accelerators for embedded vision applications 2018 Kiseok Kwon
Alon Amid
Amir Gholami
Bichen Wu
Krste Asanović
Kurt Keutzer
+ PDF Chat SqueezeNext: Hardware-Aware Neural Network Design 2018 Amir Gholami
Kiseok Kwon
Bichen Wu
Zizheng Tai
Xiangyu Yue
Peter Jin
Sicheng Zhao
Kurt Keutzer
+ Co-Design of Deep Neural Nets and Neural Net Accelerators for Embedded Vision Applications 2018 Kiseok Kwon
Alon Amid
Amir Gholami
Bichen Wu
Krste Asanović
Kurt Keutzer
+ SqueezeNext: Hardware-Aware Neural Network Design 2018 Amir Gholami
Kiseok Kwon
Bichen Wu
Zizheng Tai
Xiangyu Yue
Peter Jin
Sicheng Zhao
Kurt Keutzer
+ Hessian-based Analysis of Large Batch Training and Robustness to Adversaries 2018 Zhewei Yao
Amir Gholami
Qi Lei
Kurt Keutzer
Michael W. Mahoney
+ Large batch size training of neural networks with adversarial training and second-order information 2018 Zhewei Yao
Amir Gholami
Daiyaan Arfeen
Richard Liaw
Joseph E. Gonzalez
Kurt Keutzer
Michael W. Mahoney
+ On the Computational Inefficiency of Large Batch Sizes for Stochastic Gradient Descent 2018 Noah Golmant
Nikita Vemuri
Zhewei Yao
Vladimir Feinberg
Amir Gholami
Kai Rothauge
Michael W. Mahoney
Joseph E. Gonzalez
+ Parameter Re-Initialization through Cyclical Batch Size Schedules 2018 Norman Mu
Zhewei Yao
Amir Gholami
Kurt Keutzer
Michael W. Mahoney
+ Trust Region Based Adversarial Attack on Neural Networks 2018 Zhewei Yao
Amir Gholami
Peng Xu
Kurt Keutzer
Michael W. Mahoney
+ SqueezeNext: Hardware-Aware Neural Network Design 2018 Amir Gholami
Kiseok Kwon
Bichen Wu
Zizheng Tai
Xiangyu Yue
Peter J. Jin
Sicheng Zhao
Kurt Keutzer
+ Integrated Model and Data Parallelism in Training Neural Networks. 2017 Amir Gholami
Ariful Azad
Kurt Keutzer
Aydın Buluç
+ Integrated Model, Batch and Domain Parallelism in Training Neural Networks 2017 Amir Gholami
Ariful Azad
Peter Jin
Kurt Keutzer
Aydın Buluç
+ Integrated Model, Batch and Domain Parallelism in Training Neural Networks 2017 Amir Gholami
Ariful Azad
Peter J. Jin
Kurt Keutzer
Aydın Buluç
+ PDF Chat Distributed-Memory Large Deformation Diffeomorphic 3D Image Registration 2016 Andreas Mang
Amir Gholami
George Biros
+ PDF Chat FFT, FMM, or Multigrid? A comparative Study of State-Of-the-Art Poisson Solvers for Uniform and Nonuniform Grids in the Unit Cube 2016 Amir Gholami
Dhairya Malhotra
Hari Sundar
George Biros
+ AccFFT: A library for distributed-memory FFT on CPU and GPU architectures 2015 Amir Gholami
Judith Hill
Dhairya Malhotra
George Biros
+ AccFFT: A library for distributed-memory FFT on CPU and GPU architectures 2015 Amir Gholami
Judith Hill
Dhairya Malhotra
George Biros
Common Coauthors
Commonly Cited References
Action Title Year Authors # of times referenced
+ PDF Chat Deep Residual Learning for Image Recognition 2016 Kaiming He
Xiangyu Zhang
Shaoqing Ren
Jian Sun
24
+ SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <0.5MB model size 2016 Forrest Iandola
Song Han
Matthew W. Moskewicz
Khalid Ashraf
William J. Dally
Kurt Keutzer
14
+ PDF Chat SqueezeNext: Hardware-Aware Neural Network Design 2018 Amir Gholami
Kiseok Kwon
Bichen Wu
Zizheng Tai
Xiangyu Yue
Peter Jin
Sicheng Zhao
Kurt Keutzer
14
+ Distilling the Knowledge in a Neural Network 2015 Geoffrey E. Hinton
Oriol Vinyals
Jay B. Dean
12
+ PDF Chat HAWQ: Hessian AWare Quantization of Neural Networks With Mixed-Precision 2019 Zhen Dong
Zhewei Yao
Amir Gholami
Michael W. Mahoney
Kurt Keutzer
12
+ PACT: Parameterized Clipping Activation for Quantized Neural Networks 2018 Jungwook Choi
Zhuo Wang
Swagath Venkataramani
Pierce Chuang
Vijayalakshmi Srinivasan
Kailash Gopalakrishnan
11
+ Quantizing deep convolutional networks for efficient inference: A whitepaper 2018 Raghuraman Krishnamoorthi
11
+ Hessian-based Analysis of Large Batch Training and Robustness to Adversaries 2018 Zhewei Yao
Amir Gholami
Qi Lei
Kurt Keutzer
Michael W. Mahoney
11
+ PDF Chat XNOR-Net: ImageNet Classification Using Binary Convolutional Neural Networks 2016 Mohammad Rastegari
Vicente Ordóñez
Joseph Redmon
Ali Farhadi
11
+ PDF Chat LQ-Nets: Learned Quantization for Highly Accurate and Compact Deep Neural Networks 2018 Dongqing Zhang
Jiaolong Yang
Dongqiangzi Ye
Gang Hua
10
+ MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications 2017 Andrew Howard
Menglong Zhu
Bo Chen
Dmitry Kalenichenko
Weijun Wang
Tobias Weyand
Marco Andreetto
Hartwig Adam
10
+ Mixed Precision Quantization of ConvNets via Differentiable Neural Architecture Search 2018 Bichen Wu
Yanghan Wang
Peizhao Zhang
Yuandong Tian
PĂ©ter Vajda
Kurt Keutzer
10
+ Large batch size training of neural networks with adversarial training and second-order information 2018 Zhewei Yao
Amir Gholami
Daiyaan Arfeen
Richard Liaw
Joseph E. Gonzalez
Kurt Keutzer
Michael W. Mahoney
9
+ PDF Chat MobileNetV2: Inverted Residuals and Linear Bottlenecks 2018 Mark Sandler
Andrew Howard
Menglong Zhu
Andrey Zhmoginov
Liang-Chieh Chen
9
+ RoBERTa: A Robustly Optimized BERT Pretraining Approach 2019 Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
Mike Lewis
Luke Zettlemoyer
Veselin Stoyanov
9
+ DoReFa-Net: Training Low Bitwidth Convolutional Neural Networks with Low Bitwidth Gradients 2016 Shuchang Zhou
Yuxin Wu
Zekun Ni
Xinyu Zhou
He Wen
Yuheng Zou
9
+ PDF Chat Integrated Model, Batch, and Domain Parallelism in Training Neural Networks 2018 Amir Gholami
Ariful Azad
Peter Jin
Kurt Keutzer
Aydın Buluç
8
+ Very Deep Convolutional Networks for Large-Scale Image Recognition 2014 Karen Simonyan
Andrew Zisserman
8
+ PDF Chat Value-Aware Quantization for Training and Inference of Neural Networks 2018 Eunhyeok Park
Sungjoo Yoo
PĂ©ter Vajda
8
+ Incremental Network Quantization: Towards Lossless CNNs with Low-Precision Weights 2017 Aojun Zhou
Anbang Yao
Yiwen Guo
Lin Xu
Yurong Chen
8
+ HAQ: Hardware-Aware Automated Quantization 2018 Kuan Wang
Zhijian Liu
Yujun Lin
Ji Lin
Song Han
8
+ PDF Chat Quantized Convolutional Neural Networks for Mobile Devices 2016 Jiaxiang Wu
Cong Leng
Yuhang Wang
Qinghao Hu
Jian Cheng
8
+ Exploring the Regularity of Sparse Structure in Convolutional Neural Networks 2017 Huizi Mao
Song Han
Jeff Pool
Wenshuo Li
Xingyu Liu
Yu Wang
William J. Dally
8
+ PDF Chat Q-BERT: Hessian Based Ultra Low Precision Quantization of BERT 2020 Sheng Shen
Zhen Dong
Jiayu Ye
Linjian Ma
Zhewei Yao
Amir Gholami
Michael W. Mahoney
Kurt Keutzer
7
+ Attention is All you Need 2017 Ashish Vaswani
Noam Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan N. Gomez
Ɓukasz Kaiser
Illia Polosukhin
7
+ PDF Chat Adaptive Quantization for Deep Neural Network 2018 Yiren Zhou
Seyed-Mohsen Moosavi-Dezfooli
Ngai‐Man Cheung
Pascal Frossard
7
+ PDF Chat Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference 2018 Benoit Jacob
Skirmantas Kligys
Bo Chen
Menglong Zhu
Matthew F. Tang
Andrew Howard
Hartwig Adam
Dmitry Kalenichenko
7
+ Quantized Neural Networks: Training Neural Networks with Low Precision Weights and Activations 2016 Itay Hubara
Matthieu Courbariaux
Daniel Soudry
Ran El‐Yaniv
Yoshua Bengio
7
+ FitNets: Hints for Thin Deep Nets 2014 Adriana Romero
Nicolas Ballas
Samira Ebrahimi Kahou
Antoine Chassang
Carlo Gatta
Yoshua Bengio
6
+ PyHessian: Neural Networks Through the Lens of the Hessian 2019 Zhewei Yao
Amir Gholami
Kurt Keutzer
Michael W. Mahoney
6
+ Pruning Filters for Efficient ConvNets 2016 Hao Li
Asim Kadav
Igor Đurđanović
Hanan Samet
Hans Peter Graf
6
+ Pruning Convolutional Neural Networks for Resource Efficient Inference 2016 Pavlo Molchanov
Stephen Tyree
Tero Karras
Timo Aila
Jan Kautz
6
+ PDF Chat Densely Connected Convolutional Networks 2017 Gao Huang
Zhuang Liu
Laurens van der Maaten
Kilian Q. Weinberger
6
+ PDF Chat Rethinking the Inception Architecture for Computer Vision 2016 Christian Szegedy
Vincent Vanhoucke
Sergey Ioffe
Jon Shlens
Zbigniew Wojna
6
+ Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour 2017 Priya Goyal
Piotr DollĂĄr
Ross Girshick
Pieter Noordhuis
Lukasz Wesolowski
Aapo Kyrola
Andrew Tulloch
Yangqing Jia
Kaiming He
6
+ Trained Ternary Quantization 2016 Chenzhuo Zhu
Song Han
Huizi Mao
William J. Dally
6
+ PDF Chat ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices 2018 Xiangyu Zhang
Xinyu Zhou
Mengxiao Lin
Jian Sun
6
+ MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications 2017 Andrew Howard
Menglong Zhu
Bo Chen
Dmitry Kalenichenko
Weijun Wang
Tobias Weyand
Marco Andreetto
Hartwig Adam
6
+ Ternary Weight Networks 2016 Fengfu Li
Bin Liu
6
+ PDF Chat Dreaming to Distill: Data-Free Knowledge Transfer via DeepInversion 2020 Hongxu Yin
Pavlo Molchanov
Jose M. Álvarez
Zhizhong Li
Arun Mallya
Derek Hoiem
Niraj K. Jha
Jan Kautz
5
+ Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift 2015 Sergey Ioffe
Christian Szegedy
5
+ AdaBatch: Adaptive Batch Sizes for Training Deep Neural Networks 2017 Aditya Devarakonda
Maxim Naumov
Michael Garland
5
+ PDF Chat SqueezeDet: Unified, Small, Low Power Fully Convolutional Neural Networks for Real-Time Object Detection for Autonomous Driving 2017 Bichen Wu
Alvin Wan
Forrest Iandola
Peter H. Jin
Kurt Keutzer
5
+ PDF Chat The Knowledge Within: Methods for Data-Free Model Compression 2020 Matan Haroush
Itay Hubara
Elad Hoffer
Daniel Soudry
5
+ Estimating or Propagating Gradients Through Stochastic Neurons for Conditional Computation 2013 Yoshua Bengio
Nicholas LĂ©onard
Aaron Courville
5
+ Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift 2015 Sergey Ioffe
Christian Szegedy
5
+ Some large-scale matrix computation problems 1996 Zhaojun Bai
Gark Fahey
Gene H. Golub
5
+ PDF Chat ShuffleNet V2: Practical Guidelines for Efficient CNN Architecture Design 2018 Ningning Ma
Xiangyu Zhang
Hai-Tao Zheng
Jian Sun
5
+ PDF Chat Randomized algorithms for estimating the trace of an implicit symmetric positive semi-definite matrix 2011 Haim Avron
Sivan Toledo
5
+ Don't Decay the Learning Rate, Increase the Batch Size 2017 Samuel Smith
Pieter-Jan Kindermans
Chris Ying
Quoc V. Le
5