+
PDF
Chat
|
Going deeper with Image Transformers
|
2021
|
Hugo Touvron
Matthieu Cord
Alexandre Sablayrolles
Gabriel Synnaeve
Hervé Jeǔou
|
2
|
+
PDF
Chat
|
ShuffleNet V2: Practical Guidelines for Efficient CNN Architecture Design
|
2018
|
Ningning Ma
Xiangyu Zhang
Hai-Tao Zheng
Jian Sun
|
2
|
+
PDF
Chat
|
RepVGG: Making VGG-style ConvNets Great Again
|
2021
|
Xiaohan Ding
Xiangyu Zhang
Ningning Ma
Jungong Han
Guiguang Ding
Jian Sun
|
2
|
+
PDF
Chat
|
Deep High-Resolution Representation Learning for Human Pose Estimation
|
2019
|
Ke Sun
Bin Xiao
Dong Liu
Jingdong Wang
|
2
|
+
|
Learning Robust Global Representations by Penalizing Local Predictive Power
|
2019
|
Haohan Wang
Songwei Ge
Eric P. Xing
Zachary C. Lipton
|
2
|
+
PDF
Chat
|
Diverse Branch Block: Building a Convolution as an Inception-like Unit
|
2021
|
Xiaohan Ding
Xiangyu Zhang
Jungong Han
Guiguang Ding
|
2
|
+
PDF
Chat
|
Deep Residual Learning for Image Recognition
|
2016
|
Kaiming He
Xiangyu Zhang
Shaoqing Ren
Jian Sun
|
2
|
+
PDF
Chat
|
ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices
|
2018
|
Xiangyu Zhang
Xinyu Zhou
Mengxiao Lin
Jian Sun
|
2
|
+
PDF
Chat
|
The Many Faces of Robustness: A Critical Analysis of Out-of-Distribution Generalization
|
2021
|
Dan Hendrycks
Steven Basart
Norman Mu
Saurav Kadavath
Fengqiu Wang
Evan Dorundo
Rahul Desai
Tyler Zhu
Samyak Parajuli
Mike Guo
|
2
|
+
|
Early Convolutions Help Transformers See Better
|
2021
|
Tete Xiao
Mannat Singh
Eric Mintun
Trevor Darrell
Piotr DollĂĄr
Ross Girshick
|
2
|
+
PDF
Chat
|
Natural Adversarial Examples
|
2021
|
Dan Hendrycks
Kevin Zhao
Steven Basart
Jacob Steinhardt
Dawn Song
|
2
|
+
PDF
Chat
|
ACNet: Strengthening the Kernel Skeletons for Powerful CNN via Asymmetric Convolution Blocks
|
2019
|
Xiaohan Ding
Yuchen Guo
Guiguang Ding
Jungong Han
|
2
|
+
|
MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications
|
2017
|
Andrew Howard
Menglong Zhu
Bo Chen
Dmitry Kalenichenko
Weijun Wang
Tobias Weyand
Marco Andreetto
Hartwig Adam
|
2
|
+
PDF
Chat
|
ConViT: improving vision transformers with soft convolutional inductive biases*
|
2022
|
StĂ©phane dâAscoli
Hugo Touvron
Matthew L. Leavitt
Ari S. Morcos
Giulio Biroli
Levent Sagun
|
2
|
+
PDF
Chat
|
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
|
2021
|
Ze Liu
Yutong Lin
Yue Cao
Han Hu
Yixuan Wei
Zheng Zhang
Stephen Lin
Baining Guo
|
2
|
+
|
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
|
2020
|
Alexey Dosovitskiy
Lucas Beyer
Alexander Kolesnikov
Dirk Weissenborn
Xiaohua Zhai
Thomas Unterthiner
Mostafa Dehghani
Matthias Minderer
Georg Heigold
Sylvain Gelly
|
2
|
+
PDF
Chat
|
MobileNetV2: Inverted Residuals and Linear Bottlenecks
|
2018
|
Mark Sandler
Andrew Howard
Menglong Zhu
Andrey Zhmoginov
Liang-Chieh Chen
|
2
|
+
PDF
Chat
|
Towards Robust Vision Transformer
|
2022
|
Xiaofeng Mao
Gege Qi
Yuefeng Chen
Xiaodan Li
Ranjie Duan
Shaokai Ye
Yuan He
Hui Xue
|
2
|
+
PDF
Chat
|
CvT: Introducing Convolutions to Vision Transformers
|
2021
|
Haiping Wu
Bin Xiao
Noel Codella
Mengchen Liu
Xiyang Dai
Lu Yuan
Lei Zhang
|
2
|
+
PDF
Chat
|
Learning Joint Reconstruction of Hands and Manipulated Objects
|
2019
|
Yana Hasson
GĂŒl Varol
Dimitrios Tzionas
Igor Kalevatykh
Michael J. Black
Ivan Laptev
Cordelia Schmid
|
1
|
+
PDF
Chat
|
FreiHAND: A Dataset for Markerless Capture of Hand Pose and Shape From Single RGB Images
|
2019
|
Christian Zimmermann
Duygu Ceylan
Shuicheng Yan
Bryan Russell
Max Argus
Thomas Brox
|
1
|
+
|
ExpandNets: Linear Over-parameterization to Train Compact Convolutional Networks
|
2018
|
Shuxuan Guo
Jose M. Ălvarez
Mathieu Salzmann
|
1
|
+
PDF
Chat
|
ResNeSt: Split-Attention Networks
|
2022
|
Hang Zhang
Chongruo Wu
Zhongyue Zhang
Yi Zhu
Haibin Lin
Zhi Zhang
Yue Sun
Tong He
Jonas Mueller
R. Manmatha
|
1
|
+
|
Linformer: Self-Attention with Linear Complexity
|
2020
|
Sinong Wang
Belinda Z. Li
Madian Khabsa
Fang Han
Hao Ma
|
1
|
+
PDF
Chat
|
Designing Network Design Spaces
|
2020
|
Ilija Radosavovic
Raj Prateek Kosaraju
Ross Girshick
Kaiming He
Piotr DollĂĄr
|
1
|
+
PDF
Chat
|
Weakly-Supervised Mesh-Convolutional Hand Reconstruction in the Wild
|
2020
|
Dominik Kulon
Rıza Alp GĂŒler
Iasonas Kokkinos
Michael M. Bronstein
Stefanos Zafeiriou
|
1
|
+
PDF
Chat
|
GhostNet: More Features From Cheap Operations
|
2020
|
Kai Han
Yunhe Wang
Qi Tian
Jianyuan Guo
Chunjing Xu
Chang Xu
|
1
|
+
|
MEAL V2: Boosting Vanilla ResNet-50 to 80%+ Top-1 Accuracy on ImageNet without Tricks
|
2020
|
Zhiqiang Shen
Marios Savvides
|
1
|
+
|
Model Rubik's Cube: Twisting Resolution, Depth and Width for TinyNets
|
2020
|
Kai Han
Yunhe Wang
Qiulin Zhang
Wei Zhang
Chunjing Xu
Tong Zhang
|
1
|
+
|
SSD: Single Shot MultiBox Detector
|
2016
|
Wei Liu
Dragomir Anguelov
Dumitru Erhan
Christian Szegedy
Scott Reed
Cheng-Yang Fu
Alexander C. Berg
|
1
|
+
PDF
Chat
|
I2L-MeshNet: Image-to-Lixel Prediction Network for Accurate 3D Human Pose and Mesh Estimation from a Single RGB Image
|
2020
|
Gyeongsik Moon
Kyoung Mu Lee
|
1
|
+
PDF
Chat
|
Pose2Mesh: Graph Convolutional Network for 3D Human Pose and Mesh Recovery from a 2D Human Pose
|
2020
|
Hongsuk Choi
Gyeongsik Moon
Kyoung Mu Lee
|
1
|
+
PDF
Chat
|
Rethinking Bottleneck Structure for Efficient Mobile Network Design
|
2020
|
Daquan Zhou
Qibin Hou
Yunpeng Chen
Jiashi Feng
Shuicheng Yan
|
1
|
+
|
An Attention Free Transformer
|
2021
|
Shuangfei Zhai
Walter Talbott
Nitish Srivastava
Chen Huang
Hanlin Goh
Joshua M. Susskind
|
1
|
+
|
High-Performance Large-Scale Image Recognition Without Normalization
|
2021
|
Andrew Brock
Soham De
Samuel Smith
Karen Simonyan
|
1
|
+
PDF
Chat
|
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions
|
2021
|
Wenhai Wang
Enze Xie
Xiang Li
Deng-Ping Fan
Kaitao Song
Ding Liang
Tong LĂŒ
Ping Luo
Ling Shao
|
1
|
+
|
Conditional Positional Encodings for Vision Transformers
|
2021
|
Xiangxiang Chu
Zhi Tian
Bo Zhang
Xinlong Wang
Xiaolin Wei
Huaxia Xia
Chunhua Shen
|
1
|
+
|
EfficientNetV2: Smaller Models and Faster Training
|
2021
|
Mingxing Tan
Quoc V. Le
|
1
|
+
|
Twins: Revisiting the Design of Spatial Attention in Vision Transformers
|
2021
|
Xiangxiang Chu
Zhi Tian
Yuqing Wang
Bo Zhang
Haibing Ren
Xiaolin Wei
Huaxia Xia
Chunhua Shen
|
1
|
+
PDF
Chat
|
Less Is More: Pay Less Attention in Vision Transformers
|
2022
|
Zizheng Pan
Bohan Zhuang
Haoyu He
Jing Liu
Jianfei Cai
|
1
|
+
PDF
Chat
|
Camera-Space Hand Mesh Recovery via Semantic Aggregation and Adaptive 2D-1D Registration
|
2021
|
Xingyu Chen
Yufeng Liu
Chongyang Ma
Jianlong Chang
Huayan Wang
Chen Tian
Xiaoyan Guo
Pengfei Wan
Wen Zheng
|
1
|
+
|
CoAtNet: Marrying Convolution and Attention for All Data Sizes
|
2021
|
Zihang Dai
Hanxiao Liu
Quoc V. Le
Mingxing Tan
|
1
|
+
PDF
Chat
|
Bottleneck Transformers for Visual Recognition
|
2021
|
Aravind Srinivas
Tsung-Yi Lin
Niki Parmar
Jonathon Shlens
Pieter Abbeel
Ashish Vaswani
|
1
|
+
PDF
Chat
|
End-to-End Human Pose and Mesh Reconstruction with Transformers
|
2021
|
Kevin Lin
Lijuan Wang
Zicheng Liu
|
1
|
+
PDF
Chat
|
I2UV-HandNet: Image-to-UV Prediction Network for Accurate and High-fidelity 3D Hand Mesh Modeling
|
2021
|
Ping Chen
Yujin Chen
Dong Yang
Fangyin Wu
Qin Li
Qingpei Xia
Yong Tan
|
1
|
+
PDF
Chat
|
Mobile-Former: Bridging MobileNet and Transformer
|
2022
|
Yinpeng Chen
Xiyang Dai
Dongdong Chen
Mengchen Liu
Xiaoyi Dong
Lu Yuan
Zicheng Liu
|
1
|
+
PDF
Chat
|
Towards Accurate Alignment in Real-time 3D Hand-Mesh Reconstruction
|
2021
|
Xiao Tang
Tianyu Wang
ChiâWing Fu
|
1
|
+
PDF
Chat
|
MicroNet: Improving Image Recognition with Extremely Low FLOPs
|
2021
|
Yunsheng Li
Yinpeng Chen
Xiyang Dai
Dongdong Chen
Mengchen Liu
Lu Yuan
Zicheng Liu
Lei Zhang
Nuno Vasconcelos
|
1
|
+
PDF
Chat
|
Hand Image Understanding via Deep Multi-Task Learning
|
2021
|
Xiong Zhang
Hongsheng Huang
Jianchao Tan
Hongmin Xu
Cheng Yang
Guozhu Peng
Lei Wang
Ji Liu
|
1
|
+
PDF
Chat
|
Rethinking Spatial Dimensions of Vision Transformers
|
2021
|
Byeongho Heo
Sangdoo Yun
Dongyoon Han
Sanghyuk Chun
Junsuk Choe
Seong Joon Oh
|
1
|