+
PDF
Chat
|
Deep Residual Learning for Image Recognition
|
2016
|
Kaiming He
Xiangyu Zhang
Shaoqing Ren
Jian Sun
|
8
|
+
PDF
Chat
|
Focal Loss for Dense Object Detection
|
2017
|
Tsung-Yi Lin
Priya Goyal
Ross Girshick
Kaiming He
Piotr Dollár
|
5
|
+
PDF
Chat
|
Aggregated Residual Transformations for Deep Neural Networks
|
2017
|
Saining Xie
Ross Girshick
Piotr Dollár
Zhuowen Tu
Kaiming He
|
5
|
+
PDF
Chat
|
Cascade R-CNN: Delving Into High Quality Object Detection
|
2018
|
Zhaowei Cai
Nuno Vasconcelos
|
5
|
+
|
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift
|
2015
|
Sergey Ioffe
Christian Szegedy
|
4
|
+
|
Decoupled Weight Decay Regularization
|
2017
|
Ilya Loshchilov
Frank Hutter
|
4
|
+
PDF
Chat
|
Squeeze-and-Excitation Networks
|
2018
|
Jie Hu
Li Shen
Gang Sun
|
4
|
+
PDF
Chat
|
Random Erasing Data Augmentation
|
2020
|
Zhun Zhong
Liang Zheng
Guoliang Kang
Shaozi Li
Yi Yang
|
4
|
+
PDF
Chat
|
Deep Networks with Stochastic Depth
|
2016
|
Gao Huang
Yu Sun
Zhuang Liu
Daniel Sedra
Kilian Q. Weinberger
|
4
|
+
PDF
Chat
|
Rethinking the Inception Architecture for Computer Vision
|
2016
|
Christian Szegedy
Vincent Vanhoucke
Sergey Ioffe
Jon Shlens
Zbigniew Wojna
|
4
|
+
PDF
Chat
|
Randaugment: Practical automated data augmentation with a reduced search space
|
2020
|
Ekin D. Cubuk
Barret Zoph
Jonathon Shlens
Quoc V. Le
|
4
|
+
PDF
Chat
|
CutMix: Regularization Strategy to Train Strong Classifiers With Localizable Features
|
2019
|
Sangdoo Yun
Dongyoon Han
Sanghyuk Chun
Seong Joon Oh
Youngjoon Yoo
Junsuk Choe
|
4
|
+
PDF
Chat
|
ResMLP: Feedforward Networks for Image Classification With Data-Efficient Training
|
2022
|
Hugo Touvron
Piotr Bojanowski
Mathilde Caron
Matthieu Cord
Alaaeldin El-Nouby
Édouard Grave
Gautier Izacard
Armand Joulin
Gabriel Synnaeve
Jakob Verbeek
|
4
|
+
|
Gaussian Error Linear Units (GELUs)
|
2016
|
Dan Hendrycks
Kevin Gimpel
|
4
|
+
PDF
Chat
|
FNet: Mixing Tokens with Fourier Transforms
|
2022
|
James Lee-Thorp
Joshua Ainslie
Ilya Eckstein
Santiago Ontañón
|
4
|
+
|
Attention Is All You Need
|
2017
|
Ashish Vaswani
Noam Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan N. Gomez
Łukasz Kaiser
Illia Polosukhin
|
3
|
+
|
Squeeze-and-Excitation Networks
|
2017
|
Jie Hu
Li Shen
Samuel Albanie
Gang Sun
Enhua Wu
|
3
|
+
PDF
Chat
|
You Only Look Once: Unified, Real-Time Object Detection
|
2016
|
Joseph Redmon
Santosh Divvala
Ross Girshick
Ali Farhadi
|
3
|
+
|
CycleMLP: A MLP-like Architecture for Dense Prediction
|
2021
|
Shoufa Chen
Enze Xie
Chongjian Ge
Runjian Chen
Ding Liang
Ping Luo
|
3
|
+
PDF
Chat
|
Training data-efficient image transformers & distillation through attention
|
2021
|
Hugo Touvron
Matthieu Cord
Matthijs Douze
Francisco Massa
Alexandre Sablayrolles
Hervé Jeǵou
|
3
|
+
PDF
Chat
|
Vision Permutator: A Permutable MLP-Like Architecture for Visual Recognition
|
2022
|
Qibin Hou
Zihang Jiang
Li Yuan
Ming‐Ming Cheng
Shuicheng Yan
Jiashi Feng
|
3
|
+
PDF
Chat
|
Feature Pyramid Networks for Object Detection
|
2017
|
Tsung-Yi Lin
Piotr Dollár
Ross Girshick
Kaiming He
Bharath Hariharan
Serge Belongie
|
3
|
+
PDF
Chat
|
Squeeze-and-Excitation Networks
|
2019
|
Jie Hu
Li Shen
Samuel Albanie
Gang Sun
Enhua Wu
|
3
|
+
|
Transformer in Transformer
|
2021
|
Kai Han
An Xiao
Enhua Wu
Jianyuan Guo
Chunjing Xu
Yunhe Wang
|
3
|
+
PDF
Chat
|
Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation
|
2014
|
Ross Girshick
Jeff Donahue
Trevor Darrell
Jitendra Malik
|
3
|
+
|
Refiner: Refining Self-attention for Vision Transformers
|
2021
|
Daquan Zhou
Yujun Shi
Bingyi Kang
Weihao Yu
Zihang Jiang
Yuan Li
Xiaojie Jin
Qibin Hou
Jiashi Feng
|
3
|
+
PDF
Chat
|
ImageNet Large Scale Visual Recognition Challenge
|
2015
|
Olga Russakovsky
Jia Deng
Hao Su
Jonathan Krause
Sanjeev Satheesh
Sean Ma
Zhiheng Huang
Andrej Karpathy
Aditya Khosla
Michael S. Bernstein
|
3
|
+
|
ResNet strikes back: An improved training procedure in timm
|
2021
|
Ross Wightman
Hugo Touvron
Hervé Jeǵou
|
3
|
+
|
SSD: Single Shot MultiBox Detector
|
2016
|
Wei Liu
Dragomir Anguelov
Dumitru Erhan
Christian Szegedy
Scott Reed
Cheng-Yang Fu
Alexander C. Berg
|
3
|
+
PDF
Chat
|
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
|
2021
|
Ze Liu
Yutong Lin
Yue Cao
Han Hu
Yixuan Wei
Zheng Zhang
Stephen Lin
Baining Guo
|
3
|
+
PDF
Chat
|
Cycle-SUM: Cycle-Consistent Adversarial LSTM Networks for Unsupervised Video Summarization
|
2019
|
Li Yuan
Francis EH Tay
Ping Li
L. P. Zhou
Jiashi Feng
|
3
|
+
PDF
Chat
|
Distilling Object Detectors With Fine-Grained Feature Imitation
|
2019
|
Tao Wang
Li Yuan
Xiaopeng Zhang
Jiashi Feng
|
3
|
+
PDF
Chat
|
End-to-End Object Detection with Transformers
|
2020
|
Nicolas Carion
Francisco Massa
Gabriel Synnaeve
Nicolas Usunier
Alexander Kirillov
Sergey Zagoruyko
|
3
|
+
|
MLP-Mixer: An all-MLP Architecture for Vision
|
2021
|
Ilya Tolstikhin
Neil Houlsby
Alexander Kolesnikov
Lucas Beyer
Xiaohua Zhai
Thomas Unterthiner
Jessica Yung
Andreas Steiner
Daniel Keysers
Jakob Uszkoreit
|
3
|
+
|
Object Relation Detection Based on One-shot Learning
|
2018
|
Li Zhou
Jian Zhao
Jianshu Li
Yuan Li
Jiashi Feng
|
3
|
+
|
Human in Events: A Large-Scale Benchmark for Human-centric Video Analysis in Complex Events
|
2020
|
Weiyao Lin
Huabin Liu
Shizhan Liu
Yuxi Li
Guo-Jun Qi
Rui Qian
Tao Wang
Nicu Sebe
Ning Xu
Hongkai Xiong
|
3
|
+
|
CrowdHuman: A Benchmark for Detecting Human in a Crowd
|
2018
|
Shuai Shao
Zijian Zhao
Boxun Li
Tete Xiao
Gang Yu
Xiangyu Zhang
Jian Sun
|
3
|
+
PDF
Chat
|
Central Similarity Quantization for Efficient Image and Video Retrieval
|
2020
|
Li Yuan
Tao Wang
Xiaopeng Zhang
Francis EH Tay
Zequn Jie
Wei Liu
Jiashi Feng
|
3
|
+
|
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
|
2018
|
Jacob Devlin
Ming‐Wei Chang
Kenton Lee
Kristina Toutanova
|
3
|
+
PDF
Chat
|
CityPersons: A Diverse Dataset for Pedestrian Detection
|
2017
|
Shanshan Zhang
Rodrigo Benenson
Bernt Schiele
|
3
|
+
PDF
Chat
|
Few-Shot Adaptive Faster R-CNN
|
2019
|
Tao Wang
Xiaopeng Zhang
Li Yuan
Jiashi Feng
|
3
|
+
PDF
Chat
|
Detection in Crowded Scenes: One Proposal, Multiple Predictions
|
2020
|
Xuangeng Chu
Anlin Zheng
Xiangyu Zhang
Jian Sun
|
3
|
+
PDF
Chat
|
Xception: Deep Learning with Depthwise Separable Convolutions
|
2017
|
François Chollet
|
3
|
+
|
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
|
2015
|
Shaoqing Ren
Kaiming He
Ross Girshick
Jian Sun
|
3
|
+
PDF
Chat
|
Designing Network Design Spaces
|
2020
|
Ilija Radosavovic
Raj Prateek Kosaraju
Ross Girshick
Kaiming He
Piotr Dollár
|
3
|
+
PDF
Chat
|
Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet
|
2021
|
Li Yuan
Yunpeng Chen
Tao Wang
Weihao Yu
Yujun Shi
Zihang Jiang
Francis E. H. Tay
Jiashi Feng
Shuicheng Yan
|
3
|
+
|
Matching pursuits with time-frequency dictionaries
|
1993
|
Stéphane Mallat
Zhifeng Zhang
|
2
|
+
|
RoBERTa: A Robustly Optimized BERT Pretraining Approach
|
2019
|
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
Mike Lewis
Luke Zettlemoyer
Veselin Stoyanov
|
2
|
+
|
$A^2$-Nets: Double Attention Networks
|
2018
|
Yunpeng Chen
Yannis Kalantidis
Jianshu Li
Shuicheng Yan
Jiashi Feng
|
2
|
+
|
Adam: A Method for Stochastic Optimization
|
2014
|
Diederik P. Kingma
Jimmy Ba
|
2
|