+
|
Very Deep Convolutional Networks for Large-Scale Image Recognition
|
2014
|
Karen Simonyan
Andrew Zisserman
|
1
|
+
PDF
Chat
|
Implementation of on-site velocity boundary conditions for D3Q19 lattice Boltzmann simulations
|
2010
|
Martin Hecht
Jens Harting
|
1
|
+
|
Deep Unsupervised Learning using Nonequilibrium Thermodynamics
|
2015
|
Jascha Sohl‐Dickstein
Eric A. Weiss
Niru Maheswaranathan
Surya Ganguli
|
1
|
+
PDF
Chat
|
Rethinking the Inception Architecture for Computer Vision
|
2016
|
Christian Szegedy
Vincent Vanhoucke
Sergey Ioffe
Jon Shlens
Zbigniew Wojna
|
1
|
+
PDF
Chat
|
Deep Residual Learning for Image Recognition
|
2016
|
Kaiming He
Xiangyu Zhang
Shaoqing Ren
Jian Sun
|
1
|
+
|
Generative Adversarial Text to Image Synthesis
|
2016
|
Scott Reed
Zeynep Akata
Xinchen Yan
Lajanugen Logeswaran
Bernt Schiele
Honglak Lee
|
1
|
+
PDF
Chat
|
Aggregated Residual Transformations for Deep Neural Networks
|
2017
|
Saining Xie
Ross Girshick
Piotr Dollár
Zhuowen Tu
Kaiming He
|
1
|
+
|
Proximal Policy Optimization Algorithms
|
2017
|
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
|
1
|
+
PDF
Chat
|
Squeeze-and-Excitation Networks
|
2018
|
Jie Hu
Li Shen
Gang Sun
|
1
|
+
PDF
Chat
|
CBAM: Convolutional Block Attention Module
|
2018
|
Sanghyun Woo
Jongchan Park
Joon‐Young Lee
In So Kweon
|
1
|
+
PDF
Chat
|
Unified Perceptual Parsing for Scene Understanding
|
2018
|
Tete Xiao
Yingcheng Liu
Bolei Zhou
Yuning Jiang
Jian Sun
|
1
|
+
PDF
Chat
|
Res2Net: A New Multi-Scale Backbone Architecture
|
2019
|
Shanghua Gao
Ming‐Ming Cheng
Kai Zhao
Xinyu Zhang
Ming–Hsuan Yang
Philip H. S. Torr
|
1
|
+
|
Microsoft COCO: Common Objects in Context
|
2014
|
Tsung-Yi Lin
Michael Maire
Serge Belongie
Lubomir Bourdev
Ross Girshick
James Hays
Pietro Perona
Deva Ramanan
C. Lawrence Zitnick
Piotr Dollár
|
1
|
+
PDF
Chat
|
Non-local Neural Networks
|
2018
|
Xiaolong Wang
Ross Girshick
Abhinav Gupta
Kaiming He
|
1
|
+
PDF
Chat
|
ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices
|
2018
|
Xiangyu Zhang
Xinyu Zhou
Mengxiao Lin
Jian Sun
|
1
|
+
PDF
Chat
|
MobileNetV2: Inverted Residuals and Linear Bottlenecks
|
2018
|
Mark Sandler
Andrew Howard
Menglong Zhu
Andrey Zhmoginov
Liang-Chieh Chen
|
1
|
+
PDF
Chat
|
Focal Loss for Dense Object Detection
|
2017
|
Tsung-Yi Lin
Priya Goyal
Ross Girshick
Kaiming He
Piotr Dollár
|
1
|
+
PDF
Chat
|
Cascade R-CNN: Delving Into High Quality Object Detection
|
2018
|
Zhaowei Cai
Nuno Vasconcelos
|
1
|
+
PDF
Chat
|
Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning
|
2017
|
Christian Szegedy
Sergey Ioffe
Vincent Vanhoucke
Alexander A. Alemi
|
1
|
+
|
Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks
|
2019
|
Nils Reimers
Iryna Gurevych
|
1
|
+
|
Fine-Tuning Language Models from Human Preferences
|
2019
|
Daniel M. Ziegler
Nisan Stiennon
Jeffrey Wu
T. B. Brown
Alec Radford
Dario Amodei
Paul F. Christiano
Geoffrey Irving
|
1
|
+
PDF
Chat
|
Designing Network Design Spaces
|
2020
|
Ilija Radosavovic
Raj Prateek Kosaraju
Ross Girshick
Kaiming He
Piotr Dollár
|
1
|
+
PDF
Chat
|
Bridging the Gap Between Anchor-Based and Anchor-Free Detection via Adaptive Training Sample Selection
|
2020
|
Shifeng Zhang
Cheng Chi
Yongqiang Yao
Zhen Lei
Stan Z. Li
|
1
|
+
|
Denoising Diffusion Probabilistic Models
|
2020
|
Jonathan Ho
Ajay N. Jain
Pieter Abbeel
|
1
|
+
|
Zero-Shot Text-to-Image Generation
|
2021
|
Aditya Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
|
1
|
+
PDF
Chat
|
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions
|
2021
|
Wenhai Wang
Enze Xie
Xiang Li
Deng-Ping Fan
Kaitao Song
Ding Liang
Tong Lü
Ping Luo
Ling Shao
|
1
|
+
PDF
Chat
|
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
|
2021
|
Ze Liu
Yutong Lin
Yue Cao
Han Hu
Yixuan Wei
Zheng Zhang
Stephen Lin
Baining Guo
|
1
|
+
PDF
Chat
|
Multi-Scale Vision Longformer: A New Vision Transformer for High-Resolution Image Encoding
|
2021
|
Pengchuan Zhang
Xiyang Dai
Jianwei Yang
Bin Xiao
Lu Yuan
Lei Zhang
Jianfeng Gao
|
1
|
+
PDF
Chat
|
CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification
|
2021
|
Chun-Fu Richard Chen
Quanfu Fan
Rameswar Panda
|
1
|
+
|
Twins: Revisiting the Design of Spatial Attention in Vision Transformers
|
2021
|
Xiangxiang Chu
Zhi Tian
Yuqing Wang
Bo Zhang
Haibing Ren
Xiaolin Wei
Huaxia Xia
Chunhua Shen
|
1
|
+
|
Learning Transferable Visual Models From Natural Language Supervision
|
2021
|
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya Ramesh
Gabriel Goh
Sandhini Agarwal
Girish Sastry
Amanda Askell
Pamela Mishkin
Jack Clark
|
1
|
+
|
Shuffle Transformer: Rethinking Spatial Shuffle for Vision Transformer
|
2021
|
Zilong Huang
Youcheng Ben
Guozhong Luo
Pei Cheng
Gang Yu
Bin Fu
|
1
|
+
|
DynamicViT: Efficient Vision Transformers with Dynamic Token Sparsification
|
2021
|
Yongming Rao
Wenliang Zhao
Benlin Liu
Jiwen Lu
Jie Zhou
Cho‐Jui Hsieh
|
1
|
+
|
LoRA: Low-Rank Adaptation of Large Language Models
|
2021
|
J. Edward Hu
Yelong Shen
Phillip Wallis
Zeyuan Allen-Zhu
Yuanzhi Li
Shean Wang
Weizhu Chen
|
1
|
+
|
CoAtNet: Marrying Convolution and Attention for All Data Sizes
|
2021
|
Zihang Dai
Hanxiao Liu
Quoc V. Le
Mingxing Tan
|
1
|
+
PDF
Chat
|
Bottleneck Transformers for Visual Recognition
|
2021
|
Aravind Srinivas
Tsung-Yi Lin
Niki Parmar
Jonathon Shlens
Pieter Abbeel
Ashish Vaswani
|
1
|
+
PDF
Chat
|
Sparse R-CNN: End-to-End Object Detection with Learnable Proposals
|
2021
|
Peize Sun
Rufeng Zhang
Yi Jiang
Tao Kong
Chenfeng Xu
Wei Zhan
Masayoshi Tomizuka
Lei Li
Zehuan Yuan
Changhu Wang
|
1
|
+
PDF
Chat
|
PVT v2: Improved baselines with Pyramid Vision Transformer
|
2022
|
Wenhai Wang
Enze Xie
Xiang Li
Deng-Ping Fan
Kaitao Song
Ding Liang
Tong Lü
Ping Luo
Ling Shao
|
1
|
+
|
Focal Self-attention for Local-Global Interactions in Vision Transformers
|
2021
|
Jianwei Yang
Chunyuan Li
Pengchuan Zhang
Xiyang Dai
Bin Xiao
Lu Yuan
Jianfeng Gao
|
1
|
+
PDF
Chat
|
Mobile-Former: Bridging MobileNet and Transformer
|
2022
|
Yinpeng Chen
Xiyang Dai
Dongdong Chen
Mengchen Liu
Xiaoyi Dong
Lu Yuan
Zicheng Liu
|
1
|
+
PDF
Chat
|
CvT: Introducing Convolutions to Vision Transformers
|
2021
|
Haiping Wu
Bin Xiao
Noel Codella
Mengchen Liu
Xiyang Dai
Lu Yuan
Lei Zhang
|
1
|
+
PDF
Chat
|
Co-Scale Conv-Attentional Image Transformers
|
2021
|
Weijian Xu
Yifan Xu
Tyler A. Chang
Zhuowen Tu
|
1
|
+
|
Hierarchical Text-Conditional Image Generation with CLIP Latents
|
2022
|
Aditya Ramesh
Prafulla Dhariwal
Alex Nichol
Casey Chu
Mark Chen
|
1
|
+
|
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
|
2022
|
Junnan Li
Dongxu Li
Caiming Xiong
Steven C. H. Hoi
|
1
|
+
|
Focal Modulation Networks
|
2022
|
Jianwei Yang
Chunyuan Li
Jianfeng Gao
|
1
|
+
|
Training language models to follow instructions with human feedback
|
2022
|
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
Pamela Mishkin
Chong Zhang
Sandhini Agarwal
Katarina Slama
Alex Ray
|
1
|
+
|
Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
|
2022
|
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
Emily Denton
Seyed Kamyar Seyed Ghasemipour
Burcu Karagol Ayan
S. Sara Mahdavi
Rapha Gontijo Lopes
|
1
|
+
|
EfficientFormer: Vision Transformers at MobileNet Speed
|
2022
|
Yanyu Li
Geng Yuan
Yang Wen
Eric Hu
Georgios Evangelidis
Sergey Tulyakov
Yanzhi Wang
Jian Ren
|
1
|
+
|
Next-ViT: Next Generation Vision Transformer for Efficient Deployment in Realistic Industrial Scenarios
|
2022
|
Jiashi Li
Xin Xia
Wei Li
Huixia Li
Xing Wang
Xuefeng Xiao
Rui Wang
Min Zheng
Xin Pan
|
1
|
+
|
HorNet: Efficient High-Order Spatial Interactions with Recursive Gated Convolutions
|
2022
|
Yongming Rao
Wenliang Zhao
Yansong Tang
Jie Zhou
Ser-Nam Lim
Jiwen Lu
|
1
|