+
|
Attention Is All You Need
|
2017
|
Ashish Vaswani
Noam Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan N. Gomez
Ćukasz Kaiser
Illia Polosukhin
|
3
|
+
|
Adafactor: Adaptive Learning Rates with Sublinear Memory Cost
|
2018
|
Noam Shazeer
Mitchell Stern
|
2
|
+
|
Learning Transferable Visual Models From Natural Language Supervision
|
2021
|
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya Ramesh
Gabriel Goh
Sandhini Agarwal
Girish Sastry
Amanda Askell
Pamela Mishkin
Jack Clark
|
2
|
+
PDF
Chat
|
Scaling Vision Transformers
|
2022
|
Xiaohua Zhai
Alexander Kolesnikov
Neil Houlsby
Lucas Beyer
|
2
|
+
PDF
Chat
|
The Power of Scale for Parameter-Efficient Prompt Tuning
|
2021
|
Brian Lester
Rami AlâRfou
Noah Constant
|
2
|
+
|
Unifying Visual-Semantic Embeddings with Multimodal Neural Language Models
|
2014
|
Ryan Kiros
Ruslan Salakhutdinov
Richard S. Zemel
|
2
|
+
PDF
Chat
|
CIDEr: Consensus-based image description evaluation
|
2015
|
Ramakrishna Vedantam
C. Lawrence Zitnick
Devi Parikh
|
1
|
+
PDF
Chat
|
Deep visual-semantic alignments for generating image descriptions
|
2015
|
Andrej Karpathy
Li Fei-Fei
|
1
|
+
|
Sequence Transduction with Recurrent Neural Networks
|
2012
|
Alex Graves
|
1
|
+
|
Zero-Shot Learning Through Cross-Modal Transfer
|
2013
|
Richard Socher
Milind Ganjoo
Christopher D. Manning
Andrew Y. Ng
|
1
|
+
PDF
Chat
|
Label-Embedding for Image Classification
|
2015
|
Zeynep Akata
Florent Perronnin
ZaĂŻd Harchaoui
Cordelia Schmid
|
1
|
+
PDF
Chat
|
Deep Residual Learning for Image Recognition
|
2016
|
Kaiming He
Xiangyu Zhang
Shaoqing Ren
Jian Sun
|
1
|
+
|
TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems
|
2016
|
MartıÌn Abadi
Ashish Agarwal
Paul Barham
Eugene Brevdo
Zhifeng Chen
Craig Citro
Gregory S. Corrado
Andy Davis
Jay B. Dean
Matthieu Devin
|
1
|
+
PDF
Chat
|
Personalized speech recognition on mobile devices
|
2016
|
Ian McGraw
Rohit Prabhavalkar
Raziel Ălvarez
Montse Gonzalez Arenas
Kanishka Rao
David Rybach
Ouais Alsharif
HaĆim Sak
Alexander Gruenstein
Françoise Beaufays
|
1
|
+
PDF
Chat
|
Learning Visual Features from Large Weakly Supervised Data
|
2016
|
Armand Joulin
Laurens van der Maaten
Allan Jabri
Nicolas Vasilache
|
1
|
+
PDF
Chat
|
Show and tell: A neural image caption generator
|
2015
|
Oriol Vinyals
Alexander Toshev
Samy Bengio
Dumitru Erhan
|
1
|
+
|
UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild
|
2012
|
Khurram Soomro
Amir Zamir
Mubarak Shah
|
1
|
+
|
Training Deep Nets with Sublinear Memory Cost
|
2016
|
Tianqi Chen
Bing Xu
Chiyuan Zhang
Carlos Guestrin
|
1
|
+
PDF
Chat
|
Photo Aesthetics Ranking Network with Attributes and Content Adaptation
|
2016
|
Shu Kong
Xiaohui Shen
Zhe Lin
RadomĂr MÄch
Charless C. Fowlkes
|
1
|
+
|
WaveNet: A Generative Model for Raw Audio
|
2016
|
AĂ€ron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alexander Graves
Nal Kalchbrenner
Andrew Senior
Koray Kavukcuoglu
|
1
|
+
PDF
Chat
|
Joint CTC-attention based end-to-end speech recognition using multi-task learning
|
2017
|
Suyoun Kim
Takaaki Hori
Shinji Watanabe
|
1
|
+
PDF
Chat
|
Dual Attention Networks for Multimodal Reasoning and Matching
|
2017
|
Hyeonseob Nam
Jung-Woo Ha
Jeonghee Kim
|
1
|
+
PDF
Chat
|
Learning a Deep Embedding Model for Zero-Shot Learning
|
2017
|
Li Zhang
Tao Xiang
Shaogang Gong
|
1
|
+
|
Overcoming catastrophic forgetting in neural networks
|
2017
|
James Kirkpatrick
Razvan Pascanu
Neil C. Rabinowitz
Joel Veness
Guillaume Desjardins
Andrei A. Rusu
Kieran Milan
John Quan
Tiago Ramalho
Agnieszka GrabskaâBarwiĆska
|
1
|
+
PDF
Chat
|
Learning Visual N-Grams from Web Data
|
2017
|
Ang Li
Allan Jabri
Armand Joulin
Laurens van der Maaten
|
1
|
+
PDF
Chat
|
Towards Better Decoding and Language Model Integration in Sequence to Sequence Models
|
2017
|
Jan Chorowski
Navdeep Jaitly
|
1
|
+
PDF
Chat
|
Zero-Shot Learning â The Good, the Bad and the Ugly
|
2017
|
Yongqin Xian
Bernt Schiele
Zeynep Akata
|
1
|
+
|
In-Datacenter Performance Analysis of a Tensor Processing Unit
|
2017
|
Norman P. Jouppi
Cliff Young
Nishant Patil
David A. Patterson
Gaurav Agrawal
Raminder Bajwa
S. C. Bates
Suresh Bhatia
Nan Boden
Al Borchers
|
1
|
+
|
Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour
|
2017
|
Priya Goyal
Piotr DollĂĄr
Ross Girshick
Pieter Noordhuis
Lukasz Wesolowski
Aapo Kyrola
Andrew Tulloch
Yangqing Jia
Kaiming He
|
1
|
+
|
Recurrent neural networks with specialized word embeddings for health-domain named-entity recognition
|
2017
|
Iñigo Jauregi Unanue
Ehsan Zare Borzeshi
Massimo Piccardi
|
1
|
+
PDF
Chat
|
Describing Textures in the Wild
|
2014
|
Mircea Cimpoi
Subhransu Maji
Iasonas Kokkinos
Sammy Mohamed
Andrea Vedaldi
|
1
|
+
|
Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis
|
2018
|
Yuxuan Wang
Daisy Stanton
Yu Zhang
RJ Skerry-Ryan
Eric Battenberg
Joel Shor
Ying Xiao
Fei Ren
Jia Ye
Rif A. Saurous
|
1
|
+
|
VSE++: Improving Visual-Semantic Embeddings with Hard Negatives
|
2017
|
Fartash Faghri
David J. Fleet
Jamie Kiros
Sanja Fidler
|
1
|
+
PDF
Chat
|
Latent Embeddings for Zero-Shot Classification
|
2016
|
Yongqin Xian
Zeynep Akata
Gaurav Sharma
Quynh L. Nguyen
Matthias Hein
Bernt Schiele
|
1
|
+
|
Adafactor: Adaptive Learning Rates with Sublinear Memory Cost
|
2018
|
Noam Shazeer
Mitchell Stern
|
1
|
+
PDF
Chat
|
Rotation Equivariant CNNs for Digital Pathology
|
2018
|
Bastiaan S. Veeling
Jasper Linmans
Jim Winkens
Taco Cohen
Max Welling
|
1
|
+
|
Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis
|
2018
|
Jia Ye
Yu Zhang
Ron J. Weiss
Quan Wang
Jonathan Shen
Fei Ren
Zhifeng Chen
Patrick Nguyen
Ruoming Pang
Ignacio LĂłpez Moreno
|
1
|
+
|
Representation Learning with Contrastive Predictive Coding
|
2018
|
AĂ€ron van den Oord
Yazhe Li
Oriol Vinyals
|
1
|
+
PDF
Chat
|
Deep Context: End-to-end Contextual Speech Recognition
|
2018
|
Golan Pundak
Tara N. Sainath
Rohit Prabhavalkar
Anjuli Kannan
Ding Zhao
|
1
|
+
|
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
|
2018
|
Jacob Devlin
MingâWei Chang
Kenton Lee
Kristina Toutanova
|
1
|
+
PDF
Chat
|
Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis
|
2019
|
Yajie Zhang
Shifeng Pan
Lei He
Zhen-Hua Ling
|
1
|
+
|
Decoupled Weight Decay Regularization
|
2017
|
Ilya Loshchilov
Frank Hutter
|
1
|
+
PDF
Chat
|
Effective Aesthetics Prediction With Multi-Level Spatially Pooled Features
|
2019
|
Vlad Hosu
Bastian GoldlĂŒcke
Dietmar Saupe
|
1
|
+
|
Entity-Relation Extraction as Multi-Turn Question Answering
|
2019
|
Xiaoya Li
Fan Yin
Zijun Sun
Xiayu Li
Arianna Yuan
Duo Chai
Mingxin Zhou
Jiwei Li
|
1
|
+
|
Set Transformer: A Framework for Attention-based Permutation-Invariant Neural Networks
|
2018
|
Juho Lee
Yoonho Lee
Jungtaek Kim
Adam R. Kosiorek
Seungjin Choi
Yee Whye Teh
|
1
|
+
|
EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks
|
2019
|
Mingxing Tan
Quoc V. Le
|
1
|
+
PDF
Chat
|
Streaming End-to-end Speech Recognition for Mobile Devices
|
2019
|
Yanzhang He
Tara N. Sainath
Rohit Prabhavalkar
Ian McGraw
Raziel Ălvarez
Ding Zhao
David Rybach
Anjuli Kannan
Yonghui Wu
Ruoming Pang
|
1
|
+
PDF
Chat
|
State-of-the-Art Speech Recognition with Sequence-to-Sequence Models
|
2018
|
ChungâCheng Chiu
Tara N. Sainath
Yonghui Wu
Rohit Prabhavalkar
Patrick Nguyen
Zhifeng Chen
Anjuli Kannan
Ron J. Weiss
Kanishka Rao
Ekaterina Gonina
|
1
|
+
PDF
Chat
|
Revisiting Unreasonable Effectiveness of Data in Deep Learning Era
|
2017
|
Chen Sun
Abhinav Shrivastava
Saurabh Singh
Abhinav Gupta
|
1
|
+
PDF
Chat
|
Squeeze-and-Excitation Networks
|
2018
|
Jie Hu
Li Shen
Gang Sun
|
1
|