Jakob Uszkoreit

Follow

Generating author description...

All published works
Action Title Year Authors
+ PDF Chat Scene Representation Transformer: Geometry-Free Novel View Synthesis Through Set-Latent Scene Representations 2022 Mehdi S. M. Sajjadi
Henning Meyer
Etienne Pot
Urs Bergmann
Klaus Greff
Noha Radwan
Suhani Vora
Mario Lučić
Daniel Duckworth
Alexey Dosovitskiy
+ MLP-Mixer: An all-MLP Architecture for Vision 2021 Ilya Tolstikhin
Neil Houlsby
Alexander Kolesnikov
Lucas Beyer
Xiaohua Zhai
Thomas Unterthiner
Jessica Yung
Andreas Steiner
Daniel Keysers
Jakob Uszkoreit
+ How to train your ViT? Data, Augmentation, and Regularization in Vision Transformers. 2021 Andreas Steiner
Alexander Kolesnikov
Xiaohua Zhai
Ross Wightman
Jakob Uszkoreit
Lucas Beyer
+ PDF Chat Differentiable Patch Selection for Image Recognition 2021 Jean-Baptiste Cordonnier
Aravindh Mahendran
Alexey Dosovitskiy
Dirk Weissenborn
Jakob Uszkoreit
Thomas Unterthiner
+ Differentiable Patch Selection for Image Recognition 2021 Jean-Baptiste Cordonnier
Aravindh Mahendran
Alexey Dosovitskiy
Dirk Weissenborn
Jakob Uszkoreit
Thomas Unterthiner
+ MLP-Mixer: An all-MLP Architecture for Vision 2021 Ilya Tolstikhin
Neil Houlsby
Alexander Kolesnikov
Lucas Beyer
Xiaohua Zhai
Thomas Unterthiner
Jessica Yung
Andreas Steiner
Daniel Keysers
Jakob Uszkoreit
+ Scene Representation Transformer: Geometry-Free Novel View Synthesis Through Set-Latent Scene Representations 2021 Mehdi S. M. Sajjadi
Henning Meyer
Etienne Pot
Urs Bergmann
Klaus Greff
Noha Radwan
Suhani Vora
Mario Lučić
Daniel Duckworth
Alexey Dosovitskiy
+ How to train your ViT? Data, Augmentation, and Regularization in Vision Transformers 2021 Andreas Steiner
А. И. Колесников
Xiaohua Zhai
Ross Wightman
Jakob Uszkoreit
Lucas Beyer
+ Differentiable Patch Selection for Image Recognition 2021 Jean-Baptiste Cordonnier
Aravindh Mahendran
Alexey Dosovitskiy
Dirk Weissenborn
Jakob Uszkoreit
Thomas Unterthiner
+ Towards End-to-End In-Image Neural Machine Translation 2020 Elman Mansimov
Mitchell Stern
Mia Chen
Orhan Fırat
Jakob Uszkoreit
Puneet Jain
+ Object-Centric Learning with Slot Attention 2020 Francesco Locatello
Dirk Weissenborn
Thomas Unterthiner
Aravindh Mahendran
Georg Heigold
Jakob Uszkoreit
Alexey Dosovitskiy
Thomas Kipf
+ An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale 2020 Alexey Dosovitskiy
Lucas Beyer
Alexander Kolesnikov
Dirk Weissenborn
Xiaohua Zhai
Thomas Unterthiner
Mostafa Dehghani
Matthias Minderer
Georg Heigold
Sylvain Gelly
+ An Empirical Study of Generation Order for Machine Translation 2020 William Chan
Mitchell Stern
Jamie Kiros
Jakob Uszkoreit
+ Towards End-to-End In-Image Neural Machine Translation 2020 Elman Mansimov
Mitchell Stern
Mia Chen
Orhan Fırat
Jakob Uszkoreit
Puneet Jain
+ Towards End-to-End In-Image Neural Machine Translation 2020 Elman Mansimov
Mitchell Stern
Mia Chen
Orhan Fırat
Jakob Uszkoreit
Puneet Jain
+ Scaling Autoregressive Video Models. 2019 Dirk Weissenborn
Oscar Täckström
Jakob Uszkoreit
+ KERMIT: Generative Insertion-Based Modeling for Sequences 2019 William Chan
Nikita Kitaev
Kelvin Guu
Mitchell Stern
Jakob Uszkoreit
+ Insertion Transformer: Flexible Sequence Generation via Insertion Operations 2019 Mitchell Stern
William Chan
Jamie Kiros
Jakob Uszkoreit
+ An Empirical Study of Generation Order for Machine Translation 2019 William Chan
Mitchell Stern
Jamie Kiros
Jakob Uszkoreit
+ Scaling Autoregressive Video Models 2019 Dirk Weissenborn
Oscar Täckström
Jakob Uszkoreit
+ Blockwise Parallel Decoding for Deep Autoregressive Models 2018 Mitchell Stern
Noam Shazeer
Jakob Uszkoreit
+ Blockwise Parallel Decoding for Deep Autoregressive Models 2018 Mitchell Stern
Noam Shazeer
Jakob Uszkoreit
+ An Improved Relative Self-Attention Mechanism for Transformer with Application to Music Generation 2018 Cheng-Zhi Anna Huang
Ashish Vaswani
Jakob Uszkoreit
Noam Shazeer
Curtis Hawthorne
Andrew M. Dai
Matthew D. Hoffman
Douglas Eck
+ Music Transformer 2018 Cheng-Zhi Anna Huang
Ashish Vaswani
Jakob Uszkoreit
Noam Shazeer
Ian Simon
Curtis Hawthorne
Andrew M. Dai
Matthew D. Hoffman
Monica Dinculescu
Douglas Eck
+ Tensor2Tensor for Neural Machine Translation 2018 Ashish Vaswani
Samy Bengio
Eugene Brevdo
François Chollet
Aidan N. Gomez
Stephan Gouws
Llion Jones
Łukasz Kaiser
Nal Kalchbrenner
Niki Parmar
+ Self-Attention with Relative Position Representations 2018 Peter Shaw
Jakob Uszkoreit
Ashish Vaswani
+ Image Transformer 2018 Niki Parmar
Ashish Vaswani
Jakob Uszkoreit
Łukasz Kaiser
Noam Shazeer
Alexander Ku
Dustin Tran
+ Image Tranformer 2018 Niki Parmar
Ashish Vaswani
Jakob Uszkoreit
Łukasz Kaiser
Noam Shazeer
Alexander Ku
+ Fast Decoding in Sequence Models using Discrete Latent Variables 2018 Łukasz Kaiser
Aurko Roy
Ashish Vaswani
Niki Parmar
Samy Bengio
Jakob Uszkoreit
Noam Shazeer
+ Universal Transformers 2018 Mostafa Dehghani
Stephan Gouws
Oriol Vinyals
Jakob Uszkoreit
Łukasz Kaiser
+ Self-Attention with Relative Position Representations 2018 Peter Shaw
Jakob Uszkoreit
Ashish Vaswani
+ PDF Chat The Best of Both Worlds: Combining Recent Advances in Neural Machine Translation 2018 Mia Xu Chen
Orhan Fırat
Ankur Bapna
Melvin Johnson
Wolfgang Macherey
George Foster
Llion Jones
Mike Schuster
Noam Shazeer
Niki Parmar
+ Blockwise Parallel Decoding for Deep Autoregressive Models 2018 Mitchell Stern
Noam Shazeer
Jakob Uszkoreit
+ Image Transformer 2018 Niki Parmar
Ashish Vaswani
Jakob Uszkoreit
Łukasz Kaiser
Noam Shazeer
Alexander Ku
Dustin Tran
+ Tensor2Tensor for Neural Machine Translation 2018 Ashish Vaswani
Samy Bengio
Eugene Brevdo
François Chollet
Aidan N. Gomez
Stephan Gouws
Llion Jones
Łukasz Kaiser
Nal Kalchbrenner
Niki Parmar
+ Music Transformer 2018 Cheng-Zhi Anna Huang
Ashish Vaswani
Jakob Uszkoreit
Noam Shazeer
Ian Simon
Curtis Hawthorne
Andrew M. Dai
Matthew D. Hoffman
Monica Dinculescu
Douglas Eck
+ Self-Attention with Relative Position Representations 2018 Peter J. Shaw
Jakob Uszkoreit
Ashish Vaswani
+ Attention is All you Need 2017 Ashish Vaswani
Noam Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan N. Gomez
Łukasz Kaiser
Illia Polosukhin
+ Neural Paraphrase Identification of Questions with Noisy Pretraining 2017 Gaurav Singh Tomar
Thyago S.P.C. Duque
Oscar Täckström
Jakob Uszkoreit
Dipanjan Das
+ Neural Paraphrase Identification of Questions with Noisy Pretraining 2017 Gaurav Singh Tomar
Thyago S.P.C. Duque
Oscar Täckström
Jakob Uszkoreit
Dipanjan Das
+ One Model To Learn Them All 2017 Łukasz Kaiser
Aidan N. Gomez
Noam Shazeer
Ashish Vaswani
Niki Parmar
Llion Jones
Jakob Uszkoreit
+ Neural Paraphrase Identification of Questions with Noisy Pretraining 2017 Gaurav Singh Tomar
Thyago S.P.C. Duque
Oscar Täckström
Jakob Uszkoreit
Dipanjan Das
+ Attention Is All You Need 2017 Ashish Vaswani
Noam Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan N. Gomez
Łukasz Kaiser
Illia Polosukhin
+ Hierarchical Question Answering for Long Documents 2016 Eunsol Choi
Daniel Hewlett
Alexandre Lacoste
Illia Polosukhin
Jakob Uszkoreit
Jonathan Berant
+ A Decomposable Attention Model for Natural Language Inference 2016 Ankur P. Parikh
Oscar Täckström
Dipanjan Das
Jakob Uszkoreit
+ PDF Chat A Decomposable Attention Model for Natural Language Inference 2016 Ankur P. Parikh
Oscar Täckström
Dipanjan Das
Jakob Uszkoreit
+ Hierarchical Question Answering for Long Documents 2016 Eunsol Choi
Daniel Hewlett
Alexandre Lacoste
Illia Polosukhin
Jakob Uszkoreit
Jonathan Berant
+ A Decomposable Attention Model for Natural Language Inference 2016 Ankur P. Parikh
Oscar Täckström
Dipanjan Das
Jakob Uszkoreit
Common Coauthors
Commonly Cited References
Action Title Year Authors # of times referenced
+ Attention is All you Need 2017 Ashish Vaswani
Noam Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan N. Gomez
Łukasz Kaiser
Illia Polosukhin
13
+ Neural Machine Translation by Jointly Learning to Align and Translate 2015 Dzmitry Bahdanau
Kyunghyun Cho
Yoshua Bengio
12
+ Adam: A Method for Stochastic Optimization 2014 Diederik P. Kingma
Jimmy Ba
10
+ Sequence to Sequence Learning with Neural Networks 2014 Ilya Sutskever
Oriol Vinyals
Quoc V. Le
8
+ PDF Chat A Decomposable Attention Model for Natural Language Inference 2016 Ankur P. Parikh
Oscar Täckström
Dipanjan Das
Jakob Uszkoreit
7
+ PDF Chat Deep Residual Learning for Image Recognition 2016 Kaiming He
Xiangyu Zhang
Shaoqing Ren
Jian Sun
7
+ Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation 2014 Kyunghyun Cho
Bart van Merriënboer
Çaǧlar Gülçehre
Dzmitry Bahdanau
Fethi Bougares
Holger Schwenk
Yoshua Bengio
6
+ Neural Machine Translation in Linear Time 2016 Nal Kalchbrenner
Lasse Espeholt
Karen Simonyan
Aäron van den Oord
Alexander Graves
Koray Kavukcuoglu
6
+ Neural Machine Translation by Jointly Learning to Align and Translate 2014 Dzmitry Bahdanau
Kyunghyun Cho
Yoshua Bengio
6
+ Attention Is All You Need 2017 Ashish Vaswani
Noam Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan N. Gomez
Łukasz Kaiser
Illia Polosukhin
6
+ PDF Chat Neural Machine Translation of Rare Words with Subword Units 2016 Rico Sennrich
Barry Haddow
Alexandra Birch
5
+ Convolutional Sequence to Sequence Learning 2017 Jonas Gehring
Michael Auli
David Grangier
Denis Yarats
Yann Dauphin
5
+ PDF Chat Rethinking the Inception Architecture for Computer Vision 2016 Christian Szegedy
Vincent Vanhoucke
Sergey Ioffe
Jon Shlens
Zbigniew Wojna
4
+ An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale 2020 Alexey Dosovitskiy
Lucas Beyer
Alexander Kolesnikov
Dirk Weissenborn
Xiaohua Zhai
Thomas Unterthiner
Mostafa Dehghani
Matthias Minderer
Georg Heigold
Sylvain Gelly
4
+ PDF Chat Effective Approaches to Attention-based Neural Machine Translation 2015 Thang Luong
Hieu Pham
Christopher D. Manning
4
+ Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation 2016 Yonghui Wu
Mike Schuster
Zhifeng Chen
Quoc V. Le
Mohammad Norouzi
Wolfgang Macherey
Maxim Krikun
Yuan Cao
Qin Gao
Klaus Macherey
3
+ Recurrent Models of Visual Attention 2014 Volodymyr Mnih
Nicolas Heess
Alex Graves
Koray Kavukcuoglu
3
+ Sequence to Sequence Learning with Neural Networks 2014 Ilya Sutskever
Oriol Vinyals
Quoc V. Le
3
+ Distilling the Knowledge in a Neural Network 2015 Geoffrey E. Hinton
Oriol Vinyals
Jay B. Dean
3
+ Generating Long Sequences with Sparse Transformers. 2019 Rewon Child
Scott Gray
Alec Radford
Ilya Sutskever
3
+ A Large-scale Study of Representation Learning with the Visual Task Adaptation Benchmark 2019 Xiaohua Zhai
Joan Puigcerver
Alexander Kolesnikov
Pierre Ruyssen
Carlos Riquelme
Mario Lučić
Josip Djolonga
André Susano Pinto
Maxim Neumann
Alexey Dosovitskiy
3
+ Decoupled Weight Decay Regularization 2017 Ilya Loshchilov
Frank Hutter
3
+ PDF Chat Big Transfer (BiT): General Visual Representation Learning 2020 Alexander Kolesnikov
Lucas Beyer
Xiaohua Zhai
Joan Puigcerver
Jessica Yung
Sylvain Gelly
Neil Houlsby
3
+ Fixing the train-test resolution discrepancy 2019 Hugo Touvron
Andrea Vedaldi
Matthijs Douze
Hervé Jeǵou
3
+ mixup: Beyond Empirical Risk Minimization 2017 Hongyi Zhang
Moustapha Cissé
Yann Dauphin
David López-Paz
3
+ PDF Chat Revisiting Unreasonable Effectiveness of Data in Deep Learning Era 2017 Chen Sun
Abhinav Shrivastava
Saurabh Singh
Abhinav Gupta
3
+ PDF Chat Squeeze-and-Excitation Networks 2019 Jie Hu
Li Shen
Samuel Albanie
Gang Sun
Enhua Wu
3
+ Image Transformer 2018 Niki Parmar
Ashish Vaswani
Jakob Uszkoreit
Łukasz Kaiser
Noam Shazeer
Alexander Ku
Dustin Tran
3
+ Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation 2014 Kyunghyun Cho
Bart van Merriënboer
Çaǧlar Gülçehre
Dzmitry Bahdanau
Fethi Bougares
Holger Schwenk
Yoshua Bengio
3
+ Depthwise Separable Convolutions for Neural Machine Translation 2017 Łukasz Kaiser
Aidan N. Gomez
François Chollet
3
+ Action Recognition using Visual Attention 2015 Shikhar Sharma
Ryan Kiros
Ruslan Salakhutdinov
2
+ Convolutional Neural Network Architectures for Matching Natural Language Sentences 2015 Baotian Hu
Zhengdong Lu
Hang Li
Qingcai Chen
2
+ PDF Chat Finding Function in Form: Compositional Character Models for Open Vocabulary Word Representation 2015 Ling Wang
Chris Dyer
Alan W. Black
Isabel Trancoso
Ramon Fermandez
Silvio Amir
Luís Marujo
Tiago Luís
2
+ WaveNet: A Generative Model for Raw Audio 2016 Aäron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alex Graves
Nal Kalchbrenner
Andrew Senior
Koray Kavukcuoglu
2
+ Insertion Transformer: Flexible Sequence Generation via Insertion Operations 2019 Mitchell Stern
William Chan
Jamie Kiros
Jakob Uszkoreit
2
+ Encoding Source Language with Convolutional Neural Network for Machine Translation 2015 Fandong Meng
Zhengdong Lu
Mingxuan Wang
Hang Li
Wenbin Jiang
Qun Liu
2
+ A large annotated corpus for learning natural language inference 2015 Samuel R. Bowman
Gabor Angeli
Christopher Potts
Christopher D. Manning
2
+ Insertion-based Decoding with automatically Inferred Generation Order 2019 Jiatao Gu
Qi Liu
Kyunghyun Cho
2
+ MIST: Multiple Instance Spatial Transformer Networks 2018 Baptiste Angles
Simon Kornblith
Shahram Izadi
Andrea Tagliasacchi
Kwang Moo Yi
2
+ PDF Chat Deep Networks with Stochastic Depth 2016 Gao Huang
Yu Sun
Zhuang Liu
Daniel Sedra
Kilian Q. Weinberger
2
+ End-to-End Speech Translation with Knowledge Distillation 2019 Yuchen Liu
Hao Xiong
Zhongjun He
Jiajun Zhang
Hua Wu
Haifeng Wang
Chengqing Zong
2
+ PDF Chat ImageNet Large Scale Visual Recognition Challenge 2015 Olga Russakovsky
Jia Deng
Hao Su
Jonathan Krause
Sanjeev Satheesh
Sean Ma
Zhiheng Huang
Andrej Karpathy
Aditya Khosla
Michael S. Bernstein
2
+ The Unreasonable Effectiveness of Deep Features as a Perceptual Metric 2018 Richard Zhang
Phillip Isola
Alexei A. Efros
Eli Shechtman
Oliver Wang
2
+ Fully Convolutional Attention Localization Networks: Efficient Attention Localization for Fine-Grained Recognition 2016 Xiao Liu
Tian Xia
Jiang Wang
Yuanqing Lin
2
+ Weight Normalization: A Simple Reparameterization to Accelerate Training of Deep Neural Networks 2016 Tim Salimans
Diederik P. Kingma
2
+ Non-Autoregressive Neural Machine Translation 2017 Jiatao Gu
James Bradbury
Caiming Xiong
Victor O. K. Li
Richard Socher
2
+ Exploring the Limits of Weakly Supervised Pretraining 2018 Dhruv Mahajan
Ross Girshick
Vignesh Ramanathan
Kaiming He
Manohar Paluri
Yixuan Li
Ashwin Bharambe
Laurens van der Maaten
2
+ Sentence Similarity Learning by Lexical Decomposition and Composition 2016 Zhiguo Wang
Haitao Mi
Abraham Ittycheriah
2
+ PDF Chat Going deeper with convolutions 2015 Christian Szegedy
Wei Liu
Yangqing Jia
Pierre Sermanet
Scott Reed
Dragomir Anguelov
Dumitru Erhan
Vincent Vanhoucke
Andrew Rabinovich
2
+ PDF Chat Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering 2018 Peter Anderson
Xiaodong He
Chris Buehler
Damien Teney
Mark Johnson
Stephen Jay Gould
Lei Zhang
2