Ron J. Weiss

Follow

Generating author description...

All published works
Action Title Year Authors
+ PDF Chat G-Augment: Searching for the Meta-Structure of Data Augmentation Policies for ASR 2023 Gary Wang
Ekin D. Cubuk
Andrew Rosenberg
Shuyang Cheng
Ron J. Weiss
Bhuvana Ramabhadran
Pedro J. Moreno
Quoc V. Le
Daniel Park
+ G-Augment: Searching for the Meta-Structure of Data Augmentation Policies for ASR 2022 Gary Wang
Ekin D. Cubuk
Andrew E. Rosenberg
Shuyang Cheng
Ron J. Weiss
Bhuvana Ramabhadran
Pedro J. Moreno
Quoc V. Le
Daniel Park
+ PDF Chat Sparse, Efficient, and Semantic Mixture Invariant Training: Taming In-the-Wild Unsupervised Sound Separation 2021 Scott Wisdom
Aren Jansen
Ron J. Weiss
Hakan Erdoğan
John R. Hershey
+ PDF Chat Multitask Training with Text Data for End-to-End Speech Recognition 2021 Peidong Wang
Tara N. Sainath
Ron J. Weiss
+ PDF Chat WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis 2021 Nanxin Chen
Yu Zhang
Heiga Zen
Ron J. Weiss
Mohammad Norouzi
Najim Dehak
William Chan
+ PDF Chat WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis 2021 Nanxin Chen
Yu Zhang
Heiga Zen
Ron J. Weiss
Mohammad Norouzi
Najim Dehak
William Chan
+ PDF Chat Parallel Tacotron: Non-Autoregressive and Controllable TTS 2021 Isaac Elias
Heiga Zen
Jonathan Shen
Yu Zhang
Jia Ye
Ron J. Weiss
Yonghui Wu
+ PDF Chat Wave-Tacotron: Spectrogram-Free End-to-End Text-to-Speech Synthesis 2021 Ron J. Weiss
RJ Skerry-Ryan
Eric Battenberg
Soroosh Mariooryad
Diederik P. Kingma
+ WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis 2021 Nanxin Chen
Yu Zhang
Heiga Zen
Ron J. Weiss
Mohammad Norouzi
Najim Dehak
William Chan
+ Sparse, Efficient, and Semantic Mixture Invariant Training: Taming In-the-Wild Unsupervised Sound Separation 2021 Scott Wisdom
Aren Jansen
Ron J. Weiss
Hakan Erdoğan
John R. Hershey
+ Wave-Tacotron: Spectrogram-free end-to-end text-to-speech synthesis 2020 Ron J. Weiss
RJ Skerry-Ryan
Eric Battenberg
Soroosh Mariooryad
Diederik P. Kingma
+ PDF Chat Multitask Training with Text Data for End-to-End Speech Recognition 2020 Peidong Wang
Tara N. Sainath
Ron J. Weiss
+ PDF Chat Generating Diverse and Natural Text-to-Speech Samples Using a Quantized Fine-Grained VAE and Autoregressive Prosody Prior 2020 Guangzhi Sun
Zhang Yu
Ron J. Weiss
Yuan Cao
Heiga Zen
Andrew Rosenberg
Bhuvana Ramabhadran
Yonghui Wu
+ PDF Chat Fully-Hierarchical Fine-Grained Prosody Modeling For Interpretable Speech Synthesis 2020 Guangzhi Sun
Yu Zhang
Ron J. Weiss
Yuan Cao
Heiga Zen
Yonghui Wu
+ Generating diverse and natural text-to-speech samples using a quantized fine-grained VAE and auto-regressive prosody prior 2020 Guangzhi Sun
Yu Zhang
Ron J. Weiss
Yuan Cao
Heiga Zen
Andrew Rosenberg
Bhuvana Ramabhadran
Yonghui Wu
+ PDF Chat Generating diverse and natural text-to-speech samples using a quantized fine-grained VAE and auto-regressive prosody prior 2020 Guangzhi Sun
Yu Zhang
Ron J. Weiss
Yuan Cao
Heiga Zen
Andrew E. Rosenberg
Bhuvana Ramabhadran
Yonghui Wu
+ PDF Chat Fully-hierarchical fine-grained prosody modeling for interpretable speech synthesis 2020 Guangzhi Sun
Yu Zhang
Ron J. Weiss
Yuan Cao
Heiga Zen
Yonghui Wu
+ Fully-hierarchical fine-grained prosody modeling for interpretable speech synthesis 2020 Guangzhi Sun
Yu Zhang
Ron J. Weiss
Yuan Cao
Heiga Zen
Yonghui Wu
+ Parallel Tacotron: Non-Autoregressive and Controllable TTS 2020 Isaac Elias
Heiga Zen
Jonathan Shen
Yu Zhang
Jia Ye
Ron J. Weiss
Yonghui Wu
+ Unsupervised Sound Separation Using Mixture Invariant Training 2020 Scott Wisdom
Efthymios Tzinis
Hakan Erdoğan
Ron J. Weiss
Kevin Wilson
John R. Hershey
+ Multitask Training with Text Data for End-to-End Speech Recognition 2020 Peidong Wang
Tara N. Sainath
Ron J. Weiss
+ WaveGrad: Estimating Gradients for Waveform Generation 2020 Nanxin Chen
Yu Zhang
Heiga Zen
Ron J. Weiss
Mohammad Norouzi
William Chan
+ Generating diverse and natural text-to-speech samples using a quantized fine-grained VAE and auto-regressive prosody prior 2020 Guangzhi Sun
Zhang Yu
Ron J. Weiss
Yuan Cao
Heiga Zen
Andrew Rosenberg
Bhuvana Ramabhadran
Yonghui Wu
+ Wave-Tacotron: Spectrogram-free end-to-end text-to-speech synthesis 2020 Ron J. Weiss
RJ Skerry-Ryan
Eric Battenberg
Soroosh Mariooryad
Diederik P. Kingma
+ PDF Chat LibriTTS: A Corpus Derived from LibriSpeech for Text-to-Speech 2019 Heiga Zen
Viet Chau Dang
Rob Clark
Zhang Yu
Ron J. Weiss
Jia Ye
Zhifeng Chen
Yonghui Wu
+ PDF Chat Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning 2019 Yu Zhang
Ron J. Weiss
Heiga Zen
Yonghui Wu
Zhifeng Chen
RJ Skerry-Ryan
Jia Ye
Andrew Rosenberg
Bhuvana Ramabhadran
+ PDF Chat Direct Speech-to-Speech Translation with a Sequence-to-Sequence Model 2019 Jia Ye
Ron J. Weiss
Fadi Biadsy
Wolfgang Macherey
Melvin Johnson
Zhifeng Chen
Yonghui Wu
+ PDF Chat Parrotron: An End-to-End Speech-to-Speech Conversion Model and its Applications to Hearing-Impaired Speech and Speech Separation 2019 Fadi Biadsy
Ron J. Weiss
Pedro J. Moreno
D. Kanvesky
Jia Ye
+ PDF Chat VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking 2019 Quan Wang
Hannah Muckenhirn
Kevin Wilson
Prashant Sridhar
Zelin Wu
John R. Hershey
Rif A. Saurous
Ron J. Weiss
Jia Ye
Ignacio López Moreno
+ PDF Chat Unsupervised Speech Representation Learning Using WaveNet Autoencoders 2019 Jan Chorowski
Ron J. Weiss
Samy Bengio
Aäron van den Oord
+ PDF Chat A Spelling Correction Model for End-to-end Speech Recognition 2019 Jinxi Guo
Tara N. Sainath
Ron J. Weiss
+ PDF Chat Leveraging Weakly Supervised Data to Improve End-to-end Speech-to-text Translation 2019 Jia Ye
Melvin Johnson
Wolfgang Macherey
Ron J. Weiss
Yuan Cao
Chung‐Cheng Chiu
Naveen Ari
Stella Laurenzo
Yonghui Wu
+ Parrotron: An End-to-End Speech-to-Speech Conversion Model and its Applications to Hearing-Impaired Speech and Speech Separation 2019 Fadi Biadsy
Ron J. Weiss
Pedro J. Moreno
Dimitri Kanevsky
Jia Ye
+ Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling 2019 Jonathan Shen
Patrick Nguyen
Yonghui Wu
Zhifeng Chen
Mia Xu Chen
Jia Ye
Anjuli Kannan
Tara N. Sainath
Yuan Cao
Chung‐Cheng Chiu
+ Direct speech-to-speech translation with a sequence-to-sequence model 2019 Jia Ye
Ron J. Weiss
Fadi Biadsy
Wolfgang Macherey
Melvin Johnson
Zhifeng Chen
Yonghui Wu
+ LibriTTS: A Corpus Derived from LibriSpeech for Text-to-Speech 2019 Heiga Zen
Viet Chau Dang
Rob Clark
Zhang Yu
Ron J. Weiss
Jia Ye
Zhifeng Chen
Yonghui Wu
+ A spelling correction model for end-to-end speech recognition 2019 Jinxi Guo
Tara N. Sainath
Ron J. Weiss
+ Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning 2019 Yu Zhang
Ron J. Weiss
Heiga Zen
Yonghui Wu
Zhifeng Chen
RJ Skerry-Ryan
Jia Ye
Andrew Rosenberg
Bhuvana Ramabhadran
+ Hierarchical Generative Modeling for Controllable Speech Synthesis 2018 Wei-Ning Hsu
Yu Zhang
Ron J. Weiss
Heiga Zen
Yonghui Wu
Yuxuan Wang
Yuan Cao
Jia Ye
Zhifeng Chen
Jonathan Shen
+ Metrics for Signal Temporal Logic Formulae 2018 Curtis Madsen
Prashant Vaidyanathan
Sadra Sadraddini
Cristian-Ioan Vasile
Nicholas A. DeLateur
Ron J. Weiss
Douglas Densmore
Călin Belta
+ Synthesizing Diverse, High-Quality Audio Textures. 2018 Joseph M. Antognini
Matt Hoffman
Ron J. Weiss
+ PDF Chat State-of-the-Art Speech Recognition with Sequence-to-Sequence Models 2018 Chung‐Cheng Chiu
Tara N. Sainath
Yonghui Wu
Rohit Prabhavalkar
Patrick Nguyen
Zhifeng Chen
Anjuli Kannan
Ron J. Weiss
Kanishka Rao
Ekaterina Gonina
+ PDF Chat On Using Backpropagation for Speech Texture Generation and Voice Conversion 2018 Jan Chorowski
Ron J. Weiss
Rif A. Saurous
Samy Bengio
+ PDF Chat Natural TTS Synthesis by Conditioning Wavenet on MEL Spectrogram Predictions 2018 Jonathan Shen
Ruoming Pang
Ron J. Weiss
Mike Schuster
Navdeep Jaitly
Zongheng Yang
Zhifeng Chen
Yu Zhang
Yuxuan Wang
Rj Skerrv-Ryan
+ PDF Chat Multilingual Speech Recognition with a Single End-to-End Model 2018 Shubham Toshniwal
Tara N. Sainath
Ron J. Weiss
Bo Li
Pedro J. Moreno
Eugene Weinstein
Kanishka Rao
+ Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron 2018 RJ Skerry-Ryan
Eric Battenberg
Ying Xiao
Yuxuan Wang
Daisy Stanton
Joel Shor
Ron J. Weiss
Rob Clark
Rif A. Saurous
+ Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis 2018 Jia Ye
Yu Zhang
Ron J. Weiss
Quan Wang
Jonathan Shen
Fei Ren
Zhifeng Chen
Patrick Nguyen
Ruoming Pang
Ignacio López Moreno
+ VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking 2018 Quan Wang
Hannah Muckenhirn
Kevin Wilson
Prashant Sridhar
Zelin Wu
John R. Hershey
Rif A. Saurous
Ron J. Weiss
Jia Ye
Ignacio López Moreno
+ Leveraging Weakly Supervised Data to Improve End-to-End Speech-to-Text Translation 2018 Jia Ye
Melvin Johnson
Wolfgang Macherey
Ron J. Weiss
Yuan Cao
Chung‐Cheng Chiu
Naveen Ari
Stella Laurenzo
Yonghui Wu
+ Hierarchical Generative Modeling for Controllable Speech Synthesis 2018 Wei-Ning Hsu
Yu Zhang
Ron J. Weiss
Heiga Zen
Yonghui Wu
Yuxuan Wang
Yuan Cao
Jia Ye
Zhifeng Chen
Jonathan Shen
+ Synthesizing Diverse, High-Quality Audio Textures 2018 Joseph F. Antognini
Matt Hoffman
Ron J. Weiss
+ Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron 2018 RJ Skerry-Ryan
Eric Battenberg
Ying Xiao
Yuxuan Wang
Daisy Stanton
Joel Shor
Ron J. Weiss
Rob Clark
Rif A. Saurous
+ On Using Backpropagation for Speech Texture Generation and Voice Conversion 2017 Jan Chorowski
Ron J. Weiss
Rif A. Saurous
Samy Bengio
+ State-of-the-art Speech Recognition With Sequence-to-Sequence Models 2017 Chung‐Cheng Chiu
Tara N. Sainath
Yonghui Wu
Rohit Prabhavalkar
Patrick Nguyen
Zhifeng Chen
Anjuli Kannan
Ron J. Weiss
Kanishka Rao
Ekaterina Gonina
+ PDF Chat Sequence-to-Sequence Models Can Directly Translate Foreign Speech 2017 Ron J. Weiss
Jan Chorowski
Navdeep Jaitly
Yonghui Wu
Zhifeng Chen
+ PDF Chat Tacotron: Towards End-to-End Speech Synthesis 2017 Yuxuan Wang
RJ Skerry-Ryan
Daisy Stanton
Yonghui Wu
Ron J. Weiss
Navdeep Jaitly
Zongheng Yang
Ying Xiao
Zhifeng Chen
Samy Bengio
+ Online and Linear-Time Attention by Enforcing Monotonic Alignments 2017 Colin Raffel
Minh-Thang Luong
Peter J. Liu
Ron J. Weiss
Douglas Eck
+ Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model. 2017 Yuxuan Wang
RJ Skerry-Ryan
Daisy Stanton
Yonghui Wu
Ron J. Weiss
Navdeep Jaitly
Zongheng Yang
Ying Xiao
Zhifeng Chen
Samy Bengio
+ PDF Chat CNN architectures for large-scale audio classification 2017 Shawn Hershey
Sourish Chaudhuri
Daniel P. W. Ellis
Jort F. Gemmeke
Aren Jansen
Robert C. Moore
Manoj Plakal
Devin Platt
Rif A. Saurous
Bryan Seybold
+ Online and Linear-Time Attention by Enforcing Monotonic Alignments 2017 Colin Raffel
Minh-Thang Luong
Peter J. Liu
Ron J. Weiss
Douglas Eck
+ Tacotron: Towards End-to-End Speech Synthesis 2017 Yuxuan Wang
RJ Skerry-Ryan
Daisy Stanton
Yonghui Wu
Ron J. Weiss
Navdeep Jaitly
Zongheng Yang
Ying Xiao
Zhifeng Chen
Samy Bengio
+ Multilingual Speech Recognition With A Single End-To-End Model 2017 Shubham Toshniwal
Tara N. Sainath
Ron J. Weiss
Bo Li
Pedro J. Moreno
Eugene Weinstein
Kanishka Rao
+ Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions 2017 Jonathan Shen
Ruoming Pang
Ron J. Weiss
Mike Schuster
Navdeep Jaitly
Zongheng Yang
Zhifeng Chen
Yu Zhang
Yuxuan Wang
RJ Skerry-Ryan
+ Sequence-to-Sequence Models Can Directly Translate Foreign Speech 2017 Ron J. Weiss
Jan Chorowski
Navdeep Jaitly
Yonghui Wu
Zhifeng Chen
+ On Using Backpropagation for Speech Texture Generation and Voice Conversion 2017 Jan Chorowski
Ron J. Weiss
Rif A. Saurous
Samy Bengio
+ CNN Architectures for Large-Scale Audio Classification 2016 Shawn Hershey
Sourish Chaudhuri
Daniel P. W. Ellis
Jort F. Gemmeke
Aren Jansen
Robert C. Moore
Manoj Plakal
Devin Platt
Rif A. Saurous
Bryan Seybold
+ CNN Architectures for Large-Scale Audio Classification 2016 Shawn Hershey
Sourish Chaudhuri
Daniel P. W. Ellis
Jort F. Gemmeke
Aren Jansen
Robert C. Moore
Manoj Plakal
Devin Platt
Rif A. Saurous
Bryan Seybold
+ Affinity Weighted Embedding 2013 Jason Weston
Ron J. Weiss
Hector Yee
+ Scikit-learn: Machine Learning in Python 2012 Fabián Pedregosa
Gaël Varoquaux
Alexandre Gramfort
Vincent Michel
Bertrand Thirion
Olivier Grisel
Mathieu Blondel
Peter Prettenhofer
Ron J. Weiss
Vincent Dubourg
+ Latent Collaborative Retrieval 2012 Jason Weston
Chong Wang
Ron J. Weiss
Adam Berenzweig
+ PDF Chat Scikit-learn: Machine Learning in Python 2011 Fabián Pedregosa
Gaël Varoquaux
Alexandre Gramfort
Vincent Michel
Bertrand Thirion
Olivier Grisel
Mathieu Blondel
Andreas Müller
Joel Nothman
Gilles Louppe
Common Coauthors
Commonly Cited References
Action Title Year Authors # of times referenced
+ PDF Chat Natural TTS Synthesis by Conditioning Wavenet on MEL Spectrogram Predictions 2018 Jonathan Shen
Ruoming Pang
Ron J. Weiss
Mike Schuster
Navdeep Jaitly
Zongheng Yang
Zhifeng Chen
Yu Zhang
Yuxuan Wang
Rj Skerrv-Ryan
21
+ PDF Chat Tacotron: Towards End-to-End Speech Synthesis 2017 Yuxuan Wang
RJ Skerry-Ryan
Daisy Stanton
Yonghui Wu
Ron J. Weiss
Navdeep Jaitly
Zongheng Yang
Ying Xiao
Zhifeng Chen
Samy Bengio
20
+ Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation 2016 Yonghui Wu
Mike Schuster
Zhifeng Chen
Quoc V. Le
Mohammad Norouzi
Wolfgang Macherey
Maxim Krikun
Yuan Cao
Qin Gao
Klaus Macherey
16
+ Neural Machine Translation by Jointly Learning to Align and Translate 2015 Dzmitry Bahdanau
Kyunghyun Cho
Yoshua Bengio
16
+ PDF Chat State-of-the-Art Speech Recognition with Sequence-to-Sequence Models 2018 Chung‐Cheng Chiu
Tara N. Sainath
Yonghui Wu
Rohit Prabhavalkar
Patrick Nguyen
Zhifeng Chen
Anjuli Kannan
Ron J. Weiss
Kanishka Rao
Ekaterina Gonina
12
+ Sequence to Sequence Learning with Neural Networks 2014 Ilya Sutskever
Oriol Vinyals
Quoc V. Le
11
+ Adam: A Method for Stochastic Optimization 2014 Diederik P. Kingma
Jimmy Ba
11
+ Deep Voice 3: 2000-Speaker Neural Text-to-Speech 2017 Wei Ping
Kainan Peng
Andrew Gibiansky
Sercan Ö. Arık
Ajay Kannan
Sharan Narang
Jonathan Raiman
J. J. Miller
11
+ Attention-Based Models for Speech Recognition 2015 Jan Chorowski
Dzmitry Bahdanau
Dmitriy Serdyuk
Kyunghyun Cho
Yoshua Bengio
11
+ WaveNet: A Generative Model for Raw Audio 2016 Aäron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alexander Graves
Nal Kalchbrenner
Andrew Senior
Koray Kavukcuoglu
10
+ Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis 2018 Yuxuan Wang
Daisy Stanton
Yu Zhang
RJ Skerry-Ryan
Eric Battenberg
Joel Shor
Ying Xiao
Fei Ren
Jia Ye
Rif A. Saurous
10
+ WaveNet: A Generative Model for Raw Audio 2016 Aäron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alex Graves
Nal Kalchbrenner
Andrew Senior
Koray Kavukcuoglu
10
+ Neural Machine Translation by Jointly Learning to Align and Translate 2014 Dzmitry Bahdanau
Kyunghyun Cho
Yoshua Bengio
10
+ Attention is All you Need 2017 Ashish Vaswani
Noam Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan N. Gomez
Łukasz Kaiser
Illia Polosukhin
10
+ Hierarchical Generative Modeling for Controllable Speech Synthesis 2018 Wei-Ning Hsu
Yu Zhang
Ron J. Weiss
Heiga Zen
Yonghui Wu
Yuxuan Wang
Yuan Cao
Jia Ye
Zhifeng Chen
Jonathan Shen
8
+ Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift 2015 Sergey Ioffe
Christian Szegedy
8
+ Efficient Neural Audio Synthesis 2018 Nal Kalchbrenner
Erich Elsen
Karen Simonyan
Seb Noury
Norman Casagrande
Edward Lockhart
Florian Stimberg
Aäron van den Oord
Sander Dieleman
Koray Kavukcuoglu
7
+ Attention Is All You Need 2017 Ashish Vaswani
Noam Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan N. Gomez
Łukasz Kaiser
Illia Polosukhin
7
+ Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis 2018 Jia Ye
Yu Zhang
Ron J. Weiss
Quan Wang
Jonathan Shen
Fei Ren
Zhifeng Chen
Patrick Nguyen
Ruoming Pang
Ignacio López Moreno
7
+ Sequence to Sequence Learning with Neural Networks 2014 Ilya Sutskever
Oriol Vinyals
Quoc V. Le
6
+ PDF Chat Towards Better Decoding and Language Model Integration in Sequence to Sequence Models 2017 Jan Chorowski
Navdeep Jaitly
6
+ Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift 2015 Sergey Ioffe
Christian Szegedy
6
+ PDF Chat Robust and Fine-grained Prosody Control of End-to-end Speech Synthesis 2019 Younggun Lee
Taesu Kim
6
+ Sequence Transduction with Recurrent Neural Networks 2012 Alex Graves
6
+ Listen and Translate: A Proof of Concept for End-to-End Speech-to-Text Translation 2016 Alexandre Bérard
Olivier Pietquin
Laurent Besacier
Christophe Servan
6
+ TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems 2016 Martı́n Abadi
Ashish Agarwal
Paul Barham
Eugene Brevdo
Zhifeng Chen
Craig Citro
Gregory S. Corrado
Andy Davis
Jay B. Dean
Matthieu Devin
6
+ PDF Chat An Analysis of Incorporating an External Language Model into a Sequence-to-Sequence Model 2018 Anjuli Kannan
Yonghui Wu
Patrick Nguyen
Tara N. Sainath
ZhiJeng Chen
Rohit Prabhavalkar
6
+ PDF Chat Deep clustering: Discriminative embeddings for segmentation and separation 2016 John R. Hershey
Zhuo Chen
Jonathan Le Roux
Shinji Watanabe
6
+ PDF Chat Improving Neural Machine Translation Models with Monolingual Data 2016 Rico Sennrich
Barry Haddow
Alexandra Birch
6
+ PDF Chat Very deep convolutional networks for end-to-end speech recognition 2017 Zhang Yu
William Chan
Navdeep Jaitly
6
+ Parallel WaveNet: Fast High-Fidelity Speech Synthesis 2017 Aäron van den Oord
Yazhe Li
I. Babuschkin
Karen Simonyan
Oriol Vinyals
Koray Kavukcuoglu
George van den Driessche
Edward Lockhart
Luis C. Cobo
Florian Stimberg
6
+ Deep Voice: Real-time Neural Text-to-Speech 2017 Sercan Ö. Arık
Mike Chrzanowski
Adam Coates
Gregory Diamos
Andrew Gibiansky
Yongguo Kang
Xian Li
J. J. Miller
Andrew Y. Ng
Jonathan Raiman
5
+ PDF Chat Location-Relative Attention Mechanisms for Robust Long-Form Speech Synthesis 2020 Eric Battenberg
RJ Skerry-Ryan
Soroosh Mariooryad
Daisy Stanton
David Kao
Matt Shannon
Tom Bagby
5
+ End-to-End Adversarial Text-to-Speech. 2020 Jeff Donahue
Sander Dieleman
Mikołaj Bińkowski
Erich Elsen
Karen Simonyan
5
+ DurIAN: Duration Informed Attention Network For Multimodal Synthesis 2019 Chengzhu Yu
Heng Lu
Na Hu
Meng Yu
Chao Weng
Kun Xu
Peng Liu
Deyi Tuo
Shiyin Kang
Guangzhi Lei
5
+ Scheduled Sampling for Sequence Prediction with Recurrent Neural Networks 2015 Samy Bengio
Oriol Vinyals
Navdeep Jaitly
Noam Shazeer
5
+ Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron 2018 RJ Skerry-Ryan
Eric Battenberg
Ying Xiao
Yuxuan Wang
Daisy Stanton
Joel Shor
Ron J. Weiss
Rob Clark
Rif A. Saurous
5
+ Deep Encoder-Decoder Models for Unsupervised Learning of Controllable Speech Synthesis 2018 Gustav Eje Henter
Jaime Lorenzo-Trueba
Xin Wang
Junichi Yamagishi
5
+ Neural Voice Cloning with a Few Samples 2018 Sercan Ö. Arık
Jitong Chen
Kainan Peng
Wei Ping
Yanqi Zhou
5
+ PDF Chat Deep Residual Learning for Image Recognition 2016 Kaiming He
Xiangyu Zhang
Shaoqing Ren
Jian Sun
5
+ PDF Chat Sequence-to-Sequence Models Can Directly Translate Foreign Speech 2017 Ron J. Weiss
Jan Chorowski
Navdeep Jaitly
Yonghui Wu
Zhifeng Chen
5
+ PDF Chat Expressive Speech Synthesis via Modeling Expressions with Variational Autoencoder 2018 Kei Akuzawa
Yusuke Iwasawa
Yutaka Matsuo
4
+ High Fidelity Speech Synthesis with Adversarial Networks 2019 Mikołaj Bińkowski
Jeff Donahue
Sander Dieleman
Aidan Clark
Erich Elsen
Norman Casagrande
Luis C. Cobo
Karen Simonyan
4
+ Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model. 2017 Yuxuan Wang
RJ Skerry-Ryan
Daisy Stanton
Yonghui Wu
Ron J. Weiss
Navdeep Jaitly
Zongheng Yang
Ying Xiao
Zhifeng Chen
Samy Bengio
4
+ PDF Chat End-to-end attention-based large vocabulary speech recognition 2016 Dzmitry Bahdanau
Jan Chorowski
Dmitriy Serdyuk
Philémon Brakel
Yoshua Bengio
4
+ MelGAN: Generative Adversarial Networks for Conditional Waveform Synthesis 2019 Kundan Kumar
Rithesh Kumar
T. de Boissière
Lucas Gestin
Wei Zhen Teoh
Jose Sotelo
Alexandre de Brébisson
Yoshua Bengio
Aaron Courville
4
+ PDF Chat Conditional End-to-End Audio Transforms 2018 Albert Haque
Michelle Guo
Prateek Verma
4
+ PDF Chat LibriTTS: A Corpus Derived from LibriSpeech for Text-to-Speech 2019 Heiga Zen
Viet Chau Dang
Rob Clark
Zhang Yu
Ron J. Weiss
Jia Ye
Zhifeng Chen
Yonghui Wu
4
+ PDF Chat SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition 2019 Daniel Park
William Chan
Yu Zhang
Chung‐Cheng Chiu
Barret Zoph
Ekin D. Cubuk
Quoc V. Le
4
+ Sample Efficient Adaptive Text-to-Speech 2018 Yutian Chen
Yannis Assael
Brendan Shillingford
David Budden
Scott Reed
Heiga Zen
Quan Wang
Luis C. Cobo
Andrew Trask
Ben Laurie
4