Yu-An Chung

Follow

Generating author description...

All published works
Action Title Year Authors
+ PDF Chat DC-Spin: A Speaker-invariant Speech Tokenizer for Spoken Language Models 2024 Heng-Jui Chang
Hongyu Gong
Changhan Wang
James Glass
Yu-An Chung
+ COLLD: Contrastive Layer-to-Layer Distillation for Compressing Multilingual Pre-Trained Speech Encoders 2024 Heng-Jui Chang
Ning Dong
Ruslan Mavlyutov
Sravya Popuri
Yu-An Chung
+ UnitY: Two-pass Direct Speech-to-speech Translation with Discrete Units 2023 Hirofumi Inaguma
Sravya Popuri
Ilia Kulikov
Peng‐Jen Chen
Changhan Wang
Yu-An Chung
Yun Tang
Ann Lee
Shinji Watanabe
Juan Pino
+ PDF Chat Speech-to-Speech Translation for a Real-world Unwritten Language 2023 Peng‐Jen Chen
Kevin Tran
Yilin Yang
Jingfei Du
Justine Kao
Yu-An Chung
Paden Tomasello
Paul-Ambroise Duquenne
Holger Schwenk
Hongyu Gong
+ SeamlessM4T: Massively Multilingual & Multimodal Machine Translation 2023 Seamless Communication
LoĂŻc Barrault
Yu-An Chung
Mariano Cora Meglioli
David C. Dale
Ning Dong
Paul-Ambroise Duquenne
Hady Elsahar
Hongyu Gong
Kevin S. Heffernan
+ CoLLD: Contrastive Layer-to-layer Distillation for Compressing Multilingual Pre-trained Speech Encoders 2023 Heng-Jui Chang
Ning Dong
Ruslan Mavlyutov
Sravya Popuri
Yu-An Chung
+ Seamless: Multilingual Expressive and Streaming Speech Translation 2023 Seamless Communication
LoĂŻc Barrault
Yu-An Chung
Mariano Coria Meglioli
David C. Dale
Ning Dong
Mark Duppenthaler
Paul-Ambroise Duquenne
Brian E. Ellis
Hady Elsahar
+ PDF Chat SSAST: Self-Supervised Audio Spectrogram Transformer 2022 Yuan Gong
Cheng-I Lai
Yu-An Chung
James Glass
+ Speech-to-Speech Translation For A Real-world Unwritten Language 2022 Peng‐Jen Chen
Kevin Tran
Yilin Yang
Jingfei Du
Justine Kao
Yu-An Chung
Paden Tomasello
Paul-Ambroise Duquenne
Holger Schwenk
Hongyu Gong
+ UnitY: Two-pass Direct Speech-to-speech Translation with Discrete Units 2022 Hirofumi Inaguma
Sravya Popuri
Ilia Kulikov
Peng‐Jen Chen
Changhan Wang
Yu-An Chung
Yun Tang
Ann Lee
Shinji Watanabe
Juan Pino
+ PDF Chat w2v-BERT: Combining Contrastive Learning and Masked Language Modeling for Self-Supervised Speech Pre-Training 2021 Yu-An Chung
Yu Zhang
Wei Han
Chung‐Cheng Chiu
James Qin
Ruoming Pang
Yonghui Wu
+ PDF Chat AST: Audio Spectrogram Transformer 2021 Yuan Gong
Yu-An Chung
James Glass
+ PDF Chat Non-Autoregressive Predictive Coding for Learning Speech Representations from Local Dependencies 2021 Alexander H. Liu
Yu-An Chung
James Glass
+ PDF Chat W2v-BERT: Combining Contrastive Learning and Masked Language Modeling for Self-Supervised Speech Pre-Training 2021 Yu-An Chung
Yu Zhang
Wei Han
Chung‐Cheng Chiu
James Qin
Ruoming Pang
Yonghui Wu
+ PDF Chat Similarity Analysis of Self-Supervised Speech Representations 2021 Yu-An Chung
Yonatan Belinkov
James Glass
+ PDF Chat AST: Audio Spectrogram Transformer 2021 Yuan Gong
Yu-An Chung
James Glass
+ PSLA: Improving Audio Event Classification with Pretraining, Sampling, Labeling, and Aggregation. 2021 Yuan Gong
Yu-An Chung
James Glass
+ PSLA: Improving Audio Tagging with Pretraining, Sampling, Labeling, and Aggregation 2021 Yuan Gong
Yu-An Chung
James Glass
+ AST: Audio Spectrogram Transformer 2021 Yuan Gong
Yu-An Chung
James Glass
+ PDF Chat SPLAT: Speech-Language Joint Pre-Training for Spoken Language Understanding 2021 Yu-An Chung
Chenguang Zhu
Michael Zeng
+ W2v-BERT: Combining Contrastive Learning and Masked Language Modeling for Self-Supervised Speech Pre-Training 2021 Yu-An Chung
Yu Zhang
Wei Han
Chung‐Cheng Chiu
James Qin
Ruoming Pang
Yonghui Wu
+ PDF Chat PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation 2021 Yuan Gong
Yu-An Chung
James Glass
+ SLAM: A Unified Encoder for Speech and Language Modeling via Speech-Text Joint Pre-Training 2021 Ankur Bapna
Yu-An Chung
Nan Wu
Anmol Gulati
Jia Ye
Jonathan H. Clark
Melvin Johnson
Jason Riesa
Alexis Conneau
Zhang Yu
+ SSAST: Self-Supervised Audio Spectrogram Transformer 2021 Yuan Gong
Cheng-I Lai
Yu-An Chung
James Glass
+ PDF Chat Cost-Sensitive Deep Learning with Layer-Wise Cost Estimation 2020 Yu-An Chung
Shao‐Wen Yang
Hsuan-Tien Lin
+ Non-Autoregressive Predictive Coding for Learning Speech Representations from Local Dependencies 2020 Alexander H. Liu
Yu-An Chung
James Glass
+ PDF Chat Vector-Quantized Autoregressive Predictive Coding 2020 Yu-An Chung
Hao Tang
James Glass
+ PDF Chat Generative Pre-Training for Speech with Autoregressive Predictive Coding 2020 Yu-An Chung
James Glass
+ Clinical Text Summarization with Syntax-Based Negation and Semantic Concept Identification 2020 Wei‐Hung Weng
Yu-An Chung
Schrasing Tong
+ Improved Speech Representations with Multi-Target Autoregressive Predictive Coding 2020 Yu-An Chung
James Glass
+ Vector-Quantized Autoregressive Predictive Coding 2020 Yu-An Chung
Hao Tang
James Glass
+ Improved Speech Representations with Multi-Target Autoregressive Predictive Coding 2020 Yu-An Chung
James Glass
+ Similarity Analysis of Self-Supervised Speech Representations 2020 Yu-An Chung
Yonatan Belinkov
James Glass
+ SPLAT: Speech-Language Joint Pre-Training for Spoken Language Understanding 2020 Yu-An Chung
Chenguang Zhu
Michael Zeng
+ Non-Autoregressive Predictive Coding for Learning Speech Representations from Local Dependencies 2020 Alexander H. Liu
Yu-An Chung
James Glass
+ PDF Chat An Unsupervised Autoregressive Model for Speech Representation Learning 2019 Yu-An Chung
Wei-Ning Hsu
Hao Tang
James Glass
+ Unsupervised Clinical Language Translation 2019 Wei‐Hung Weng
Yu-An Chung
Peter Szolovits
+ PDF Chat Semi-supervised Training for Improving Data Efficiency in End-to-end Speech Synthesis 2019 Yu-An Chung
Yuxuan Wang
Wei-Ning Hsu
Yu Zhang
RJ Skerry-Ryan
+ PDF Chat Towards Unsupervised Speech-to-text Translation 2019 Yu-An Chung
Wei‐Hung Weng
Schrasing Tong
James Glass
+ Unsupervised Clinical Language Translation 2019 Wei‐Hung Weng
Yu-An Chung
Peter Szolovits
+ An Unsupervised Autoregressive Model for Speech Representation Learning 2019 Yu-An Chung
Wei-Ning Hsu
Hao Tang
James Glass
+ Towards Transfer Learning for End-to-End Speech Synthesis from Deep Pre-Trained Language Models 2019 Wei Fang
Yu-An Chung
James Glass
+ SummAE: Zero-Shot Abstractive Text Summarization using Length-Agnostic Auto-Encoders 2019 Peter J. Liu
Yu-An Chung
Jie Ren
+ Generative Pre-Training for Speech with Autoregressive Predictive Coding 2019 Yu-An Chung
James Glass
+ PDF Chat Speech2Vec: A Sequence-to-Sequence Framework for Learning Word Embeddings from Speech 2018 Yu-An Chung
James Glass
+ Speech2Vec: A Sequence-to-Sequence Framework for Learning Word Embeddings from Speech 2018 Yu-An Chung
James Glass
+ Unsupervised Cross-Modal Alignment of Speech and Text Embedding Spaces 2018 Yu-An Chung
Wei‐Hung Weng
Schrasing Tong
James Glass
+ Towards Unsupervised Speech-to-Text Translation 2018 Yu-An Chung
Wei‐Hung Weng
Schrasing Tong
James Glass
+ Semi-Supervised Training for Improving Data Efficiency in End-to-End Speech Synthesis 2018 Yu-An Chung
Yuxuan Wang
Wei-Ning Hsu
Yu Zhang
RJ Skerry-Ryan
+ Supervised and Unsupervised Transfer Learning for Question Answering 2018 Yu-An Chung
Hung-yi Lee
James Glass
+ libact: Pool-based Active Learning in Python 2017 Yao-Yuan Yang
Shao-Chuan Lee
Yu-An Chung
Tung-En Wu
Sian Chen
Hsuan-Tien Lin
+ Learning Word Embeddings from Speech 2017 Yu-An Chung
James Glass
+ Learning Deep Representations of Medical Images using Siamese CNNs with Application to Content-Based Image Retrieval 2017 Yu-An Chung
Wei‐Hung Weng
+ Supervised and Unsupervised Transfer Learning for Question Answering 2017 Yu-An Chung
Hung-yi Lee
James Glass
+ PDF Chat Audio Word2Vec: Unsupervised Learning of Audio Segment Representations Using Sequence-to-Sequence Autoencoder 2016 Yu-An Chung
Chao-Chung Wu
Chia-Hao Shen
Hung-yi Lee
Lin-Shan Lee
+ Cost-aware pre-training for multiclass cost-sensitive deep learning 2016 Yu-An Chung
Hsuan-Tien Lin
Shao‐Wen Yang
+ Unsupervised Learning of Audio Segment Representations using Sequence-to-sequence Recurrent Neural Networks 2016 Yu-An Chung
Chao-Chung Wu
Chia-Hao Shen
Hung-yi Lee
+ Audio Word2Vec: Unsupervised Learning of Audio Segment Representations using Sequence-to-sequence Autoencoder 2016 Yu-An Chung
Chao-Chung Wu
Chia-Hao Shen
Hung-yi Lee
Lin-Shan Lee
+ Cost-Sensitive Deep Learning with Layer-Wise Cost Estimation 2016 Yu-An Chung
Shao‐Wen Yang
Hsuan-Tien Lin
+ Cost-aware Pre-training for Multiclass Cost-sensitive Deep Learning 2015 Yu-An Chung
Hsuan-Tien Lin
Shao‐Wen Yang
Common Coauthors
Commonly Cited References
Action Title Year Authors # of times referenced
+ PDF Chat An Unsupervised Autoregressive Model for Speech Representation Learning 2019 Yu-An Chung
Wei-Ning Hsu
Hao Tang
James Glass
16
+ PDF Chat wav2vec: Unsupervised Pre-Training for Speech Recognition 2019 Steffen Schneider
Alexei Baevski
Ronan Collobert
Michael Auli
15
+ Attention is All you Need 2017 Ashish Vaswani
Noam Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan N. Gomez
Łukasz Kaiser
Illia Polosukhin
14
+ Representation Learning with Contrastive Predictive Coding 2018 Aäron van den Oord
Yazhe Li
Oriol Vinyals
14
+ Deep Contextualized Word Representations 2018 Matthew E. Peters
Mark E Neumann
Mohit Iyyer
Matt Gardner
Christopher Clark
Kenton Lee
Luke Zettlemoyer
13
+ Adam: A Method for Stochastic Optimization 2014 Diederik P. Kingma
Jimmy Ba
13
+ PDF Chat Mockingjay: Unsupervised Speech Representation Learning with Deep Bidirectional Transformer Encoders 2020 Andy T. Liu
Shu-Wen Yang
Po-Han Chi
Po‐Chun Hsu
Hung-yi Lee
12
+ PDF Chat SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition 2019 Daniel Park
William Chan
Yu Zhang
Chung‐Cheng Chiu
Barret Zoph
Ekin D. Cubuk
Quoc V. Le
11
+ PDF Chat Deep Residual Learning for Image Recognition 2016 Kaiming He
Xiangyu Zhang
Shaoqing Ren
Jian Sun
11
+ PDF Chat Learning Problem-Agnostic Speech Representations from Multiple Self-Supervised Tasks 2019 Santiago Pascual
Mirco Ravanelli
Joan SerrĂ 
Antonio Bonafonte
Yoshua Bengio
10
+ PDF Chat Unsupervised Speech Representation Learning Using WaveNet Autoencoders 2019 Jan Chorowski
Ron J. Weiss
Samy Bengio
Aäron van den Oord
9
+ Attention Is All You Need 2017 Ashish Vaswani
Noam Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan N. Gomez
Łukasz Kaiser
Illia Polosukhin
9
+ Representation Learning with Contrastive Predictive Coding 2018 Aäron van den Oord
Yazhe Li
Oriol Vinyals
9
+ PDF Chat Deep Contextualized Acoustic Representations for Semi-Supervised Speech Recognition 2020 Shaoshi Ling
Yuzong Liu
JuliĂĄn Salazar
Katrin Kirchhoff
9
+ Improved Speech Representations with Multi-Target Autoregressive Predictive Coding 2020 Yu-An Chung
James Glass
9
+ PDF Chat Generative Pre-Training for Speech with Autoregressive Predictive Coding 2020 Yu-An Chung
James Glass
9
+ PDF Chat Speech2Vec: A Sequence-to-Sequence Framework for Learning Word Embeddings from Speech 2018 Yu-An Chung
James Glass
8
+ vq-wav2vec: Self-Supervised Learning of Discrete Speech Representations 2020 Alexei Baevski
Steffen Schneider
Michael Auli
8
+ Sequence to Sequence Learning with Neural Networks 2014 Ilya Sutskever
Oriol Vinyals
Quoc V. Le
8
+ PDF Chat Unsupervised Pre-Training of Bidirectional Speech Encoders via Masked Reconstruction 2020 Weiran Wang
Qingming Tang
Karen Livescu
8
+ PDF Chat Conformer: Convolution-augmented Transformer for Speech Recognition 2020 Anmol Gulati
James Qin
Chung‐Cheng Chiu
Niki Parmar
Yu Zhang
Jiahui Yu
Wei Han
Shibo Wang
Zhengdong Zhang
Yonghui Wu
8
+ PDF Chat Audio Word2Vec: Unsupervised Learning of Audio Segment Representations Using Sequence-to-Sequence Autoencoder 2016 Yu-An Chung
Chao-Chung Wu
Chia-Hao Shen
Hung-yi Lee
Lin-Shan Lee
7
+ PDF Chat Libri-Light: A Benchmark for ASR with Limited or No Supervision 2020 Jacob Kahn
Maude Rivière
Wenlong Zheng
Eugene Kharitonov
Qinmei Xu
Pierre-Emmanuel MazarĂŠ
Julien Karadayi
Vitaliy Liptchinsky
Ronan Collobert
Christian Fuegen
7
+ PDF Chat Unspeech: Unsupervised Speech Context Embeddings 2018 Benjamin Milde
Chris Biemann
7
+ Learning Word Embeddings from Speech 2017 Yu-An Chung
James Glass
7
+ BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding 2018 Jacob Devlin
Ming‐Wei Chang
Kenton Lee
Kristina Toutanova
7
+ PDF Chat Deep convolutional acoustic word embeddings using word-pair side information 2016 Herman Kamper
Weiran Wang
Karen Livescu
7
+ Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation 2014 Kyunghyun Cho
Bart van MerriĂŤnboer
Çaǧlar Gülçehre
Dzmitry Bahdanau
Fethi Bougares
Holger Schwenk
Yoshua Bengio
7
+ Generating Wikipedia by Summarizing Long Sequences 2018 Peter J. Liu
Mohammad Saleh
Etienne Pot
Ben Goodrich
Ryan Sepassi
Łukasz Kaiser
Noam Shazeer
6
+ PDF Chat TERA: Self-Supervised Learning of Transformer Encoder Representation for Speech 2021 Andy T. Liu
Shang-Wen Li
Hung-yi Lee
6
+ PDF Chat Speech-XLNet: Unsupervised Acoustic Model Pretraining for Self-Attention Networks 2020 Xingchen Song
Guangsen Wang
Yiheng Huang
Zhiyong Wu
Dan Su
Helen Meng
6
+ Sequence to Sequence Learning with Neural Networks 2014 Ilya Sutskever
Oriol Vinyals
Quoc V. Le
6
+ Audio Word2Vec: Unsupervised Learning of Audio Segment Representations using Sequence-to-sequence Autoencoder 2016 Yu-An Chung
Chao-Chung Wu
Chia-Hao Shen
Hung-yi Lee
Lin-Shan Lee
6
+ PDF Chat Enriching Word Vectors with Subword Information 2017 Piotr Bojanowski
Édouard Grave
Armand Joulin
TomĂĄĹĄ Mikolov
6
+ PDF Chat Universal Language Model Fine-tuning for Text Classification 2018 Jeremy Howard
Sebastian Ruder
6
+ wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations 2020 Alexei Baevski
Henry Zhou
Abdelrahman Mohamed
Michael Auli
6
+ Effectiveness of self-supervised pre-training for speech recognition 2019 Alexei Baevski
Michael Auli
Abdelrahman Mohamed
5
+ Attention-Based Models for Speech Recognition 2015 Jan Chorowski
Dzmitry Bahdanau
Dmitriy Serdyuk
Kyunghyun Cho
Yoshua Bengio
5
+ Distributed Representations of Words and Phrases and their Compositionality 2013 TomĂĄĹĄ Mikolov
Ilya Sutskever
Kai Chen
Greg S. Corrado
Jeffrey Dean
5
+ Generating Sequences With Recurrent Neural Networks 2013 Alex Graves
5
+ Discriminative Acoustic Word Embeddings: Recurrent Neural Network-Based Approaches 2016 Shane Settle
Karen Livescu
5
+ Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation 2016 Yonghui Wu
Mike Schuster
Zhifeng Chen
Quoc V. Le
Mohammad Norouzi
Wolfgang Macherey
Maxim Krikun
Yuan Cao
Qin Gao
Klaus Macherey
5
+ RoBERTa: A Robustly Optimized BERT Pretraining Approach 2019 Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
Mike Lewis
Luke Zettlemoyer
Veselin Stoyanov
5
+ PDF Chat Vector-Quantized Autoregressive Predictive Coding 2020 Yu-An Chung
Hao Tang
James Glass
5
+ PDF Chat Semi-supervised Training for Improving Data Efficiency in End-to-end Speech Synthesis 2019 Yu-An Chung
Yuxuan Wang
Wei-Ning Hsu
Yu Zhang
RJ Skerry-Ryan
5
+ PDF Chat Unsupervised Pretraining Transfers Well Across Languages 2020 Morgane Rivière
Armand Joulin
Pierre-Emmanuel MazarĂŠ
Emmanuel Dupoux
5
+ Neural Machine Translation by Jointly Learning to Align and Translate 2014 Dzmitry Bahdanau
Kyunghyun Cho
Yoshua Bengio
5
+ Neural Discrete Representation Learning 2017 Aäron van den Oord
Oriol Vinyals
Koray Kavukcuoglu
5
+ Augmenting Librispeech with French Translations: A Multimodal Corpus for Direct Speech Translation Evaluation 2018 Ali Can Kocabiyikoglu
Laurent Besacier
Olivier Kraif
4
+ Improving Transformer-based Speech Recognition Using Unsupervised Pre-training 2019 Dongwei Jiang
Xiaoning Lei
Wubo Li
Ne Luo
Yuxuan Hu
Wei Zou
Xiangang Li
4