Learning Robust and Multilingual Speech Representations

Type: Article

Publication Date: 2020-01-01

Citations: 77

DOI: https://doi.org/10.18653/v1/2020.findings-emnlp.106

Locations

  • arXiv (Cornell University) - View - PDF

Similar Works

Action Title Year Authors
+ Learning Robust and Multilingual Speech Representations 2020 Kazuya Kawakami
Luyu Wang
Chris Dyer
Phil Blunsom
Aäron van den Oord
+ PDF Chat Word-Level Embeddings for Cross-Task Transfer Learning in Speech Processing 2021 Pierre Beckmann
Mikolaj Kegler
Miloš Cerňak
+ PDF Chat Adapting multilingual speech representation model for a new, underresourced language through multilingual fine-tuning and continued pretraining 2022 Karol Nowakowski
Michał Ptaszyński
Kyoko Murasaki
Jagna Nieuważny
+ PDF Chat XTREME-S: Evaluating Cross-lingual Speech Representations 2022 Alexis Conneau
Ankur Bapna
Yu Zhang
Min Ma
Patrick von Platen
Anton Lozhkov
Colin Cherry
Jia Ye
Clara Rivera
Mihir Kale
+ XTREME-S: Evaluating Cross-lingual Speech Representations 2022 Alexis Conneau
Ankur Bapna
Yu Zhang
Min Ma
Patrick von Platen
Anton Lozhkov
Colin Cherry
Jia Ye
Clara Rivera
Mihir Kale
+ PDF Chat Self-Supervised Speech Representation Learning: A Review 2022 Abdelrahman Mohamed
Hung-yi Lee
Lasse Borgholt
Jakob D. Havtorn
Joakim Edin
Christian Igel
Katrin Kirchhoff
Shang-Wen Li
Karen Livescu
Lars Maaløe
+ Self-Supervised Speech Representation Learning: A Review 2022 Abdelrahman Mohamed
Hung-yi Lee
Lasse Borgholt
Jakob D. Havtorn
Joakim Edin
Christian Igel
Katrin Kirchhoff
Shang-Wen Li
Karen Livescu
Lars Maaløe
+ PDF Chat Learning Invariant Representation and Risk Minimized for Unsupervised Accent Domain Adaptation 2023 Chendong Zhao
Jianzong Wang
Xiaoyang Qu
Haoqian Wang
Jing Xiao
+ Learning Invariant Representation and Risk Minimized for Unsupervised Accent Domain Adaptation 2022 Chendong Zhao
Jianzong Wang
Xiaoyang Qu
Haoqian Wang
Jing Xiao
+ Multilingual Speech Recognition using Knowledge Transfer across Learning Processes 2021 Rimita Lahiri
Kenichi Kumatani
Eric Sun
Yao Qian
+ Unsupervised Cross-lingual Representation Learning for Speech Recognition 2020 Alexis Conneau
Alexei Baevski
Ronan Collobert
Abdelrahman Mohamed
Michael Auli
+ A Survey of Multilingual Models for Automatic Speech Recognition 2022 Hemant Kumar Yadav
Sunayana Sitaram
+ PDF Chat XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale 2021 Arun Babu
Changhan Wang
Andros Tjandra
Kushal Lakhotia
Qiantong Xu
Naman Goyal
Kritika Singh
Patrick von Platen
Yatharth Saraf
Juan Pino
+ Transfer Learning for Speech and Language Processing 2015 Dong Wang
Thomas Fang Zheng
+ MAESTRO: Matched Speech Text Representations through Modality Matching 2022 Zhehuai Chen
Yu Zhang
Andrew E. Rosenberg
Bhuvana Ramabhadran
Pedro Moreno
Ankur Bapna
Heiga Zen
+ PDF Chat MAESTRO: Matched Speech Text Representations through Modality Matching 2022 Zhehuai Chen
Zhang Yu
Andrew E. Rosenberg
Bhuvana Ramabhadran
Pedro J. Moreno
Ankur Bapna
Heiga Zen
+ PDF Chat Learning Noise-Invariant Representations for Robust Speech Recognition 2018 Davis Liang
Zhiheng Huang
Zachary C. Lipton
+ Improved acoustic word embeddings for zero-resource languages using multilingual transfer 2020 Herman Kamper
Yevgen Matusevych
Sharon Goldwater
+ Improved acoustic word embeddings for zero-resource languages using multilingual transfer 2020 Herman Kamper
Yevgen Matusevych
Sharon Goldwater
+ VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation 2021 Changhan Wang
Morgane Rivière
Ann Lee
Anne Wu
Chaitanya Talnikar
Daniel Haziza
Mary Williamson
Juan Pino
Emmanuel Dupoux

Works That Cite This (56)

Action Title Year Authors
+ PDF Chat LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech 2021 Solène Evain
Ha-Thanh Nguyen
Hang Le
Marcely Zanon Boito
Salima Mdhaffar
Sina Alisamir
Ziyi Tong
Natalia Tomashenko
Marco Dinarelli
Titouan Parcollet
+ PDF Chat An Evaluation of Self-supervised Pre-training for Skin-Lesion Analysis 2023 Levy Chaves
Alceu Bissoto
Eduardo Valle
Sandra Avila
+ PDF Chat Contrastive Multiview Coding 2020 Yonglong Tian
Dilip Krishnan
Phillip Isola
+ PDF Chat TERA: Self-Supervised Learning of Transformer Encoder Representation for Speech 2021 Andy T. Liu
Shang-Wen Li
Hung-yi Lee
+ Cocktail Hubert: Generalized Self-Supervised Pre-Training for Mixture and Single-Source Speech 2023 Maryam Fazel-Zarandi
Wei-Ning Hsu
+ Exploration of Language Dependency for Japanese Self-Supervised Speech Representation Models 2023 Takanori Ashihara
Takafumi Moriya
Kohei Matsuura
Tomohiro Tanaka
+ Multilingual Speech Translation from Efficient Finetuning of Pretrained Models 2021 Xian Li
Changhan Wang
Yun Tang
Chau Tran
Yuqing Tang
Juan Pino
Alexei Baevski
Alexis Conneau
Michael Auli
+ PDF Chat BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition 2020 Shaoshi Ling
Julián Salazar
Yuzong Liu
Katrin Kirchhoff
+ Multilingual Speech Translation with Efficient Finetuning of Pretrained Models 2020 Xian Li
Changhan Wang
Yun Tang
Chau Tran
Yuqing Tang
Juan Pino
Alexei Baevski
Alexis Conneau
Michael Auli
+ PDF Chat Self-Supervised Representations Improve End-to-End Speech Translation 2020 Anne Wu
Changhan Wang
Juan Pino
Jiatao Gu

Works Cited by This (29)

Action Title Year Authors
+ Deep Speech 2: End-to-End Speech Recognition in English and Mandarin 2015 Dario Amodei
Rishita Anubhai
Eric Battenberg
Carl Case
Jared Casper
Bryan Catanzaro
Jingdong Chen
Mike Chrzanowski
Adam Coates
Greg Diamos
+ Wav2Letter: an End-to-End ConvNet-based Speech Recognition System 2016 Ronan Collobert
Christian Puhrsch
Gabriel Synnaeve
+ Representation Learning with Contrastive Predictive Coding 2018 Aäron van den Oord
Yazhe Li
Oriol Vinyals
+ BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding 2018 Jacob Devlin
Ming‐Wei Chang
Kenton Lee
Kristina Toutanova
+ Mixed-Precision Training for NLP and Speech Recognition with OpenSeq2Seq 2018 Oleksii Kuchaiev
Boris Ginsburg
Igor Gitman
Vitaly Lavrukhin
Jason Li
Huyen Nguyen
Carl Case
Paulius Micikevicius
+ PDF Chat SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition 2019 Daniel Park
William Chan
Yu Zhang
Chung‐Cheng Chiu
Barret Zoph
Ekin D. Cubuk
Quoc V. Le
+ Data-Efficient Image Recognition with Contrastive Predictive Coding 2019 Olivier J. Hénaff
Aravind Srinivas
Jeffrey De Fauw
Ali Razavi
Carl Doersch
S. M. Ali Eslami
Aäron van den Oord
+ Learning Representations by Maximizing Mutual Information Across Views 2019 Philip Bachman
R Devon Hjelm
William Buchwalter
+ On Variational Bounds of Mutual Information 2019 Ben Poole
Sherjil Ozair
Aäron van den Oord
Alexander A. Alemi
George Tucker
+ Large Scale Adversarial Representation Learning 2019 Jeff Donahue
Karen Simonyan