Paden Tomasello

Follow

Generating author description...

All published works
Action Title Year Authors
+ PDF Chat SSR: Alignment-Aware Modality Connector for Speech Language Models 2024 Weiting Tan
Hirofumi Inaguma
Ning Dong
Paden Tomasello
Xutai Ma
+ Continual Learning for On-Device Speech Recognition Using Disentangled Conformers 2023 Anuj Diwan
Ching-Feng Yeh
Wei-Ning Hsu
Paden Tomasello
Eunsol Choi
David Harwath
Abdelrahman Mohamed
+ PDF Chat Stop: A Dataset for Spoken Task Oriented Semantic Parsing 2023 Paden Tomasello
Akshat Shrivastava
Daniel A. Lazar
Po‐Chun Hsu
Duc Van Le
Adithya Sagar
Ali Elkahky
Jade Copet
Wei-Ning Hsu
Yossi Adi
+ PDF Chat Generative Spoken Dialogue Language Modeling 2023 Tu Anh Nguyen
Eugene Kharitonov
Jade Copet
Yossi Adi
Wei-Ning Hsu
Ali Elkahky
Paden Tomasello
Robin Algayres
Benoît Sagot
Abdelrahman Mohamed
+ Efficient Speech Representation Learning with Low-Bit Quantization 2023 Ching-Feng Yeh
Wei-Ning Hsu
Paden Tomasello
Abdelrahman Mohamed
+ Hybrid Transducer and Attention based Encoder-Decoder Modeling for Speech-to-Text Tasks 2023 Yun Tang
Anna Y. Sun
Hirofumi Inaguma
Xinyue Chen
Ning Dong
Xutai Ma
Paden Tomasello
Juan Pino
+ Scaling Speech Technology to 1,000+ Languages 2023 Vineel Pratap
Andros Tjandra
Bowen Shi
Paden Tomasello
Arun Babu
Sayani Kundu
Ali Elkahky
Zhaoheng Ni
Apoorv Vyas
Maryam Fazel-Zarandi
+ PDF Chat Speech-to-Speech Translation for a Real-world Unwritten Language 2023 Peng‐Jen Chen
Kevin Tran
Yilin Yang
Jingfei Du
Justine Kao
Yu-An Chung
Paden Tomasello
Paul-Ambroise Duquenne
Holger Schwenk
Hongyu Gong
+ Hybrid Transducer and Attention based Encoder-Decoder Modeling for Speech-to-Text Tasks 2023 Yun Tang
Anna Sun
Hirofumi Inaguma
Xinyue Chen
Ning Dong
Xutai Ma
Paden Tomasello
Juan Pino
+ SeamlessM4T: Massively Multilingual & Multimodal Machine Translation 2023 Seamless Communication
Loïc Barrault
Yu-An Chung
Mariano Cora Meglioli
David C. Dale
Ning Dong
Paul-Ambroise Duquenne
Hady Elsahar
Hongyu Gong
Kevin S. Heffernan
+ Efficient Monotonic Multihead Attention 2023 Xutai Ma
Anna Sun
Siqi Ouyang
Hirofumi Inaguma
Paden Tomasello
+ Seamless: Multilingual Expressive and Streaming Speech Translation 2023 Seamless Communication
Loïc Barrault
Yu-An Chung
Mariano Coria Meglioli
David C. Dale
Ning Dong
Mark Duppenthaler
Paul-Ambroise Duquenne
Brian E. Ellis
Hady Elsahar
+ PDF Chat Deliberation Model for On-Device Spoken Language Understanding 2022 Manh Duc Le
Akshat Shrivastava
Paden Tomasello
Suyoun Kim
Aleksandr Livshits
Ozlem Kalinli
Michael L. Seltzer
+ Flashlight: Enabling Innovation in Tools for Machine Learning 2022 Jacob Kahn
Vineel Pratap
Tatiana Likhomanenko
Qiantong Xu
Awni Hannun
Jeff Cai
Paden Tomasello
Ann Lee
Édouard Grave
Gilad Avidov
+ textless-lib: a Library for Textless Spoken Language Processing 2022 Eugene Kharitonov
Jade Copet
Kushal Lakhotia
Tu Anh Nguyen
Paden Tomasello
Ann Lee
Ali Elkahky
Wei-Ning Hsu
Abdelrahman Mohamed
Emmanuel Dupoux
+ Generative Spoken Dialogue Language Modeling 2022 Tu Anh Nguyen
Eugene Kharitonov
Jade Copet
Yossi Adi
Wei-Ning Hsu
Ali Elkahky
Paden Tomasello
Robin Algayres
Benoît Sagot
Abdelrahman Mohamed
+ STOP: A dataset for Spoken Task Oriented Semantic Parsing 2022 Paden Tomasello
Po‐Chun Hsu
Akshat Shrivastava
Daniel A. Lazar
Manh Duc Le
Adithya Sagar
Ali Elkahky
Jade Copet
Wei-Ning Hsu
Yossef Mordechay
+ textless-lib: a Library for Textless Spoken Language Processing 2022 Eugene Kharitonov
Jade Copet
Kushal Lakhotia
Tu Anh Nguyen
Paden Tomasello
Ann Lee
Ali Elkahky
Wei-Ning Hsu
Abdelrahman Mohamed
Emmanuel Dupoux
+ Deliberation Model for On-Device Spoken Language Understanding 2022 Manh Duc Le
Akshat Shrivastava
Paden Tomasello
Suyoun Kim
Aleksandr Livshits
Ozlem Kalinli
Michael L. Seltzer
+ Speech-to-Speech Translation For A Real-world Unwritten Language 2022 Peng‐Jen Chen
Kevin Tran
Yilin Yang
Jingfei Du
Justine Kao
Yu-An Chung
Paden Tomasello
Paul-Ambroise Duquenne
Holger Schwenk
Hongyu Gong
+ Continual Learning for On-Device Speech Recognition using Disentangled Conformers 2022 Anuj Diwan
Ching-Feng Yeh
Wei-Ning Hsu
Paden Tomasello
Eunsol Choi
David Harwath
Abdelrahman Mohamed
+ PDF Chat Rethinking Evaluation in ASR: Are Our Models Robust Enough? 2021 Tatiana Likhomanenko
Qiantong Xu
Vineel Pratap
Paden Tomasello
Jacob Kahn
Gilad Avidov
Ronan Collobert
Gabriel Synnaeve
+ PDF Chat Self-Training and Pre-Training are Complementary for Speech Recognition 2021 Qiantong Xu
Alexei Baevski
Tatiana Likhomanenko
Paden Tomasello
Alexis Conneau
Ronan Collobert
Gabriel Synnaeve
Michael Auli
+ PDF Chat Massively Multilingual ASR: 50 Languages, 1 Model, 1 Billion Parameters 2020 Vineel Pratap
Anuroop Sriram
Paden Tomasello
Awni Hannun
Vitaliy Liptchinsky
Gabriel Synnaeve
Ronan Collobert
+ Rethinking Evaluation in ASR: Are Our Models Robust Enough? 2020 Tatiana Likhomanenko
Qiantong Xu
Vineel Pratap
Paden Tomasello
Jacob Kahn
Gilad Avidov
Ronan Collobert
Gabriel Synnaeve
+ Massively Multilingual ASR: 50 Languages, 1 Model, 1 Billion Parameters 2020 Vineel Pratap
Anuroop Sriram
Paden Tomasello
Awni Hannun
Vitaliy Liptchinsky
Gabriel Synnaeve
Ronan Collobert
+ Self-training and Pre-training are Complementary for Speech Recognition 2020 Qiantong Xu
Alexei Baevski
Tatiana Likhomanenko
Paden Tomasello
Alexis Conneau
Ronan Collobert
Gabriel Synnaeve
Michael Auli
+ Rethinking Evaluation in ASR: Are Our Models Robust Enough? 2020 Tatiana Likhomanenko
Qiantong Xu
Vineel Pratap
Paden Tomasello
Jacob Kahn
Gilad Avidov
Ronan Collobert
Gabriel Synnaeve
+ Massively Multilingual ASR: 50 Languages, 1 Model, 1 Billion Parameters 2020 Vineel Pratap
Anuroop Sriram
Paden Tomasello
Awni Hannun
Vitaliy Liptchinsky
Gabriel Synnaeve
Ronan Collobert
+ PDF Chat DSCnet: Replicating Lidar Point Clouds With Deep Sensor Cloning 2019 Paden Tomasello
Sammy Sidhu
Anting Shen
Matthew W. Moskewicz
Nobie Redmon
Gayatri Joshi
Romi Phadte
Paras Jain
Forrest Iandola
+ DSCnet: Replicating Lidar Point Clouds with Deep Sensor Cloning. 2018 Paden Tomasello
Sammy Sidhu
Anting Shen
Matthew W. Moskewicz
Nobie Redmon
Gayatri Joshi
Romi Phadte
Paras Jain
Forrest Iandola
+ DSCnet: Replicating Lidar Point Clouds with Deep Sensor Cloning 2018 Paden Tomasello
Sammy Sidhu
Anting Shen
Matthew W. Moskewicz
Nobie Redmon
Gayatri Joshi
Romi Phadte
Paras Jain
Forrest Iandola
Common Coauthors
Commonly Cited References
Action Title Year Authors # of times referenced
+ PDF Chat SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition 2019 Daniel Park
William Chan
Yu Zhang
Chung‐Cheng Chiu
Barret Zoph
Ekin D. Cubuk
Quoc V. Le
9
+ wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations 2020 Alexei Baevski
Henry Zhou
Abdelrahman Mohamed
Michael Auli
6
+ End-to-end ASR: from Supervised to Semi-Supervised Learning with Modern Architectures 2019 Gabriel Synnaeve
Qiantong Xu
Jacob Kahn
Édouard Grave
Tatiana Likhomanenko
Vineel Pratap
Anuroop Sriram
Vitaliy Liptchinsky
Ronan Collobert
6
+ PDF Chat Libri-Light: A Benchmark for ASR with Limited or No Supervision 2020 Jacob Kahn
Maude Rivière
Wenlong Zheng
Eugene Kharitonov
Qinmei Xu
Pierre-Emmanuel Mazaré
Julien Karadayi
Vitaliy Liptchinsky
Ronan Collobert
Christian Fuegen
5
+ PDF Chat Wav2Letter++: A Fast Open-source Speech Recognition System 2019 Vineel Pratap
Awni Hannun
Qiantong Xu
Jeff Cai
Jacob Kahn
Gabriel Synnaeve
Vitaliy Liptchinsky
Ronan Collobert
5
+ SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing 2018 Taku Kudo
John T. E. Richardson
5
+ PDF Chat Conformer: Convolution-augmented Transformer for Speech Recognition 2020 Anmol Gulati
James Qin
Chung‐Cheng Chiu
Niki Parmar
Yu Zhang
Jiahui Yu
Wei Han
Shibo Wang
Zhengdong Zhang
Yonghui Wu
5
+ Attention Is All You Need 2017 Ashish Vaswani
Noam Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan N. Gomez
Łukasz Kaiser
Illia Polosukhin
4
+ Adaptive Input Representations for Neural Language Modeling 2018 Alexei Baevski
Michael Auli
4
+ Sequence Transduction with Recurrent Neural Networks 2012 Alex Graves
4
+ PDF Chat fairseq: A Fast, Extensible Toolkit for Sequence Modeling 2019 Myle Ott
Sergey Edunov
Alexei Baevski
Angela Fan
Sam Gross
Nathan Ng
David Grangier
Michael Auli
4
+ PDF Chat wav2vec: Unsupervised Pre-Training for Speech Recognition 2019 Steffen Schneider
Alexei Baevski
Ronan Collobert
Michael Auli
3
+ HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units 2021 Wei-Ning Hsu
Benjamin Bolte
Yao-Hung Hubert Tsai
Kushal Lakhotia
Ruslan Salakhutdinov
Abdelrahman Mohamed
3
+ Attention is All you Need 2017 Ashish Vaswani
Noam Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan N. Gomez
Łukasz Kaiser
Illia Polosukhin
3
+ BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding 2018 Jacob Devlin
Ming‐Wei Chang
Kenton Lee
Kristina Toutanova
3
+ Representation Learning with Contrastive Predictive Coding 2018 Aäron van den Oord
Yazhe Li
Oriol Vinyals
3
+ PDF Chat TED-LIUM 3: Twice as Much Data and Corpus Repartition for Experiments on Speaker Adaptation 2018 François Hernandez
Vincent Nguyen
Sahar Ghannay
Natalia Tomashenko
Yannick Estève
3
+ Conformer: Convolution-augmented Transformer for Speech Recognition 2020 Anmol Gulati
James Qin
Chung‐Cheng Chiu
Niki Parmar
Yu Zhang
Jiahui Yu
Wei Han
Shibo Wang
Zhengdong Zhang
Yonghui Wu
3
+ Common Voice: A Massively-Multilingual Speech Corpus 2019 Rosana Ardila
Megan Branson
Kelly Davis
Michael Henretty
Michael Köhler
Josh Meyer
Reuben Morais
Lindsay Saunders
Francis M. Tyers
Gregor Weber
3
+ PDF Chat Speech Resynthesis from Discrete Disentangled Self-Supervised Representations 2021 Adam Polyak
Yossi Adi
Jade Copet
Eugene Kharitonov
Kushal Lakhotia
Wei-Ning Hsu
Abdelrahman Mohamed
Emmanuel Dupoux
3
+ PDF Chat Subword Regularization: Improving Neural Network Translation Models with Multiple Subword Candidates 2018 Taku Kudo
3
+ vq-wav2vec: Self-Supervised Learning of Discrete Speech Representations 2020 Alexei Baevski
Steffen Schneider
Michael Auli
2
+ Effectiveness of self-supervised pre-training for speech recognition 2019 Alexei Baevski
Michael Auli
Abdelrahman Mohamed
2
+ Semi-Supervised Speech Recognition via Local Prior Matching 2020 Wei-Ning Hsu
Ann Lee
Gabriel Synnaeve
Awni Hannun
2
+ ENet: A Deep Neural Network Architecture for Real-Time Semantic Segmentation 2016 Adam Paszke
Abhishek Chaurasia
Sangpil Kim
Eugenio Culurciello
2
+ V-Net: Fully Convolutional Neural Networks for Volumetric Medical Image Segmentation 2016 Fausto Milletarì
Nassir Navab
Seyed‐Ahmad Ahmadi
2
+ Reducing Transformer Depth on Demand with Structured Dropout 2019 Angela Fan
Édouard Grave
Armand Joulin
2
+ PDF Chat An Unsupervised Autoregressive Model for Speech Representation Learning 2019 Yu-An Chung
Wei-Ning Hsu
Hao Tang
James Glass
2
+ Representation Learning with Contrastive Predictive Coding 2018 Aäron van den Oord
Yazhe Li
Oriol Vinyals
2
+ PDF Chat The Fifth 'CHiME' Speech Separation and Recognition Challenge: Dataset, Task and Baselines 2018 Jon Barker
Shinji Watanabe
Emmanuel Vincent
Jan Trmal
2
+ VoxelNet: End-to-End Learning for Point Cloud Based 3D Object Detection 2017 Yin Zhou
Oncel Tuzel
2
+ Improving Transformer-based Speech Recognition Using Unsupervised Pre-training 2019 Dongwei Jiang
Xiaoning Lei
Wubo Li
Ne Luo
Yuxuan Hu
Wei Zou
Xiangang Li
2
+ PDF Chat Unsupervised Pre-Training of Bidirectional Speech Encoders via Masked Reconstruction 2020 Weiran Wang
Qingming Tang
Karen Livescu
2
+ PDF Chat Bytes Are All You Need: End-to-end Multilingual Speech Recognition and Synthesis with Bytes 2019 Bo Li
Yu Zhang
Tara N. Sainath
Yonghui Wu
William Chan
2
+ PDF Chat Fully convolutional networks for semantic segmentation 2015 Jonathan Long
Evan Shelhamer
Trevor Darrell
2
+ PDF Chat Multilingual Speech Recognition with a Single End-to-End Model 2018 Shubham Toshniwal
Tara N. Sainath
Ron J. Weiss
Bo Li
Pedro J. Moreno
Eugene Weinstein
Kanishka Rao
2
+ PDF Chat Massively Multilingual Adversarial Speech Recognition 2019 Oliver Adams
Matthew Wiesner
Shinji Watanabe
David Yarowsky
2
+ SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <0.5MB model size 2016 Forrest Iandola
Song Han
Matthew W. Moskewicz
Khalid Ashraf
William J. Dally
Kurt Keutzer
2
+ PDF Chat Adversarial Training of End-to-end Speech Recognition Using a Criticizing Language Model 2019 Alexander H. Liu
Hung-yi Lee
Lin-Shan Lee
2
+ A Call for Clarity in Reporting BLEU Scores 2018 Matt Post
2
+ RoBERTa: A Robustly Optimized BERT Pretraining Approach 2019 Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
Mike Lewis
Luke Zettlemoyer
Veselin Stoyanov
2
+ Cross-lingual Language Model Pretraining 2019 Guillaume Lample
Alexis Conneau
2
+ Deep Speech 2: End-to-End Speech Recognition in English and Mandarin 2015 Dario Amodei
Rishita Anubhai
Eric Battenberg
Carl Case
Jared Casper
Bryan Catanzaro
Jingdong Chen
Mike Chrzanowski
Adam Coates
Greg Diamos
2
+ You Only Look Once: Unified, Real-Time Object Detection 2015 Joseph Redmon
Santosh Divvala
Ross Girshick
Ali Farhadi
2
+ Depth Map Prediction from a Single Image using a Multi-Scale Deep Network 2014 David Eigen
Christian Puhrsch
Rob Fergus
2
+ MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications 2017 Andrew Howard
Menglong Zhu
Bo Chen
Dmitry Kalenichenko
Weijun Wang
Tobias Weyand
Marco Andreetto
Hartwig Adam
2
+ PDF Chat Building and evaluation of a real room impulse response dataset 2019 Igor Szöke
Miroslav Skácel
Ladislav Mošner
Jakub Paliesek
Jaň Černocký
2
+ SqueezeDet: Unified, Small, Low Power Fully Convolutional Neural Networks for Real-Time Object Detection for Autonomous Driving 2016 Bichen Wu
Alvin Wan
Forrest Iandola
Peter Jin
Kurt Keutzer
2
+ The CAPIO 2017 Conversational Speech Recognition System 2018 Kyu J. Han
Akshay Chandrashekaran
Jungsuk Kim
Ian Lane
2
+ PDF Chat Towards End-to-end Spoken Language Understanding 2018 Dmitriy Serdyuk
Yongqiang Wang
Christian Fuegen
Anuj Kumar
Baiyang Liu
Yoshua Bengio
2