Robin Algayres

Follow

Generating author description...

All published works
Action Title Year Authors
+ PDF Chat LLM360 K2: Building a 65B 360-Open-Source Large Language Model from Scratch 2025 Zhengzhong Liu
Bowen Tan
Hongyi Wang
Willie Neiswanger
Tianhua Tao
Haonan Li
Fajri Koto
Yuqi Wang
Suqi Sun
Omkar Pangarkar
+ PDF Chat SpiRit-LM: Interleaved Spoken and Written Language Model 2024 Tu Anh Nguyen
Benjamin Müller
Bokai Yu
Marta R. Costa‐jussà
Maha Elbayad
Sravya Popuri
Paul-Ambroise Duquenne
Robin Algayres
Ruslan Mavlyutov
Itai Gat
+ Fine-Tuning Strategies for Faster Inference Using Speech Self-Supervised Models: A Comparative Study 2023 Salah Zaiem
Robin Algayres
Titouan Parcollet
Slim Essid
Mirco Ravanelli
+ PDF Chat Stop: A Dataset for Spoken Task Oriented Semantic Parsing 2023 Paden Tomasello
Akshat Shrivastava
Daniel A. Lazar
Po‐Chun Hsu
Duc Van Le
Adithya Sagar
Ali Elkahky
Jade Copet
Wei-Ning Hsu
Yossi Adi
+ PDF Chat Generative Spoken Dialogue Language Modeling 2023 Tu Anh Nguyen
Eugene Kharitonov
Jade Copet
Yossi Adi
Wei-Ning Hsu
Ali Elkahky
Paden Tomasello
Robin Algayres
Benoît Sagot
Abdelrahman Mohamed
+ Fine-tuning Strategies for Faster Inference using Speech Self-Supervised Models: A Comparative Study 2023 Salah Zaiem
Robin Algayres
Titouan Parcollet
Slim Essid
Mirco Ravanelli
+ Big model only for hard audios: Sample dependent Whisper model selection for efficient inferences 2023 Hugo Malard
Salah Zaiem
Robin Algayres
+ XLS-R fine-tuning on noisy word boundaries for unsupervised speech segmentation into words 2023 Robin Algayres
Pablo Diego-Simón
Benoît Sagot
Emmanuel Dupoux
+ Generative Spoken Language Model based on continuous word-sized audio tokens 2023 Robin Algayres
Yossi Adi
Tu Anh Nguyen
Jade Copet
Gabriel Synnaeve
Benoît Sagot
Emmanuel Dupoux
+ PDF Chat Generative Spoken Language Model based on continuous word-sized audio tokens 2023 Robin Algayres
Yossi Adi
Tu Anh Nguyen
Jade Copet
Gabriel Synnaeve
Benoît Sagot
Emmanuel Dupoux
+ XLS-R fine-tuning on noisy word boundaries for unsupervised speech segmentation into words 2023 Robin Algayres
Pablo Diego-Simón
Benoît Sagot
Emmanuel Dupoux
+ PDF Chat Speech Sequence Embeddings using Nearest Neighbors Contrastive Learning 2022 Robin Algayres
Adel Nabli
Benoît Sagot
Emmanuel Dupoux
+ Generative Spoken Dialogue Language Modeling 2022 Tu Anh Nguyen
Eugene Kharitonov
Jade Copet
Yossi Adi
Wei-Ning Hsu
Ali Elkahky
Paden Tomasello
Robin Algayres
Benoît Sagot
Abdelrahman Mohamed
+ DP-Parse: Finding Word Boundaries from Raw Speech with an Instance Lexicon 2022 Robin Algayres
Tristan Ricoul
Julien Karadayi
Hugo Laurençon
Salah Zaiem
Abdelrahman Mohamed
Benoît Sagot
Emmanuel Dupoux
+ STOP: A dataset for Spoken Task Oriented Semantic Parsing 2022 Paden Tomasello
Po‐Chun Hsu
Akshat Shrivastava
Daniel A. Lazar
Manh Duc Le
Adithya Sagar
Ali Elkahky
Jade Copet
Wei-Ning Hsu
Yossef Mordechay
+ PDF Chat DP-Parse: Finding Word Boundaries from Raw Speech with an Instance Lexicon 2022 Robin Algayres
Tristan Ricoul
Julien Karadayi
Hugo Laurençon
Salah Zaiem
Abdelrahman Mohamed
Benoît Sagot
Emmanuel Dupoux
+ Speech Sequence Embeddings using Nearest Neighbors Contrastive Learning 2022 Robin Algayres
Adel Nabli
Benoît Sagot
Emmanuel Dupoux
+ Are word boundaries useful for unsupervised language learning? 2022 Tu Anh Nguyen
Maureen de Seyssel
Robin Algayres
Patricia Roze
Ewan Dunbar
Emmanuel Dupoux
+ PDF Chat Evaluating the Reliability of Acoustic Speech Embeddings 2020 Robin Algayres
Mohamed Salah Zaïem
Benoît Sagot
Emmanuel Dupoux
+ PDF Chat The Zero Resource Speech Challenge 2020: Discovering Discrete Subword and Word Units 2020 Ewan Dunbar
Julien Karadayi
Mathieu Bernard
Xuan-Nga Cao
Robin Algayres
Lucas Ondel
Laurent Besacier
Sakriani Sakti
Emmanuel Dupoux
+ PDF Chat The Zero Resource Speech Challenge 2019: TTS Without T 2019 Ewan Dunbar
Robin Algayres
Julien Karadayi
Mathieu Bernard
Juan Benjumea
Xuan-Nga Cao
Lucie Miskic
Charlotte Dugrain
Lucas Ondel
Alan W. Black
Common Coauthors
Commonly Cited References
Action Title Year Authors # of times referenced
+ wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations 2020 Alexei Baevski
Henry Zhou
Abdelrahman Mohamed
Michael Auli
8
+ PDF Chat Truly Unsupervised Acoustic Word Embeddings Using Weak Top-down Constraints in Encoder-decoder Models 2019 Herman Kamper
5
+ PDF Chat Self-Expressing Autoencoders for Unsupervised Spoken Term Discovery 2020 Saurabhchand Bhati
Jesús Villalba
Piotr Żelasko
Najim Dehak
4
+ PDF Chat Acoustic Word Embeddings for Zero-Resource Languages Using Self-Supervised Contrastive Learning and Multilingual Adaptation 2021 Christiaan Jacobs
Yevgen Matusevych
Herman Kamper
4
+ Representation Learning with Contrastive Predictive Coding 2018 Aäron van den Oord
Yazhe Li
Oriol Vinyals
4
+ WaveNet: A Generative Model for Raw Audio 2016 Aäron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alexander Graves
Nal Kalchbrenner
Andrew Senior
Koray Kavukcuoglu
4
+ HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units 2021 Wei-Ning Hsu
Benjamin Bolte
Yao-Hung Hubert Tsai
Kushal Lakhotia
Ruslan Salakhutdinov
Abdelrahman Mohamed
3
+ Discriminative Acoustic Word Embeddings: Recurrent Neural Network-Based Approaches 2016 Shane Settle
Karen Livescu
3
+ PDF Chat Speech Sequence Embeddings using Nearest Neighbors Contrastive Learning 2022 Robin Algayres
Adel Nabli
Benoît Sagot
Emmanuel Dupoux
3
+ PDF Chat SUPERB: Speech Processing Universal PERformance Benchmark 2021 Shu-Wen Yang
Po-Han Chi
Yung-Sung Chuang
Cheng-I Lai
Kushal Lakhotia
Yist Y. Lin
Andy T. Liu
Jiatong Shi
Xuankai Chang
Guan-Ting Lin
3
+ PDF Chat An embedded segmental K-means model for unsupervised segmentation and clustering of speech 2017 Herman Kamper
Karen Livescu
Sharon Goldwater
3
+ PDF Chat Word Segmentation on Discovered Phone Units With Dynamic Programming and Self-Supervised Scoring 2022 Herman Kamper
3
+ PDF Chat Segmental Contrastive Predictive Coding for Unsupervised Word Segmentation 2021 Saurabhchand Bhati
Jesús Villalba
Piotr Żelasko
Laureano Moro‐Velazquez
Najim Dehak
3
+ Efficient Estimation of Word Representations in Vector Space 2013 Tomáš Mikolov
Kai Chen
Greg S. Corrado
Jay B. Dean
3
+ PDF Chat HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units 2021 Wei-Ning Hsu
Benjamin Bolte
Yao-Hung Hubert Tsai
Kushal Lakhotia
Ruslan Salakhutdinov
Abdelrahman Mohamed
3
+ PDF Chat The zero resource speech challenge 2017 2017 Ewan Dunbar
Xuan Cao
Juan Benjumea
Julien Karadayi
Mathieu Bernard
Laurent Besacier
Xavier Anguera
Emmanuel Dupoux
3
+ Generative Spoken Language Modeling from Raw Audio 2021 Kushal Lakhotia
E. V. Kharitonov
Wei-Ning Hsu
Yossi Adi
Adam Polyak
Benjamin Bolte
Tu-Anh Hoang Nguyen
Jade Copet
Alexei Baevski
Adelrahman Mohamed
3
+ PDF Chat Pyannote.Audio: Neural Building Blocks for Speaker Diarization 2020 Hervé Bredin
Ruiqing Yin
Juan Manuel Coria
Grégory Gelly
Pavel Korshunov
Marvin Lavechin
Diego Fustes
Hadrien Titeux
Wassim Bouaziz
Marie-Philippe Gill
3
+ Deep Voice 3: 2000-Speaker Neural Text-to-Speech 2017 Wei Ping
Kainan Peng
Andrew Gibiansky
Sercan Ö. Arık
Ajay Kannan
Sharan Narang
Jonathan Raiman
J. J. Miller
2
+ PDF Chat The Zero Resource Speech Challenge 2017 2017 Ewan Dunbar
Xuan-Nga Cao
Juan Benjumea
Julien Karadayi
Mathieu Bernard
Laurent Besacier
Xavier Anguera
Emmanuel Dupoux
2
+ Billion-scale similarity search with GPUs 2017 JEFF JOHNSON
Matthijs Douze
Hervé Jeǵou
2
+ PDF Chat Self-Supervised Language Learning From Raw Audio: Lessons From the Zero Resource Speech Challenge 2022 Ewan Dunbar
Nicolas Hamilakis
Emmanuel Dupoux
2
+ PDF Chat Word Discovery in Visually Grounded, Self-Supervised Speech Models 2022 Puyuan Peng
David Harwath
2
+ Neural Discrete Representation Learning 2017 Aäron van den Oord
Oriol Vinyals
Koray Kavukcuoglu
2
+ Textless Speech Emotion Conversion using Decomposed and Discrete Representations 2021 Felix Kreuk
Adam Polyak
Jade Copet
Eugene Kharitonov
Tu Anh Nguyen
Morgane Rivière
Wei-Ning Hsu
Abdelrahman Mohamed
Emmanuel Dupoux
Yossi Adi
2
+ BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding 2018 Jacob Devlin
Ming‐Wei Chang
Kenton Lee
Kristina Toutanova
2
+ The Zero Resource Speech Benchmark 2021: Metrics and baselines for unsupervised spoken language modeling 2020 Tu Anh Nguyen
Maureen de Seyssel
Patricia Rozé
Morgane Rivière
Evgeny Kharitonov
Alexei Baevski
Ewan Dunbar
Emmanuel Dupoux
2
+ PDF Chat WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing 2022 Sanyuan Chen
Chengyi Wang
Zhengyang Chen
Yu Wu
Shujie Liu
Zhuo Chen
Jinyu Li
Naoyuki Kanda
Takuya Yoshioka
Xiong Xiao
2
+ PDF Chat Speech Resynthesis from Discrete Disentangled Self-Supervised Representations 2021 Adam Polyak
Yossi Adi
Jade Copet
Eugene Kharitonov
Kushal Lakhotia
Wei-Ning Hsu
Abdelrahman Mohamed
Emmanuel Dupoux
2
+ PDF Chat AudioLM: A Language Modeling Approach to Audio Generation 2023 Zalán Borsos
Raphaël Marinier
Damien Vincent
Eugene Kharitonov
Olivier Pietquin
Matt Sharifi
Dominik Roblek
Olivier Teboul
David Grangier
Marco Tagliasacchi
2
+ PDF Chat Unsupervised Feature Learning for Speech Using Correspondence and Siamese Networks 2020 Petri-Johan Last
Herman A. Engelbrecht
Herman Kamper
2
+ PDF Chat Multilingual Jointly Trained Acoustic and Written Word Embeddings 2020 Yushi Hu
Shane Settle
Karen Livescu
2
+ PDF Chat Unsupervised Discovery of Recurring Speech Patterns Using Probabilistic Adaptive Metrics 2020 Okko Räsänen
María Andrea Cruz Blandón
2
+ A Correspondence Variational Autoencoder for Unsupervised Acoustic Word Embeddings 2020 Puyuan Peng
Herman Kamper
Karen Livescu
2
+ PDF Chat Listening while speaking: Speech chain by deep learning 2017 Andros Tjandra
Sakriani Sakti
Satoshi Nakamura
2
+ BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension 2020 Mike Lewis
Yinhan Liu
Naman Goyal
Marjan Ghazvininejad
Abdelrahman Mohamed
Omer Levy
Veselin Stoyanov
Luke Zettlemoyer
2
+ Bayesian Models for Unit Discovery on a Very Low Resource Language 2018 Lucas Ondel
Pierre Godard
Laurent Besacier
Elin Larsen
Mark Hasegawa–Johnson
Odette Scharenborg
Emmanuel Dupoux
Lukáš Burget
François Yvon
Sanjeev Khudanpur
2
+ vq-wav2vec: Self-Supervised Learning of Discrete Speech Representations 2019 Alexei Baevski
Steffen Schneider
Michael Auli
2
+ PDF Chat Libri-Light: A Benchmark for ASR with Limited or No Supervision 2020 Jacob Kahn
Maude Rivière
Wenlong Zheng
Eugene Kharitonov
Qinmei Xu
Pierre-Emmanuel Mazaré
Julien Karadayi
Vitaliy Liptchinsky
Ronan Collobert
Christian Fuegen
2
+ Learning to Discover, Ground and Use Words with Segmental Neural Language Models 2019 Kazuya Kawakami
Chris Dyer
Phil Blunsom
2
+ PDF Chat Unsupervised Acoustic Unit Representation Learning for Voice Conversion Using WaveNet Auto-Encoders 2020 Mingjie Chen
Thomas Hain
2
+ PDF Chat Speech2Vec: A Sequence-to-Sequence Framework for Learning Word Embeddings from Speech 2018 Yu-An Chung
James Glass
2
+ PDF Chat Combining Adversarial Training and Disentangled Speech Representation for Robust Zero-Resource Subword Modeling 2019 Siyuan Feng
Tan Lee
Zhiyuan Peng
2
+ PDF Chat Natural TTS Synthesis by Conditioning Wavenet on MEL Spectrogram Predictions 2018 Jonathan Shen
Ruoming Pang
Ron J. Weiss
Mike Schuster
Navdeep Jaitly
Zongheng Yang
Zhifeng Chen
Yu Zhang
Yuxuan Wang
Rj Skerrv-Ryan
2
+ PDF Chat Unsupervised End-to-End Learning of Discrete Linguistic Units for Voice Conversion 2019 Andy T. Liu
Po‐Chun Hsu
Hung-yi Lee
2
+ PDF Chat VQVAE Unsupervised Unit Discovery and Multi-Scale Code2Spec Inverter for Zerospeech Challenge 2019 2019 Andros Tjandra
Berrak Şişman
Mingyang Zhang
Sakriani Sakti
Haizhou Li
Satoshi Nakamura
2
+ PDF Chat Evaluating the Reliability of Acoustic Speech Embeddings 2020 Robin Algayres
Mohamed Salah Zaïem
Benoît Sagot
Emmanuel Dupoux
2
+ PDF Chat The Zero Resource Speech Challenge 2019: TTS Without T 2019 Ewan Dunbar
Robin Algayres
Julien Karadayi
Mathieu Bernard
Juan Benjumea
Xuan-Nga Cao
Lucie Miskic
Charlotte Dugrain
Lucas Ondel
Alan W. Black
1
+ PDF Chat SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition 2019 Daniel Park
William Chan
Yu Zhang
Chung‐Cheng Chiu
Barret Zoph
Ekin D. Cubuk
Quoc V. Le
1
+ PDF Chat fairseq: A Fast, Extensible Toolkit for Sequence Modeling 2019 Myle Ott
Sergey Edunov
Alexei Baevski
Angela Fan
Sam Gross
Nathan Ng
David Grangier
Michael Auli
1