Michel Olvera

Follow

Generating author description...

All published works
Action Title Year Authors
+ PDF Chat TACO: Training-free Sound Prompted Segmentation via Deep Audio-visual CO-factorization 2024 Hugo Malard
Michel Olvera
Stéphane Lathuilière
Slim Essid
+ A SOUND DESCRIPTION: EXPLORING PROMPT TEMPLATES AND CLASS DESCRIPTIONS TO ENHANCE ZERO-SHOT AUDIO CLASSIFICATION 2024 Michel Olvera
Paraskevas Stamatiadis
Slim Essid
+ SALT: STANDARDIZED AUDIO EVENT LABEL TAXONOMY 2024 Paraskevas Stamatiadis
Michel Olvera
Slim Essid
+ An Eye for an Ear: Zero-shot Audio Description Leveraging an Image Captioner using Audiovisual Distribution Alignment 2024 Hugo Malard
Michel Olvera
Stéphane Lathuilière
Slim Essid
+ On The Choice of the Optimal Temporal Support for Audio Classification with Pre-Trained Embeddings 2024 Aurian Quélennec
Michel Olvera
Geoffroy Peeters
Slim Essid
+ On the choice of the optimal temporal support for audio classification with Pre-trained embeddings 2023 Aurian Quélennec
Michel Olvera
Geoffroy Peeters
Slim Essid
+ PDF Chat Foreground-Background Ambient Sound Scene Separation 2020 Michel Olvera
Emmanuel Vincent
Romain Serizel
Gilles Gasso
+ PDF Chat Asteroid: The PyTorch-Based Audio Source Separation Toolkit for Researchers 2020 Manuel Pariente
Samuele Cornell
Joris Cosentino
Sunit Sivasankaran
Efthymios Tzinis
Jens Heitkaemper
Michel Olvera
Fabian-Robert Stöter
Mathieu Hu
Juan M. Martín-Doñas
+ Foreground-Background Ambient Sound Scene Separation 2020 Michel Olvera
Emmanuel Vincent
Romain Serizel
Gilles Gasso
+ Foreground-Background Ambient Sound Scene Separation 2020 Michel Olvera
Emmanuel Vincent
Romain Serizel
Gilles Gasso
Common Coauthors
Commonly Cited References
Action Title Year Authors # of times referenced
+ PDF Chat Universal Sound Separation 2019 Ilya Kavalerov
Scott Wisdom
Hakan Erdoğan
Brian Patton
Kevin Wilson
Jonathan Le Roux
John R. Hershey
3
+ PDF Chat Permutation invariant training of deep models for speaker-independent multi-talker speech separation 2017 Dong Yu
Morten Kolbæk
Zheng‐Hua Tan
Jesper Jensen
3
+ PDF Chat Deep clustering: Discriminative embeddings for segmentation and separation 2016 John R. Hershey
Zhuo Chen
Jonathan Le Roux
Shinji Watanabe
3
+ PDF Chat Improving Universal Sound Separation Using Sound Classification 2020 Efthymios Tzinis
Scott Wisdom
John R. Hershey
Aren Jansen
Daniel P. W. Ellis
2
+ Supervised Speech Separation Based on Deep Learning: An Overview 2018 DeLiang Wang
Jitong Chen
2
+ PDF Chat Trainable frontend for robust and far-field keyword spotting 2017 Yuxuan Wang
Pascal Getreuer
T. A. Hughes
Richard F. Lyon
Rif A. Saurous
2
+ Conv-TasNet: Surpassing Ideal Time–Frequency Magnitude Masking for Speech Separation 2019 Yi Luo
Nima Mesgarani
1
+ PDF Chat Single-Channel Multi-Speaker Separation Using Deep Clustering 2016 Yusuf Ziya Işık
Jonathan Le Roux
Zhuo Chen
Shinji Watanabe
John R. Hershey
1
+ PDF Chat TaSNet: Time-Domain Audio Separation Network for Real-Time, Single-Channel Speech Separation 2018 Yi Luo
Nima Mesgarani
1
+ PDF Chat Speaker Recognition from Raw Waveform with SincNet 2018 Mirco Ravanelli
Yoshua Bengio
1
+ PDF Chat SDR – Half-baked or Well Done? 2019 Jonathan Le Roux
Scott Wisdom
Hakan Erdoğan
John R. Hershey
1
+ PDF Chat WHAM!: Extending Speech Separation to Noisy Environments 2019 Gordon Wichern
Joe Antognini
M. D. Flynn
Licheng Richard Zhu
Emmett McQuinn
Dwight Crow
Ethan Manilow
Jonathan Le Roux
1
+ PDF Chat A Comprehensive Study of Speech Separation: Spectrogram vs Waveform Separation 2019 Fahimeh Bahmaninezhad
Jian Wu
Rongzhi Gu
Shi-Xiong Zhang
Yong Xu
Yu Meng
Dong Yu
1
+ Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation 2019 Yi Luo
Zhuo Chen
Takuya Yoshioka
1
+ PDF Chat Two-Step Sound Source Separation: Training On Learned Latent Targets 2020 Efthymios Tzinis
Shrikant Venkataramani
Zhepei Wang
Cem Subakan
Paris Smaragdis
1
+ Analyzing the impact of speaker localization errors on speech separation for automatic speech recognition 2019 Sunit Sivasankaran
Emmaneul Vincent
Dominique Fohr
1
+ PDF Chat A Multi-Phase Gammatone Filterbank for Speech Separation Via Tasnet 2020 David Ditter
Timo Gerkmann
1
+ SMS-WSJ: Database, performance measures, and baseline recipe for multi-channel source separation and recognition 2019 Lukas Drude
Jens Heitkaemper
Christoph Boeddeker
Reinhold Haeb‐Umbach
1
+ The INTERSPEECH 2020 Deep Noise Suppression Challenge: Datasets, Subjective Speech Quality and Testing Framework 2020 Chandan K. Reddy
Ebrahim Beyrami
Harishchandra Dubey
Vishak Gopal
Roger Cheng
Ross Cutler
Sergiy Matusevych
Robert Aichner
Ashkan Aazami
Sebastian Braun
1
+ PDF Chat Filterbank Design for End-to-end Speech Separation 2020 Manuel Pariente
Samuele Cornell
Antoine Deleforge
Emmanuel Vincent
1
+ PDF Chat WHAMR!: Noisy and Reverberant Single-Channel Speech Separation 2020 Matthew Maciejewski
Gordon Wichern
Emmett McQuinn
Jonathan Le Roux
1
+ PDF Chat Dual-Path RNN: Efficient Long Sequence Modeling for Time-Domain Single-Channel Speech Separation 2020 Yi Luo
Zhuo Chen
Takuya Yoshioka
1
+ PDF Chat Demystifying TasNet: A Dissecting Approach 2020 Jens Heitkaemper
Darius Jakobeit
Christoph Boeddeker
Lukas Drude
Reinhold Haeb‐Umbach
1
+ PDF Chat Analyzing the impact of speaker localization errors on speech separation for automatic speech recognition 2020 Sunit Sivasankaran
Emmanuel Vincent
Dominique Fohr
1
+ PDF Chat PANNs: Large-Scale Pretrained Audio Neural Networks for Audio Pattern Recognition 2020 Qiuqiang Kong
Yin Cao
Turab Iqbal
Yuxuan Wang
Wenwu Wang
Mark D. Plumbley
1
+ What's All the FUSS About Free Universal Sound Separation Data?. 2020 Scott Wisdom
Hakan Erdoğan
Daniel P. W. Ellis
Romain Serizel
Nicolas Turpault
Eduardo Fonseca
Justin Salamon
Prem Seetharaman
John R. Hershey
1
+ PDF Chat What’s all the Fuss about Free Universal Sound Separation Data? 2021 Scott Wisdom
Hakan Erdoğan
Daniel P. W. Ellis
Romain Serizel
Nicolas Turpault
Eduardo Fonseca
Justin Salamon
Prem Seetharaman
John R. Hershey
1
+ PDF Chat Contrastive Learning of General-Purpose Audio Representations 2021 Aaqib Saeed
David Grangier
Neil Zeghidour
1
+ PDF Chat Receptive Field Regularization Techniques for Audio Classification and Tagging With Deep Convolutional Neural Networks 2021 Khaled Koutini
Hamid Eghbal-zadeh
Gerhard Widmer
1
+ PDF Chat Wavesplit: End-to-End Speech Separation by Speaker Clustering 2021 Neil Zeghidour
David Grangier
1
+ PDF Chat AST: Audio Spectrogram Transformer 2021 Yuan Gong
Yu-An Chung
James Glass
1
+ PDF Chat SUPERB: Speech Processing Universal PERformance Benchmark 2021 Shu-Wen Yang
Po-Han Chi
Yung-Sung Chuang
Cheng-I Lai
Kushal Lakhotia
Yist Y. Lin
Andy T. Liu
Jiatong Shi
Xuankai Chang
Guan-Ting Lin
1
+ PDF Chat Efficient Training of Audio Transformers with Patchout 2022 Khaled Koutini
Jan Schlüter
Hamid Eghbal-zadeh
Gerhard Widmer
1
+ PDF Chat HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units 2021 Wei-Ning Hsu
Benjamin Bolte
Yao-Hung Hubert Tsai
Kushal Lakhotia
Ruslan Salakhutdinov
Abdelrahman Mohamed
1
+ PDF Chat Towards Learning Universal Audio Representations 2022 Luyu Wang
Pauline Luc
Yan Wu
Adrià Recasens
Lucas Smaira
Andrew Brock
Andrew Jaegle
Jean-Baptiste Alayrac
Sander Dieleman
João Carreira
1
+ Onssen: an open-source speech separation and enhancement library 2019 Zhaoheng Ni
Michael Mandel
1
+ PyTorch: An Imperative Style, High-Performance Deep Learning Library 2019 Adam Paszke
Sam Gross
Francisco Massa
Adam Lerer
James T. Bradbury
Gregory Chanan
Trevor Killeen
Zeming Lin
Natalia Gimelshein
Luca Antiga
1
+ PDF Chat BYOL for Audio: Exploring Pre-Trained General-Purpose Audio Representations 2022 Daisuke Niizumi
Daiki Takeuchi
Yasunori Ohishi
Noboru Harada
Kunio Kashino
1
+ An Attention-Based Approach to Hierarchical Multi-Label Music Instrument Classification 2023 Zhi Zhong
Masato Hirano
Kazuki Shimada
Kazuya Tateishi
Shusuke Takahashi
Yuki Mitsufuji
1
+ PDF Chat The NumPy Array: A Structure for Efficient Numerical Computation 2011 Stéfan van der Walt
Steven C. Colbert
Gaël Varoquaux
1
+ Fine-Tuning Strategies for Faster Inference Using Speech Self-Supervised Models: A Comparative Study 2023 Salah Zaiem
Robin Algayres
Titouan Parcollet
Slim Essid
Mirco Ravanelli
1
+ PDF Chat CNN architectures for large-scale audio classification 2017 Shawn Hershey
Sourish Chaudhuri
Daniel P. W. Ellis
Jort F. Gemmeke
Aren Jansen
Robert C. Moore
Manoj Plakal
Devin Platt
Rif A. Saurous
Bryan Seybold
1
+ PDF Chat Deep attractor network for single-microphone speaker separation 2017 Zhuo Chen
Yi Luo
Nima Mesgarani
1
+ PDF Chat Multitalker Speech Separation With Utterance-Level Permutation Invariant Training of Deep Recurrent Neural Networks 2017 Morten Kolbæk
Dong Yu
Zheng‐Hua Tan
Jesper Jensen
1
+ Adaptive Pooling Operators for Weakly Labeled Sound Event Detection 2018 Brian McFee
Justin Salamon
Juan Pablo Bello
1
+ Speaker Recognition from Raw Waveform with SincNet 2018 Mirco Ravanelli
Yoshua Bengio
1