Lukas Drude

Follow

Generating author description...

All published works
Action Title Year Authors
+ Promptformer: Prompted Conformer Transducer for ASR 2024 Sergio Duarte-Torres
Arunasish Sen
Aman Rana
Lukas Drude
Alejandro Gomez-Alanis
Andreas Schwarz
L. RĂ€del
Volker Leutnant
+ Promptformer: Prompted Conformer Transducer for ASR 2024 Sergio Duarte-Torres
Arunasish Sen
Aman Rana
Lukas Drude
Alejandro Gomez-Alanis
Andreas Schwarz
L. RĂ€del
Volker Leutnant
+ PDF Chat Multi-View Frequency-Attention Alternative to CNN Frontends for Automatic Speech Recognition 2023 Belen Alastruey
Lukas Drude
Jahn Heymann
Simon Wiesler
+ Multi-View Frequency-Attention Alternative to CNN Frontends for Automatic Speech Recognition 2023 Belen Alastruey
Lukas Drude
Jahn Heymann
Simon Wiesler
+ PDF Chat Contextual-Utterance Training for Automatic Speech Recognition 2022 Alejandro Gomez-Alanis
Lukas Drude
Andreas Schwarz
Rupak Vignesh Swaminathan
Simon Wiesler
+ Contextual-Utterance Training for Automatic Speech Recognition 2022 Alejandro Gomez-Alanis
Lukas Drude
Andreas Schwarz
Rupak Vignesh Swaminathan
Simon Wiesler
+ PDF Chat Multi-Channel Opus Compression for Far-Field Automatic Speech Recognition with a Fixed Bitrate Budget 2021 Lukas Drude
Jahn Heymann
Andreas Schwarz
Jean-Marc Valin
+ Multi-channel Opus compression for far-field automatic speech recognition with a fixed bitrate budget 2021 Lukas Drude
Jahn Heymann
Andreas Schwarz
Jean-Marc Valin
+ Multi-channel Opus compression for far-field automatic speech recognition with a fixed bitrate budget 2021 Lukas Drude
Jahn Heymann
Andreas Schwarz
Jean-Marc Valin
+ PDF Chat Multi-Talker ASR for an Unknown Number of Sources: Joint Training of Source Counting, Separation and ASR 2020 Thilo von Neumann
Christoph Boeddeker
Lukas Drude
Keisuke Kinoshita
Marc Delcroix
Tomohiro Nakatani
Reinhold Haeb‐Umbach
+ PDF Chat Far-Field Automatic Speech Recognition 2020 Reinhold Haeb‐Umbach
Jahn Heymann
Lukas Drude
Shinji Watanabe
Marc Delcroix
Tomohiro Nakatani
+ PDF Chat End-to-End Training of Time Domain Audio Separation and Recognition 2020 Thilo von Neumann
Keisuke Kinoshita
Lukas Drude
Christoph Boeddeker
Marc Delcroix
Tomohiro Nakatani
Reinhold Haeb‐Umbach
+ PDF Chat Demystifying TasNet: A Dissecting Approach 2020 Jens Heitkaemper
Darius Jakobeit
Christoph Boeddeker
Lukas Drude
Reinhold Haeb‐Umbach
+ Far-Field Automatic Speech Recognition 2020 Reinhold Haeb‐Umbach
Jahn Heymann
Lukas Drude
Shinji Watanabe
Marc Delcroix
Tomohiro Nakatani
+ Demystifying TasNet: A Dissecting Approach 2019 Jens Heitkaemper
Darius Jakobeit
Christoph Boeddeker
Lukas Drude
Reinhold Haeb‐Umbach
+ PDF Chat Unsupervised Training of Neural Mask-Based Beamforming 2019 Lukas Drude
Jahn Heymann
Reinhold Haeb‐Umbach
+ PDF Chat Unsupervised Training of a Deep Clustering Model for Multichannel Blind Source Separation 2019 Lukas Drude
Daniel Hasenklever
Reinhold Haeb‐Umbach
+ Unsupervised training of neural mask-based beamforming 2019 Lukas Drude
Jahn Heymann
Reinhold Haeb‐Umbach
+ Unsupervised training of a deep clustering model for multichannel blind source separation 2019 Lukas Drude
Daniel Hasenklever
Reinhold Haeb‐Umbach
+ PDF Chat Directional Statistics and Filtering Using <b>libDirectional</b> 2019 Gerhard Kurz
Igor Gilitschenski
Florian Pfaff
Lukas Drude
Uwe D. Hanebeck
Reinhold Haeb‐Umbach
Roland Siegwart
+ SMS-WSJ: Database, performance measures, and baseline recipe for multi-channel source separation and recognition 2019 Lukas Drude
Jens Heitkaemper
Christoph Boeddeker
Reinhold Haeb‐Umbach
+ Unsupervised training of neural mask-based beamforming 2019 Lukas Drude
Jahn Heymann
Reinhold Haeb‐Umbach
+ Unsupervised training of a deep clustering model for multichannel blind source separation 2019 Lukas Drude
Daniel Hasenklever
Reinhold Haeb‐Umbach
+ Demystifying TasNet: A Dissecting Approach 2019 Jens Heitkaemper
Darius Jakobeit
Christoph Boeddeker
Lukas Drude
Reinhold Haeb‐Umbach
+ On the Computation of Complex-valued Gradients with Application to Statistically Optimum Beamforming 2017 Christoph Boeddeker
Patrick Hanebrink
Lukas Drude
Jahn Heymann
Reinhold Haeb‐Umbach
+ The Incredible Shrinking Neural Network: New Perspectives on Learning Representations Through The Lens of Pruning 2017 Nikolas Wolfe
Aditya Sharma
Lukas Drude
Bhiksha Raj
Common Coauthors
Commonly Cited References
Action Title Year Authors # of times referenced
+ PDF Chat Deep clustering: Discriminative embeddings for segmentation and separation 2016 John R. Hershey
Zhuo Chen
Jonathan Le Roux
Shinji Watanabe
7
+ PDF Chat Permutation invariant training of deep models for speaker-independent multi-talker speech separation 2017 Dong Yu
Morten KolbĂŠk
Zheng‐Hua Tan
Jesper Jensen
6
+ PDF Chat The Fifth 'CHiME' Speech Separation and Recognition Challenge: Dataset, Task and Baselines 2018 Jon Barker
Shinji Watanabe
Emmanuel Vincent
Jan Trmal
6
+ PDF Chat Single-Channel Multi-Speaker Separation Using Deep Clustering 2016 Yusuf Ziya IĆŸÄ±k
Jonathan Le Roux
Zhuo Chen
Shinji Watanabe
John R. Hershey
6
+ PDF Chat Deep attractor network for single-microphone speaker separation 2017 Zhuo Chen
Yi Luo
Nima Mesgarani
6
+ PDF Chat A Purely End-to-End System for Multi-speaker Speech Recognition 2018 Hiroshi Seki
Takaaki Hori
Shinji Watanabe
Jonathan Le Roux
John R. Hershey
5
+ Conv-TasNet: Surpassing Ideal Time–Frequency Magnitude Masking for Speech Separation 2019 Yi Luo
Nima Mesgarani
5
+ PDF Chat Multitalker Speech Separation With Utterance-Level Permutation Invariant Training of Deep Recurrent Neural Networks 2017 Morten KolbĂŠk
Dong Yu
Zheng‐Hua Tan
Jesper Jensen
5
+ PDF Chat Unsupervised Deep Clustering for Source Separation: Direct Learning from Mixtures Using Spatial Information 2019 Efthymios Tzinis
Shrikant Venkataramani
Paris Smaragdis
4
+ PDF Chat Bootstrapping Single-channel Source Separation via Unsupervised Spatial Clustering on Stereo Mixtures 2019 Prem Seetharaman
Gordon Wichern
Jonathan Le Roux
Bryan Pardo
4
+ PDF Chat Recognizing Multi-Talker Speech with Permutation Invariant Training 2017 Dong Yu
Xuankai Chang
Yanmin Qian
4
+ PDF Chat Unsupervised Training of a Deep Clustering Model for Multichannel Blind Source Separation 2019 Lukas Drude
Daniel Hasenklever
Reinhold Haeb‐Umbach
4
+ PDF Chat Phasebook and Friends: Leveraging Discrete Representations for Source Separation 2019 Jonathan Le Roux
Gordon Wichern
Shinji Watanabe
Andy M. Sarroff
John R. Hershey
4
+ PDF Chat Joint CTC-attention based end-to-end speech recognition using multi-task learning 2017 Suyoun Kim
Takaaki Hori
Shinji Watanabe
4
+ PDF Chat ESPnet: End-to-End Speech Processing Toolkit 2018 Shinji Watanabe
Takaaki Hori
Shigeki Karita
Tomoki Hayashi
Jiro Nishitoba
Yuya Unno
Nelson Enrique Yalta Soplin
Jahn Heymann
Matthew Wiesner
Nanxin Chen
4
+ PDF Chat SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition 2019 Daniel Park
William Chan
Yu Zhang
Chung‐Cheng Chiu
Barret Zoph
Ekin D. Cubuk
Quoc V. Le
4
+ PDF Chat A Comprehensive Study of Speech Separation: Spectrogram vs Waveform Separation 2019 Fahimeh Bahmaninezhad
Jian Wu
Rongzhi Gu
Shi-Xiong Zhang
Yong Xu
Yu Meng
Dong Yu
4
+ Sequence Transduction with Recurrent Neural Networks 2012 Alex Graves
3
+ Data analysis for shapes and images 1997 John T. Kent
3
+ PDF Chat Building State-of-the-art Distant Speech Recognition Using the CHiME-4 Challenge with a Setup of Speech Enhancement Baseline 2018 Szu-Jui Chen
Aswin Shanmugam Subramanian
Hainan Xu
Shinji Watanabe
3
+ PDF Chat Recursive Speech Separation for Unknown Number of Speakers 2019 Naoya Takahashi
Parthasaarathy Sudarsanam
Nabarun Goswami
Yuki Mitsufuji
3
+ PDF Chat Specaugment on Large Scale Datasets 2020 Daniel Park
Yu Zhang
Chung‐Cheng Chiu
Youzheng Chen
Bo Li
William Chan
Quoc V. Le
Yonghui Wu
3
+ PDF Chat WHAM!: Extending Speech Separation to Noisy Environments 2019 Gordon Wichern
Joe Antognini
M. D. Flynn
Licheng Richard Zhu
Emmett McQuinn
Dwight Crow
Ethan Manilow
Jonathan Le Roux
3
+ Single-Channel Multi-Speaker Separation using Deep Clustering 2016 Yusuf Ziya IĆŸÄ±k
Jonathan Le Roux
Zhuo Chen
Shinji Watanabe
John R. Hershey
2
+ PDF Chat SDR – Half-baked or Well Done? 2019 Jonathan Le Roux
Scott Wisdom
Hakan Erdoğan
John R. Hershey
2
+ PDF Chat End-to-End Multi-Speaker Speech Recognition Using Speaker Embeddings and Transfer Learning 2019 Pavel Denisov
Ngoc Thang Vu
2
+ MUSAN: A Music, Speech, and Noise Corpus 2015 David Snyder
Guoguo Chen
Daniel Povey
2
+ PDF Chat Single-channel multi-talker speech recognition with permutation invariant training 2018 Yanmin Qian
Xuankai Chang
Dong Yu
2
+ PDF Chat Speech recognition with deep recurrent neural networks 2013 Alex Graves
Abdelrahman Mohamed
Geoffrey E. Hinton
2
+ PDF Chat End-to-end Monaural Multi-speaker ASR System without Pretraining 2019 Xuankai Chang
Yanmin Qian
Kai Yu
Shinji Watanabe
2
+ PDF Chat Deep Learning Based Phase Reconstruction for Speaker Separation: A Trigonometric Perspective 2019 Zhong-Qiu Wang
Ke Tan
DeLiang Wang
2
+ Attention is All you Need 2017 Ashish Vaswani
Noam Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan N. Gomez
Ɓukasz Kaiser
Illia Polosukhin
2
+ TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation. 2018 Yi Luo
Nima Mesgarani
2
+ Pattern Recognition and Machine Learning 2007 Christopher Bishop
2
+ PDF Chat End-to-End Speech Separation with Unfolded Iterative Phase Reconstruction 2018 Zhong-Qiu Wang
Jonathan Le Roux
DeLiang Wang
John R. Hershey
2
+ Supervised Speech Separation Based on Deep Learning: An Overview 2018 DeLiang Wang
Jitong Chen
2
+ PDF Chat TaSNet: Time-Domain Audio Separation Network for Real-Time, Single-Channel Speech Separation 2018 Yi Luo
Nima Mesgarani
2
+ PDF Chat All-neural Online Source Separation, Counting, and Diarization for Meeting Analysis 2019 Thilo von Neumann
Keisuke Kinoshita
Marc Delcroix
Shoko Araki
Tomohiro Nakatani
Reinhold Haeb‐Umbach
2
+ PDF Chat Pyroomacoustics: A Python Package for Audio Room Simulation and Array Processing Algorithms 2018 Robin Scheibler
Eric Bezzam
Ivan Dokmanić
2
+ End-to-End Multi-Channel Speech Separation 2019 Rongzhi Gu
Jian Wu
Shi-Xiong Zhang
Lianwu Chen
Yong Xu
Meng Yu
Dan Su
Yuexian Zou
Dong Yu
2
+ PDF Chat A Study of Enhancement, Augmentation and Autoencoder Methods for Domain Adaptation in Distant Speech Recognition 2018 Hao Tang
Wei-Ning Hsu
François Grondin
James Glass
2
+ PDF Chat Improved Training of End-to-end Attention Models for Speech Recognition 2018 Albert Zeyer
Kazuki Irie
Ralf SchlĂŒter
Hermann Ney
2
+ FurcaNet: An end-to-end deep gated convolutional, long short-term memory, deep neural networks for single channel speech separation 2019 Ziqiang Shi
Huibin Lin
Liu Liu
Rujie Liu
Shoji Hayakawa
Shouji Harada
Jiqing Han
2
+ PDF Chat Multi-geometry Spatial Acoustic Modeling for Distant Speech Recognition 2019 Kenichi Kumatani
Minhua Wu
Shiva Sundaram
Nikko Ström
Björn Hoffmeister
2
+ PDF Chat Acoustic Modeling for Overlapping Speech Recognition: Jhu Chime-5 Challenge System 2019 Vimal Manohar
Szu-Jui Chen
Zhiqi Wang
Yusuke Fujita
Shinji Watanabe
Sanjeev Khudanpur
2
+ PDF Chat Recognizing Overlapped Speech in Meetings: A Multichannel Separation Approach Using Neural Networks 2018 Takuya Yoshioka
Hakan Erdoğan
Zhuo Chen
Xiong Xiao
Fil Alleva
2
+ On the Computation of Complex-valued Gradients with Application to Statistically Optimum Beamforming 2017 Christoph Boeddeker
Patrick Hanebrink
Lukas Drude
Jahn Heymann
Reinhold Haeb‐Umbach
2
+ PDF Chat Stream Attention-based Multi-array End-to-end Speech Recognition 2019 Xiaofei Wang
Ruizhi Li
Sri Harish Mallidi
Takaaki Hori
Shinji Watanabe
Hynek HeƙmanskĂœ
2
+ PDF Chat Deep clustering and conventional networks for music separation: Stronger together 2017 Yi Luo
Zhuo Chen
John R. Hershey
Jonathan Le Roux
Nima Mesgarani
2
+ PDF Chat Building and evaluation of a real room impulse response dataset 2019 Igor Szöke
Miroslav SkĂĄcel
Ladislav MoĆĄner
Jakub Paliesek
Jaƈ ČernockĂœ
2