Projects
Reading
People
Chat
SU\G
(đž)
/K·U
Projects
Reading
People
Chat
Sign Up
Light
Dark
System
Lukas Drude
Follow
Share
Generating author description...
All published works
Action
Title
Year
Authors
+
Promptformer: Prompted Conformer Transducer for ASR
2024
Sergio Duarte-Torres
Arunasish Sen
Aman Rana
Lukas Drude
Alejandro Gomez-Alanis
Andreas Schwarz
L. RĂ€del
Volker Leutnant
+
Promptformer: Prompted Conformer Transducer for ASR
2024
Sergio Duarte-Torres
Arunasish Sen
Aman Rana
Lukas Drude
Alejandro Gomez-Alanis
Andreas Schwarz
L. RĂ€del
Volker Leutnant
+
PDF
Chat
Multi-View Frequency-Attention Alternative to CNN Frontends for Automatic Speech Recognition
2023
Belen Alastruey
Lukas Drude
Jahn Heymann
Simon Wiesler
+
Multi-View Frequency-Attention Alternative to CNN Frontends for Automatic Speech Recognition
2023
Belen Alastruey
Lukas Drude
Jahn Heymann
Simon Wiesler
+
PDF
Chat
Contextual-Utterance Training for Automatic Speech Recognition
2022
Alejandro Gomez-Alanis
Lukas Drude
Andreas Schwarz
Rupak Vignesh Swaminathan
Simon Wiesler
+
Contextual-Utterance Training for Automatic Speech Recognition
2022
Alejandro Gomez-Alanis
Lukas Drude
Andreas Schwarz
Rupak Vignesh Swaminathan
Simon Wiesler
+
PDF
Chat
Multi-Channel Opus Compression for Far-Field Automatic Speech Recognition with a Fixed Bitrate Budget
2021
Lukas Drude
Jahn Heymann
Andreas Schwarz
Jean-Marc Valin
+
Multi-channel Opus compression for far-field automatic speech recognition with a fixed bitrate budget
2021
Lukas Drude
Jahn Heymann
Andreas Schwarz
Jean-Marc Valin
+
Multi-channel Opus compression for far-field automatic speech recognition with a fixed bitrate budget
2021
Lukas Drude
Jahn Heymann
Andreas Schwarz
Jean-Marc Valin
+
PDF
Chat
Multi-Talker ASR for an Unknown Number of Sources: Joint Training of Source Counting, Separation and ASR
2020
Thilo von Neumann
Christoph Boeddeker
Lukas Drude
Keisuke Kinoshita
Marc Delcroix
Tomohiro Nakatani
Reinhold HaebâUmbach
+
PDF
Chat
Far-Field Automatic Speech Recognition
2020
Reinhold HaebâUmbach
Jahn Heymann
Lukas Drude
Shinji Watanabe
Marc Delcroix
Tomohiro Nakatani
+
PDF
Chat
End-to-End Training of Time Domain Audio Separation and Recognition
2020
Thilo von Neumann
Keisuke Kinoshita
Lukas Drude
Christoph Boeddeker
Marc Delcroix
Tomohiro Nakatani
Reinhold HaebâUmbach
+
PDF
Chat
Demystifying TasNet: A Dissecting Approach
2020
Jens Heitkaemper
Darius Jakobeit
Christoph Boeddeker
Lukas Drude
Reinhold HaebâUmbach
+
Far-Field Automatic Speech Recognition
2020
Reinhold HaebâUmbach
Jahn Heymann
Lukas Drude
Shinji Watanabe
Marc Delcroix
Tomohiro Nakatani
+
Demystifying TasNet: A Dissecting Approach
2019
Jens Heitkaemper
Darius Jakobeit
Christoph Boeddeker
Lukas Drude
Reinhold HaebâUmbach
+
PDF
Chat
Unsupervised Training of Neural Mask-Based Beamforming
2019
Lukas Drude
Jahn Heymann
Reinhold HaebâUmbach
+
PDF
Chat
Unsupervised Training of a Deep Clustering Model for Multichannel Blind Source Separation
2019
Lukas Drude
Daniel Hasenklever
Reinhold HaebâUmbach
+
Unsupervised training of neural mask-based beamforming
2019
Lukas Drude
Jahn Heymann
Reinhold HaebâUmbach
+
Unsupervised training of a deep clustering model for multichannel blind source separation
2019
Lukas Drude
Daniel Hasenklever
Reinhold HaebâUmbach
+
PDF
Chat
Directional Statistics and Filtering Using <b>libDirectional</b>
2019
Gerhard Kurz
Igor Gilitschenski
Florian Pfaff
Lukas Drude
Uwe D. Hanebeck
Reinhold HaebâUmbach
Roland Siegwart
+
SMS-WSJ: Database, performance measures, and baseline recipe for multi-channel source separation and recognition
2019
Lukas Drude
Jens Heitkaemper
Christoph Boeddeker
Reinhold HaebâUmbach
+
Unsupervised training of neural mask-based beamforming
2019
Lukas Drude
Jahn Heymann
Reinhold HaebâUmbach
+
Unsupervised training of a deep clustering model for multichannel blind source separation
2019
Lukas Drude
Daniel Hasenklever
Reinhold HaebâUmbach
+
Demystifying TasNet: A Dissecting Approach
2019
Jens Heitkaemper
Darius Jakobeit
Christoph Boeddeker
Lukas Drude
Reinhold HaebâUmbach
+
On the Computation of Complex-valued Gradients with Application to Statistically Optimum Beamforming
2017
Christoph Boeddeker
Patrick Hanebrink
Lukas Drude
Jahn Heymann
Reinhold HaebâUmbach
+
The Incredible Shrinking Neural Network: New Perspectives on Learning Representations Through The Lens of Pruning
2017
Nikolas Wolfe
Aditya Sharma
Lukas Drude
Bhiksha Raj
Common Coauthors
Coauthor
Papers Together
Reinhold HaebâUmbach
16
Jahn Heymann
11
Christoph Boeddeker
7
Andreas Schwarz
5
Tomohiro Nakatani
4
Simon Wiesler
4
Alejandro Gomez-Alanis
4
Jens Heitkaemper
4
Marc Delcroix
4
Daniel Hasenklever
3
Jean-Marc Valin
3
Darius Jakobeit
3
Belen Alastruey
2
Shinji Watanabe
2
Thilo von Neumann
2
Andreas Schwarz
2
Arunasish Sen
2
Volker Leutnant
2
Keisuke Kinoshita
2
Rupak Vignesh Swaminathan
2
Aman Rana
2
Sergio Duarte-Torres
2
L. RĂ€del
2
Igor Gilitschenski
1
Bhiksha Raj
1
Florian Pfaff
1
Roland Siegwart
1
Patrick Hanebrink
1
Uwe D. Hanebeck
1
Aditya Sharma
1
Gerhard Kurz
1
Nikolas Wolfe
1
Commonly Cited References
Action
Title
Year
Authors
# of times referenced
+
PDF
Chat
Deep clustering: Discriminative embeddings for segmentation and separation
2016
John R. Hershey
Zhuo Chen
Jonathan Le Roux
Shinji Watanabe
7
+
PDF
Chat
Permutation invariant training of deep models for speaker-independent multi-talker speech separation
2017
Dong Yu
Morten KolbĂŠk
ZhengâHua Tan
Jesper Jensen
6
+
PDF
Chat
The Fifth 'CHiME' Speech Separation and Recognition Challenge: Dataset, Task and Baselines
2018
Jon Barker
Shinji Watanabe
Emmanuel Vincent
Jan Trmal
6
+
PDF
Chat
Single-Channel Multi-Speaker Separation Using Deep Clustering
2016
Yusuf Ziya IĆık
Jonathan Le Roux
Zhuo Chen
Shinji Watanabe
John R. Hershey
6
+
PDF
Chat
Deep attractor network for single-microphone speaker separation
2017
Zhuo Chen
Yi Luo
Nima Mesgarani
6
+
PDF
Chat
A Purely End-to-End System for Multi-speaker Speech Recognition
2018
Hiroshi Seki
Takaaki Hori
Shinji Watanabe
Jonathan Le Roux
John R. Hershey
5
+
Conv-TasNet: Surpassing Ideal TimeâFrequency Magnitude Masking for Speech Separation
2019
Yi Luo
Nima Mesgarani
5
+
PDF
Chat
Multitalker Speech Separation With Utterance-Level Permutation Invariant Training of Deep Recurrent Neural Networks
2017
Morten KolbĂŠk
Dong Yu
ZhengâHua Tan
Jesper Jensen
5
+
PDF
Chat
Unsupervised Deep Clustering for Source Separation: Direct Learning from Mixtures Using Spatial Information
2019
Efthymios Tzinis
Shrikant Venkataramani
Paris Smaragdis
4
+
PDF
Chat
Bootstrapping Single-channel Source Separation via Unsupervised Spatial Clustering on Stereo Mixtures
2019
Prem Seetharaman
Gordon Wichern
Jonathan Le Roux
Bryan Pardo
4
+
PDF
Chat
Recognizing Multi-Talker Speech with Permutation Invariant Training
2017
Dong Yu
Xuankai Chang
Yanmin Qian
4
+
PDF
Chat
Unsupervised Training of a Deep Clustering Model for Multichannel Blind Source Separation
2019
Lukas Drude
Daniel Hasenklever
Reinhold HaebâUmbach
4
+
PDF
Chat
Phasebook and Friends: Leveraging Discrete Representations for Source Separation
2019
Jonathan Le Roux
Gordon Wichern
Shinji Watanabe
Andy M. Sarroff
John R. Hershey
4
+
PDF
Chat
Joint CTC-attention based end-to-end speech recognition using multi-task learning
2017
Suyoun Kim
Takaaki Hori
Shinji Watanabe
4
+
PDF
Chat
ESPnet: End-to-End Speech Processing Toolkit
2018
Shinji Watanabe
Takaaki Hori
Shigeki Karita
Tomoki Hayashi
Jiro Nishitoba
Yuya Unno
Nelson Enrique Yalta Soplin
Jahn Heymann
Matthew Wiesner
Nanxin Chen
4
+
PDF
Chat
SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
2019
Daniel Park
William Chan
Yu Zhang
ChungâCheng Chiu
Barret Zoph
Ekin D. Cubuk
Quoc V. Le
4
+
PDF
Chat
A Comprehensive Study of Speech Separation: Spectrogram vs Waveform Separation
2019
Fahimeh Bahmaninezhad
Jian Wu
Rongzhi Gu
Shi-Xiong Zhang
Yong Xu
Yu Meng
Dong Yu
4
+
Sequence Transduction with Recurrent Neural Networks
2012
Alex Graves
3
+
Data analysis for shapes and images
1997
John T. Kent
3
+
PDF
Chat
Building State-of-the-art Distant Speech Recognition Using the CHiME-4 Challenge with a Setup of Speech Enhancement Baseline
2018
Szu-Jui Chen
Aswin Shanmugam Subramanian
Hainan Xu
Shinji Watanabe
3
+
PDF
Chat
Recursive Speech Separation for Unknown Number of Speakers
2019
Naoya Takahashi
Parthasaarathy Sudarsanam
Nabarun Goswami
Yuki Mitsufuji
3
+
PDF
Chat
Specaugment on Large Scale Datasets
2020
Daniel Park
Yu Zhang
ChungâCheng Chiu
Youzheng Chen
Bo Li
William Chan
Quoc V. Le
Yonghui Wu
3
+
PDF
Chat
WHAM!: Extending Speech Separation to Noisy Environments
2019
Gordon Wichern
Joe Antognini
M. D. Flynn
Licheng Richard Zhu
Emmett McQuinn
Dwight Crow
Ethan Manilow
Jonathan Le Roux
3
+
Single-Channel Multi-Speaker Separation using Deep Clustering
2016
Yusuf Ziya IĆık
Jonathan Le Roux
Zhuo Chen
Shinji Watanabe
John R. Hershey
2
+
PDF
Chat
SDR â Half-baked or Well Done?
2019
Jonathan Le Roux
Scott Wisdom
Hakan ErdoÄan
John R. Hershey
2
+
PDF
Chat
End-to-End Multi-Speaker Speech Recognition Using Speaker Embeddings and Transfer Learning
2019
Pavel Denisov
Ngoc Thang Vu
2
+
MUSAN: A Music, Speech, and Noise Corpus
2015
David Snyder
Guoguo Chen
Daniel Povey
2
+
PDF
Chat
Single-channel multi-talker speech recognition with permutation invariant training
2018
Yanmin Qian
Xuankai Chang
Dong Yu
2
+
PDF
Chat
Speech recognition with deep recurrent neural networks
2013
Alex Graves
Abdelrahman Mohamed
Geoffrey E. Hinton
2
+
PDF
Chat
End-to-end Monaural Multi-speaker ASR System without Pretraining
2019
Xuankai Chang
Yanmin Qian
Kai Yu
Shinji Watanabe
2
+
PDF
Chat
Deep Learning Based Phase Reconstruction for Speaker Separation: A Trigonometric Perspective
2019
Zhong-Qiu Wang
Ke Tan
DeLiang Wang
2
+
Attention is All you Need
2017
Ashish Vaswani
Noam Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan N. Gomez
Ćukasz Kaiser
Illia Polosukhin
2
+
TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation.
2018
Yi Luo
Nima Mesgarani
2
+
Pattern Recognition and Machine Learning
2007
Christopher Bishop
2
+
PDF
Chat
End-to-End Speech Separation with Unfolded Iterative Phase Reconstruction
2018
Zhong-Qiu Wang
Jonathan Le Roux
DeLiang Wang
John R. Hershey
2
+
Supervised Speech Separation Based on Deep Learning: An Overview
2018
DeLiang Wang
Jitong Chen
2
+
PDF
Chat
TaSNet: Time-Domain Audio Separation Network for Real-Time, Single-Channel Speech Separation
2018
Yi Luo
Nima Mesgarani
2
+
PDF
Chat
All-neural Online Source Separation, Counting, and Diarization for Meeting Analysis
2019
Thilo von Neumann
Keisuke Kinoshita
Marc Delcroix
Shoko Araki
Tomohiro Nakatani
Reinhold HaebâUmbach
2
+
PDF
Chat
Pyroomacoustics: A Python Package for Audio Room Simulation and Array Processing Algorithms
2018
Robin Scheibler
Eric Bezzam
Ivan DokmaniÄ
2
+
End-to-End Multi-Channel Speech Separation
2019
Rongzhi Gu
Jian Wu
Shi-Xiong Zhang
Lianwu Chen
Yong Xu
Meng Yu
Dan Su
Yuexian Zou
Dong Yu
2
+
PDF
Chat
A Study of Enhancement, Augmentation and Autoencoder Methods for Domain Adaptation in Distant Speech Recognition
2018
Hao Tang
Wei-Ning Hsu
François Grondin
James Glass
2
+
PDF
Chat
Improved Training of End-to-end Attention Models for Speech Recognition
2018
Albert Zeyer
Kazuki Irie
Ralf SchlĂŒter
Hermann Ney
2
+
FurcaNet: An end-to-end deep gated convolutional, long short-term memory, deep neural networks for single channel speech separation
2019
Ziqiang Shi
Huibin Lin
Liu Liu
Rujie Liu
Shoji Hayakawa
Shouji Harada
Jiqing Han
2
+
PDF
Chat
Multi-geometry Spatial Acoustic Modeling for Distant Speech Recognition
2019
Kenichi Kumatani
Minhua Wu
Shiva Sundaram
Nikko Ström
Björn Hoffmeister
2
+
PDF
Chat
Acoustic Modeling for Overlapping Speech Recognition: Jhu Chime-5 Challenge System
2019
Vimal Manohar
Szu-Jui Chen
Zhiqi Wang
Yusuke Fujita
Shinji Watanabe
Sanjeev Khudanpur
2
+
PDF
Chat
Recognizing Overlapped Speech in Meetings: A Multichannel Separation Approach Using Neural Networks
2018
Takuya Yoshioka
Hakan ErdoÄan
Zhuo Chen
Xiong Xiao
Fil Alleva
2
+
On the Computation of Complex-valued Gradients with Application to Statistically Optimum Beamforming
2017
Christoph Boeddeker
Patrick Hanebrink
Lukas Drude
Jahn Heymann
Reinhold HaebâUmbach
2
+
PDF
Chat
Stream Attention-based Multi-array End-to-end Speech Recognition
2019
Xiaofei Wang
Ruizhi Li
Sri Harish Mallidi
Takaaki Hori
Shinji Watanabe
Hynek HeĆmanskĂœ
2
+
PDF
Chat
Deep clustering and conventional networks for music separation: Stronger together
2017
Yi Luo
Zhuo Chen
John R. Hershey
Jonathan Le Roux
Nima Mesgarani
2
+
PDF
Chat
Building and evaluation of a real room impulse response dataset
2019
Igor Szöke
Miroslav SkĂĄcel
Ladislav MoĆĄner
Jakub Paliesek
JaĆ ÄernockĂœ
2