RWTH ASR Systems for LibriSpeech: Hybrid vs Attention
RWTH ASR Systems for LibriSpeech: Hybrid vs Attention
We present state-of-the-art automatic speech recognition (ASR) systems employing a standard hybrid DNN/HMM architecture compared to an attention-based encoder-decoder design for the LibriSpeech task. Detailed descriptions of the system development, including model design, pretraining schemes, training schedules, and optimization approaches are provided for both system architectures. Both hybrid DNN/HMM and …