Ask a Question

Prefer a chat interface with context about you and your work?

RWTH ASR Systems for LibriSpeech: Hybrid vs Attention

RWTH ASR Systems for LibriSpeech: Hybrid vs Attention

We present state-of-the-art automatic speech recognition (ASR) systems employing a standard hybrid DNN/HMM architecture compared to an attention-based encoder-decoder design for the LibriSpeech task. Detailed descriptions of the system development, including model design, pretraining schemes, training schedules, and optimization approaches are provided for both system architectures. Both hybrid DNN/HMM and …