Ask a Question

Prefer a chat interface with context about you and your work?

Exploring neural transducers for end-to-end speech recognition

Exploring neural transducers for end-to-end speech recognition

In this work, we perform an empirical comparison among the CTC, RNN-Transducer, and attention-based Seq2Seq models for end-to-end speech recognition. We show that, without any language model, Seq2Seq and RNN-Transducer models both outperform the best reported CTC models with a language model, on the popular Hub5'00 benchmark. On our internal …