Ask a Question

Prefer a chat interface with context about you and your work?

An Investigation of Phone-Based Subword Units for End-to-End Speech Recognition

An Investigation of Phone-Based Subword Units for End-to-End Speech Recognition

Phones and their context-dependent variants have been the standard modeling units for conventional speech recognition systems, while characters and subwords have demonstrated their effectiveness for end-to-end recognition systems.We investigate the use of phone-based subwords, in particular, byte pair encoder (BPE), as modeling units for end-to-end speech recognition.In addition, we also …