An investigation of phone-based subword units for end-to-end speech
recognition
An investigation of phone-based subword units for end-to-end speech
recognition
Phones and their context-dependent variants have been the standard modeling units for conventional speech recognition systems, while characters and subwords have demonstrated their effectiveness for end-to-end recognition systems. We investigate the use of phone-based subwords, in particular, byte pair encoder (BPE), as modeling units for end-to-end speech recognition. In addition, …