Attention Based On-Device Streaming Speech Recognition with Large Speech Corpus
Attention Based On-Device Streaming Speech Recognition with Large Speech Corpus
In this paper, we present a new on-device automatic speech recognition (ASR) system based on monotonic chunk-wise attention (MoChA) models trained with large (> 10K hours) corpus. We attained around 90% of a word recognition rate for general domain mainly by using joint training of connectionist temporal classifier (CTC) and …