Improving Tail Performance of a Deliberation E2E ASR Model Using a Large Text Corpus
Improving Tail Performance of a Deliberation E2E ASR Model Using a Large Text Corpus
End-to-end (E2E) automatic speech recognition (ASR) systems lack the distinct language model (LM) component that characterizes traditional speech systems.While this simplifies the model architecture, it complicates the task of incorporating textonly data into training, which is important to the recognition of tail words that do not occur often in audio-text …