Ask a Question

Prefer a chat interface with context about you and your work?

Improving Tail Performance of a Deliberation E2E ASR Model Using a Large Text Corpus

Improving Tail Performance of a Deliberation E2E ASR Model Using a Large Text Corpus

End-to-end (E2E) automatic speech recognition (ASR) systems lack the distinct language model (LM) component that characterizes traditional speech systems.While this simplifies the model architecture, it complicates the task of incorporating textonly data into training, which is important to the recognition of tail words that do not occur often in audio-text …