Improving Streaming Transformer Based ASR Under a Framework of Self-Supervised Learning
Improving Streaming Transformer Based ASR Under a Framework of Self-Supervised Learning
Recently self-supervised learning has emerged as an effective approach to improve the performance of automatic speech recognition (ASR).Under such a framework, the neural network is usually pre-trained with massive unlabeled data and then fine-tuned with limited labeled data.However, the nonstreaming architecture like bidirectional transformer is usually adopted by the neural …