REBORN: Reinforcement-Learned Boundary Segmentation with Iterative
Training for Unsupervised ASR
REBORN: Reinforcement-Learned Boundary Segmentation with Iterative
Training for Unsupervised ASR
Unsupervised automatic speech recognition (ASR) aims to learn the mapping between the speech signal and its corresponding textual transcription without the supervision of paired speech-text data. A word/phoneme in the speech signal is represented by a segment of speech signal with variable length and unknown boundary, and this segmental structure …