Segmental Contrastive Predictive Coding for Unsupervised Word Segmentation
Segmental Contrastive Predictive Coding for Unsupervised Word Segmentation
Automatic detection of phoneme or word-like units is one of the core objectives in zero-resource speech processing.Recent attempts employ self-supervised training methods, such as contrastive predictive coding (CPC), where the next frame is predicted given past context.However, CPC only looks at the audio signal's frame-level structure.We overcome this limitation with …