Ask AI a math question

Automatic detection of phoneme or word-like units is one of the core objectives in zero-resource speech processing.Recent attempts employ self-supervised training methods, such as contrastive predictive coding (CPC), where the next frame is predicted given past context.However, CPC only looks at the audio signal's frame-level structure.We overcome this limitation with …

Ask a Question