Dual-Path Modeling for Long Recording Speech Separation in Meetings
Dual-Path Modeling for Long Recording Speech Separation in Meetings
The continuous speech separation (CSS) is a task to separate the speech sources from a long, partially overlapped recording, which involves a varying number of speakers. A straightforward extension of conventional utterance-level speech separation to the CSS task is to segment the long recording with a size-fixed window and process …