Ask a Question

Prefer a chat interface with context about you and your work?

Time-Domain Speaker Extraction Network

Time-Domain Speaker Extraction Network

Speaker extraction is to extract a target speaker's voice from multi-talker speech. It simulates humans' cocktail party effect or the selective listening ability. The prior work mostly performs speaker extraction in frequency domain, then reconstructs the signal with some phase approximation. The inaccuracy of phase estimation is inherent to the …