TaSNet: Time-Domain Audio Separation Network for Real-Time, Single-Channel Speech Separation
TaSNet: Time-Domain Audio Separation Network for Real-Time, Single-Channel Speech Separation
Robust speech processing in multi-talker environments requires effective speech separation. Recent deep learning systems have made significant progress toward solving this problem, yet it remains challenging particularly in real-time, short latency applications. Most methods attempt to construct a mask for each source in time-frequency representation of the mixture signal which …