Improving LPCNET-Based Text-to-Speech with Linear Prediction-Structured Mixture Density Network
Improving LPCNET-Based Text-to-Speech with Linear Prediction-Structured Mixture Density Network
In this paper, we propose an improved LPCNet vocoder using a linear prediction (LP)-structured mixture density network (MDN). The recently proposed LPCNet vocoder has successfully achieved high-quality and lightweight speech synthesis systems by combining a vocal tract LP filter with a WaveRNN-based vocal source (i.e., excitation) generator. However, the quality …