ExcitNet Vocoder: A Neural Excitation Model for Parametric Speech Synthesis Systems
ExcitNet Vocoder: A Neural Excitation Model for Parametric Speech Synthesis Systems
This paper proposes a WaveNet-based neural excitation model (ExcitNet) for statistical parametric speech synthesis systems. Conventional WaveNet-based neural vocoding systems significantly improve the perceptual quality of synthesized speech by statistically generating a time sequence of speech waveforms through an auto-regressive framework. However, they often suffer from noisy outputs because of …