Multi-Band Melgan: Faster Waveform Generation For High-Quality Text-To-Speech
Multi-Band Melgan: Faster Waveform Generation For High-Quality Text-To-Speech
In this paper, we propose multi-band MelGAN, a much faster waveform generation model targeting to high-quality text-to-speech. Specifically, we improve the original MelGAN by the following aspects. First, we increase the receptive field of the generator, which is proven to be beneficial to speech generation. Second, we substitute the feature …