Ask a Question

Prefer a chat interface with context about you and your work?

Multi-Band Melgan: Faster Waveform Generation For High-Quality Text-To-Speech

Multi-Band Melgan: Faster Waveform Generation For High-Quality Text-To-Speech

In this paper, we propose multi-band MelGAN, a much faster waveform generation model targeting to high-quality text-to-speech. Specifically, we improve the original MelGAN by the following aspects. First, we increase the receptive field of the generator, which is proven to be beneficial to speech generation. Second, we substitute the feature …