Ask a Question

Prefer a chat interface with context about you and your work?

Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks with Guided Attention

Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks with Guided Attention

This paper describes a novel text-to-speech (TTS) technique based on deep convolutional neural networks (CNN), without use of any recurrent units. Recurrent neural networks (RNN) have become a standard technique to model sequential data recently, and this technique has been used in some cutting-edge neural TTS techniques. However, training RNN …