Ask AI a math question

We present a neural text-to-speech system for fine-grained prosody transfer from one speaker to another.Conventional approaches for end-to-end prosody transfer typically use either fixed-dimensional or variable-length prosody embedding via a secondary attention to encode the reference signal.However, when trained on a single-speaker dataset, the conventional prosody transfer systems are not …

Ask a Question