ProsodyFM: Unsupervised Phrasing and Intonation Control for Intelligible
Speech Synthesis
ProsodyFM: Unsupervised Phrasing and Intonation Control for Intelligible
Speech Synthesis
Prosody contains rich information beyond the literal meaning of words, which is crucial for the intelligibility of speech. Current models still fall short in phrasing and intonation; they not only miss or misplace breaks when synthesizing long sentences with complex structures but also produce unnatural intonation. We propose ProsodyFM, a …