Ask a Question

Prefer a chat interface with context about you and your work?

ProsodyFM: Unsupervised Phrasing and Intonation Control for Intelligible Speech Synthesis

ProsodyFM: Unsupervised Phrasing and Intonation Control for Intelligible Speech Synthesis

Prosody contains rich information beyond the literal meaning of words, which is crucial for the intelligibility of speech. Current models still fall short in phrasing and intonation; they not only miss or misplace breaks when synthesizing long sentences with complex structures but also produce unnatural intonation. We propose ProsodyFM, a …