Text-to-Song: Towards Controllable Music Generation Incorporating Vocals
and Accompaniment
Text-to-Song: Towards Controllable Music Generation Incorporating Vocals
and Accompaniment
A song is a combination of singing voice and accompaniment. However, existing works focus on singing voice synthesis and music generation independently. Little attention was paid to explore song synthesis. In this work, we propose a novel task called text-to-song synthesis which incorporating both vocals and accompaniments generation. We develop …