Generalized Multi-Source Inference for Text Conditioned Music Diffusion
Models
Generalized Multi-Source Inference for Text Conditioned Music Diffusion
Models
Multi-Source Diffusion Models (MSDM) allow for compositional musical generation tasks: generating a set of coherent sources, creating accompaniments, and performing source separation. Despite their versatility, they require estimating the joint distribution over the sources, necessitating pre-separated musical data, which is rarely available, and fixing the number and type of sources …