Ask a Question

Prefer a chat interface with context about you and your work?

Generalized Multi-Source Inference for Text Conditioned Music Diffusion Models

Generalized Multi-Source Inference for Text Conditioned Music Diffusion Models

Multi-Source Diffusion Models (MSDM) allow for compositional musical generation tasks: generating a set of coherent sources, creating accompaniments, and performing source separation. Despite their versatility, they require estimating the joint distribution over the sources, necessitating pre-separated musical data, which is rarely available, and fixing the number and type of sources …