Ask a Question

Prefer a chat interface with context about you and your work?

MEDIC: Zero-shot Music Editing with Disentangled Inversion Control

MEDIC: Zero-shot Music Editing with Disentangled Inversion Control

Text-guided diffusion models make a paradigm shift in audio generation, facilitating the adaptability of source audio to conform to specific textual prompts. Recent works introduce inversion techniques, like DDIM inversion, to zero-shot editing, exploiting pretrained diffusion models for audio modification. Nonetheless, our investigation exposes that DDIM inversion suffers from an …