Ask a Question

Prefer a chat interface with context about you and your work?

OmniSep: Unified Omni-Modality Sound Separation with Query-Mixup

OmniSep: Unified Omni-Modality Sound Separation with Query-Mixup

The scaling up has brought tremendous success in the fields of vision and language in recent years. When it comes to audio, however, researchers encounter a major challenge in scaling up the training data, as most natural audio contains diverse interfering signals. To address this limitation, we introduce Omni-modal Sound …