Ask a Question

Prefer a chat interface with context about you and your work?

Solos: A Dataset for Audio-Visual Music Analysis

Solos: A Dataset for Audio-Visual Music Analysis

In this paper, we present a new dataset of music performance videos which can be used for training machine learning methods for multiple tasks such as audio-visual blind source separation and localization, cross-modal correspondences, cross-modal generation and, in general, any audio-visual self-supervised task. These videos, gathered from YouTube, consist of …