Ask a Question

Prefer a chat interface with context about you and your work?

LiRA: Learning Visual Speech Representations from Audio through Self-supervision

LiRA: Learning Visual Speech Representations from Audio through Self-supervision

The large amount of audiovisual content being shared online today has drawn substantial attention to the prospect of audiovisual self-supervised learning. Recent works have focused on each of these modalities separately, while others have attempted to model both simultaneously in a cross-modal fashion. However, comparatively little attention has been given …