Ask a Question

Prefer a chat interface with context about you and your work?

Vision-Infused Deep Audio Inpainting

Vision-Infused Deep Audio Inpainting

Multi-modality perception is essential to develop interactive intelligence. In this work, we consider a new task of visual information-infused audio inpainting, i.e. synthesizing missing audio segments that correspond to their accompanying videos. We identify two key aspects for a successful inpainter: (1) It is desirable to operate on spectrograms instead …