Ask a Question

Prefer a chat interface with context about you and your work?

Motion-Appearance Co-memory Networks for Video Question Answering

Motion-Appearance Co-memory Networks for Video Question Answering

Video Question Answering (QA) is an important task in understanding video temporal structure. We observe that there are three unique attributes of video QA compared with image QA: (1) it deals with long sequences of images containing richer information not only in quantity but also in variety; (2) motion and …