Ask a Question

Prefer a chat interface with context about you and your work?

Open-Vocabulary Video Relation Extraction

Open-Vocabulary Video Relation Extraction

A comprehensive understanding of videos is inseparable from describing the action with its contextual action-object interactions. However, many current video understanding tasks prioritize general action classification and overlook the actors and relationships that shape the nature of the action, resulting in a superficial understanding of the action. Motivated by this, …