Bridge-Prompt: Towards Ordinal Action Understanding in Instructional Videos
Bridge-Prompt: Towards Ordinal Action Understanding in Instructional Videos
Action recognition models have shown a promising capability to classify human actions in short video clips. In a real scenario, multiple correlated human actions commonly occur in particular orders, forming semantically meaningful human activities. Conventional action recognition approaches focus on analyzing single actions. However, they fail to fully reason about …