Action Unit Memory Network for Weakly Supervised Temporal Action Localization
Action Unit Memory Network for Weakly Supervised Temporal Action Localization
Weakly supervised temporal action localization aims to detect and localize actions in untrimmed videos with only video-level labels during training. However, without frame-level annotations, it is challenging to achieve localization completeness and relieve background interference. In this paper, we present an Action Unit Memory Network (AUMN) for weakly supervised temporal …