Generalized Rank Pooling for Activity Recognition
Generalized Rank Pooling for Activity Recognition
Most popular deep models for action recognition split video sequences into short sub-sequences consisting of a few frames, frame-based features are then pooled for recognizing the activity. Usually, this pooling step discards the temporal order of the frames, which could otherwise be used for better recognition. Towards this end, we …