Chained Multi-stream Networks Exploiting Pose, Motion, and Appearance for Action Classification and Detection
Chained Multi-stream Networks Exploiting Pose, Motion, and Appearance for Action Classification and Detection
General human action recognition requires understanding of various visual cues. In this paper, we propose a network architecture that computes and integrates the most important visual cues for action recognition: pose, motion, and the raw images. For the integration, we introduce a Markov chain model which adds cues successively. The …