Ask a Question

Prefer a chat interface with context about you and your work?

Social Scene Understanding: End-to-End Multi-person Action Localization and Collective Activity Recognition

Social Scene Understanding: End-to-End Multi-person Action Localization and Collective Activity Recognition

We present a unified framework for understanding human social behaviors in raw image sequences. Our model jointly detects multiple individuals, infers their social actions, and estimates the collective actions with a single feed-forward pass through a neural network. We propose a single architecture that does not rely on external detection …