Prefer a chat interface with context about you and your work?
Weakly Supervised Human-Object Interaction Detection in Video via Contrastive Spatiotemporal Regions
We introduce the task of weakly supervised learning for detecting human and object interactions in videos. Our task poses unique challenges as a system does not know what types of human-object interactions are present in a video or the actual spatiotemporal location of the human and the object. To address …