Prefer a chat interface with context about you and your work?
Exploiting temporal information to detect conversational groups in videos and predict the next speaker
Studies in human human interaction have introduced the concept of F formation to describe the spatial arrangement of participants during social interactions. This paper has two objectives. It aims at detecting F formations in video sequences and predicting the next speaker in a group conversation. The proposed approach exploits time …