Ask a Question

Prefer a chat interface with context about you and your work?

Self-Supervised Learning for Semi-Supervised Temporal Language Grounding

Self-Supervised Learning for Semi-Supervised Temporal Language Grounding

Given a text description, Temporal Language Grounding (TLG) aims to localize temporal boundaries of the segments that contain the specified semantics in an untrimmed video. TLG is inherently a challenging task, as it requires comprehensive understanding of both sentence semantics and video contents. Previous works either tackle this task in …