Ask a Question

Prefer a chat interface with context about you and your work?

First Place Solution to the Multiple-choice Video QA Track of The Second Perception Test Challenge

First Place Solution to the Multiple-choice Video QA Track of The Second Perception Test Challenge

In this report, we present our first-place solution to the Multiple-choice Video Question Answering (QA) track of The Second Perception Test Challenge. This competition posed a complex video understanding task, requiring models to accurately comprehend and answer questions about video content. To address this challenge, we leveraged the powerful QwenVL2 …