Ask a Question

Prefer a chat interface with context about you and your work?

CRIPP-VQA: Counterfactual Reasoning about Implicit Physical Properties via Video Question Answering

CRIPP-VQA: Counterfactual Reasoning about Implicit Physical Properties via Video Question Answering

Videos often capture objects, their visible properties, their motion, and the interactions between different objects. Objects also have physical properties such as mass, which the imaging pipeline is unable to directly capture. However, these properties can be estimated by utilizing cues from relative object motion and the dynamics introduced by …