TC-Bench: Benchmarking Temporal Compositionality in Text-to-Video and
Image-to-Video Generation
TC-Bench: Benchmarking Temporal Compositionality in Text-to-Video and
Image-to-Video Generation
Video generation has many unique challenges beyond those of image generation. The temporal dimension introduces extensive possible variations across frames, over which consistency and continuity may be violated. In this study, we move beyond evaluating simple actions and argue that generated videos should incorporate the emergence of new concepts and …