Owl-1: Omni World Model for Consistent Long Video Generation
Owl-1: Omni World Model for Consistent Long Video Generation
Video generation models (VGMs) have received extensive attention recently and serve as promising candidates for general-purpose large vision models. While they can only generate short videos each time, existing methods achieve long video generation by iteratively calling the VGMs, using the last-frame output as the condition for the next-round generation. …