DINO-WM: World Models on Pre-trained Visual Features enable Zero-shot
Planning
DINO-WM: World Models on Pre-trained Visual Features enable Zero-shot
Planning
The ability to predict future outcomes given control actions is fundamental for physical reasoning. However, such predictive models, often called world models, have proven challenging to learn and are typically developed for task-specific solutions with online policy learning. We argue that the true potential of world models lies in their …