Prometheus: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D
Scene Generation
Prometheus: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D
Scene Generation
In this work, we introduce Prometheus, a 3D-aware latent diffusion model for text-to-3D generation at both object and scene levels in seconds. We formulate 3D scene generation as multi-view, feed-forward, pixel-aligned 3D Gaussian generation within the latent diffusion paradigm. To ensure generalizability, we build our model upon pre-trained text-to-image generation …