Direct3D: Scalable Image-to-3D Generation via 3D Latent Diffusion
Transformer
Direct3D: Scalable Image-to-3D Generation via 3D Latent Diffusion
Transformer
Generating high-quality 3D assets from text and images has long been challenging, primarily due to the absence of scalable 3D representations capable of capturing intricate geometry distributions. In this work, we introduce Direct3D, a native 3D generative model scalable to in-the-wild input images, without requiring a multiview diffusion model or …