IPoD: Implicit Field Learning with Point Diffusion for Generalizable 3D Object Reconstruction from Single RGB-D Images

Type: Preprint

Publication Date: 2024-03-30

Citations: 0

DOI: https://doi.org/10.48550/arxiv.2404.00269

Abstract

Generalizable 3D object reconstruction from single-view RGB-D images remains a challenging task, particularly with real-world data. Current state-of-the-art methods develop Transformer-based implicit field learning, necessitating an intensive learning paradigm that requires dense query-supervision uniformly sampled throughout the entire space. We propose a novel approach, IPoD, which harmonizes implicit field learning with point diffusion. This approach treats the query points for implicit field learning as a noisy point cloud for iterative denoising, allowing for their dynamic adaptation to the target object shape. Such adaptive query points harness diffusion learning's capability for coarse shape recovery and also enhances the implicit representation's ability to delineate finer details. Besides, an additional self-conditioning mechanism is designed to use implicit predictions as the guidance of diffusion learning, leading to a cooperative system. Experiments conducted on the CO3D-v2 dataset affirm the superiority of IPoD, achieving 7.8% improvement in F-score and 28.6% in Chamfer distance over existing methods. The generalizability of IPoD is also demonstrated on the MVImgNet dataset. Our project page is at https://yushuang-wu.github.io/IPoD.

Locations

  • arXiv (Cornell University) - View - PDF

Similar Works

Action Title Year Authors
+ CCD-3DR: Consistent Conditioning in Diffusion for Single-Image 3D Reconstruction 2023 Yan Di
Chenyangguang Zhang
Pengyuan Wang
Guangyao Zhai
Ruida Zhang
Fabian Manhardt
Benjamin Busam
Xiangyang Ji
Federico Tombari
+ $PC^2$: Projection-Conditioned Point Cloud Diffusion for Single-Image 3D Reconstruction 2023 Luke Melas-Kyriazi
Christian Rupprecht
Andrea Vedaldi
+ PDF Chat DiffPoint: Single and Multi-view Point Cloud Reconstruction with ViT Based Diffusion Model 2024 Yu Feng
Shi Xing
Meng‐Li Cheng
Yun Xiong
+ Ladybird: Quasi-Monte Carlo Sampling for Deep Implicit Field Based 3D Reconstruction with Symmetry 2020 Yifan Xu
Tianqi Fan
Yi Yuan
Gurprit Singh
+ A Conditional Point Diffusion-Refinement Paradigm for 3D Point Cloud Completion 2021 Zhaoyang Lyu
Zhifeng Kong
Xudong Xu
Liang Pan
Dahua Lin
+ DI-Fusion: Online Implicit 3D Reconstruction with Deep Priors 2020 Jiahui Huang
Shi-Sheng Huang
Haoxuan Song
Shi‐Min Hu
+ PDF Chat DI-Fusion: Online Implicit 3D Reconstruction with Deep Priors 2021 Jiahui Huang
Shi-Sheng Huang
Haoxuan Song
Shi‐Min Hu
+ PDF Chat PCDreamer: Point Cloud Completion Through Multi-view Diffusion Priors 2024 Guangshun Wei
Yuan Feng
L.L. Ma
Chen Wang
Yuanfeng Zhou
Changjian Li
+ PDF Chat Bayesian Diffusion Models for 3D Shape Reconstruction 2024 Haiyang Xu
Lei Yu
Zeyuan Chen
Xiang Zhang
Yue Zhao
Yilin Wang
Zhuowen Tu
+ PDF Chat MVSBoost: An Efficient Point Cloud-based 3D Reconstruction 2024 Umair Haroon
Ahmad AlMughrabi
Ricardo Paulino Marques
Petia Radeva
+ PDF Chat RGB2Point: 3D Point Cloud Generation from Single RGB Images 2024 Jae Joong Lee
Bedřich Beneš
+ PDF Chat Coherent 3D Scene Diffusion From a Single RGB Image 2024 Manuel Dahnert
Angela Dai
Norman Müller
Matthias Nießner
+ PDF Chat SPAR3D: Stable Point-Aware Reconstruction of 3D Objects from Single Images 2025 Zixuan Huang
Mark Boss
Aaryaman Vasishta
James M. Rehg
Varun Jampani
+ Semi-Supervised Single-View 3D Reconstruction via Prototype Shape Priors 2022 Zhen Xing
Hengduo Li
Zuxuan Wu
Yu–Gang Jiang
+ DmifNet:3D Shape Reconstruction Based on Dynamic Multi-Branch Information Fusion 2020 Lei Li
Suping Wu
+ PDF Chat DmifNet: 3D Shape Reconstruction based on Dynamic Multi-Branch Information Fusion 2021 Lei Li
Suping Wu
+ PDF Chat The More You See in 2D, the More You Perceive in 3D 2024 Xinyang Han
Zelin Gao
Angjoo Kanazawa
Shubham Goel
Yossi Gandelsman
+ InFusionSurf: Refining Neural RGB-D Surface Reconstruction Using Per-Frame Intrinsic Refinement and TSDF Fusion Prior Learning 2023 Seunghwan Lee
Gwanmo Park
Hyewon Son
Jiwon Ryu
Han Joo Chae
+ PDF Chat Semi-supervised Single-view 3D Reconstruction via Multi Shape Prior Fusion Strategy and Self-Attention 2024 Wei Zhoua
Xinzhe Shia
Yvonne R. Shea
Kunlong Liua
Yongqin Zhanga
+ PDF Chat Neural Fields as Learnable Kernels for 3D Reconstruction 2022 Francis Williams
Žan Gojčič
Sameh Khamis
Denis Zorin
Joan Bruna
Sanja Fidler
Or Litany

Works That Cite This (0)

Action Title Year Authors

Works Cited by This (0)

Action Title Year Authors