IPoD: Implicit Field Learning with Point Diffusion for Generalizable 3D Object Reconstruction from Single RGB-D Images

Yushuang Wu, Luyue Shi, Junhao Cai, Weihao Yuan, Lingteng Qiu, Zilong Dong, Liefeng Bo, Shuguang Cui, Xiaoguang Han

Type: Preprint

Publication Date: 2024-03-30

Citations: 0

DOI: https://doi.org/10.48550/arxiv.2404.00269

Abstract

Generalizable 3D object reconstruction from single-view RGB-D images remains a challenging task, particularly with real-world data. Current state-of-the-art methods develop Transformer-based implicit field learning, necessitating an intensive learning paradigm that requires dense query-supervision uniformly sampled throughout the entire space. We propose a novel approach, IPoD, which harmonizes implicit field learning with point diffusion. This approach treats the query points for implicit field learning as a noisy point cloud for iterative denoising, allowing for their dynamic adaptation to the target object shape. Such adaptive query points harness diffusion learning's capability for coarse shape recovery and also enhances the implicit representation's ability to delineate finer details. Besides, an additional self-conditioning mechanism is designed to use implicit predictions as the guidance of diffusion learning, leading to a cooperative system. Experiments conducted on the CO3D-v2 dataset affirm the superiority of IPoD, achieving 7.8% improvement in F-score and 28.6% in Chamfer distance over existing methods. The generalizability of IPoD is also demonstrated on the MVImgNet dataset. Our project page is at https://yushuang-wu.github.io/IPoD.

Locations

arXiv (Cornell University) - View - PDF

Similar Works

Action	Title	Year	Authors
+	CCD-3DR: Consistent Conditioning in Diffusion for Single-Image 3D Reconstruction	2023	Yan Di Chenyangguang Zhang Pengyuan Wang Guangyao Zhai Ruida Zhang Fabian Manhardt Benjamin Busam Xiangyang Ji Federico Tombari
+	$PC^2$: Projection-Conditioned Point Cloud Diffusion for Single-Image 3D Reconstruction	2023	Luke Melas-Kyriazi Christian Rupprecht Andrea Vedaldi
+ PDF Chat	DiffPoint: Single and Multi-view Point Cloud Reconstruction with ViT Based Diffusion Model	2024	Yu Feng Shi Xing Meng‐Li Cheng Yun Xiong
+	Ladybird: Quasi-Monte Carlo Sampling for Deep Implicit Field Based 3D Reconstruction with Symmetry	2020	Yifan Xu Tianqi Fan Yi Yuan Gurprit Singh
+	A Conditional Point Diffusion-Refinement Paradigm for 3D Point Cloud Completion	2021	Zhaoyang Lyu Zhifeng Kong Xudong Xu Liang Pan Dahua Lin
+	DI-Fusion: Online Implicit 3D Reconstruction with Deep Priors	2020	Jiahui Huang Shi-Sheng Huang Haoxuan Song Shi‐Min Hu
+ PDF Chat	DI-Fusion: Online Implicit 3D Reconstruction with Deep Priors	2021	Jiahui Huang Shi-Sheng Huang Haoxuan Song Shi‐Min Hu
+ PDF Chat	PCDreamer: Point Cloud Completion Through Multi-view Diffusion Priors	2024	Guangshun Wei Yuan Feng L.L. Ma Chen Wang Yuanfeng Zhou Changjian Li
+ PDF Chat	Bayesian Diffusion Models for 3D Shape Reconstruction	2024	Haiyang Xu Lei Yu Zeyuan Chen Xiang Zhang Yue Zhao Yilin Wang Zhuowen Tu
+ PDF Chat	MVSBoost: An Efficient Point Cloud-based 3D Reconstruction	2024	Umair Haroon Ahmad AlMughrabi Ricardo Paulino Marques Petia Radeva
+ PDF Chat	RGB2Point: 3D Point Cloud Generation from Single RGB Images	2024	Jae Joong Lee Bedřich Beneš
+ PDF Chat	Coherent 3D Scene Diffusion From a Single RGB Image	2024	Manuel Dahnert Angela Dai Norman Müller Matthias Nießner
+ PDF Chat	SPAR3D: Stable Point-Aware Reconstruction of 3D Objects from Single Images	2025	Zixuan Huang Mark Boss Aaryaman Vasishta James M. Rehg Varun Jampani
+	Semi-Supervised Single-View 3D Reconstruction via Prototype Shape Priors	2022	Zhen Xing Hengduo Li Zuxuan Wu Yu–Gang Jiang
+	DmifNet:3D Shape Reconstruction Based on Dynamic Multi-Branch Information Fusion	2020	Lei Li Suping Wu
+ PDF Chat	DmifNet: 3D Shape Reconstruction based on Dynamic Multi-Branch Information Fusion	2021	Lei Li Suping Wu
+ PDF Chat	The More You See in 2D, the More You Perceive in 3D	2024	Xinyang Han Zelin Gao Angjoo Kanazawa Shubham Goel Yossi Gandelsman
+	InFusionSurf: Refining Neural RGB-D Surface Reconstruction Using Per-Frame Intrinsic Refinement and TSDF Fusion Prior Learning	2023	Seunghwan Lee Gwanmo Park Hyewon Son Jiwon Ryu Han Joo Chae
+ PDF Chat	Semi-supervised Single-view 3D Reconstruction via Multi Shape Prior Fusion Strategy and Self-Attention	2024	Wei Zhoua Xinzhe Shia Yvonne R. Shea Kunlong Liua Yongqin Zhanga
+ PDF Chat	Neural Fields as Learnable Kernels for 3D Reconstruction	2022	Francis Williams Žan Gojčič Sameh Khamis Denis Zorin Joan Bruna Sanja Fidler Or Litany

Works That Cite This (0)

Action	Title	Year	Authors

Works Cited by This (0)

Action	Title	Year	Authors