A Skeleton-Bridged Deep Learning Approach for Generating Meshes of Complex Topologies From Single RGB Images

Type: Preprint

Publication Date: 2019-06-01

Citations: 79

DOI: https://doi.org/10.1109/cvpr.2019.00467

Abstract

This paper focuses on the challenging task of learning 3D object surface reconstructions from single RGB images. Existing methods achieve varying degrees of success by using different geometric representations. However, they all have their own drawbacks, and cannot well reconstruct those surfaces of complex topologies. To this end, we propose in this paper a skeleton-bridged, stage-wise learning approach to address the challenge. Our use of skeleton is due to its nice property of topology preservation, while being of lower complexity to learn. To learn skeleton from an input image, we design a deep architecture whose decoder is based on a novel design of parallel streams respectively for synthesis of curve- and surface-like skeleton points. We use different shape representations of point cloud, volume, and mesh in our stage-wise learning, in order to take their respective advantages. We also propose multi-stage use of the input image to correct prediction errors that are possibly accumulated in each stage. We conduct intensive experiments to investigate the efficacy of our proposed approach. Qualitative and quantitative results on representative object categories of both simple and complex topologies demonstrate the superiority of our approach over existing ones. We will make our ShapeNet-Skeleton dataset publicly available.

Locations

  • arXiv (Cornell University) - View - PDF
  • 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) - View

Works Cited by This (24)

Action Title Year Authors
+ Very Deep Convolutional Networks for Large-Scale Image Recognition 2014 Karen Simonyan
Andrew Zisserman
+ PDF Chat 3D ShapeNets: A deep representation for volumetric shapes 2015 Zhirong Wu
Shuran Song
Aditya Khosla
Fisher Yu
Linguang Zhang
Xiaoou Tang
Jianxiong Xiao
+ PDF Chat Going deeper with convolutions 2015 Christian Szegedy
Wei Liu
Yangqing Jia
Pierre Sermanet
Scott Reed
Dragomir Anguelov
Dumitru Erhan
Vincent Vanhoucke
Andrew Rabinovich
+ PDF Chat Deep Residual Learning for Image Recognition 2016 Kaiming He
Xiangyu Zhang
Shaoqing Ren
Jian Sun
+ PDF Chat 3D-R2N2: A Unified Approach for Single and Multi-view 3D Object Reconstruction 2016 Christopher Choy
Danfei Xu
JunYoung Gwak
Kai Chen
Silvio Savarese
+ Generative and Discriminative Voxel Modeling with Convolutional Neural Networks 2016 Andrew Brock
Theodore Lim
Jacob Ritchie
Nick Weston
+ Learning a Probabilistic Latent Space of Object Shapes via 3D Generative-Adversarial Modeling 2016 Jiajun Wu
Chengkai Zhang
Tianfan Xue
William T. Freeman
Joshua B. Tenenbaum
+ PDF Chat OctNet: Learning Deep 3D Representations at High Resolutions 2017 Gernot Riegler
Ali Osman Ulusoy
Andreas Geiger
+ PDF Chat A Point Set Generation Network for 3D Object Reconstruction from a Single Image 2017 Haoqiang Fan
Hao Su
Leonidas Guibas
+ PDF Chat Octree Generating Networks: Efficient Convolutional Architectures for High-resolution 3D Outputs 2017 Maxim Tatarchenko
Alexey Dosovitskiy
Thomas Brox