BB8: A Scalable, Accurate, Robust to Partial Occlusion Method for Predicting the 3D Poses of Challenging Objects without Using Depth

Type: Preprint

Publication Date: 2017-10-01

Citations: 774

DOI: https://doi.org/10.1109/iccv.2017.413

Download PDF

Abstract

We introduce a novel method for 3D object detection and pose estimation from color images only. We first use segmentation to detect the objects of interest in 2D even in presence of partial occlusions and cluttered background. By contrast with recent patch-based methods, we rely on a "holistic" approach: We apply to the detected objects a Convolutional Neural Network (CNN) trained to predict their 3D poses in the form of 2D projections of the corners of their 3D bounding boxes. This, however, is not sufficient for handling objects from the recent T-LESS dataset: These objects exhibit an axis of rotational symmetry, and the similarity of two images of such an object under two different poses makes training the CNN challenging. We solve this problem by restricting the range of poses used for training, and by introducing a classifier to identify the range of a pose at run-time before estimating it. We also use an optional additional step that refines the predicted poses. We improve the state-of-the-art on the LINEMOD dataset from 73.7% [2] to 89.3% of correctly registered RGB frames. We are also the first to report results on the Occlusion dataset [1] using color images only. We obtain 54% of frames passing the Pose 6D criterion on average on several sequences of the T-LESS dataset, compared to the 67% of the state-of-the-art [10] on the same sequences which uses both color and depth. The full approach is also scalable, as a single network can be trained for multiple objects simultaneously.

Locations

  • arXiv (Cornell University) - View - PDF
  • HAL (Le Centre pour la Communication Scientifique Directe) - View - PDF

Similar Works

Action Title Year Authors
+ BB8: A Scalable, Accurate, Robust to Partial Occlusion Method for Predicting the 3D Poses of Challenging Objects without Using Depth 2017 Mahdi Rad
Vincent Lepetit
+ BB8: A Scalable, Accurate, Robust to Partial Occlusion Method for Predicting the 3D Poses of Challenging Objects without Using Depth 2017 Mahdi Rad
Vincent Lepetit
+ Segmentation-driven 6D Object Pose Estimation 2018 Yinlin Hu
Joachim Hugonot
Pascal Fua
Mathieu Salzmann
+ PDF Chat Segmentation-Driven 6D Object Pose Estimation 2019 Yinlin Hu
Joachim Hugonot
Pascal Fua
Mathieu Salzmann
+ PDF Chat Real-Time Seamless Single Shot 6D Object Pose Prediction 2018 Bugra Tekin
Sudipta N. Sinha
Pascal Fua
+ Real-Time Seamless Single Shot 6D Object Pose Prediction 2017 Bugra Tekin
Sudipta N. Sinha
Pascal Fua
+ Real-Time Seamless Single Shot 6D Object Pose Prediction 2017 Bugra Tekin
Sudipta N. Sinha
Pascal Fua
+ PDF Chat Templates for 3D Object Pose Estimation Revisited: Generalization to New Objects and Robustness to Occlusions 2022 Van Nguyen Nguyen
Yinlin Hu
Yang Xiao
Mathieu Salzmann
Vincent Lepetit
+ DPOD: Dense 6D Pose Object Detector in RGB images 2019 Sergey Zakharov
Ivan Shugurov
Slobodan Ilić
+ Templates for 3D Object Pose Estimation Revisited: Generalization to New Objects and Robustness to Occlusions 2022 Van Nguyen Nguyen
Yinlin Hu
Xiao Yang
Mathieu Salzmann
Vincent Lepetit
+ PDF Chat Occlusion-Robust Object Pose Estimation with Holistic Representation 2022 Bo Chen
Tat-Jun Chin
Marius Klimavičius
+ Occlusion-Robust Object Pose Estimation with Holistic Representation 2021 Bo Chen
Tat-Jun Chin
Marius Klimavičius
+ iPose: Instance-Aware 6D Pose Estimation of Partly Occluded Objects 2017 Omid Hosseini Jafari
Siva Karthik Mustikovela
Karl Pertsch
Eric Brachmann
Carsten Rother
+ Inferring 3D Object Pose in RGB-D Images 2015 Saurabh Gupta
Pablo Arbeláez
Ross Girshick
Jitendra Malik
+ PDF Chat YCB-M: A Multi-Camera RGB-D Dataset for Object Recognition and 6DoF Pose Estimation 2020 Till Grenzdörffer
Martin Günther
Joachim Hertzberg
+ Semantic keypoint-based pose estimation from single RGB frames 2022 Karl Schmeckpeper
Philip R. Osteen
Yufu Wang
Georgios Pavlakos
Kenneth Chaney
W. H. Jordan
Xiaowei Zhou
Konstantinos G. Derpanis
Kostas Daniilidis
+ HomebrewedDB: RGB-D Dataset for 6D Pose Estimation of 3D Objects 2019 Roman Kaskman
Sergey Zakharov
Ivan Shugurov
Slobodan Ilić
+ HomebrewedDB: RGB-D Dataset for 6D Pose Estimation of 3D Objects 2019 Roman Kaskman
Sergey Zakharov
Ivan Shugurov
Slobodan Ilić
+ PDF Chat HomebrewedDB: RGB-D Dataset for 6D Pose Estimation of 3D Objects 2019 Roman Kaskman
Sergey Zakharov
Ivan Shugurov
Slobodan Ilić
+ OSOP: A Multi-Stage One Shot Object Pose Estimation Framework 2022 Ivan Shugurov
Fu Li
Benjamin Busam
Slobodan Ilić

Works That Cite This (334)

Action Title Year Authors
+ PDF Chat Deep learning-based spacecraft relative navigation methods: A survey 2021 Jianing Song
Duarte Rondao
Nabil Aouf
+ PDF Chat Templates for 3D Object Pose Estimation Revisited: Generalization to New Objects and Robustness to Occlusions 2022 Van Nguyen Nguyen
Yinlin Hu
Yang Xiao
Mathieu Salzmann
Vincent Lepetit
+ PDF Chat Bridging the Reality Gap for Pose Estimation Networks using Sensor-Based Domain Randomization 2021 Frederik Hagelskjar
Anders Glent Buch
+ PDF Chat YOLOPose V2: Understanding and improving transformer-based 6D pose estimation 2023 Arul Selvam Periyasamy
Arash Amini
Vladimir Tsaturyan
Sven Behnke
+ PDF Chat Shape-Constraint Recurrent Flow for 6D Object Pose Estimation 2023 Yang Hai
Rui Song
Jiaojiao Li
Yinlin Hu
+ Rapid Pose Label Generation through Sparse Representation of Unknown Objects. 2020 Rohan Pratap Singh
Mehdi Benallegue
Yusuke Yoshiyasu
Fumio Kanehiro
+ PDF Chat Reconstruct, Rasterize and Backprop: Dense shape and pose estimation from a single image 2020 Aniket Pokale
Aditya Aggarwal
Krishna Murthy Jatavallabhula
K. Madhava Krishna
+ PDF Chat Vision-based robotic grasping from object localization, object pose estimation to grasp estimation for parallel grippers: a review 2020 Guoguang Du
Kai Wang
Shiguo Lian
Kaiyong Zhao
+ PDF Chat On Pre-trained Image Features and Synthetic Images for Deep Learning 2019 Stefan Hinterstoißer
Vincent Lepetit
Paul Wohlhart
Kurt Konolige
+ DTF-Net: Category-Level Pose Estimation and Shape Reconstruction via Deformable Template Field 2023 Haowen Wang
Zhipeng Fan
Zhen Zhao
Zhengping Che
Zhiyuan Xu
Dong Liu
Feifei Feng
Yakun Huang
Xiuquan Qiao
Jian Tang