BB8: A Scalable, Accurate, Robust to Partial Occlusion Method for Predicting the 3D Poses of Challenging Objects without Using Depth

Mahdi Rad, Vincent Lepetit

Type: Preprint

Publication Date: 2017-10-01

Citations: 774

DOI: https://doi.org/10.1109/iccv.2017.413

Abstract

We introduce a novel method for 3D object detection and pose estimation from color images only. We first use segmentation to detect the objects of interest in 2D even in presence of partial occlusions and cluttered background. By contrast with recent patch-based methods, we rely on a "holistic" approach: We apply to the detected objects a Convolutional Neural Network (CNN) trained to predict their 3D poses in the form of 2D projections of the corners of their 3D bounding boxes. This, however, is not sufficient for handling objects from the recent T-LESS dataset: These objects exhibit an axis of rotational symmetry, and the similarity of two images of such an object under two different poses makes training the CNN challenging. We solve this problem by restricting the range of poses used for training, and by introducing a classifier to identify the range of a pose at run-time before estimating it. We also use an optional additional step that refines the predicted poses. We improve the state-of-the-art on the LINEMOD dataset from 73.7% [2] to 89.3% of correctly registered RGB frames. We are also the first to report results on the Occlusion dataset [1] using color images only. We obtain 54% of frames passing the Pose 6D criterion on average on several sequences of the T-LESS dataset, compared to the 67% of the state-of-the-art [10] on the same sequences which uses both color and depth. The full approach is also scalable, as a single network can be trained for multiple objects simultaneously.

Locations

arXiv (Cornell University) - View - PDF
HAL (Le Centre pour la Communication Scientifique Directe) - View - PDF

Similar Works

Action	Title	Year	Authors
+	BB8: A Scalable, Accurate, Robust to Partial Occlusion Method for Predicting the 3D Poses of Challenging Objects without Using Depth	2017	Mahdi Rad Vincent Lepetit
+	BB8: A Scalable, Accurate, Robust to Partial Occlusion Method for Predicting the 3D Poses of Challenging Objects without Using Depth	2017	Mahdi Rad Vincent Lepetit
+	Segmentation-driven 6D Object Pose Estimation	2018	Yinlin Hu Joachim Hugonot Pascal Fua Mathieu Salzmann
+ PDF Chat	Segmentation-Driven 6D Object Pose Estimation	2019	Yinlin Hu Joachim Hugonot Pascal Fua Mathieu Salzmann
+ PDF Chat	Real-Time Seamless Single Shot 6D Object Pose Prediction	2018	Bugra Tekin Sudipta N. Sinha Pascal Fua
+	Real-Time Seamless Single Shot 6D Object Pose Prediction	2017	Bugra Tekin Sudipta N. Sinha Pascal Fua
+	Real-Time Seamless Single Shot 6D Object Pose Prediction	2017	Bugra Tekin Sudipta N. Sinha Pascal Fua
+ PDF Chat	Templates for 3D Object Pose Estimation Revisited: Generalization to New Objects and Robustness to Occlusions	2022	Van Nguyen Nguyen Yinlin Hu Yang Xiao Mathieu Salzmann Vincent Lepetit
+	DPOD: Dense 6D Pose Object Detector in RGB images	2019	Sergey Zakharov Ivan Shugurov Slobodan Ilić
+	Templates for 3D Object Pose Estimation Revisited: Generalization to New Objects and Robustness to Occlusions	2022	Van Nguyen Nguyen Yinlin Hu Xiao Yang Mathieu Salzmann Vincent Lepetit
+ PDF Chat	Occlusion-Robust Object Pose Estimation with Holistic Representation	2022	Bo Chen Tat-Jun Chin Marius Klimavičius
+	Occlusion-Robust Object Pose Estimation with Holistic Representation	2021	Bo Chen Tat-Jun Chin Marius Klimavičius
+	iPose: Instance-Aware 6D Pose Estimation of Partly Occluded Objects	2017	Omid Hosseini Jafari Siva Karthik Mustikovela Karl Pertsch Eric Brachmann Carsten Rother
+	Inferring 3D Object Pose in RGB-D Images	2015	Saurabh Gupta Pablo Arbeláez Ross Girshick Jitendra Malik
+ PDF Chat	YCB-M: A Multi-Camera RGB-D Dataset for Object Recognition and 6DoF Pose Estimation	2020	Till Grenzdörffer Martin Günther Joachim Hertzberg
+	Semantic keypoint-based pose estimation from single RGB frames	2022	Karl Schmeckpeper Philip R. Osteen Yufu Wang Georgios Pavlakos Kenneth Chaney W. H. Jordan Xiaowei Zhou Konstantinos G. Derpanis Kostas Daniilidis
+	HomebrewedDB: RGB-D Dataset for 6D Pose Estimation of 3D Objects	2019	Roman Kaskman Sergey Zakharov Ivan Shugurov Slobodan Ilić
+	HomebrewedDB: RGB-D Dataset for 6D Pose Estimation of 3D Objects	2019	Roman Kaskman Sergey Zakharov Ivan Shugurov Slobodan Ilić
+ PDF Chat	HomebrewedDB: RGB-D Dataset for 6D Pose Estimation of 3D Objects	2019	Roman Kaskman Sergey Zakharov Ivan Shugurov Slobodan Ilić
+	OSOP: A Multi-Stage One Shot Object Pose Estimation Framework	2022	Ivan Shugurov Fu Li Benjamin Busam Slobodan Ilić

Works That Cite This (334)

Action	Title	Year	Authors
+ PDF Chat	Deep learning-based spacecraft relative navigation methods: A survey	2021	Jianing Song Duarte Rondao Nabil Aouf
+ PDF Chat	Templates for 3D Object Pose Estimation Revisited: Generalization to New Objects and Robustness to Occlusions	2022	Van Nguyen Nguyen Yinlin Hu Yang Xiao Mathieu Salzmann Vincent Lepetit
+ PDF Chat	Bridging the Reality Gap for Pose Estimation Networks using Sensor-Based Domain Randomization	2021	Frederik Hagelskjar Anders Glent Buch
+ PDF Chat	YOLOPose V2: Understanding and improving transformer-based 6D pose estimation	2023	Arul Selvam Periyasamy Arash Amini Vladimir Tsaturyan Sven Behnke
+ PDF Chat	Shape-Constraint Recurrent Flow for 6D Object Pose Estimation	2023	Yang Hai Rui Song Jiaojiao Li Yinlin Hu
+	Rapid Pose Label Generation through Sparse Representation of Unknown Objects.	2020	Rohan Pratap Singh Mehdi Benallegue Yusuke Yoshiyasu Fumio Kanehiro
+ PDF Chat	Reconstruct, Rasterize and Backprop: Dense shape and pose estimation from a single image	2020	Aniket Pokale Aditya Aggarwal Krishna Murthy Jatavallabhula K. Madhava Krishna
+ PDF Chat	Vision-based robotic grasping from object localization, object pose estimation to grasp estimation for parallel grippers: a review	2020	Guoguang Du Kai Wang Shiguo Lian Kaiyong Zhao
+ PDF Chat	On Pre-trained Image Features and Synthetic Images for Deep Learning	2019	Stefan Hinterstoißer Vincent Lepetit Paul Wohlhart Kurt Konolige
+	DTF-Net: Category-Level Pose Estimation and Shape Reconstruction via Deformable Template Field	2023	Haowen Wang Zhipeng Fan Zhen Zhao Zhengping Che Zhiyuan Xu Dong Liu Feifei Feng Yakun Huang Xiuquan Qiao Jian Tang

Works Cited by This (12)

Action	Title	Year	Authors
+	Very Deep Convolutional Networks for Large-Scale Image Recognition	2014	Karen Simonyan Andrew Zisserman
+ PDF Chat	Learning Analysis-by-Synthesis for 6D Pose Estimation in RGB-D Images	2015	Alexander Krull Eric Brachmann Frank Michel Michael Ying Yang Stefan Gumhold Carsten Rother
+ PDF Chat	Fully convolutional networks for semantic segmentation	2015	Jonathan Long Evan Shelhamer Trevor Darrell
+ PDF Chat	ImageNet Large Scale Visual Recognition Challenge	2015	Olga Russakovsky Jia Deng Hao Su Jonathan Krause Sanjeev Satheesh Sean Ma Zhiheng Huang Andrej Karpathy Aditya Khosla Michael S. Bernstein
+ PDF Chat	PoseNet: A Convolutional Network for Real-Time 6-DOF Camera Relocalization	2015	Alex Kendall Matthew Koichi Grimes Roberto Cipolla
+ PDF Chat	Training a Feedback Loop for Hand Pose Estimation	2015	Markus Oberweger Paul Wohlhart Vincent Lepetit
+	6D Object Detection and Next-Best-View Prediction in the Crowd.	2015	Andreas Doumanoglou Rigas Kouskouridas Sotiris Malassiotis Tae‐Kyun Kim
+ PDF Chat	T-LESS: An RGB-D Dataset for 6D Pose Estimation of Texture-Less Objects	2017	Tomáš Hodaň Pavel Haluza Štěpán Obdržálek Jiřı́ Matas Manolis Lourakis Xenophon Zabulis
+	Very Deep Convolutional Networks for Large-Scale Image Recognition	2014	Karen Simonyan Andrew Zisserman
+	Hashmod: A Hashing Method for Scalable 3D Object Detection	2015	Wadim Kehl Federico Tombari Nassir Navab Slobodan Ilić Vincent Lepetit