Is Pseudo-Lidar needed for Monocular 3D Object detection?

Type: Article

Publication Date: 2021-10-01

Citations: 209

DOI: https://doi.org/10.1109/iccv48922.2021.00313

Abstract

Recent progress in 3D object detection from single images leverages monocular depth estimation as a way to produce 3D pointclouds, turning cameras into pseudo-lidar sensors. These two-stage detectors improve with the accuracy of the intermediate depth estimation network, which can itself be improved without manual labels via large-scale self-supervised learning. However, they tend to suffer from overfitting more than end-to-end methods, are more complex, and the gap with similar lidar-based detectors remains significant. In this work, we propose an end-to-end, single stage, monocular 3D object detector, DD3D, that can benefit from depth pre-training like pseudo-lidar methods, but without their limitations. Our architecture is designed for effective information transfer between depth estimation and 3D detection, allowing us to scale with the amount of unlabeled pre-training data. Our method achieves state-of-the-art results on two challenging benchmarks, with 16.34% and 9.28% AP for Cars and Pedestrians (respectively) on the KITTI-3D benchmark, and 41.5% mAP on NuScenes.

Locations

  • arXiv (Cornell University) - View - PDF
  • 2021 IEEE/CVF International Conference on Computer Vision (ICCV) - View

Similar Works

Action Title Year Authors
+ Is Pseudo-Lidar needed for Monocular 3D Object detection? 2021 Dennis Park
Rareș Ambruș
Vitor Guizilini
Jie Li
Adrien Gaidon
+ Is Pseudo-Lidar needed for Monocular 3D Object detection? 2021 Dennis Park
Rareș Ambruș
Vitor Guizilini
Jie Li
Adrien Gaidon
+ PDF Chat Depth Is All You Need for Monocular 3D Detection 2023 Dennis Park
Jie Li
Dian Chen
Vitor Guizilini
Adrien Gaidon
+ Monocular 3D Object Detection with Pseudo-LiDAR Point Cloud 2019 Xinshuo Weng
Kris Kitani
+ PDF Chat Boosting Monocular 3D Object Detection With Object-Centric Auxiliary Depth Supervision 2022 Young-Seok Kim
Sanmin Kim
Sangmin Sim
Jun Won Choi
Dongsuk Kum
+ Boosting Monocular 3D Object Detection with Object-Centric Auxiliary Depth Supervision 2022 Young-Seok Kim
Sanmin Kim
Sangmin Sim
Jun Won Choi
Dongsuk Kum
+ Depth Is All You Need for Monocular 3D Detection 2022 Dennis Park
Jie Li
Dian Chen
Vitor Guizilini
Adrien Gaidon
+ PDF Chat Monocular 3D Object Detection with Pseudo-LiDAR Point Cloud 2019 Xinshuo Weng
Kris Kitani
+ MonoDistill: Learning Spatial Features for Monocular 3D Object Detection 2022 Zhiyu Chong
Xinzhu Ma
Hong Zhang
Yuā€Xin Yue
Haojie Li
Zhihui Wang
Wanli Ouyang
+ PDF Chat OBMO: One Bounding Box Multiple Objects for Monocular 3D Object Detection 2023 C.C. Huang
Tong He
Haidong Ren
Wenxiao Wang
Binbin Lin
Deng Cai
+ Self-supervised 3D Object Detection from Monocular Pseudo-LiDAR 2022 Curie Kim
Ue-Hwan Kim
Jong-Hwan Kim
+ PDF Chat Self-supervised 3D Object Detection from Monocular Pseudo-LiDAR 2022 Curie Kim
Ue-Hwan Kim
Jong-Hwan Kim
+ OBMO: One Bounding Box Multiple Objects for Monocular 3D Object Detection 2022 C.C. Huang
Tong He
Haidong Ren
Wenxiao Wang
Binbin Lin
Deng Cai
+ End-to-End Pseudo-LiDAR for Image-Based 3D Object Detection 2020 Rui Qian
Divyansh Garg
Yan Wang
Yurong You
Serge Belongie
Bharath Hariharan
Mark Campbell
Kilian Q. Weinberger
Weiā€Lun Chao
+ PDF Chat End-to-End Pseudo-LiDAR for Image-Based 3D Object Detection 2020 Rui Qian
Divyansh Garg
Yan Wang
Yurong You
Serge Belongie
Bharath Hariharan
Mark Campbell
Kilian Q. Weinberger
Weiā€Lun Chao
+ Learning Depth-Guided Convolutions for Monocular 3D Object Detection 2019 Mingyu Ding
Yuqi Huo
Hongwei Yi
Zhe Wang
Jianping Shi
Zhiwu Lu
Ping Luo
+ OPA-3D: Occlusion-Aware Pixel-Wise Aggregation for Monocular 3D Object Detection 2022 Yongzhi Su
Di Yan
Fabian Manhardt
Guangyao Zhai
Jason Rambach
Benjamin Busam
Didier Stricker
Federico Tombari
+ PDF Chat OPA-3D: Occlusion-Aware Pixel-Wise Aggregation for Monocular 3D Object Detection 2023 Yongzhi Su
Yan Di
Guangyao Zhai
Fabian Manhardt
Jason Rambach
Benjamin Busam
Didier Stricker
Federico Tombari
+ PDF Chat VFMM3D: Releasing the Potential of Image by Vision Foundation Model for Monocular 3D Object Detection 2024 Bonan Ding
Jin Xie
Jing Nie
Jiale Cao
+ Pseudo-LiDAR++: Accurate Depth for 3D Object Detection in Autonomous Driving 2019 Yurong You
Yan Wang
Weiā€Lun Chao
Divyansh Garg
Geoff Pleiss
Bharath Hariharan
Mark Campbell
Kilian Q. Weinberger

Works That Cite This (97)

Action Title Year Authors
+ PDF Chat SSD-MonoDETR: Supervised Scale-Aware Deformable Transformer for Monocular 3D Object Detection 2023 Xuan He
Fan Yang
Kailun Yang
Jiacheng Lin
Haolong Fu
Meng Wang
Jin Yuan
Zhiyong Li
+ PDF Chat BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation 2023 Zhijian Liu
Haotian Tang
Alexander Amini
Xinyu Yang
Huizi Mao
Daniela Rus
Song Han
+ PDF Chat Lidar Point Cloud Guided Monocular 3D Object Detection 2022 Peng Liang
Fei Liu
Zhengxu Yu
Senbo Yan
Dan Deng
Zheng Yang
Haifeng Liu
Deng Cai
+ PDF Chat Standing Between Past and Future: Spatio-Temporal Modeling for Multi-Camera 3D Multi-Object Tracking 2023 Ziqi Pang
Jie Li
Pavel Tokmakov
Dian Chen
Sergey Zagoruyko
Yu-Xiong Wang
+ PDF Chat UniDistill: A Universal Cross-Modality Knowledge Distillation Framework for 3D Object Detection in Bird's-Eye View 2023 Shengchao Zhou
Weizhou Liu
Chen Hu
Shuchang Zhou
Chao Ma
+ PDF Chat BEVFormer v2: Adapting Modern Image Backbones to Bird's-Eye-View Recognition via Perspective Supervision 2023 Chenyu Yang
Yuntao Chen
Hao Tian
Chenxin Tao
Xizhou Zhu
Zhaoxiang Zhang
Gao Huang
Hongyang Li
Yu Qiao
Lewei Lu
+ PDF Chat 3D Object Detection for Autonomous Driving: A Comprehensive Survey 2023 Jiageng Mao
Shaoshuai Shi
Xiaogang Wang
Hongsheng Li
+ PDF Chat PolarFormer: Multi-Camera 3D Object Detection with Polar Transformer 2023 Yanqin Jiang
Zhang Li
Zhenwei Miao
Xiatian Zhu
Jin Gao
Weiming Hu
Yuā€“Gang Jiang
+ PDF Chat Visual Attention-Based Self-Supervised Absolute Depth Estimation Using Geometric Priors in Autonomous Driving 2022 Jie Xiang
Yun Wang
Lifeng An
Haiyang Liu
Zijun Wang
Jian Liu
+ PDF Chat Object as Query: Lifting any 2D Object Detector to 3D Detection 2023 Zitian Wang
Zehao Huang
Jiahui Fu
Naiyan Wang
Si Liu