Enhancing 3D Object Detection with 2D Detection-Guided Query Anchors

Type: Preprint

Publication Date: 2024-03-09

Citations: 0

DOI: https://doi.org/10.48550/arxiv.2403.06093

Abstract

Multi-camera-based 3D object detection has made notable progress in the past several years. However, we observe that there are cases (e.g. faraway regions) in which popular 2D object detectors are more reliable than state-of-the-art 3D detectors. In this paper, to improve the performance of query-based 3D object detectors, we present a novel query generating approach termed QAF2D, which infers 3D query anchors from 2D detection results. A 2D bounding box of an object in an image is lifted to a set of 3D anchors by associating each sampled point within the box with depth, yaw angle, and size candidates. Then, the validity of each 3D anchor is verified by comparing its projection in the image with its corresponding 2D box, and only valid anchors are kept and used to construct queries. The class information of the 2D bounding box associated with each query is also utilized to match the predicted boxes with ground truth for the set-based loss. The image feature extraction backbone is shared between the 3D detector and 2D detector by adding a small number of prompt parameters. We integrate QAF2D into three popular query-based 3D object detectors and carry out comprehensive evaluations on the nuScenes dataset. The largest improvement that QAF2D can bring about on the nuScenes validation subset is $2.3\%$ NDS and $2.7\%$ mAP. Code is available at https://github.com/nullmax-vision/QAF2D.

Locations

  • arXiv (Cornell University) - View - PDF

Similar Works

Action Title Year Authors
+ Object as Query: Lifting any 2D Object Detector to 3D Detection 2023 Zitian Wang
Zehao Huang
Jiahui Fu
Naiyan Wang
Si Liu
+ PDF Chat Object as Query: Lifting any 2D Object Detector to 3D Detection 2023 Zitian Wang
Zehao Huang
Jiahui Fu
Naiyan Wang
Si Liu
+ PDF Chat MonoDGP: Monocular 3D Object Detection with Decoupled-Query and Geometry-Error Priors 2024 Fangling Pu
Yifan Wang
Jianan Deng
Wenming Yang
+ Priors are Powerful: Improving a Transformer for Multi-camera 3D Detection with 2D Priors 2023 Di Feng
Francesco Ferroni
+ PDF Chat DETR3D: 3D Object Detection from Multi-view Images via 3D-to-2D Queries 2021 Yue Wang
Vitor Campagnolo Guizilini
Tianyuan Zhang
Yilun Wang
Hang Zhao
Justin Solomon
+ DETR3D: 3D Object Detection from Multi-view Images via 3D-to-2D Queries 2021 Yue Wang
Vitor Guizilini
Tianyuan Zhang
Yilun Wang
Hang Zhao
Justin Solomon
+ Towards Fair and Comprehensive Comparisons for Image-Based 3D Object Detection 2023 Xinzhu Ma
Yongtao Wang
Yinmin Zhang
Zhiyi Xia
Yuan Meng
Zhihui Wang
Haojie Li
Wanli Ouyang
+ Shape-Aware Monocular 3D Object Detection 2022 Wei Chen
Jie Zhao
Wan‐Lei Zhao
Song-Yuan Wu
+ PDF Chat Towards Fair and Comprehensive Comparisons for Image-Based 3D Object Detection 2023 Xinzhu Ma
Yongtao Wang
Yinmin Zhang
Zhiyi Xia
Yuan Meng
Zhihui Wang
Haojie Li
Wanli Ouyang
+ Far3D: Expanding the Horizon for Surround-view 3D Object Detection 2023 Xiaohui Jiang
Shuailin Li
Yingfei Liu
Shihao Wang
Fan Jia
Tiancai Wang
Lijin Han
Xiangyu Zhang
+ ConQueR: Query Contrast Voxel-DETR for 3D Object Detection 2022 Benjin Zhu
Zhe Wang
Shaoshuai Shi
Hang Xu
Lanqing Hong
Hongsheng Li
+ PDF Chat MDHA: Multi-Scale Deformable Transformer with Hybrid Anchors for Multi-View 3D Object Detection 2024 Michelle Adeline
Junn Yong Loo
Vishnu Monn Baskaran
+ PDF Chat MonoDETRNext: Next-generation Accurate and Efficient Monocular 3D Object Detection Method 2024 Pan Liao
Feng Yang
Di Wu
Bo Liu
+ Disentangling Monocular 3D Object Detection 2019 A. Simonelli
Samuel Rota Bulò
Lorenzo Porzi
Manuel LĂłpez-Antequera
Peter Kontschieder
+ Disentangling Monocular 3D Object Detection 2019 Andrea Simonelli
Samuel Rota Bulò
Lorenzo Porzi
Manuel LĂłpez-Antequera
Peter Kontschieder
+ A Simple Baseline for Multi-Camera 3D Object Detection 2022 Yunpeng Zhang
Wenzhao Zheng
Zheng Zhu
Guan Huang
Jie Zhou
Jiwen Lu
+ Exploring Geometric Consistency for Monocular 3D Object Detection 2021 Qing Lian
Botao Ye
Ruijia Xu
Weilong Yao
Tong Zhang
+ PDF Chat ConQueR: Query Contrast Voxel-DETR for 3D Object Detection 2023 Benjin Zhu
Zhe Wang
Shaoshuai Shi
Hang Xu
Lanqing Hong
Hongsheng Li
+ PDF Chat A Simple Baseline for Multi-Camera 3D Object Detection 2023 Yunpeng Zhang
Wenzhao Zheng
Zheng Zhu
Guan Huang
Jiwen Lu
Jie Zhou
+ PDF Chat Exploring Geometric Consistency for Monocular 3D Object Detection 2022 Qing Lian
Botao Ye
Ruijia Xu
Weilong Yao
Tong Zhang

Works That Cite This (0)

Action Title Year Authors

Works Cited by This (0)

Action Title Year Authors