Ask a Question

Prefer a chat interface with context about you and your work?

Eliminating Cross-modal Conflicts in BEV Space for LiDAR-Camera 3D Object Detection

Eliminating Cross-modal Conflicts in BEV Space for LiDAR-Camera 3D Object Detection

Recent 3D object detectors typically utilize multi-sensor data and unify multi-modal features in the shared bird's-eye view (BEV) representation space. However, our empirical findings indicate that previous methods have limitations in generating fusion BEV features free from cross-modal conflicts. These conflicts encompass extrinsic conflicts caused by BEV feature construction and …