Ask AI a math question

Compared with the visual grounding on 2D images, the natural-language-guided 3D object localization on point clouds is more challenging. In this paper, we propose a new model, named InstanceRefer <sup xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">1</sup> , to achieve a superior 3D visual grounding through the grounding-by-matching strategy. In practice, our model first predicts …

Ask a Question