On the Robustness of Language Guidance for Low-Level Vision Tasks:
Findings from Depth Estimation
On the Robustness of Language Guidance for Low-Level Vision Tasks:
Findings from Depth Estimation
Recent advances in monocular depth estimation have been made by incorporating natural language as additional guidance. Although yielding impressive results, the impact of the language prior, particularly in terms of generalization and robustness, remains unexplored. In this paper, we address this gap by quantifying the impact of this prior and …