Probing the Mid-level Vision Capabilities of Self-Supervised Learning
Probing the Mid-level Vision Capabilities of Self-Supervised Learning
Mid-level vision capabilities - such as generic object localization and 3D geometric understanding - are not only fundamental to human vision but are also crucial for many real-world applications of computer vision. These abilities emerge with minimal supervision during the early stages of human visual development. Despite their significance, current …