Ask a Question

Prefer a chat interface with context about you and your work?

Affordances from Human Videos as a Versatile Representation for Robotics

Affordances from Human Videos as a Versatile Representation for Robotics

Building a robot that can understand and learn to interact by watching humans has inspired several vision problems. However, despite some successful results on static datasets, it remains unclear how current models can be used on a robot directly. In this paper, we aim to bridge this gap by leveraging …