VLABench: A Large-Scale Benchmark for Language-Conditioned Robotics
Manipulation with Long-Horizon Reasoning Tasks
VLABench: A Large-Scale Benchmark for Language-Conditioned Robotics
Manipulation with Long-Horizon Reasoning Tasks
General-purposed embodied agents are designed to understand the users' natural instructions or intentions and act precisely to complete universal tasks. Recently, methods based on foundation models especially Vision-Language-Action models (VLAs) have shown a substantial potential to solve language-conditioned manipulation (LCM) tasks well. However, existing benchmarks do not adequately meet the …