Learning Navigation Behaviors End-to-End With AutoRL
Learning Navigation Behaviors End-to-End With AutoRL
We learn end-to-end point-to-point and pathfollowing navigation behaviors that avoid moving obstacles. These policies receive noisy lidar observations and output robot linear and angular velocities. The policies are trained in small, static environments with AutoRL, an evolutionary automation layer around reinforcement learning (RL) that searches for a deep RL reward …