Variational Dynamic for Self-Supervised Exploration in Deep Reinforcement Learning
Variational Dynamic for Self-Supervised Exploration in Deep Reinforcement Learning
Efficient exploration remains a challenging problem in reinforcement learning, especially for tasks where extrinsic rewards from environments are sparse or even totally disregarded. Significant advances based on intrinsic motivation show promising results in simple environments but often get stuck in environments with multimodal and stochastic dynamics. In this work, we …