Investigating the Interplay of Prioritized Replay and Generalization
Investigating the Interplay of Prioritized Replay and Generalization
Experience replay is ubiquitous in reinforcement learning, to reuse past data and improve sample efficiency. Though a variety of smart sampling schemes have been introduced to improve performance, uniform sampling by far remains the most common approach. One exception is Prioritized Experience Replay (PER), where sampling is done proportionally to …