There Is No Turning Back: A Self-Supervised Approach for Reversibility-Aware Reinforcement Learning
There Is No Turning Back: A Self-Supervised Approach for Reversibility-Aware Reinforcement Learning
We propose to learn to distinguish reversible from irreversible actions for better informed decision-making in Reinforcement Learning (RL). From theoretical considerations, we show that approximate reversibility can be learned through a simple surrogate task: ranking randomly sampled trajectory events in chronological order. Intuitively, pairs of events that are always observed …