Prefer a chat interface with context about you and your work?
Guided Online Distillation: Promoting Safe Reinforcement Learning by Offline Demonstration