The Ladder in Chaos: A Simple and Effective Improvement to General DRL Algorithms by Policy Path Trimming and Boosting

Type: Preprint

Publication Date: 2023-01-01

Citations: 0

DOI: https://doi.org/10.48550/arxiv.2303.01391

Locations

  • arXiv (Cornell University) - View
  • DataCite API - View

Similar Works

Action Title Year Authors
+ PDF Chat Reinforcement Learning Gradients as Vitamin for Online Finetuning Decision Transformers 2024 Kai Yan
Alexander G. Schwing
Yu-Xiong Wang
+ Cautious Policy Programming: Exploiting KL Regularization in Monotonic Policy Improvement for Reinforcement Learning 2021 Lingwei Zhu
Toshinori Kitamura
Takamitsu Matsubara
+ PDF Chat ACERAC: Efficient Reinforcement Learning in Fine Time Discretization 2022 Jakub Łyskawa
Paweł Wawrzyński
+ NADPEx: An on-policy temporally consistent exploration method for deep reinforcement learning 2018 Sirui Xie
Junning Huang
Lanxin Lei
Chunxiao Liu
Zheng Ma
Wei Zhang
Liang Lin
+ NADPEx: An on-policy temporally consistent exploration method for deep reinforcement learning 2018 Sirui Xie
Junning Huang
Lanxin Lei
Chunxiao Liu
Zheng Ma
Wei Zhang
Liang Lin
+ PDF Chat Deep Conservative Policy Iteration 2020 Nino Vieillard
Olivier Pietquin
Matthieu Geist
+ DDPG++: Striving for Simplicity in Continuous-control Off-Policy Reinforcement Learning 2020 Rasool Fakoor
Pratik Chaudhari
Alexander J. Smola
+ The Definitive Guide to Policy Gradients in Deep Reinforcement Learning: Theory, Algorithms and Implementations 2024 Matthias Lehmann
+ PDF Chat Policy Mirror Descent for Regularized Reinforcement Learning: A Generalized Framework with Linear Convergence 2023 Wenhao Zhan
Shicong Cen
Baihe Huang
Yuxin Chen
Jason D. Lee
Yuejie Chi
+ Never Worse, Mostly Better: Stable Policy Improvement in Deep Reinforcement Learning 2019 Pranav Khanna
Guy Tennenholtz
Nadav Merlis
Shie Mannor
Chen Tessler
+ P3O: Policy-on Policy-off Policy Optimization 2019 Rasool Fakoor
Pratik Chaudhari
Alexander J. Smola
+ P3O: Policy-on Policy-off Policy Optimization 2019 Rasool Fakoor
Pratik Chaudhari
Alexander J. Smola
+ P3O: Policy-on Policy-off Policy Optimization 2019 Rasool Fakoor
Pratik Chaudhari
Alexander J. Smola
+ Identifying Policy Gradient Subspaces 2024 Jochen Schneider
Pierre Schumacher
Simon Guist
Le Chen
Daniel Häufle
Bernhard Schölkopf
Dieter Büchler
+ PDF Chat Actor-Critic Reinforcement Learning with Phased Actor 2024 Ruofan Wu
Junmin Zhong
Jennie Si
+ PDF Chat Comprehensive Survey of Reinforcement Learning: From Algorithms to Practical Challenges 2024 Majid Ghasemi
Amir Mousavi
Dariush Ebrahimi
+ PDF Chat CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning 2024 Zeyuan Liu
Kai Yang
Xiu Li
+ Regularization Matters in Policy Optimization 2019 Zhuang Liu
Xuanlin Li
Bingyi Kang
Trevor Darrell
+ The Phenomenon of Policy Churn 2022 Tom Schaul
André Sales Barreto
John Quan
Georg Ostrovski
+ Deep Conservative Policy Iteration. 2019 Nino Vieillard
Olivier Pietquin
Matthieu Geist

Works That Cite This (0)

Action Title Year Authors

Works Cited by This (0)

Action Title Year Authors