Beyond Prioritized Replay: Sampling States in Model-Based Reinforcement Learning via Simulated Priorities

Type: Preprint

Publication Date: 2020-07-19

Citations: 0

Locations

  • arXiv (Cornell University) - View

Similar Works

Action Title Year Authors
+ Understanding and Mitigating the Limitations of Prioritized Experience Replay 2020 Yangchen Pan
Jincheng Mei
Amir‐massoud Farahmand
Martha White
Hengshuai Yao
Mohsen Rohani
Luo Jun
+ Beyond Prioritized Replay: Sampling States in Model-Based RL via Simulated Priorities 2021 Jincheng Mei
Yangchen Pan
Martha White
Amir‐massoud Farahmand
Hengshuai Yao
+ Actor Prioritized Experience Replay 2022 Baturay Sağlam
Furkan B. Mutlu
Dogan C. Cicek
Süleyman S. Kozat
+ PDF Chat CUER: Corrected Uniform Experience Replay for Off-Policy Continuous Deep Reinforcement Learning Algorithms 2024 Arda Sarp Yenicesu
Furkan B. Mutlu
Süleyman S. Kozat
Ozgur S. Oguz
+ On the model-based stochastic value gradient for continuous reinforcement learning 2020 Brandon Amos
Samuel Stanton
Denis Yarats
Andrew Gordon Wilson
+ PDF Chat Actor Prioritized Experience Replay 2023 Baturay Sağlam
Furkan B. Mutlu
Dogan C. Cicek
Süleyman S. Kozat
+ PTR-PPO: Proximal Policy Optimization with Prioritized Trajectory Replay 2021 Xingxing Liang
Yang Ma
Yanghe Feng
Zhong Liu
+ Time-Efficient Reinforcement Learning with Stochastic Stateful Policies 2023 Firas Al-Hafez
Guoping Zhao
Jan Peters
Davide Tateo
+ PDF Chat Off-Policy Correction for Deep Deterministic Policy Gradient Algorithms via Batch Prioritized Experience Replay 2021 Dogan C. Cicek
Enes Duran
Baturay Sağlam
Furkan B. Mutlu
Süleyman S. Kozat
+ PDF Chat Off-Policy Correction for Deep Deterministic Policy Gradient Algorithms via Batch Prioritized Experience Replay 2021 Dogan C. Cicek
Enes Duran
Baturay Sağlam
Furkan B. Mutlu
Süleyman S. Kozat
+ Off-Policy Correction for Deep Deterministic Policy Gradient Algorithms via Batch Prioritized Experience Replay 2021 Dogan C. Cicek
Enes Duran
Baturay Sağlam
Furkan B. Mutlu
Süleyman S. Kozat
+ Variance Reduction based Experience Replay for Policy Optimization 2021 Hua Zheng
Wei Xie
M. Ben Feng
+ PDF Chat Frugal Actor-Critic: Sample Efficient Off-Policy Deep Reinforcement Learning Using Unique Experiences 2024 Nikhil Kumar Singh
Indranil Saha
+ Expected Policy Gradients for Reinforcement Learning 2018 Kamil Ciosek
Shimon Whiteson
+ PDF Chat Reusing Historical Trajectories in Natural Policy Gradient via Importance Sampling: Convergence and Convergence Rate 2024 Yifan Lin
Yuhao Wang
Enlu Zhou
+ Prioritized Experience-based Reinforcement Learning with Human Guidance: Methdology and Application to Autonomous Driving. 2021 Jingda Wu
Zhiyu Huang
Wenhui Huang
Chen Lv
+ PDF Chat Learning Diverse Policies with Soft Self-Generated Guidance 2023 Guojian Wang
Faguo Wu
Xiao Zhang
Jianxiang Liu
+ Improved Exploring Starts by Kernel Density Estimation-Based State-Space Coverage Acceleration in Reinforcement Learning. 2021 Maximilian Schenke
Oliver Wallscheid
+ Variance Reduction based Experience Replay for Policy Optimization 2022 Hua Zheng
Wei Xie
M. Ben Feng
+ Maximum Entropy Model Rollouts: Fast Model Based Policy Optimization without Compounding Errors 2020 Chi Zhang
Sanmukh R. Kuppannagari
Viktor K. Prasanna

Works That Cite This (0)

Action Title Year Authors