Projects
Reading
People
Chat
SU\G
(𝔸)
/K·U
Projects
Reading
People
Chat
Sign Up
Light
Dark
System
Beyond Prioritized Replay: Sampling States in Model-Based Reinforcement Learning via Simulated Priorities
Jincheng Mei
,
Yangchen Pan
,
Amir‐massoud Farahmand
,
Hengshuai Yao
,
Martha White
Type:
Preprint
Publication Date:
2020-07-19
Citations:
0
View Publication
Share
Locations
arXiv (Cornell University) -
View
Similar Works
Action
Title
Year
Authors
+
Understanding and Mitigating the Limitations of Prioritized Experience Replay
2020
Yangchen Pan
Jincheng Mei
Amir‐massoud Farahmand
Martha White
Hengshuai Yao
Mohsen Rohani
Luo Jun
+
Beyond Prioritized Replay: Sampling States in Model-Based RL via Simulated Priorities
2021
Jincheng Mei
Yangchen Pan
Martha White
Amir‐massoud Farahmand
Hengshuai Yao
+
Actor Prioritized Experience Replay
2022
Baturay Sağlam
Furkan B. Mutlu
Dogan C. Cicek
Süleyman S. Kozat
+
PDF
Chat
CUER: Corrected Uniform Experience Replay for Off-Policy Continuous Deep Reinforcement Learning Algorithms
2024
Arda Sarp Yenicesu
Furkan B. Mutlu
Süleyman S. Kozat
Ozgur S. Oguz
+
On the model-based stochastic value gradient for continuous reinforcement learning
2020
Brandon Amos
Samuel Stanton
Denis Yarats
Andrew Gordon Wilson
+
PDF
Chat
Actor Prioritized Experience Replay
2023
Baturay Sağlam
Furkan B. Mutlu
Dogan C. Cicek
Süleyman S. Kozat
+
PTR-PPO: Proximal Policy Optimization with Prioritized Trajectory Replay
2021
Xingxing Liang
Yang Ma
Yanghe Feng
Zhong Liu
+
Time-Efficient Reinforcement Learning with Stochastic Stateful Policies
2023
Firas Al-Hafez
Guoping Zhao
Jan Peters
Davide Tateo
+
PDF
Chat
Off-Policy Correction for Deep Deterministic Policy Gradient Algorithms via Batch Prioritized Experience Replay
2021
Dogan C. Cicek
Enes Duran
Baturay Sağlam
Furkan B. Mutlu
Süleyman S. Kozat
+
PDF
Chat
Off-Policy Correction for Deep Deterministic Policy Gradient Algorithms via Batch Prioritized Experience Replay
2021
Dogan C. Cicek
Enes Duran
Baturay Sağlam
Furkan B. Mutlu
Süleyman S. Kozat
+
Off-Policy Correction for Deep Deterministic Policy Gradient Algorithms via Batch Prioritized Experience Replay
2021
Dogan C. Cicek
Enes Duran
Baturay Sağlam
Furkan B. Mutlu
Süleyman S. Kozat
+
Variance Reduction based Experience Replay for Policy Optimization
2021
Hua Zheng
Wei Xie
M. Ben Feng
+
PDF
Chat
Frugal Actor-Critic: Sample Efficient Off-Policy Deep Reinforcement Learning Using Unique Experiences
2024
Nikhil Kumar Singh
Indranil Saha
+
Expected Policy Gradients for Reinforcement Learning
2018
Kamil Ciosek
Shimon Whiteson
+
PDF
Chat
Reusing Historical Trajectories in Natural Policy Gradient via Importance Sampling: Convergence and Convergence Rate
2024
Yifan Lin
Yuhao Wang
Enlu Zhou
+
Prioritized Experience-based Reinforcement Learning with Human Guidance: Methdology and Application to Autonomous Driving.
2021
Jingda Wu
Zhiyu Huang
Wenhui Huang
Chen Lv
+
PDF
Chat
Learning Diverse Policies with Soft Self-Generated Guidance
2023
Guojian Wang
Faguo Wu
Xiao Zhang
Jianxiang Liu
+
Improved Exploring Starts by Kernel Density Estimation-Based State-Space Coverage Acceleration in Reinforcement Learning.
2021
Maximilian Schenke
Oliver Wallscheid
+
Variance Reduction based Experience Replay for Policy Optimization
2022
Hua Zheng
Wei Xie
M. Ben Feng
+
Maximum Entropy Model Rollouts: Fast Model Based Policy Optimization without Compounding Errors
2020
Chi Zhang
Sanmukh R. Kuppannagari
Viktor K. Prasanna
Works That Cite This (0)
Action
Title
Year
Authors
Works Cited by This (28)
Action
Title
Year
Authors
+
PDF
Chat
Exponential Convergence of Langevin Distributions and Their Discrete Approximations
1996
Gareth O. Roberts
Richard L. Tweedie
+
Diffusion for Global Optimization in $\mathbb{R}^n $
1987
Tzuu-Shuh Chiang
Chii-Ruey Hwang
Shuenn Jyi Sheu
+
PDF
Chat
Robust Estimation of a Location Parameter
1964
Peter J. Huber
+
Bayesian Learning via Stochastic Gradient Langevin Dynamics
2011
Max Welling
Yee Whye Teh
+
Hindsight Experience Replay
2017
Marcin Andrychowicz
Filip Wolski
Alex Ray
Jonas Schneider
Rachel Fong
Peter Welinder
Bob McGrew
Josh Tobin
Pieter Abbeel
Wojciech Zaremba
+
A Deeper Look at Experience Replay
2017
Shangtong Zhang
Richard S. Sutton
+
Distributed Prioritized Experience Replay
2018
Dan Horgan
John Quan
David Budden
Gabriel Barth-Maron
Matteo Hessel
Hado van Hasselt
David Silver
+
The Effect of Planning Shape on Dyna-style Planning in High-dimensional State Spaces
2018
Gerhard Holland
Erik Talvitie
Michael Bowling
+
Recurrent World Models Facilitate Policy Evolution
2018
David Ha
Jürgen Schmidhuber
+
Approximate Robust Control of Uncertain Dynamical Systems
2019
Edouard Leurent
Y. Blanco
Denis Efimov
Odalric-Ambrym Maillard