Off-Policy Shaping Ensembles in Reinforcement Learning

Type: Preprint

Publication Date: 2014-05-21

Citations: 4

Locations

  • arXiv (Cornell University) - View

Similar Works

Action Title Year Authors
+ Off-Policy Shaping Ensembles in Reinforcement Learning 2014 Anna Harutyunyan
Tim Brys
Peter Vrancx
Ann Nowé
+ Off-Policy Reward Shaping with Ensembles 2015 Anna Harutyunyan
Tim Brys
Peter Vrancx
Ann Nowé
+ Shaping Advice in Deep Reinforcement Learning 2022 Baicen Xiao
Bhaskar Ramasubramanian
Radha Poovendran
+ Shaping Advice in Deep Multi-Agent Reinforcement Learning 2021 Baicen Xiao
Bhaskar Ramasubramanian
Radha Poovendran
+ PDF Chat Off-Policy Reinforcement Learning with Loss Function Weighted by Temporal Difference Error 2023 Bumgeun Park
Taeyoung Kim
Woohyeon Moon
Sarvar Hussain Nengroo
Dongsoo Har
+ PDF Chat Mixed Q-Functionals: Advancing Value-Based Methods in Cooperative MARL with Continuous Action Domains 2024 Yasin Fındık
S. Reza Ahmadzadeh
+ SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning 2020 Kimin Lee
Michael Laskin
Aravind Srinivas
Pieter Abbeel
+ Composing Entropic Policies using Divergence Correction 2018 Jonathan J. Hunt
André Sales Barreto
Timothy Lillicrap
Nicolas Heess
+ Entropic Policy Composition with Generalized Policy Improvement and Divergence Correction. 2018 Jonathan J. Hunt
André Barreto
Timothy Lillicrap
Nicolas Heess
+ Will it Blend? Composing Value Functions in Reinforcement Learning 2018 Benjamin van Niekerk
Steven D. James
Adam Christopher Earle
Benjamin Rosman
+ Difference Rewards Policy Gradients 2020 Jacopo Castellini
Sam Devlin
Frans A. Oliehoek
Rahul Savani
+ Composing Entropic Policies using Divergence Correction. 2018 Jonathan J. Hunt
André Barreto
Timothy Lillicrap
Nicolas Heess
+ Ensemble Reinforcement Learning in Continuous Spaces -- A Hierarchical Multi-Step Approach for Policy Training 2023 Gang Chen
Victoria Huang
+ Ensemble Reinforcement Learning in Continuous Spaces -- A Hierarchical Multi-Step Approach for Policy Training 2022 Gang Chen
Victoria Huang
+ SEERL: Sample Efficient Ensemble Reinforcement Learning 2020 Rohan Saphal
Balaraman Ravindran
Dheevatsa Mudigere
Sasikanth Avancha
Bharat Kaul
+ Stabilizing Off-Policy Reinforcement Learning with Conservative Policy Gradients 2019 Chen Tessler
Nadav Merlis
Shie Mannor
+ Improving On-policy Learning with Statistical Reward Accumulation 2018 Yubin Deng
Ke Yu
Dahua Lin
Xiaoou Tang
Chen Change Loy
+ Combining policy gradient and Q-learning 2016 Brendan O’Donoghue
Rémi Munos
Koray Kavukcuoglu
Volodymyr Mnih
+ PDF Chat Difference rewards policy gradients 2022 Jacopo Castellini
Sam Devlin
Frans A. Oliehoek
Rahul Savani
+ Ensemble Value Functions for Efficient Exploration in Multi-Agent Reinforcement Learning 2023 Lukas Schäfer
Oliver Slumbers
Stephen McAleer
Yali Du
Stefano V. Albrecht
David Mguni

Works Cited by This (1)

Action Title Year Authors
+ Off-Policy Actor-Critic 2012 Thomas Degris
Martha White
Richard S. Sutton