Projects
Reading
People
Chat
SU\G
(𝔸)
/K·U
Projects
Reading
People
Chat
Sign Up
Light
Dark
System
He Ba
Follow
Share
Generating author description...
All published works
Action
Title
Year
Authors
+
PDF
Chat
Critic PI2: Master Continuous Planning via Policy Improvement with Path Integrals and Deep Actor-Critic Reinforcement Learning
2021
He Ba
Jiajun Fan
Xian Guo
Jianye Hao
+
Critic PI2: Master Continuous Planning via Policy Improvement with Path Integrals and Deep Actor-Critic Reinforcement Learning
2020
Jiajun Fan
He Ba
Xian Guo
Jianye Hao
Common Coauthors
Coauthor
Papers Together
Jianye Hao
2
Xian Guo
2
Jiajun Fan
2
Commonly Cited References
Action
Title
Year
Authors
# of times referenced
+
Proximal Policy Optimization Algorithms
2017
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
2
+
Continuous control with deep reinforcement learning
2016
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
Nicolas Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
2
+
Asynchronous Methods for Deep Reinforcement Learning
2016
Volodymyr Mnih
Adrià Puigdomènech Badia
Mehdi Mirza
Alex Graves
Tim Harley
Timothy Lillicrap
David Silver
Koray Kavukcuoglu
2
+
Q-Prop: Sample-Efficient Policy Gradient with An Off-Policy Critic
2016
Shixiang Gu
Timothy Lillicrap
Zoubin Ghahramani
Richard E. Turner
Sergey Levine
2
+
Emergence of Locomotion Behaviours in Rich Environments
2017
Nicolas Heess
Dhruva Tb
Sriram Srinivasan
Jay Lemmon
Josh Merel
Greg Wayne
Yuval Tassa
Tom Erez
Ziyu Wang
S. M. Ali Eslami
2
+
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
2018
Lasse Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymir Mnih
Tom Ward
Yotam Doron
Vlad Firoiu
Tim Harley
Iain Dunning
1
+
Trust Region Policy Optimization
2015
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
1
+
Continuous Deep Q-Learning with Model-based Acceleration
2016
Shixiang Gu
Timothy Lillicrap
Ilya Sutskever
Sergey Levine
1
+
Control of Memory, Active Perception, and Action in Minecraft
2016
Junhyuk Oh
Valliappa Chockalingam
Satinder Singh
Honglak Lee
1
+
PDF
Chat
Neural Network Dynamics for Model-Based Deep Reinforcement Learning with Model-Free Fine-Tuning
2018
Anusha Nagabandi
Gregory Kahn
Ronald S. Fearing
Sergey Levine
1
+
When to Trust Your Model: Model-Based Policy Optimization
2019
Michael Jänner
Justin Fu
Marvin Zhang
Sergey Levine
1
+
High-Dimensional Continuous Control Using Generalized Advantage Estimation
2015
John Schulman
Philipp Moritz
Sergey Levine
Michael I. Jordan
Pieter Abbeel
1
+
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
2018
Lasse Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymir Mnih
Tom Ward
Yotam Doron
Vlad Firoiu
Tim Harley
Iain Dunning
1
+
Continuous control with deep reinforcement learning
2015
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
Nicolas Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
1
+
PDF
Chat
Path integral guided policy search
2017
Yevgen Chebotar
Mrinal Kalakrishnan
Ali Abdullah Yahya
Adrian Li
Stefan Schaal
Sergey Levine
1
+
Q-PrOP: Sample-efficient policy gradient with an off-policy critic
2017
Shixiang Gu
Timothy Lillicrap
Zoubin Ghahramani
Richard E. Turner
Sergey Levine
1
+
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
2018
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
1