He Ba

Follow

Generating author description...

Common Coauthors
Coauthor Papers Together
Jianye Hao 2
Xian Guo 2
Jiajun Fan 2
Commonly Cited References
Action Title Year Authors # of times referenced
+ Proximal Policy Optimization Algorithms 2017 John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
2
+ Continuous control with deep reinforcement learning 2016 Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
Nicolas Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
2
+ Asynchronous Methods for Deep Reinforcement Learning 2016 Volodymyr Mnih
Adrià Puigdomènech Badia
Mehdi Mirza
Alex Graves
Tim Harley
Timothy Lillicrap
David Silver
Koray Kavukcuoglu
2
+ Q-Prop: Sample-Efficient Policy Gradient with An Off-Policy Critic 2016 Shixiang Gu
Timothy Lillicrap
Zoubin Ghahramani
Richard E. Turner
Sergey Levine
2
+ Emergence of Locomotion Behaviours in Rich Environments 2017 Nicolas Heess
Dhruva Tb
Sriram Srinivasan
Jay Lemmon
Josh Merel
Greg Wayne
Yuval Tassa
Tom Erez
Ziyu Wang
S. M. Ali Eslami
2
+ IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures 2018 Lasse Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymir Mnih
Tom Ward
Yotam Doron
Vlad Firoiu
Tim Harley
Iain Dunning
1
+ Trust Region Policy Optimization 2015 John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
1
+ Continuous Deep Q-Learning with Model-based Acceleration 2016 Shixiang Gu
Timothy Lillicrap
Ilya Sutskever
Sergey Levine
1
+ Control of Memory, Active Perception, and Action in Minecraft 2016 Junhyuk Oh
Valliappa Chockalingam
Satinder Singh
Honglak Lee
1
+ PDF Chat Neural Network Dynamics for Model-Based Deep Reinforcement Learning with Model-Free Fine-Tuning 2018 Anusha Nagabandi
Gregory Kahn
Ronald S. Fearing
Sergey Levine
1
+ When to Trust Your Model: Model-Based Policy Optimization 2019 Michael Jänner
Justin Fu
Marvin Zhang
Sergey Levine
1
+ High-Dimensional Continuous Control Using Generalized Advantage Estimation 2015 John Schulman
Philipp Moritz
Sergey Levine
Michael I. Jordan
Pieter Abbeel
1
+ IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures 2018 Lasse Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymir Mnih
Tom Ward
Yotam Doron
Vlad Firoiu
Tim Harley
Iain Dunning
1
+ Continuous control with deep reinforcement learning 2015 Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
Nicolas Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
1
+ PDF Chat Path integral guided policy search 2017 Yevgen Chebotar
Mrinal Kalakrishnan
Ali Abdullah Yahya
Adrian Li
Stefan Schaal
Sergey Levine
1
+ Q-PrOP: Sample-efficient policy gradient with an off-policy critic 2017 Shixiang Gu
Timothy Lillicrap
Zoubin Ghahramani
Richard E. Turner
Sergey Levine
1
+ Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor 2018 Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
1