He Ba

Follow

Generating author description...

All published works

Action	Title	Year	Authors
+ PDF Chat	Critic PI2: Master Continuous Planning via Policy Improvement with Path Integrals and Deep Actor-Critic Reinforcement Learning	2021	He Ba Jiajun Fan Xian Guo Jianye Hao
+	Critic PI2: Master Continuous Planning via Policy Improvement with Path Integrals and Deep Actor-Critic Reinforcement Learning	2020	Jiajun Fan He Ba Xian Guo Jianye Hao

Common Coauthors

Coauthor	Papers Together
Jianye Hao	2
Xian Guo	2
Jiajun Fan	2

Commonly Cited References

Action	Title	Year	Authors	# of times referenced
+	Proximal Policy Optimization Algorithms	2017	John Schulman Filip Wolski Prafulla Dhariwal Alec Radford Oleg Klimov	2
+	Continuous control with deep reinforcement learning	2016	Timothy Lillicrap Jonathan J. Hunt Alexander Pritzel Nicolas Heess Tom Erez Yuval Tassa David Silver Daan Wierstra	2
+	Asynchronous Methods for Deep Reinforcement Learning	2016	Volodymyr Mnih Adrià Puigdomènech Badia Mehdi Mirza Alex Graves Tim Harley Timothy Lillicrap David Silver Koray Kavukcuoglu	2
+	Q-Prop: Sample-Efficient Policy Gradient with An Off-Policy Critic	2016	Shixiang Gu Timothy Lillicrap Zoubin Ghahramani Richard E. Turner Sergey Levine	2
+	Emergence of Locomotion Behaviours in Rich Environments	2017	Nicolas Heess Dhruva Tb Sriram Srinivasan Jay Lemmon Josh Merel Greg Wayne Yuval Tassa Tom Erez Ziyu Wang S. M. Ali Eslami	2
+	IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures	2018	Lasse Espeholt Hubert Soyer Rémi Munos Karen Simonyan Volodymir Mnih Tom Ward Yotam Doron Vlad Firoiu Tim Harley Iain Dunning	1
+	Trust Region Policy Optimization	2015	John Schulman Sergey Levine Philipp Moritz Michael I. Jordan Pieter Abbeel	1
+	Continuous Deep Q-Learning with Model-based Acceleration	2016	Shixiang Gu Timothy Lillicrap Ilya Sutskever Sergey Levine	1
+	Control of Memory, Active Perception, and Action in Minecraft	2016	Junhyuk Oh Valliappa Chockalingam Satinder Singh Honglak Lee	1
+ PDF Chat	Neural Network Dynamics for Model-Based Deep Reinforcement Learning with Model-Free Fine-Tuning	2018	Anusha Nagabandi Gregory Kahn Ronald S. Fearing Sergey Levine	1
+	When to Trust Your Model: Model-Based Policy Optimization	2019	Michael Jänner Justin Fu Marvin Zhang Sergey Levine	1
+	High-Dimensional Continuous Control Using Generalized Advantage Estimation	2015	John Schulman Philipp Moritz Sergey Levine Michael I. Jordan Pieter Abbeel	1
+	IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures	2018	Lasse Espeholt Hubert Soyer Rémi Munos Karen Simonyan Volodymir Mnih Tom Ward Yotam Doron Vlad Firoiu Tim Harley Iain Dunning	1
+	Continuous control with deep reinforcement learning	2015	Timothy Lillicrap Jonathan J. Hunt Alexander Pritzel Nicolas Heess Tom Erez Yuval Tassa David Silver Daan Wierstra	1
+ PDF Chat	Path integral guided policy search	2017	Yevgen Chebotar Mrinal Kalakrishnan Ali Abdullah Yahya Adrian Li Stefan Schaal Sergey Levine	1
+	Q-PrOP: Sample-efficient policy gradient with an off-policy critic	2017	Shixiang Gu Timothy Lillicrap Zoubin Ghahramani Richard E. Turner Sergey Levine	1
+	Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor	2018	Tuomas Haarnoja Aurick Zhou Pieter Abbeel Sergey Levine	1