Hengshuai Yao

Follow

Generating author description...

All published works
Action Title Year Authors
+ Explainable Artificial Intelligence for Autonomous Driving: A Comprehensive Overview and Field Guide for Future Research Directions 2024 Shahin Atakishiyev
Mohammad Salameh
Hengshuai Yao
Randy Goebel
+ PDF Chat Towards Safe, Explainable, and Regulated Autonomous Driving 2023 Shahin Atakishiyev
Mohammad Salameh
Hengshuai Yao
Randy Goebel
+ PDF Chat The Sufficiency of Off-Policyness and Soft Clipping: PPO Is Still Insufficient according to an Off-Policy Measure 2023 Xing Chen
Dongcui Diao
Hechang Chen
Hengshuai Yao
Haiyin Piao
Zhixiao Sun
Zhiwei Yang
Randy Goebel
Bei Jiang
Yi Chang
+ Measuring and Mitigating Interference in Reinforcement Learning 2023 Vincent Liu
Adam White
Hengshuai Yao
Martha White
+ A new Gradient TD Algorithm with only One Step-size: Convergence Rate Analysis using $L$-$λ$ Smoothness 2023 Hengshuai Yao
+ Baird Counterexample is Solved: with an example of How to Debug a Two-time-scale Algorithm 2023 Hengshuai Yao
+ Careful at Estimation and Bold at Exploration 2023 Xing Chen
Yijun Liu
Zhaogeng Liu
Hechang Chen
Hengshuai Yao
Yi Chang
+ Learning to Accelerate by the Methods of Step-size Planning 2022 Hengshuai Yao
+ The Sufficiency of Off-Policyness and Soft Clipping: PPO is still Insufficient according to an Off-Policy Measure 2022 Xing Chen
Dongcui Diao
Hechang Chen
Hengshuai Yao
Jielong Yang
Haiyin Piao
Zhixiao Sun
Bei Jiang
Yi Chang
+ Class Interference of Deep Neural Networks 2022 Dongcui Diao
Hengshuai Yao
Bei Jiang
+ The Vanishing Decision Boundary Complexity and the Strong First Component 2022 Hengshuai Yao
+ PDF Chat A Multi-Component Framework for the Analysis and Design of Explainable Artificial Intelligence 2021 Miyoung Kim
Shahin Atakishiyev
Housam Khalifa Bashier Babiker
Nawshad Farruque
Randy Goebel
Osmar R. Zaı̈ane
Mohammad-Hossein Motallebi
Juliano Rabelo
Talat Iqbal Syed
Hengshuai Yao
+ PDF Chat Nonsmooth Low-Rank Matrix Recovery: Methodology, Theory and Algorithm 2021 Wei Tu
Peng Liu
Yi Liu
Guodong Li
Bei Jiang
Linglong Kong
Hengshuai Yao
Shangling Jui
+ Exploring the Robustness of Distributional Reinforcement Learning against Noisy State Observations. 2021 Ke Sun
Yi Liu
Yingnan Zhao
Hengshuai Yao
Shangling Jui
Linglong Kong
+ Beyond Prioritized Replay: Sampling States in Model-Based RL via Simulated Priorities 2021 Jincheng Mei
Yangchen Pan
Martha White
Amir‐massoud Farahmand
Hengshuai Yao
+ Breaking the Deadly Triad with a Target Network 2021 Shangtong Zhang
Hengshuai Yao
Shimon Whiteson
+ Towards Safe, Explainable, and Regulated Autonomous Driving 2021 Shahin Atakishiyev
Mohammad Salameh
Hengshuai Yao
Randy Goebel
+ Explainable Artificial Intelligence for Autonomous Driving: A Comprehensive Overview and Field Guide for Future Research Directions 2021 Shahin Atakishiyev
Mohammad Salameh
Hengshuai Yao
Randy Goebel
+ Exploring the Training Robustness of Distributional Reinforcement Learning against Noisy State Observations 2021 Ke Sun
Yi Liu
Yingnan Zhao
Hengshuai Yao
Shangling Jui
Linglong Kong
+ Beyond Prioritized Replay: Sampling States in Model-Based Reinforcement Learning via Simulated Priorities 2020 Jincheng Mei
Yangchen Pan
Amir‐massoud Farahmand
Hengshuai Yao
Martha White
+ PDF Chat Towards a practical measure of interference for reinforcement learning 2020 Vincent Liu
Adam White
Hengshuai Yao
Martha White
+ Weakly Supervised Few-shot Object Segmentation using Co-Attention with Visual and Semantic Embeddings 2020 Mennatullah Siam
Naren Doraiswamy
Boris N. Oreshkin
Hengshuai Yao
Martin Jägersand
+ PDF Chat Mapless Navigation among Dynamics with Social-safety-awareness: a reinforcement learning approach from 2D laser scans 2020 Jun Jin
Nhat M. Nguyen
Nazmus Sakib
Daniel Graves
Hengshuai Yao
Martin Jägersand
+ Towards a practical measure of interference for reinforcement learning 2020 Vincent Liu
Adam White
Hengshuai Yao
Martha White
+ Variance-Reduced Off-Policy Memory-Efficient Policy Search 2020 Daoming Lyu
Qi Qi
Mohammad Ghavamzadeh
Hengshuai Yao
Tianbao Yang
Bo Liu
+ Understanding and Mitigating the Limitations of Prioritized Experience Replay 2020 Yangchen Pan
Jincheng Mei
Amir‐massoud Farahmand
Martha White
Hengshuai Yao
Mohsen Rohani
Luo Jun
+ Weakly Supervised Few-shot Object Segmentation using Co-Attention with Visual and Semantic Embeddings 2020 Mennatullah Siam
Naren Doraiswamy
Boris N. Oreshkin
Hengshuai Yao
Martin Jägersand
+ Provably Convergent Off-Policy Actor-Critic with Function Approximation 2019 Shangtong Zhang
Bo Liu
Hengshuai Yao
Shimon Whiteson
+ PDF Chat Negative Log Likelihood Ratio Loss for Deep Neural Network Classification 2019 Hengshuai Yao
Donglai Zhu
Bei Jiang
Peng Yu
+ Hill Climbing on Value Estimates for Search-control in Dyna 2019 Yangchen Pan
Hengshuai Yao
Amir‐massoud Farahmand
Martha White
+ PDF Chat QUOTA: The Quantile Option Architecture for Reinforcement Learning 2019 Shangtong Zhang
Hengshuai Yao
+ PDF Chat ACE: An Actor Ensemble Algorithm for Continuous Control with Tree Search 2019 Shangtong Zhang
Hengshuai Yao
+ Distributional Reinforcement Learning for Efficient Exploration 2019 Borislav Mavrin
Hengshuai Yao
Linglong Kong
Kaiwen Wu
Yaoliang Yu
+ Reinforcing Classical Planning for Adversary Driving Scenarios. 2019 Nazmus Sakib
Hengshuai Yao
Zhang Hong
+ Deep Reinforcement Learning with Decorrelation 2019 Borislav Mavrin
Hengshuai Yao
Linglong Kong
+ Distributional Reinforcement Learning for Efficient Exploration 2019 Borislav Mavrin
Shangtong Zhang
Hengshuai Yao
Linglong Kong
Kaiwen Wu
Yaoliang Yu
+ Hill Climbing on Value Estimates for Search-control in Dyna 2019 Yangchen Pan
Hengshuai Yao
Amir‐massoud Farahmand
Martha White
+ Discounted Reinforcement Learning Is Not an Optimization Problem 2019 Abhishek Naik
Roshan Shariff
Niko Yasui
Hengshuai Yao
Richard S. Sutton
+ Is Fast Adaptation All You Need? 2019 Khurram Javed
Hengshuai Yao
Martha White
+ Single-step Options for Adversary Driving 2019 Nazmus Sakib
Hengshuai Yao
Hong Zhang
Shangling Jui
+ One-Shot Weakly Supervised Video Object Segmentation 2019 Mennatullah Siam
Naren Doraiswamy
Boris N. Oreshkin
Hengshuai Yao
Martin Jägersand
+ Provably Convergent Two-Timescale Off-Policy Actor-Critic with Function Approximation 2019 Shangtong Zhang
Bo Liu
Hengshuai Yao
Shimon Whiteson
+ PDF Chat Practical Issues of Action-Conditioned Next Image Prediction 2018 Donglai Zhu
Hao Chern
Hengshuai Yao
Masoud S. Nosrati
Peyman Yadmellat
Yunfei Zhang
+ Practical Issues of Action-conditioned Next Image Prediction 2018 Donglai Zhu
Hao Chen
Hengshuai Yao
Masoud S. Nosrati
Peyman Yadmellat
Yunfei Zhang
+ Negative Log Likelihood Ratio Loss for Deep Neural Network Classification 2018 Donglai Zhu
Hengshuai Yao
Bei Jiang
Yu Peng
+ QUOTA: The Quantile Option Architecture for Reinforcement Learning 2018 Shangtong Zhang
Borislav Mavrin
Linglong Kong
Bo Liu
Hengshuai Yao
+ ACE: An Actor Ensemble Algorithm for Continuous Control with Tree Search 2018 Shangtong Zhang
Hao Chen
Hengshuai Yao
+ Reinforcement Ranking 2013 Hengshuai Yao
Dale Schuurmans
+ Discovering and Leveraging the Most Valuable Links for Ranking 2012 Hengshuai Yao
+ PDF Chat Preconditioned temporal difference learning 2008 Hengshuai Yao
Zhiqiang Liu
+ Preconditioned Temporal Difference Learning 2007 Hengshuai Yao
Common Coauthors
Commonly Cited References
Action Title Year Authors # of times referenced
+ Continuous control with deep reinforcement learning 2016 Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
Nicolas Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
7
+ Distributed Distributional Deterministic Policy Gradients 2018 Gabriel Barth-Maron
Matthew W. Hoffman
David Budden
Will Dabney
Dan Horgan
Dhruva Tb
Alistair Muldal
Nicolas Heess
Timothy Lillicrap
5
+ Continuous control with deep reinforcement learning 2015 Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
Nicolas Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
5
+ End to End Learning for Self-Driving Cars 2016 Mariusz Bojarski
Davide Del Testa
Daniel Dworakowski
Bernhard Firner
Beat Flepp
Prasoon Goyal
Lawrence D. Jackel
Mathew Monfort
Urs Müller
Jiakai Zhang
4
+ PDF Chat The Arcade Learning Environment: An Evaluation Platform for General Agents 2013 Marc G. Bellemare
Yavar Naddaf
Joel Veness
Michael Bowling
4
+ Organizing Experience: a Deeper Look at Replay Mechanisms for Sample-Based Planning in Continuous State Domains 2018 Yangchen Pan
Muhammad Zaigham Zaheer
Adam White
Andrew Patterson
Martha White
4
+ PDF Chat Distributional Reinforcement Learning With Quantile Regression 2018 Will Dabney
Mark Rowland
Marc G. Bellemare
Rémi Munos
4
+ The Effect of Planning Shape on Dyna-style Planning in High-dimensional State Spaces 2018 Gerhard Holland
Erik Talvitie
Michael Bowling
4
+ PDF Chat Exponential Convergence of Langevin Distributions and Their Discrete Approximations 1996 Gareth O. Roberts
Richard L. Tweedie
3
+ Continuous Deep Q-Learning with Model-based Acceleration 2016 Shixiang Gu
Timothy Lillicrap
Ilya Sutskever
Sergey Levine
3
+ DeepTraffic: Crowdsourced Hyperparameter Tuning of Deep Reinforcement Learning Systems for Multi-Agent Dense Traffic Navigation 2018 Lex Fridman
Jack Terwilliger
Benedikt Jenik
3
+ PDF Chat QUOTA: The Quantile Option Architecture for Reinforcement Learning 2019 Shangtong Zhang
Hengshuai Yao
3
+ Implicit Quantile Networks for Distributional Reinforcement Learning 2018 Will Dabney
Georg Ostrovski
David Silver
Rémi Munos
3
+ PDF Chat Socially aware motion planning with deep reinforcement learning 2017 Yu Fan Chen
Michael Everett
Miao Liu
Jonathan P. How
3
+ PDF Chat Robust Estimation of a Location Parameter 1964 Peter J. Huber
3
+ Addressing Function Approximation Error in Actor-Critic Methods 2018 Scott Fujimoto
Herke van Hoof
David Meger
3
+ PDF Chat Deep Residual Learning for Image Recognition 2016 Kaiming He
Xiangyu Zhang
Shaoqing Ren
Jian Sun
3
+ Deep Exploration via Bootstrapped DQN 2016 Ian Osband
Charles Blundell
Alexander Pritzel
Benjamin Van Roy
3
+ PDF Chat Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization 2017 Ramprasaath R. Selvaraju
Michael Cogswell
Abhishek Das
Ramakrishna Vedantam
Devi Parikh
Dhruv Batra
3
+ PDF Chat Rainbow: Combining Improvements in Deep Reinforcement Learning 2018 Matteo Hessel
Joseph Modayil
Hado van Hasselt
Tom Schaul
Georg Ostrovski
Will Dabney
Dan Horgan
Bilal Piot
Mohammad Gheshlaghi Azar
David Silver
3
+ Prioritized Experience Replay 2015 Tom Schaul
John Quan
Ioannis Antonoglou
David Silver
3
+ Asynchronous Methods for Deep Reinforcement Learning 2016 Volodymyr Mnih
Adrià Puigdomènech Badia
Mehdi Mirza
Alex Graves
Tim Harley
Timothy Lillicrap
David Silver
Koray Kavukcuoglu
3
+ Bayesian Learning via Stochastic Gradient Langevin Dynamics 2011 Max Welling
Yee Whye Teh
3
+ Modelling transition dynamics in MDPs with RKHS embeddings 2012 Steffen Grünewälder
Guy Lever
Luca Baldassarre
Massi Pontil
Arthur Gretton
2
+ Towards Characterizing Divergence in Deep Q-Learning 2019 Joshua Achiam
Ethan Knight
Pieter Abbeel
2
+ TreeQN and ATreeC: Differentiable Tree-Structured Models for Deep Reinforcement Learning 2017 Gregory Farquhar
Tim Rocktäschel
Maximilian Igl
Shimon Whiteson
2
+ Approximate Robust Control of Uncertain Dynamical Systems 2019 Edouard Leurent
Y. Blanco
Denis Efimov
Odalric-Ambrym Maillard
2
+ YouTube-VOS: A Large-Scale Video Object Segmentation Benchmark 2018 Ning Xu
Linjie Yang
Yuchen Fan
Dingcheng Yue
Yuchen Liang
Shuicheng Yan
Thomas S. Huang
2
+ Giraffe: Using Deep Reinforcement Learning to Play Chess 2015 Matthew Lai
2
+ Distributional Reinforcement Learning for Efficient Exploration 2019 Borislav Mavrin
Hengshuai Yao
Linglong Kong
Kaiwen Wu
Yaoliang Yu
2
+ Deep Reinforcement Learning and the Deadly Triad 2018 Hado van Hasselt
Yotam Doron
Florian Strub
Matteo Hessel
Nicolas Sonnerat
Joseph Modayil
2
+ Exploration by Distributional Reinforcement Learning 2018 Yunhao Tang
Shipra Agrawal
2
+ Recurrent World Models Facilitate Policy Evolution 2018 David Ha
Jürgen Schmidhuber
2
+ PDF Chat Off-Policy Deep Reinforcement Learning by Bootstrapping the Covariate Shift 2019 Carles Gelada
Marc G. Bellemare
2
+ Learning to Run with Actor-Critic Ensemble 2017 Zhewei Huang
Shuchang Zhou
BoEr Zhuang
Xinyu Zhou
2
+ Learnings Options End-to-End for Continuous Action Tasks 2017 Martin Klissarov
Pierre‐Luc Bacon
Jean Harb
Doina Precup
2
+ DeepMind Control Suite 2018 Yuval Tassa
Yotam Doron
Alistair Muldal
Tom Erez
Yazhe Li
Diego de Las Casas
David Budden
Abbas Abdolmaleki
Josh Merel
Andrew Lefrancq
2
+ Parametric Return Density Estimation for Reinforcement Learning 2012 Tetsuro Morimura
Masashi Sugiyama
Hisashi Kashima
Hirotaka Hachiya
Toshiyuki Tanaka
2
+ Proximal Policy Optimization Algorithms 2017 John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
2
+ The Uncertainty Bellman Equation and Exploration 2017 Brendan O’Donoghue
Ian Osband
Rémi Munos
Volodymyr Mnih
2
+ Count-Based Exploration with Neural Density Models 2017 Georg Ostrovski
Marc G. Bellemare
Aäron van den Oord
Rémi Munos
2
+ The Option-Critic Architecture 2016 Pierre‐Luc Bacon
Jean Harb
Doina Precup
2
+ UCB Exploration via Q-Ensembles 2017 Richard Y. Chen
Szymon Sidor
Pieter Abbeel
John Schulman
2
+ CARLA: An Open Urban Driving Simulator 2017 Alexey Dosovitskiy
Germán Ros
Felipe Codevilla
Antonio M. López
Vladlen Koltun
2
+ Diffusion for Global Optimization in $\mathbb{R}^n $ 1987 Tzuu-Shuh Chiang
Chii-Ruey Hwang
Shuenn Jyi Sheu
2
+ PDF Chat Efficient Reinforcement Learning Using Recursive Least-Squares Methods 2002 Xin Xu
Haibo He
Dewen Hu
2
+ Energetic natural gradient descent 2016 Philip S. Thomas
Bruno Castro da Silva
Christoph Dann
Emma Brunskill
2
+ Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks 2017 Chelsea Finn
Pieter Abbeel
Sergey Levine
2
+ PDF Chat The Cityscapes Dataset for Semantic Urban Scene Understanding 2016 Marius Cordts
Mohamed Omran
Sebastian Ramos
Timo Rehfeld
Markus Enzweiler
Rodrigo Benenson
Uwe Franke
Stefan Roth
Bernt Schiele
2
+ PDF Chat Preconditioned temporal difference learning 2008 Hengshuai Yao
Zhiqiang Liu
2