Jianye Hao

Follow

Generating author description...

All published works
Action Title Year Authors
+ PDF Chat SpatialCoT: Advancing Spatial Reasoning through Coordinate Alignment and Chain-of-Thought for Embodied Task Planning 2025 Yuecheng Liu
Dafeng Chi
Shiguang Wu
Zhanguang Zhang
Yaochen Hu
Lingfeng Zhang
Yingxue Zhang
Shuang Wu
Tongtong Cao
Guowei Huang
+ PDF Chat Path-of-Thoughts: Extracting and Following Paths for Robust Relational Reasoning with Large Language Models 2024 Ge Zhang
Mohammad Ali Alomrani
Hongjian Gu
Jiaming Zhou
Yaochen Hu
Bin Wang
Qun Liu
Mark Coates
Yingxue Zhang
Jianye Hao
+ PDF Chat The Graph's Apprentice: Teaching an LLM Low Level Knowledge for Circuit Quality Estimation 2024 Reza Moravej
S. K. Bodhe
Zhanguang Zhang
Didier Chételat
Dimitrios Tsaras
Yingxue Zhang
Hui‐Ling Zhen
Jianye Hao
Mingxuan Yuan
+ PDF Chat Lightweight Neural App Control 2024 Filippos Christianos
Georgios Papoudakis
Thomas Coste
Jianye Hao
Jun Wang
Kun Shao
+ PDF Chat SeaDAG: Semi-autoregressive Diffusion for Conditional Directed Acyclic Graph Generation 2024 Xinyi Zhou
Xing Li
Yanqing Lian
Yiwen Wang
Jason Li Chen
Mingxuan Yuan
Jianye Hao
Guangyong Chen
Pheng‐Ann Heng
+ PDF Chat ET-Plan-Bench: Embodied Task-level Planning Benchmark Towards Spatial-Temporal Cognition with Foundation Models 2024 Lingfeng Zhang
Yuening Wang
Hongjian Gu
Atia Hamidizadeh
Zhanguang Zhang
Yuecheng Liu
Yutong Wang
David Nogués‐Bravo
Junyi Dong
Shunbo Zhou
+ PDF Chat Enhancing Logical Reasoning in Large Language Models through Graph-based Synthetic Data 2024 Jiaming Zhou
Abbas Ghaddar
Ge Zhang
Liheng Ma
Yaochen Hu
Soumyasundar Pal
Mark Coates
Bin Wang
Yingxue Zhang
Jianye Hao
+ PDF Chat MODULI: Unlocking Preference Generalization via Diffusion Models for Offline Multi-Objective Reinforcement Learning 2024 Yifu Yuan
Zhenrui Zheng
Zibin Dong
Jianye Hao
+ PDF Chat GraSS: Combining Graph Neural Networks with Expert Knowledge for SAT Solver Selection 2024 Zhanguang Zhang
Didier Chételat
Joseph Cotnareanu
Amur Ghose
Wenyi Xiao
Hui‐Ling Zhen
Yingxue Zhang
Jianye Hao
Mark Coates
Mingxuan Yuan
+ PDF Chat Actra: Optimized Transformer Architecture for Vision-Language-Action Models in Robot Learning 2024 Yueen Ma
Dafeng Chi
Shiguang Wu
Yuecheng Liu
Yuzheng Zhuang
Jianye Hao
Irwin King
+ PDF Chat CellAgent: An LLM-driven Multi-Agent Framework for Automated Single-cell Data Analysis 2024 Yihang Xiao
Jinyi Liu
Yan Zheng
Xiaohan Xie
Jianye Hao
Mingzhi Li
Ruitao Wang
Fei Ni
Yuxiao Li
Jintian Luo
+ PDF Chat MFE-ETP: A Comprehensive Evaluation Benchmark for Multi-modal Foundation Models on Embodied Task Planning 2024 Min Zhang
Jianye Hao
Xi′an Fu
Peilong Han
Hao Zhang
Lei Shi
Hongyao Tang
Yan Zheng
+ PDF Chat Benchmarking End-To-End Performance of AI-Based Chip Placement Algorithms 2024 Zhihai Wang
Zijie Geng
Z. Tu
Jie Wang
Yong Qian
Z. Xu
Ziyan Liu
Siyuan Xu
Zhentao Tang
Shixiong Kai
+ PDF Chat ROS-LLM: A ROS framework for embodied AI with task feedback and structured reasoning 2024 Christopher E. Mower
Yuhui Wan
Hongzhan Yu
Antoine Grosnit
Jonas Gonzalez-Billandon
Matthieu Zimmer
Jinlong Wang
Xinyu Zhang
Yao Zhao
Anbang Zhai
+ PDF Chat EWEK-QA: Enhanced Web and Efficient Knowledge Graph Retrieval for Citation-based Question Answering Systems 2024 Mohammad Hossein Dehghan
Mohammad Ali Alomrani
Sunyam Bagga
David Alfonso-Hermelo
Khalil Bibi
Abbas Ghaddar
Yingxue Zhang
Xiaoguang Li
Jianye Hao
Qun Liu
+ PDF Chat CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making 2024 Zibin Dong
Yifu Yuan
Jianye Hao
Fei Ni
Yi Ma
Pengyi Li
Yan Zheng
+ PDF Chat iVideoGPT: Interactive VideoGPTs are Scalable World Models 2024 Jialong Wu
Yin Shao-feng
Ningya Feng
Xu He
Dong Li
Jianye Hao
Mingsheng Long
+ PDF Chat A Survey on Vision-Language-Action Models for Embodied AI 2024 Yueen Ma
Zixing Song
Yuzheng Zhuang
Jianye Hao
Irwin King
+ PDF Chat GraSS: Combining Graph Neural Networks with Expert Knowledge for SAT Solver Selection 2024 Zhanguang Zhang
Didier Chételat
Joseph Cotnareanu
Amur Ghose
Wenyi Xiao
Hui‐Ling Zhen
Yingxue Zhang
Jianye Hao
Mark Coates
Mingxuan Yuan
+ PDF Chat vMFER: Von Mises-Fisher Experience Resampling Based on Uncertainty of Gradient Directions for Policy Improvement 2024 Yiwen Zhu
Jinyi Liu
Wenya Wei
Qianyi Fu
Yujing Hu
Fang Zhou
Bo An
Jianye Hao
Tangjie Lv
Changjie Fan
+ PDF Chat SheetAgent: A Generalist Agent for Spreadsheet Reasoning and Manipulation via Large Language Models 2024 Yibin Chen
Yifu Yuan
Zeyu Zhang
Yan Zheng
Jinyi Liu
Fei Ni
Jianye Hao
+ PDF Chat Reinforced In-Context Black-Box Optimization 2024 Lei Song
Chenxiao Gao
Ke Xue
Chenyang Wu
Dong Li
Jianye Hao
Zongzhang Zhang
Chao Qian
+ PDF Chat Enhancing Robotic Manipulation with AI Feedback from Multimodal Large Language Models 2024 Jinyi Liu
Yifu Yuan
Jianye Hao
Fei Ni
Lingzhi Fu
Yibin Chen
Zheng Yan
+ PDF Chat MENTOR: Guiding Hierarchical Reinforcement Learning with Human Feedback and Dynamic Distance Constraint 2024 Xinglin Zhou
Yifu Yuan
Shaofu Yang
Jianye Hao
+ PDF Chat Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback 2024 Yifu Yuan
Jianye Hao
Yi Ma
Zibin Dong
Hebin Liang
Jinyi Liu
Zhixin Feng
Kai Zhao
Yan Zheng
+ PDF Chat DiffuserLite: Towards Real-time Diffusion Planning 2024 Zibin Dong
Jianye Hao
Yifu Yuan
Fei Ni
Yitian Wang
Pengyi Li
Yan Zheng
+ Machine Learning Insides OptVerse AI Solver: Design Principles and Applications 2024 Xijun Li
Fangzhou Zhu
Hui‐Ling Zhen
Weilin Luo
Meng Lu
Yimin Huang
Zhenan Fan
Zirui Zhou
Yufei Kuang
Zhihai Wang
+ Bridging Evolutionary Algorithms and Reinforcement Learning: A Comprehensive Survey 2024 Pengyi Li
Jianye Hao
Hongyao Tang
Xi′an Fu
Yan Zheng
Ke Tang
+ Robust Multiobjective Reinforcement Learning Considering Environmental Uncertainties 2024 Xiangkun He
Jianye Hao
Xu Chen
Jun Wang
Xuewu Ji
Chen Lv
+ PDF Chat Bridging Evolutionary Algorithms and Reinforcement Learning: A Comprehensive Survey on Hybrid Algorithms 2024 P Li
Jianye Hao
Hongyao Tang
Xi′an Fu
Yan Zhen
Ke Tang
+ RITA: Boost Driving Simulators with Realistic Interactive Traffic Flow 2023 Zhengbang Zhu
Shenyu Zhang
Yuzheng Zhuang
Yuecheng Liu
Minghuan Liu
Ziqin Gong
Shixiong Kai
Qiang Gu
Bin Wang
Siyuan Cheng
+ PDF Chat Pessimistic value iteration for multi-task data sharing in Offline Reinforcement Learning 2023 Chenjia Bai
Lingxiao Wang
Jianye Hao
Zhuoran Yang
Bin Zhao
Zhen Wang
Xuelong Li
+ Generalized Universal Domain Adaptation with Generative Flow Networks 2023 Didi Zhu
Yinchuan Li
Yunfeng Shao
Jianye Hao
Fei Wu
Kun Kuang
Jun Xiao
Chao Wu
+ PDF Chat Traj-MAE: Masked Autoencoders for Trajectory Prediction 2023 Hao Chen
Jiaze Wang
Kun Shao
Furui Liu
Jianye Hao
Chenyong Guan
Guangyong Chen
Pheng‐Ann Heng
+ PDF Chat BiERL: A Meta Evolutionary Reinforcement Learning Framework via Bilevel Optimization 2023 Junyi Wang
Yuanyang Zhu
Zhi Wang
Yan Zheng
Jianye Hao
Chunlin Chen
+ Generative Flow Networks for Precise Reward-Oriented Active Learning on Graphs 2023 Yinchuan Li
Zhigang Li
Wenqian Li
Yunfeng Shao
Yan Zheng
Jianye Hao
+ Uncertainty-aware Consistency Learning for Cold-Start Item Recommendation 2023 Taichi Liu
Chen Gao
Zhenyu Wang
Dong Li
Jianye Hao
Depeng Jin
Yong Li
+ PDF Chat Structure Aware Incremental Learning with Personalized Imitation Weights for Recommender Systems 2023 Yuening Wang
Yingxue Zhang
Antonios Valkanas
Ruiming Tang
Chen Ma
Jianye Hao
Mark Coates
+ PDF Chat Neighbor Auto-Grouping Graph Neural Networks for Handover Parameter Configuration in Cellular Network 2023 Mehrtash Mehrabi
Walid Masoudimansour
Yingxue Zhang
Jie Chuai
Zhitang Chen
Mark Coates
Jianye Hao
Yanhui Geng
+ PDF Chat Models as Agents: Optimizing Multi-Step Predictions of Interactive Local Models in Model-Based Multi-Agent Reinforcement Learning 2023 Zifan Wu
Chao Yu
Chen Chen
Jianye Hao
Hankz Hankui Zhuo
+ PDF Chat Debiased Recommendation with User Feature Balancing 2023 Mengyue Yang
Guohao Cai
Furui Liu
Jiarui Jin
Zhenhua Dong
Xiuqiang He
Jianye Hao
Weiqi Shao
Jun Wang
Xu Chen
+ PDF Chat Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain 2023 Jianye Hao
Tianpei Yang
Hongyao Tang
Chenjia Bai
Jinyi Liu
Zhaopeng Meng
Peng Liu
Zhen Wang
+ Neighbor Auto-Grouping Graph Neural Networks for Handover Parameter Configuration in Cellular Network 2023 Mehrtash Mehrabi
Walid Masoudimansour
Yingxue Zhang
Jie Chuai
Zhitang Chen
Mark Coates
Jianye Hao
Yanhui Geng
+ Plan To Predict: Learning an Uncertainty-Foreseeing Model for Model-Based Reinforcement Learning 2023 Zifan Wu
Chao Yu
Chen Chen
Jianye Hao
Hankz Hankui Zhuo
+ Reweighted Interacting Langevin Diffusions: an Accelerated Sampling Methodfor Optimization 2023 Junlong Lyu
Zhitang Chen
Wenlong Lyu
Jianye Hao
+ Spectral Augmentations for Graph Contrastive Learning 2023 Amur Ghose
Yingxue Zhang
Jianye Hao
Mark Coates
+ The Ladder in Chaos: A Simple and Effective Improvement to General DRL Algorithms by Policy Path Trimming and Boosting 2023 Hongyao Tang
Min Zhang
Jianye Hao
+ CFlowNets: Continuous Control with Generative Flow Networks 2023 Yinchuan Li
Shuang Luo
Haozhi Wang
Jianye Hao
+ DAG Matters! GFlowNets Enhanced Explainer For Graph Neural Networks 2023 Wenqian Li
Yinchuan Li
Zhigang Li
Jianye Hao
Yan Pang
+ DR-Label: Improving GNN Models for Catalysis Systems by Label Deconstruction and Reconstruction 2023 Bowen Wang
Liang Chen
Jiaze Wang
Furui Liu
Shaogang Hao
Dong Li
Jianye Hao
Guangyong Chen
Xiaolong Zou
Pheng‐Ann Heng
+ Out-of-distribution Detection with Implicit Outlier Transformation 2023 Qizhou Wang
Junjie Ye
Feng Liu
Quanyu Dai
Marcus Kalander
Tongliang Liu
Jianye Hao
Bo Han
+ Traj-MAE: Masked Autoencoders for Trajectory Prediction 2023 Hao Chen
Jiaze Wang
Kun Shao
Furui Liu
Jianye Hao
Chenyong Guan
Guangyong Chen
Pheng‐Ann Heng
+ Models as Agents: Optimizing Multi-Step Predictions of Interactive Local Models in Model-Based Multi-Agent Reinforcement Learning 2023 Zifan Wu
Chao Yu
Chen Chen
Jianye Hao
Hankz Hankui Zhuo
+ Multi-agent Policy Reciprocity with Theoretical Guarantee 2023 Haozhi Wang
Yinchuan Li
Qing Wang
Yunfeng Shao
Jianye Hao
+ Generative Flow Networks for Precise Reward-Oriented Active Learning on Graphs 2023 Yinchuan Li
Zhigang Li
Wenqian Li
Yunfeng Shao
Zheng Yan
Jianye Hao
+ Structure Aware Incremental Learning with Personalized Imitation Weights for Recommender Systems 2023 Yuening Wang
Yingxue Zhang
Antonios Valkanas
Ruiming Tang
Chen Ma
Jianye Hao
Mark Coates
+ Generalized Universal Domain Adaptation with Generative Flow Networks 2023 Didi Zhu
Yinchuan Li
Yunfeng Shao
Jianye Hao
Fei Wu
Kun Kuang
Jun Xiao
Chao Wu
+ Learnable Behavior Control: Breaking Atari Human World Records via Sample-Efficient Behavior Selection 2023 Jiajun Fan
Yuzheng Zhuang
Yuecheng Liu
Jianye Hao
Bin Wang
Jiangcheng Zhu
Hao Wang
Shu‐Tao Xia
+ GFlowNets with Human Feedback 2023 Yinchuan Li
Shuang Luo
Yunfeng Shao
Jianye Hao
+ MetaDiffuser: Diffusion Model as Conditional Planner for Offline Meta-RL 2023 Fei Ni
Jianye Hao
Yao Mu
Yifu Yuan
Yan Zheng
Bin Wang
Zhixuan Liang
+ Hierarchical Task Network Planning for Facilitating Cooperative Multi-Agent Reinforcement Learning 2023 Xuechen Mu
Hankz Hankui Zhuo
Chen Chen
Kai Zhang
Chao Yu
Jianye Hao
+ ChiPFormer: Transferable Chip Placement via Offline Decision Transformer 2023 Yao Lai
Jinxin Liu
Zhentao Tang
Bin Wang
Jianye Hao
Ping Luo
+ Prioritized Trajectory Replay: A Replay Memory for Data-driven Reinforcement Learning 2023 Jinyi Liu
Yi Ma
Jianye Hao
Yujing Hu
Yan Zheng
Tangjie Lv
Changjie Fan
+ VOLTA: Diverse and Controllable Question-Answer Pair Generation with Variational Mutual Information Maximizing Autoencoder 2023 Yueen Ma
Dafeng Chi
Jingjing Li
Yuzheng Zhuang
Jianye Hao
Irwin King
+ Exploiting Counter-Examples for Active Learning with Partial labels 2023 Fei Zhang
Yunjie Ye
Lei Feng
Zhongwen Rao
Jieming Zhu
Marcus Kalander
Chen Gong
Jianye Hao
Bo Han
+ Uncertainty-aware Consistency Learning for Cold-Start Item Recommendation 2023 Taichi Liu
Chen Gao
Zhenyu Wang
Dong Li
Jianye Hao
Depeng Jin
Yong Li
+ BiERL: A Meta Evolutionary Reinforcement Learning Framework via Bilevel Optimization 2023 Junyi Wang
Yuanyang Zhu
Zhi Wang
Yan Zheng
Jianye Hao
Chunlin Chen
+ A Circuit Domain Generalization Framework for Efficient Logic Synthesis in Chip Design 2023 Zhihai Wang
Lei Chen
Jie Wang
Xing Li
Yinqi Bai
Xijun Li
Mingxuan Yuan
Jianye Hao
Yongdong Zhang
Feng Wu
+ AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model 2023 Zibin Dong
Yifu Yuan
Jianye Hao
Fei Ni
Yao Mu
Zheng Yan
Yujing Hu
Tangjie Lv
Changjie Fan
Zhipeng Hu
+ Rethinking Decision Transformer via Hierarchical Reinforcement Learning 2023 Yi Ma
Chenjun Xiao
Hebin Liang
Jianye Hao
+ OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in Noisy Environments 2023 Jinyi Liu
Zhi Wang
Yan Zheng
Jianye Hao
Chenjia Bai
Junjie Ye
Zhen Wang
Haiyin Piao
Yang Sun
+ Pangu-Agent: A Fine-Tunable Generalist Agent with Structured Reasoning 2023 Filippos Christianos
Georgios Papoudakis
Matthieu Zimmer
Thomas Coste
Zhihao Wu
Jingxuan Chen
Khyati Khandelwal
J. Doran
Xidong Feng
Jiacheng Liu
+ PDF Chat Contrastive-ACE: Domain Generalization Through Alignment of Causal Mechanisms 2022 Yunqi Wang
Furui Liu
Zhitang Chen
Yik‐Chung Wu
Jianye Hao
Guangyong Chen
Pheng‐Ann Heng
+ PDF Chat Coach-assisted multi-agent reinforcement learning framework for unexpected crashed agents 2022 Jian Zhao
Youpeng Zhao
Weixun Wang
Mingyu Yang
Xunhan Hu
Wengang Zhou
Jianye Hao
Houqiang Li
+ PDF Chat Empirical Policy Optimization for <i>n</i>-Player Markov Games 2022 Yuanheng Zhu
Weifan Li
Mengchen Zhao
Jianye Hao
Dongbin Zhao
+ PDF Chat PMIC: Improving Multi-Agent Reinforcement Learning with Progressive Mutual Information Collaboration 2022 Pengyi Li
Hongyao Tang
Tianpei Yang
Xiaotian Hao
Tong Sang
Zheng Yan
Jianye Hao
Matthew E. Taylor
Wenyuan Tao
Zhen Wang
+ PDF Chat Modeling Scale-free Graphs with Hyperbolic Geometry for Knowledge-aware Recommendation 2022 Yankai Chen
Meng‐Lin Yang
Yingxue Zhang
Mengchen Zhao
Ziqiao Meng
Jianye Hao
Irwin King
+ PDF Chat SEIHAI: A Sample-Efficient Hierarchical AI for the MineRL Competition 2022 Hangyu Mao
Chao Wang
Xiaotian Hao
Yihuan Mao
Yiming Lu
Chengjie Wu
Jianye Hao
Dong Li
Pingzhong Tang
+ PDF Chat Uncertainty-Aware Low-Rank Q-Matrix Estimation for Deep Reinforcement Learning 2022 Tong Sang
Hongyao Tang
Jianye Hao
Yan Zheng
Zhaopeng Meng
+ Breaking the Curse of Dimensionality in Multiagent State Space: A Unified Agent Permutation Framework 2022 Xiaotian Hao
Weixun Wang
Hangyu Mao
Yaodong Yang
Dong Li
Yan Zheng
Zhen Wang
Jianye Hao
+ Coach-assisted Multi-Agent Reinforcement Learning Framework for Unexpected Crashed Agents 2022 Jian Zhao
Youpeng Zhao
Weixun Wang
Mingyu Yang
Xunhan Hu
Wengang Zhou
Jianye Hao
Houqiang Li
+ Debiased Recommendation with User Feature Balancing 2022 Mengyue Yang
Guohao Cai
Furui Liu
Zhenhua Dong
Xiuqiang He
Jianye Hao
Jun Wang
Xu Chen
+ Generalizable Information Theoretic Causal Representation 2022 Mengyue Yang
Xinyu Cai
Furui Liu
Xu Chen
Zhitang Chen
Jianye Hao
Jun Wang
+ Introduction to The Dynamic Pickup and Delivery Problem Benchmark -- ICAPS 2021 Competition 2022 Jianye Hao
Jiawen Lu
Xijun Li
Xialiang Tong
Xiang Xiang
Mingxuan Yuan
Hankz Hankui Zhuo
+ LHNN: Lattice Hypergraph Neural Network for VLSI Congestion Prediction 2022 Bowen Wang
Guibao Shen
Dong Li
Jianye Hao
Wulong Liu
Yu Huang
Hongzhong Wu
Yibo Lin
Guangyong Chen
Pheng‐Ann Heng
+ Plan Your Target and Learn Your Skills: Transferable State-Only Imitation Learning via Decoupled Policy Optimization 2022 Minghuan Liu
Zhengbang Zhu
Yuzheng Zhuang
Weinan Zhang
Jianye Hao
Yong Yu
Jun Wang
+ Revisiting QMIX: Discriminative Credit Assignment by Gradient Entropy Regularization 2022 Jian Zhao
Yue Zhang
Xunhan Hu
Weixun Wang
Wengang Zhou
Jianye Hao
Jiangcheng Zhu
Houqiang Li
+ A Benchmark for Automatic Medical Consultation System: Frameworks, Tasks and Datasets 2022 Wei Chen
Zhiwei Li
Hongyi Fang
Qianyuan Yao
Cheng Zhong
Jianye Hao
Qi Zhang
Xuanjing Huang
Jinye Peng
Zhongyu Wei
+ PMIC: Improving Multi-Agent Reinforcement Learning with Progressive Mutual Information Collaboration 2022 Pengyi Li
Hongyao Tang
Tianpei Yang
Xiaotian Hao
Tong Sang
Zheng Yan
Jianye Hao
Matthew E. Taylor
Zhen Wang
+ Off-Beat Multi-Agent Reinforcement Learning 2022 Wei Qiu
Weixun Wang
Rundong Wang
Bo An
Yujing Hu
Svetlana Obraztsova
Zinovi Rabinovich
Jianye Hao
Yingfeng Chen
Changjie Fan
+ GALOIS: Boosting Deep Reinforcement Learning via Generalizable Logic Synthesis 2022 Yushi Cao
Zhiming Li
Tianpei Yang
Hao Zhang
Yan Zheng
Yi Li
Jianye Hao
Yang Liu
+ Branch Ranking for Efficient Mixed-Integer Programming via Offline Ranking-based Policy Learning 2022 Zeren Huang
Wenhao Chen
Weinan Zhang
Chuhan Shi
Furui Liu
Hui‐Ling Zhen
Mingxuan Yuan
Jianye Hao
Yong Yu
Jun Wang
+ A Graph-Enhanced Click Model for Web Search 2022 Jianghao Lin
Weiwen Liu
Xinyi Dai
Weinan Zhang
Shuai Li
Ruiming Tang
Xiuqiang He
Jianye Hao
Yong Yu
+ Towards A Unified Policy Abstraction Theory and Representation Learning Approach in Markov Decision Processes 2022 Min Zhang
Hongyao Tang
Jianye Hao
Zheng Yan
+ On the Convergence Theory of Meta Reinforcement Learning with Personalized Policies 2022 Haozhi Wang
Qing Wang
Yunfeng Shao
Dong Li
Jianye Hao
Yinchuan Li
+ EUCLID: Towards Efficient Unsupervised Reinforcement Learning with Multi-choice Dynamics Model 2022 Yifu Yuan
Jianye Hao
Fei Ni
Yao Mu
Yan Zheng
Yujing Hu
Jinyi Liu
Yingfeng Chen
Changjie Fan
+ Decomposed Mutual Information Optimization for Generalized Context in Meta-Reinforcement Learning 2022 Yao Mu
Yuzheng Zhuang
Fei Ni
Bin Wang
Jianyu Chen
Jianye Hao
Ping Luo
+ PTDE: Personalized Training with Distillated Execution for Multi-Agent Reinforcement Learning 2022 Yiqun Chen
Hangyu Mao
Tianle Zhang
Shiguang Wu
Bin Zhang
Jianye Hao
Dong Li
Bin Wang
Hongxing Chang
+ GFlowCausal: Generative Flow Networks for Causal Discovery 2022 Wenqian Li
Yinchuan Li
Shengyu Zhu
Yunfeng Shao
Jianye Hao
Yan Pang
+ ERL-Re$^2$: Efficient Evolutionary Reinforcement Learning with Shared State Representation and Individual Policy Representation 2022 Pengyi Li
Hongyao Tang
Jianye Hao
Yan Zheng
Xi′an Fu
Zhaopeng Meng
+ PAnDR: Fast Adaptation to New Environments from Offline Experiences via Decoupling Policy and Environment Representations 2022 Tong Sang
Hongyao Tang
Yi Ma
Jianye Hao
Yan Zheng
Zhaopeng Meng
Boyan Li
Zhen Wang
+ Prototypical context-aware dynamics generalization for high-dimensional model-based reinforcement learning 2022 Junjie Wang
Yao Mu
Dong Li
Qichao Zhang
Dongbin Zhao
Yuzheng Zhuang
Ping Luo
Bin Wang
Jianye Hao
+ State-Aware Proximal Pessimistic Algorithms for Offline Reinforcement Learning 2022 Chen Chen
Hongyao Tang
Yi Ma
Chao Wang
Qianli Shen
Dong Li
Jianye Hao
+ Planning Immediate Landmarks of Targets for Model-Free Skill Transfer across Agents 2022 Minghuan Liu
Zhengbang Zhu
Menghui Zhu
Yuzheng Zhuang
Weinan Zhang
Jianye Hao
+ Transformer in Transformer as Backbone for Deep Reinforcement Learning 2022 Hangyu Mao
Rui Zhao
Hao Chen
Jianye Hao
Yiqun Chen
Dong Li
Junge Zhang
Zhen Xiao
+ PDF Chat ED2: An Environment Dynamics Decomposition Framework for World Model Construction 2021 Cong Wang
Tianpei Yang
Jianye Hao
Yan Zheng
Hongyao Tang
Fazl Barez
Jinyi Liu
Jiajie Peng
Haiyin Piao
Zhixiao Sun
+ Learning State Representations via Retracing in Reinforcement Learning. 2021 Changmin Yu
Dong Li
Jianye Hao
Jun Wang
Neil Burgess
+ PDF Chat Dynamic Bottleneck for Robust Self-Supervised Exploration 2021 Chenjia Bai
Lingxiao Wang
Lei Han
Animesh Garg
Jianye Hao
Peng Liu
Zhaoran Wang
+ PDF Chat Flattening Sharpness for Dynamic Gradient Projection Memory Benefits Continual Learning 2021 Danruo Deng
Guangyong Chen
Jianye Hao
Qiong Wang
Pheng‐Ann Heng
+ PDF Chat Learning to select cuts for efficient mixed-integer programming 2021 Zeren Huang
Kerong Wang
Furui Liu
Hui‐Ling Zhen
Weinan Zhang
Mingxuan Yuan
Jianye Hao
Yong Yu
Jun Wang
+ HyAR: Addressing Discrete-Continuous Action Reinforcement Learning via Hybrid Action Representation. 2021 Boyan Li
Hongyao Tang
Yan Zheng
Jianye Hao
Pengyi Li
Zhen Wang
Zhaopeng Meng
Li Wang
+ Ordering-Based Causal Discovery with Reinforcement Learning 2021 Xiaoqiang Wang
Yali Du
Shengyu Zhu
Liangjun Ke
Zhitang Chen
Jianye Hao
Jun Wang
+ PDF Chat A Graph-Enhanced Click Model for Web Search 2021 Jianghao Lin
Weiwen Liu
Xinyi Dai
Weinan Zhang
Shuai Li
Ruiming Tang
Xiuqiang He
Jianye Hao
Yong Yu
+ PDF Chat Critic PI2: Master Continuous Planning via Policy Improvement with Path Integrals and Deep Actor-Critic Reinforcement Learning 2021 He Ba
Jiajun Fan
Xian Guo
Jianye Hao
+ PDF Chat Foresee then Evaluate: Decomposing Value Estimation with Latent Future Prediction 2021 Hongyao Tang
Zhaopeng Meng
Guangyong Chen
Pengfei Chen
Chen Chen
Yaodong Yang
Luo Zhang
Wulong Liu
Jianye Hao
+ PDF Chat Addressing Action Oscillations through Learning Policy Inertia 2021 Chen Chen
Hongyao Tang
Jianye Hao
Wulong Liu
Zhaopeng Meng
+ PDF Chat Towards Effective Context for Meta-Reinforcement Learning: an Approach based on Contrastive Learning 2021 Haotian Fu
Hongyao Tang
Jianye Hao
Chen Chen
Xidong Feng
Li Dong
Wulong Liu
+ Principled Exploration via Optimistic Bootstrapping and Backward Induction 2021 Chenjia Bai
Lingxiao Wang
Lei Han
Jianye Hao
Animesh Garg
Peng Liu
Zhaoran Wang
+ Automatic Web Testing Using Curiosity-Driven Reinforcement Learning 2021 Yan Zheng
Yi Liu
Xiaofei Xie
Yepang Liu
Lei Ma
Jianye Hao
Yang Liu
+ PDF Chat An Adversarial Imitation Click Model for Information Retrieval 2021 Xinyi Dai
Jianghao Lin
Weinan Zhang
Shuai Li
Weiwen Liu
Ruiming Tang
Xiuqiang He
Jianye Hao
Jun Wang
Yong Yu
+ PDF Chat Addressing Action Oscillations through Learning Policy Inertia 2021 Chen Chen
Hongyao Tang
Jianye Hao
Wulong Liu
Zhaopeng Meng
+ Ordering-Based Causal Discovery with Reinforcement Learning 2021 Xiaoqiang Wang
Yali Du
Shengyu Zhu
Liangjun Ke
Zhitang Chen
Jianye Hao
Jun Wang
+ Differentiable Logic Machines 2021 Matthieu Zimmer
Xuening Feng
Claire Glanois
Zhaohui Jiang
Jianyi Zhang
Paul Weng
Jianye Hao
Dong Li
Wulong Liu
+ Automatic Web Testing using Curiosity-Driven Reinforcement Learning 2021 Yan Zheng
Yi Liu
Xiaofei Xie
Yepang Liu
Lei Ma
Jianye Hao
Yang Liu
+ Learning Symbolic Rules for Interpretable Deep Reinforcement Learning 2021 Zhihao Ma
Yuzheng Zhuang
Paul Weng
Hankz Hankui Zhuo
Dong Li
Wulong Liu
Jianye Hao
+ Contrastive ACE: Domain Generalization Through Alignment of Causal Mechanisms 2021 Yunqi Wang
Furui Liu
Zhitang Chen
Qing Lian
Shoubo Hu
Jianye Hao
Yik‐Chung Wu
+ Principled Exploration via Optimistic Bootstrapping and Backward Induction 2021 Chenjia Bai
Lingxiao Wang
Lei Han
Jianye Hao
Animesh Garg
Peng Liu
Zhaoran Wang
+ Cooperative Multi-Agent Transfer Learning with Level-Adaptive Credit Assignment 2021 Tianze Zhou
Fubiao Zhang
Kun Shao
Kai Li
Wenhan Huang
Jun Luo
Weixun Wang
Yaodong Yang
Hangyu Mao
Bin Wang
+ Addressing Action Oscillations through Learning Policy Inertia 2021 Chen Chen
Hongyao Tang
Jianye Hao
Wulong Liu
Zhaopeng Meng
+ Foresee then Evaluate: Decomposing Value Estimation with Latent Future Prediction 2021 Hongyao Tang
Zhaopeng Meng
Guangyong Chen
Pengfei Chen
Chen Chen
Yaodong Yang
Luo Zhang
Wulong Liu
Jianye Hao
+ CMML: Contextual Modulation Meta Learning for Cold-Start Recommendation 2021 Xidong Feng
Chen Chen
Dong Li
Mengchen Zhao
Jianye Hao
Jun Wang
+ Learning to Select Cuts for Efficient Mixed-Integer Programming 2021 Zeren Huang
Kerong Wang
Furui Liu
Hui‐Ling Zhen
Weinan Zhang
Mingxuan Yuan
Jianye Hao
Yong Yu
Jun Wang
+ Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain 2021 Tianpei Yang
Hongyao Tang
Chenjia Bai
Jinyi Liu
Jianye Hao
Zhaopeng Meng
Peng Liu
+ Ranking Cost: Building An Efficient and Scalable Circuit Routing Planner with Evolution-Based Optimization 2021 Shiyu Huang
Bin Wang
Dong Li
Jianye Hao
Ting Chen
Jun Zhu
+ Lifelong Reinforcement Learning with Temporal Logic Formulas and Reward Machines 2021 Xuejing Zheng
Chao Yu
Chen Chen
Jianye Hao
Hankz Hankui Zhuo
+ SEIHAI: A Sample-efficient Hierarchical AI for the MineRL Competition 2021 Hangyu Mao
Chao Wang
Xiaotian Hao
Yihuan Mao
Yiming Lu
Chengjie Wu
Jianye Hao
Dong Li
Pingzhong Tang
+ Flattening Sharpness for Dynamic Gradient Projection Memory Benefits Continual Learning 2021 Danruo Deng
Guangyong Chen
Jianye Hao
Qiong Wang
Pheng‐Ann Heng
+ Dynamic Bottleneck for Robust Self-Supervised Exploration 2021 Chenjia Bai
Lingxiao Wang
Lei Han
Animesh Garg
Jianye Hao
Peng Liu
Zhaoran Wang
+ Uncertainty-aware Low-Rank Q-Matrix Estimation for Deep Reinforcement Learning 2021 Tong Sang
Hongyao Tang
Jianye Hao
Yan Zheng
Zhaopeng Meng
+ ED2: An Environment Dynamics Decomposition Framework for World Model Construction 2021 Cong Wang
Tianpei Yang
Jianye Hao
Yan Zheng
Hongyao Tang
Fazl Barez
Jinyi Liu
Jiajie Peng
Haiyin Piao
Zhixiao Sun
+ A Survey on Interpretable Reinforcement Learning 2021 Claire Glanois
Paul Weng
Matthieu Zimmer
Dong Li
Tianpei Yang
Jianye Hao
Wulong Liu
+ Learning State Representations via Retracing in Reinforcement Learning 2021 Changmin Yu
Dong Li
Jianye Hao
Jun Wang
Neil Burgess
+ HyAR: Addressing Discrete-Continuous Action Reinforcement Learning via Hybrid Action Representation 2021 Boyan Li
Hongyao Tang
Yan Zheng
Jianye Hao
Pengyi Li
Zhen Wang
Zhaopeng Meng
Li Wang
+ Modeling Scale-free Graphs with Hyperbolic Geometry for Knowledge-aware Recommendation 2021 Yankai Chen
Meng‐Lin Yang
Yingxue Zhang
Mengchen Zhao
Ziqiao Meng
Jianye Hao
Irwin King
+ An Empirical Study of Assumptions in Bayesian Optimisation 2020 Alexander I. Cowen-Rivers
Wenlong Lyu
Rasul Tutunov
Zhi Wang
Antoine Grosnit
Ryan‐Rhys Griffiths
Jianye Hao
Jun Wang
Jan Peters
Haitham Bou Ammar
+ Represent Your Own Policies: Reinforcement Learning with Policy-extended Value Function Approximator 2020 Hongyao Tang
Zhaopeng Meng
Jianye Hao
Chen Chen
Daniel Graves
Dong Li
Hangyu Mao
Wulong Liu
Yaodong Yang
Changmin Yu
+ KoGuN: Accelerating Deep Reinforcement Learning via Integrating Human Suboptimal Knowledge 2020 Peng Zhang
Jianye Hao
Weixun Wang
Hongyao Tang
Yi Ma
Yihai Duan
Yan Zheng
+ Efficient Deep Reinforcement Learning via Adaptive Policy Transfer 2020 Tianpei Yang
Jianye Hao
Zhaopeng Meng
Zongzhang Zhang
Yujing Hu
Yingfeng Chen
Changjie Fan
Weixun Wang
Wulong Liu
Zhaodong Wang
+ Learning to Accelerate Heuristic Searching for Large-Scale Maximum Weighted b-Matching Problems in Online Advertising 2020 Xiaotian Hao
Junqi Jin
Jianye Hao
Jin Li
Weixun Wang
Yi Ma
Zhenzhe Zheng
Han Li
Jian Xu
Kun Gai
+ Triple-GAIL: A Multi-Modal Imitation Learning Framework with Generative Adversarial Nets 2020 Cong Fei
Bin Wang
Yuzheng Zhuang
Zongzhang Zhang
Jianye Hao
Hongbo Zhang
Xuewu Ji
Wulong Liu
+ Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising 2020 Xiaotian Hao
Zhaoqing Peng
Yi Ma
Guan Wang
Junqi Jin
Jianye Hao
Shan Chen
Rongquan Bai
Mingzhou Xie
Miao Xu
+ CausalVAE: Disentangled Representation Learning via Neural Structural Causal Models 2020 Mengyue Yang
Furui Liu
Zhitang Chen
Xinwei Shen
Jianye Hao
Jun Wang
+ PDF Chat Neighborhood Cognition Consistent Multi-Agent Reinforcement Learning 2020 Hangyu Mao
Wulong Liu
Jianye Hao
Jun Luo
Dong Li
Zhengchao Zhang
Jun Wang
Zhen Xiao
+ PDF Chat From Few to More: Large-Scale Dynamic Multiagent Curriculum Learning 2020 Weixun Wang
Tianpei Yang
Yong Liu
Jianye Hao
Xiaotian Hao
Yujing Hu
Yingfeng Chen
Changjie Fan
Yang Gao
+ PDF Chat Multi-Agent Game Abstraction via Graph Attention Neural Network 2020 Yong Liu
Weixun Wang
Yujing Hu
Jianye Hao
Xingguo Chen
Yang Gao
+ PDF Chat Continuous Multiagent Control Using Collective Behavior Entropy for Large-Scale Home Energy Management 2020 Jianwen Sun
Yan Zheng
Jianye Hao
Zhaopeng Meng
Yang Liu
+ Efficient Deep Reinforcement Learning through Policy Transfer 2020 Tianpei Yang
Jianye Hao
Zhaopeng Meng
Zongzhang Zhang
Weixun Wang
Yujing Hu
Yingfeng Cheng
Changjie Fan
Zhaodong Wang
Jiajie Peng
+ PDF Chat An Efficient Transfer Learning Framework for Multiagent Reinforcement Learning 2020 Tianpei Yang
Weixun Wang
Hongyao Tang
Jianye Hao
Zhaopeng Meng
Hangyu Mao
Dong Li
Wulong Liu
Chengwei Zhang
Yujing Hu
+ PDF Chat Falsification of Cyber-Physical Systems Using Deep Reinforcement Learning 2020 Yoriyuki Yamagata
Shuang Liu
Takumi Akazaki
Yihai Duan
Jianye Hao
+ Qatten: A General Framework for Cooperative Multiagent Reinforcement Learning 2020 Yaodong Yang
Jianye Hao
Ben Liao
Kun Shao
Guangyong Chen
Wulong Liu
Hongyao Tang
+ Q-value Path Decomposition for Deep Multiagent Reinforcement Learning 2020 Yaodong Yang
Jianye Hao
Guangyong Chen
Hongyao Tang
Yingfeng Chen
Yujing Hu
Changjie Fan
Zhongyu Wei
+ KoGuN: Accelerating Deep Reinforcement Learning via Integrating Human Suboptimal Knowledge 2020 Peng Zhang
Jianye Hao
Weixun Wang
Hongyao Tang
Yi Ma
Yihai Duan
Yan Zheng
+ CausalVAE: Structured Causal Disentanglement in Variational Autoencoder 2020 Mengyue Yang
Furui Liu
Zhitang Chen
Xinwei Shen
Jianye Hao
Jun Wang
+ Learning to Accelerate Heuristic Searching for Large-Scale Maximum Weighted b-Matching Problems in Online Advertising 2020 Xiaotian Hao
Junqi Jin
Jianye Hao
Li Jin
Weixun Wang
Yi Ma
Zhenzhe Zheng
Han Li
Jian Xu
Kun Gai
+ Efficient Deep Reinforcement Learning via Adaptive Policy Transfer 2020 Tianpei Yang
Jianye Hao
Zhaopeng Meng
Zongzhang Zhang
Yujing Hu
Yingfeng Cheng
Changjie Fan
Weixun Wang
Wulong Liu
Zhaodong Wang
+ Continuous Multiagent Control using Collective Behavior Entropy for Large-Scale Home Energy Management 2020 Jianwen Sun
Yan Zheng
Jianye Hao
Zhaopeng Meng
Yang Liu
+ Triple-GAIL: A Multi-Modal Imitation Learning Framework with Generative Adversarial Nets 2020 Cong Fei
Bin Wang
Yuzheng Zhuang
Zongzhang Zhang
Jianye Hao
Hongbo Zhang
Xuewu Ji
Wulong Liu
+ Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising 2020 Xiaotian Hao
Zhaoqing Peng
Yi Ma
Guan Wang
Junqi Jin
Jianye Hao
Shan Chen
Rongquan Bai
Mingzhou Xie
Miao Xu
+ Dynamic Horizon Value Estimation for Model-based Reinforcement Learning 2020 Junjie Wang
Qichao Zhang
Dongbin Zhao
Mengchen Zhao
Jianye Hao
+ Towards Effective Context for Meta-Reinforcement Learning: an Approach based on Contrastive Learning 2020 Haotian Fu
Hongyao Tang
Jianye Hao
Chen Chen
Xidong Feng
Li Dong
Wulong Liu
+ Event-Triggered Multi-agent Reinforcement Learning with Communication under Limited-bandwidth Constraint 2020 Guangzheng Hu
Yuanheng Zhu
Dongbin Zhao
Mengchen Zhao
Jianye Hao
+ SMARTS: Scalable Multi-Agent Reinforcement Learning Training School for Autonomous Driving 2020 Ming Zhou
Jun Luo
Julian Villela
Yaodong Yang
David Rusu
Jiayu Miao
Weinan Zhang
Montgomery Alban
Iman Fadakar
Zheng Chen
+ Learning to Utilize Shaping Rewards: A New Approach of Reward Shaping 2020 Yujing Hu
Weixun Wang
Hangtian Jia
Wang Yi-xiang
Yingfeng Chen
Jianye Hao
Feng Wu
Changjie Fan
+ Critic PI2: Master Continuous Planning via Policy Improvement with Path Integrals and Deep Actor-Critic Reinforcement Learning 2020 Jiajun Fan
He Ba
Xian Guo
Jianye Hao
+ Learning to Utilize Shaping Rewards: A New Approach of Reward Shaping 2020 Yujing Hu
Weixun Wang
Hangtian Jia
Yixiang Wang
Yingfeng Chen
Jianye Hao
Feng Wu
Changjie Fan
+ PDF Chat MGHRL: Meta Goal-Generation for Hierarchical Reinforcement Learning 2020 Haotian Fu
Hongyao Tang
Jianye Hao
Wulong Liu
Chen Chen
+ An Efficient Transfer Learning Framework for Multiagent Reinforcement Learning 2020 Tianpei Yang
Weixun Wang
Hongyao Tang
Jianye Hao
Zhaopeng Meng
Hangyu Mao
Dong Li
Wulong Liu
Yingfeng Chen
Yujing Hu
+ What About Inputing Policy in Value Function: Policy Representation and Policy-extended Value Function Approximator 2020 Hongyao Tang
Zhaopeng Meng
Jianye Hao
Chen Chen
Daniel Graves
Dong Li
Changmin Yu
Hangyu Mao
Wulong Liu
Yaodong Yang
+ HEBO Pushing The Limits of Sample-Efficient Hyperparameter Optimisation 2020 Alexander I. Cowen-Rivers
Wenlong Lyu
Rasul Tutunov
Zhi Wang
Antoine Grosnit
Ryan‐Rhys Griffiths
Alexandre Max Maraval
Jianye Hao
Jun Wang
Jan Peters
+ PDF Chat There is Limited Correlation between Coverage and Robustness for Deep Neural Networks 2019 Yizhen Dong
Peixin Zhang
Jingyi Wang
Shuang Liu
Jun Sun
Jianye Hao
Xinyu Wang
Li Wang
Jin Song Dong
Ting Dai
+ PDF Chat Learning Adaptive Display Exposure for Real-Time Advertising 2019 Weixun Wang
Junqi Jin
Jianye Hao
Chunjie Chen
Chuan Yu
Weinan Zhang
Jun Wang
Xiaotian Hao
Yixi Wang
Han Li
+ PDF Chat Using deep reinforcement learning to speed up collective cell migration 2019 Hanxu Hou
Tian Gan
Yaodong Yang
Xianglei Zhu
Sen Liu
Weiming Guo
Jianye Hao
+ Efficient meta reinforcement learning via meta goal generation 2019 Haotian Fu
Hongyao Tang
Jianye Hao
+ Independent Generative Adversarial Self-Imitation Learning in Cooperative Multiagent Systems. 2019 Xiaotian Hao
Weixun Wang
Jianye Hao
Yaodong Yang
+ Towards Efficient Detection and Optimal Response against Sophisticated Opponents 2019 Tianpei Yang
Jianye Hao
Zhaopeng Meng
Chongjie Zhang
Yan Zheng
Ze Zheng
+ Deep Multi-Agent Reinforcement Learning with Discrete-Continuous Hybrid Action Spaces 2019 Haotian Fu
Hongyao Tang
Jianye Hao
Zihan Lei
Yingfeng Chen
Changjie Fan
+ Action Semantics Network: Considering the Effects of Actions in Multiagent Systems 2019 Weixun Wang
Tianpei Yang
Yong Liu
Jianye Hao
Xiaotian Hao
Yujing Hu
Yingfeng Chen
Changjie Fan
Yang Gao
+ PDF Chat SA-IGA: a multiagent reinforcement learning method towards socially optimal outcomes 2019 Chengwei Zhang
Xiaohong Li
Jianye Hao
Siqi Chen
Karl Tuyls
Wanli Xue
Zhiyong Feng
+ Deep Multi-Agent Reinforcement Learning with Discrete-Continuous Hybrid Action Spaces 2019 Haotian Fu
Hongyao Tang
Jianye Hao
Zihan Lei
Yingfeng Chen
Changjie Fan
+ Disentangling Dynamics and Returns: Value Function Decomposition with Future Prediction 2019 Hongyao Tang
Jianye Hao
Guangyong Chen
Pengfei Chen
Zhaopeng Meng
Yaodong Yang
Li Wang
+ Independent Generative Adversarial Self-Imitation Learning in Cooperative Multiagent Systems 2019 Xiaotian Hao
Weixun Wang
Jianye Hao
Yaodong Yang
+ Spectral-based Graph Convolutional Network for Directed Graphs 2019 Yi Ma
Jianye Hao
Yaodong Yang
Han Li
Junqi Jin
Guangyong Chen
+ From Few to More: Large-scale Dynamic Multiagent Curriculum Learning 2019 Weixun Wang
Tianpei Yang
Yong Liu
Jianye Hao
Xiaotian Hao
Yujing Hu
Yingfeng Chen
Changjie Fan
Yang Gao
+ Diverse Behavior Is What Game AI Needs: Generating Varied Human-Like Playing Styles Using Evolutionary Multi-Objective Deep Reinforcement Learning 2019 Ruimin Shen
Yan Zheng
Jianye Hao
Yinfeng Chen
Changjie Fan
+ There is Limited Correlation between Coverage and Robustness for Deep Neural Networks 2019 Yizhen Dong
Peixin Zhang
Jingyi Wang
Shuang Liu
Jun Sun
Jianye Hao
Xinyu Wang
Li Wang
Jin Song Dong
Ting Dai
+ Multi-Agent Game Abstraction via Graph Attention Neural Network 2019 Yong Liu
Weixun Wang
Yujing Hu
Jianye Hao
Xingguo Chen
Yang Gao
+ Neighborhood Cognition Consistent Multi-Agent Reinforcement Learning 2019 Hangyu Mao
Wulong Liu
Jianye Hao
Jun Luo
Li Dong
Zhengchao Zhang
Jun Wang
Zhen Xiao
+ MGHRL: Meta Goal-generation for Hierarchical Reinforcement Learning 2019 Haotian Fu
Hongyao Tang
Jianye Hao
Wulong Liu
Chen Chen
+ Action Semantics Network: Considering the Effects of Actions in Multiagent Systems 2019 Weixun Wang
Tianpei Yang
Yong Liu
Jianye Hao
Xiaotian Hao
Yujing Hu
Yingfeng Chen
Changjie Fan
Yang Gao
+ Hierarchical Deep Multiagent Reinforcement Learning 2018 Hongyao Tang
Jianye Hao
Tangjie Lv
Yingfeng Chen
Zongzhang Zhang
Hangtian Jia
Chunxu Ren
Yan Zheng
Changjie Fan
Li Wang
+ SCC-rFMQ Learning in Cooperative Markov Games with Continuous Actions 2018 Chengwei Zhang
Xiaohong Li
Jianye Hao
Siqi Chen
Karl Tuyls
Zhiyong Feng
Wanli Xue
Rong Chen
+ Bayes-ToMoP: A Fast Detection and Best Response Algorithm Towards Sophisticated Opponents 2018 Tianpei Yang
Zhaopeng Meng
Jianye Hao
Chongjie Zhang
Yan Zheng
+ Learning to Advertise with Adaptive Exposure via Constrained Two-Level Reinforcement Learning. 2018 Weixun Wang
Junqi Jin
Jianye Hao
Chunjie Chen
Chuan Yu
Weinan Zhang
Jun Wang
Yixi Wang
Han Li
Jian Xu
+ Hierarchical Heuristic Learning towards Effcient Norm Emergence. 2018 Tianpei Yang
Jianye Hao
Zhaopeng Meng
Sandip Sen
Sheng Jin
+ Weighted Double Deep Multiagent Reinforcement Learning in Stochastic Cooperative Environments 2018 Yan Zheng
Jianye Hao
Zongzhang Zhang
+ SA-IGA: A Multiagent Reinforcement Learning Method Towards Socially Optimal Outcomes 2018 Chengwei Zhang
Xiaohong Li
Jianye Hao
Siqi Chen
Karl Tuyls
Wanli Xue
+ Towards Cooperation in Sequential Prisoner's Dilemmas: a Deep Multiagent Reinforcement Learning Approach 2018 Weixun Wang
Jianye Hao
Yixi Wang
Matthew E. Taylor
+ PDF Chat Falsification of Cyber-Physical Systems Using Deep Reinforcement Learning 2018 Takumi Akazaki
Shuang Liu
Yoriyuki Yamagata
Yihai Duan
Jianye Hao
+ An Optimal Rewiring Strategy for Reinforcement Social Learning in Cooperative Multiagent Systems 2018 Hongyao Tang
Li Wang
Zan Wang
Tim Baarslag
Jianye Hao
+ Towards Efficient Detection and Optimal Response against Sophisticated Opponents 2018 Tianpei Yang
Zhaopeng Meng
Jianye Hao
Chongjie Zhang
Yan Zheng
Ze Zheng
+ Hierarchical Deep Multiagent Reinforcement Learning with Temporal Abstraction 2018 Hongyao Tang
Jianye Hao
Tangjie Lv
Yingfeng Chen
Zongzhang Zhang
Hangtian Jia
Chunxu Ren
Yan Zheng
Zhaopeng Meng
Changjie Fan
+ PDF Chat Weighted Double Deep Multiagent Reinforcement Learning in Stochastic Cooperative Environments 2018 Yan Zheng
Zhaopeng Meng
Jianye Hao
Zongzhang Zhang
+ Learning Adaptive Display Exposure for Real-Time Advertising 2018 Weixun Wang
Junqi Jin
Jianye Hao
Chunjie Chen
Chuan Yu
Weinan Zhang
Jun Wang
Xiaotian Hao
Yixi Wang
Han Li
+ Hierarchical Heuristic Learning towards Effcient Norm Emergence 2018 Tianpei Yang
Jianye Hao
Zhaopeng Meng
Sandip Sen
Sheng Jin
+ Dynamic analysis of cell interactions in biological environments under multiagent social learning framework 2016 Chengwei Zhang
Xiaohong Li
Shuxin Li
Jianye Hao
+ PDF Chat Blind Image Denoising via Dependent Dirichlet Process Tree 2016 Fengyuan Zhu
Guangyong Chen
Jianye Hao
Pheng‐Ann Heng
+ Blind Image Denoising via Dependent Dirichlet Process Tree 2016 Fengyuan Zhu
Guangyong Chen
Jianye Hao
Pheng‐Ann Heng
Common Coauthors
Commonly Cited References
Action Title Year Authors # of times referenced
+ Proximal Policy Optimization Algorithms 2017 John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
28
+ Asynchronous Methods for Deep Reinforcement Learning 2016 Volodymyr Mnih
Adrià Puigdomènech Badia
Mehdi Mirza
Alex Graves
Tim Harley
Timothy Lillicrap
David Silver
Koray Kavukcuoglu
24
+ Continuous control with deep reinforcement learning 2016 Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
Nicolas Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
20
+ Continuous control with deep reinforcement learning 2015 Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
Nicolas Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
10
+ Trust Region Policy Optimization 2015 John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
10
+ High-Dimensional Continuous Control Using Generalized Advantage Estimation 2015 John Schulman
Philipp Moritz
Sergey Levine
Michael I. Jordan
Pieter Abbeel
10
+ QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning 2018 Tabish Rashid
Mikayel Samvelyan
Christian Schroeder de Witt
Gregory Farquhar
Jakob Foerster
Shimon Whiteson
8
+ Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments 2017 Ryan Lowe
Yi Wu
Aviv Tamar
Jean Harb
Pieter Abbeel
Igor Mordatch
8
+ Dream to Control: Learning Behaviors by Latent Imagination 2019 Danijar Hafner
Timothy Lillicrap
Jimmy Ba
Mohammad Norouzi
8
+ Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor 2018 Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
8
+ Deep reinforcement learning with double Q-Learning 2016 Hado van Hasselt
Arthur Guez
David Silver
7
+ Addressing Function Approximation Error in Actor-Critic Methods 2018 Scott Fujimoto
Herke van Hoof
David Meger
7
+ Generative Adversarial Imitation Learning 2016 Jonathan Ho
Stefano Ermon
7
+ QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning 2018 Tabish Rashid
Mikayel Samvelyan
Christian Schroeder de Witt
Gregory Farquhar
Jakob Foerster
Shimon Whiteson
7
+ Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments 2017 Ryan Lowe
Yi Wu
Aviv Tamar
Jean Harb
OpenAI Pieter Abbeel
Igor Mordatch
6
+ Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments 2017 Ryan Lowe
Yi Wu
Aviv Tamar
Jean Harb
Pieter Abbeel
Igor Mordatch
6
+ Adam: A Method for Stochastic Optimization 2014 Diederik P. Kingma
Jimmy Ba
6
+ PDF Chat Counterfactual Multi-Agent Policy Gradients 2018 Jakob Foerster
Gregory Farquhar
Triantafyllos Afouras
Nantas Nardelli
Shimon Whiteson
6
+ CURL: Contrastive Unsupervised Representations for Reinforcement Learning 2020 Aravind Srinivas
Michael Laskin
Pieter Abbeel
6
+ PDF Chat Deep Reinforcement Learning: A Brief Survey 2017 Kai Arulkumaran
Marc Peter Deisenroth
Miles Brundage
Anil A. Bharath
5
+ Self-Supervised Exploration via Disagreement 2019 Deepak Pathak
Dhiraj Gandhi
Abhinav Gupta
5
+ PDF Chat Factorized Q-learning for large-scale multi-agent systems 2019 Ming Zhou
Yong Chen
Ying Wen
Yaodong Yang
Yufeng Su
Weinan Zhang
Dell Zhang
Jun Wang
5
+ Lenient Multi-Agent Deep Reinforcement Learning 2017 Gregory Palmer
Karl Tuyls
Daan Bloembergen
Rahul Savani
5
+ Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation 2017 Yuhuai Wu
Elman Mansimov
S. Matthew Liao
Roger Grosse
Jimmy Ba
5
+ Attention Is All You Need 2017 Ashish Vaswani
Noam Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan N. Gomez
Łukasz Kaiser
Illia Polosukhin
5
+ PDF Chat Deep reinforcement learning for robotic manipulation with asynchronous off-policy updates 2017 Shixiang Gu
Ethan Holly
Timothy Lillicrap
Sergey Levine
5
+ A Simple Framework for Contrastive Learning of Visual Representations 2020 Ting Chen
Simon Kornblith
Mohammad Norouzi
Geoffrey E. Hinton
5
+ PDF Chat An Overview of Recent Progress in the Study of Distributed Multi-Agent Coordination 2012 Yongcan Cao
Wenwu Yu
Wei Ren
Guanrong Chen
5
+ PDF Chat Weighted Double Deep Multiagent Reinforcement Learning in Stochastic Cooperative Environments 2018 Yan Zheng
Zhaopeng Meng
Jianye Hao
Zongzhang Zhang
4
+ Learning to Communicate with Deep Multi−Agent Reinforcement Learning 2016 Jakob Foerster
Ioannis Alexandros Assael
Nando de Freitas
Shimon Whiteson
4
+ The StarCraft Multi-Agent Challenge 2019 Mikayel Samvelyan
Tabish Rashid
Christian Schroeder de Witt
Gregory Farquhar
Nantas Nardelli
Tim G. J. Rudner
Chia-Man Hung
Philip H. S. Torr
Jakob Foerster
Shimon Whiteson
4
+ PDF Chat Graph Convolutional Neural Networks for Web-Scale Recommender Systems 2018 Rex Ying
Ruining He
Kaifeng Chen
Pong Eksombatchai
William L. Hamilton
Jure Leskovec
4
+ Provably Efficient Reinforcement Learning with Linear Function Approximation 2019 Chi Jin
Zhuoran Yang
Zhaoran Wang
Michael I. Jordan
4
+ PDF Chat Momentum Contrast for Unsupervised Visual Representation Learning 2020 Kaiming He
Haoqi Fan
Yuxin Wu
Saining Xie
Ross Girshick
4
+ Representation Learning with Contrastive Predictive Coding 2018 Aäron van den Oord
Yazhe Li
Oriol Vinyals
4
+ OpenAI Gym 2016 Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
4
+ PDF Chat Variational Dynamic for Self-Supervised Exploration in Deep Reinforcement Learning 2021 Chenjia Bai
Peng Liu
Kaiyu Liu
Lingxiao Wang
Yingnan Zhao
Lei Han
Zhaoran Wang
4
+ PDF Chat Real-Time Bidding by Reinforcement Learning in Display Advertising 2017 Han Cai
Kan Ren
Weinan Zhang
Kleanthis Malialis
Jun Wang
Yong Yu
Defeng Guo
4
+ PDF Chat The nested chinese restaurant process and bayesian nonparametric inference of topic hierarchies 2010 David M. Blei
Thomas L. Griffiths
Michael I. Jordan
4
+ Exploration by Random Network Distillation 2018 Yuri Burda
Harrison Edwards
Amos Storkey
Oleg Klimov
4
+ A Graph Autoencoder Approach to Causal Structure Learning 2019 Ignavier Ng
Shengyu Zhu
Zhitang Chen
Zhuangyan Fang
4
+ Deep Exploration via Bootstrapped DQN 2016 Ian Osband
Charles Blundell
Alexander Pritzel
Benjamin Van Roy
4
+ Hindsight Experience Replay 2017 Marcin Andrychowicz
Filip Wolski
Alex Ray
Jonas Schneider
Rachel Fong
Peter Welinder
Bob McGrew
Josh Tobin
Pieter Abbeel
Wojciech Zaremba
4
+ Methods of Qualitative Theory in Nonlinear Dynamics 2001 L. P. Shilnikov
Andrey Shilnikov
Dmitry Turaev
Leon O. Chua
4
+ Multiagent Bidirectionally-Coordinated Nets for Learning to Play StarCraft Combat Games. 2017 Peng Peng
Quan Yuan
Ying Wen
Yaodong Yang
Zhenkun Tang
Haitao Long
Jun Wang
4
+ Multiagent Bidirectionally-Coordinated Nets: Emergence of Human-level Coordination in Learning to Play StarCraft Combat Games 2017 Peng Peng
Ying Wen
Yaodong Yang
Quan Yuan
Zhenkun Tang
Haitao Long
Jun Wang
4
+ Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks 2017 Chelsea Finn
Pieter Abbeel
Sergey Levine
4
+ Prioritized Experience Replay 2015 Tom Schaul
John Quan
Ioannis Antonoglou
David Silver
4
+ Parametrized Deep Q-Networks Learning: Reinforcement Learning with Discrete-Continuous Hybrid Action Space 2018 Jiechao Xiong
Qing Wang
Zhuoran Yang
Peng Sun
Lei Han
Yang Zheng
Haobo Fu
Tong Zhang
Ji Liu
Han Liu
4
+ The Option-Critic Architecture 2016 Pierre‐Luc Bacon
Jean Harb
Doina Precup
4