+
PDF
Chat
|
SpatialCoT: Advancing Spatial Reasoning through Coordinate Alignment and
Chain-of-Thought for Embodied Task Planning
|
2025
|
Yuecheng Liu
Dafeng Chi
Shiguang Wu
Zhanguang Zhang
Yaochen Hu
Lingfeng Zhang
Yingxue Zhang
Shuang Wu
Tongtong Cao
Guowei Huang
|
+
PDF
Chat
|
Path-of-Thoughts: Extracting and Following Paths for Robust Relational
Reasoning with Large Language Models
|
2024
|
Ge Zhang
Mohammad Ali Alomrani
Hongjian Gu
Jiaming Zhou
Yaochen Hu
Bin Wang
Qun Liu
Mark Coates
Yingxue Zhang
Jianye Hao
|
+
PDF
Chat
|
The Graph's Apprentice: Teaching an LLM Low Level Knowledge for Circuit
Quality Estimation
|
2024
|
Reza Moravej
S. K. Bodhe
Zhanguang Zhang
Didier Chételat
Dimitrios Tsaras
Yingxue Zhang
Hui‐Ling Zhen
Jianye Hao
Mingxuan Yuan
|
+
PDF
Chat
|
Lightweight Neural App Control
|
2024
|
Filippos Christianos
Georgios Papoudakis
Thomas Coste
Jianye Hao
Jun Wang
Kun Shao
|
+
PDF
Chat
|
SeaDAG: Semi-autoregressive Diffusion for Conditional Directed Acyclic
Graph Generation
|
2024
|
Xinyi Zhou
Xing Li
Yanqing Lian
Yiwen Wang
Jason Li Chen
Mingxuan Yuan
Jianye Hao
Guangyong Chen
Pheng‐Ann Heng
|
+
PDF
Chat
|
ET-Plan-Bench: Embodied Task-level Planning Benchmark Towards
Spatial-Temporal Cognition with Foundation Models
|
2024
|
Lingfeng Zhang
Yuening Wang
Hongjian Gu
Atia Hamidizadeh
Zhanguang Zhang
Yuecheng Liu
Yutong Wang
David Nogués‐Bravo
Junyi Dong
Shunbo Zhou
|
+
PDF
Chat
|
Enhancing Logical Reasoning in Large Language Models through Graph-based
Synthetic Data
|
2024
|
Jiaming Zhou
Abbas Ghaddar
Ge Zhang
Liheng Ma
Yaochen Hu
Soumyasundar Pal
Mark Coates
Bin Wang
Yingxue Zhang
Jianye Hao
|
+
PDF
Chat
|
MODULI: Unlocking Preference Generalization via Diffusion Models for
Offline Multi-Objective Reinforcement Learning
|
2024
|
Yifu Yuan
Zhenrui Zheng
Zibin Dong
Jianye Hao
|
+
PDF
Chat
|
GraSS: Combining Graph Neural Networks with Expert Knowledge for SAT Solver Selection
|
2024
|
Zhanguang Zhang
Didier Chételat
Joseph Cotnareanu
Amur Ghose
Wenyi Xiao
Hui‐Ling Zhen
Yingxue Zhang
Jianye Hao
Mark Coates
Mingxuan Yuan
|
+
PDF
Chat
|
Actra: Optimized Transformer Architecture for Vision-Language-Action
Models in Robot Learning
|
2024
|
Yueen Ma
Dafeng Chi
Shiguang Wu
Yuecheng Liu
Yuzheng Zhuang
Jianye Hao
Irwin King
|
+
PDF
Chat
|
CellAgent: An LLM-driven Multi-Agent Framework for Automated Single-cell
Data Analysis
|
2024
|
Yihang Xiao
Jinyi Liu
Yan Zheng
Xiaohan Xie
Jianye Hao
Mingzhi Li
Ruitao Wang
Fei Ni
Yuxiao Li
Jintian Luo
|
+
PDF
Chat
|
MFE-ETP: A Comprehensive Evaluation Benchmark for Multi-modal Foundation
Models on Embodied Task Planning
|
2024
|
Min Zhang
Jianye Hao
Xi′an Fu
Peilong Han
Hao Zhang
Lei Shi
Hongyao Tang
Yan Zheng
|
+
PDF
Chat
|
Benchmarking End-To-End Performance of AI-Based Chip Placement
Algorithms
|
2024
|
Zhihai Wang
Zijie Geng
Z. Tu
Jie Wang
Yong Qian
Z. Xu
Ziyan Liu
Siyuan Xu
Zhentao Tang
Shixiong Kai
|
+
PDF
Chat
|
ROS-LLM: A ROS framework for embodied AI with task feedback and
structured reasoning
|
2024
|
Christopher E. Mower
Yuhui Wan
Hongzhan Yu
Antoine Grosnit
Jonas Gonzalez-Billandon
Matthieu Zimmer
Jinlong Wang
Xinyu Zhang
Yao Zhao
Anbang Zhai
|
+
PDF
Chat
|
EWEK-QA: Enhanced Web and Efficient Knowledge Graph Retrieval for
Citation-based Question Answering Systems
|
2024
|
Mohammad Hossein Dehghan
Mohammad Ali Alomrani
Sunyam Bagga
David Alfonso-Hermelo
Khalil Bibi
Abbas Ghaddar
Yingxue Zhang
Xiaoguang Li
Jianye Hao
Qun Liu
|
+
PDF
Chat
|
CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models
in Decision Making
|
2024
|
Zibin Dong
Yifu Yuan
Jianye Hao
Fei Ni
Yi Ma
Pengyi Li
Yan Zheng
|
+
PDF
Chat
|
iVideoGPT: Interactive VideoGPTs are Scalable World Models
|
2024
|
Jialong Wu
Yin Shao-feng
Ningya Feng
Xu He
Dong Li
Jianye Hao
Mingsheng Long
|
+
PDF
Chat
|
A Survey on Vision-Language-Action Models for Embodied AI
|
2024
|
Yueen Ma
Zixing Song
Yuzheng Zhuang
Jianye Hao
Irwin King
|
+
PDF
Chat
|
GraSS: Combining Graph Neural Networks with Expert Knowledge for SAT
Solver Selection
|
2024
|
Zhanguang Zhang
Didier Chételat
Joseph Cotnareanu
Amur Ghose
Wenyi Xiao
Hui‐Ling Zhen
Yingxue Zhang
Jianye Hao
Mark Coates
Mingxuan Yuan
|
+
PDF
Chat
|
vMFER: Von Mises-Fisher Experience Resampling Based on Uncertainty of
Gradient Directions for Policy Improvement
|
2024
|
Yiwen Zhu
Jinyi Liu
Wenya Wei
Qianyi Fu
Yujing Hu
Fang Zhou
Bo An
Jianye Hao
Tangjie Lv
Changjie Fan
|
+
PDF
Chat
|
SheetAgent: A Generalist Agent for Spreadsheet Reasoning and
Manipulation via Large Language Models
|
2024
|
Yibin Chen
Yifu Yuan
Zeyu Zhang
Yan Zheng
Jinyi Liu
Fei Ni
Jianye Hao
|
+
PDF
Chat
|
Reinforced In-Context Black-Box Optimization
|
2024
|
Lei Song
Chenxiao Gao
Ke Xue
Chenyang Wu
Dong Li
Jianye Hao
Zongzhang Zhang
Chao Qian
|
+
PDF
Chat
|
Enhancing Robotic Manipulation with AI Feedback from Multimodal Large
Language Models
|
2024
|
Jinyi Liu
Yifu Yuan
Jianye Hao
Fei Ni
Lingzhi Fu
Yibin Chen
Zheng Yan
|
+
PDF
Chat
|
MENTOR: Guiding Hierarchical Reinforcement Learning with Human Feedback
and Dynamic Distance Constraint
|
2024
|
Xinglin Zhou
Yifu Yuan
Shaofu Yang
Jianye Hao
|
+
PDF
Chat
|
Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement
Learning with Diverse Human Feedback
|
2024
|
Yifu Yuan
Jianye Hao
Yi Ma
Zibin Dong
Hebin Liang
Jinyi Liu
Zhixin Feng
Kai Zhao
Yan Zheng
|
+
PDF
Chat
|
DiffuserLite: Towards Real-time Diffusion Planning
|
2024
|
Zibin Dong
Jianye Hao
Yifu Yuan
Fei Ni
Yitian Wang
Pengyi Li
Yan Zheng
|
+
|
Machine Learning Insides OptVerse AI Solver: Design Principles and Applications
|
2024
|
Xijun Li
Fangzhou Zhu
Hui‐Ling Zhen
Weilin Luo
Meng Lu
Yimin Huang
Zhenan Fan
Zirui Zhou
Yufei Kuang
Zhihai Wang
|
+
|
Bridging Evolutionary Algorithms and Reinforcement Learning: A Comprehensive Survey
|
2024
|
Pengyi Li
Jianye Hao
Hongyao Tang
Xi′an Fu
Yan Zheng
Ke Tang
|
+
|
Robust Multiobjective Reinforcement Learning Considering Environmental Uncertainties
|
2024
|
Xiangkun He
Jianye Hao
Xu Chen
Jun Wang
Xuewu Ji
Chen Lv
|
+
PDF
Chat
|
Bridging Evolutionary Algorithms and Reinforcement Learning: A Comprehensive Survey on Hybrid Algorithms
|
2024
|
P Li
Jianye Hao
Hongyao Tang
Xi′an Fu
Yan Zhen
Ke Tang
|
+
|
RITA: Boost Driving Simulators with Realistic Interactive Traffic Flow
|
2023
|
Zhengbang Zhu
Shenyu Zhang
Yuzheng Zhuang
Yuecheng Liu
Minghuan Liu
Ziqin Gong
Shixiong Kai
Qiang Gu
Bin Wang
Siyuan Cheng
|
+
PDF
Chat
|
Pessimistic value iteration for multi-task data sharing in Offline Reinforcement Learning
|
2023
|
Chenjia Bai
Lingxiao Wang
Jianye Hao
Zhuoran Yang
Bin Zhao
Zhen Wang
Xuelong Li
|
+
|
Generalized Universal Domain Adaptation with Generative Flow Networks
|
2023
|
Didi Zhu
Yinchuan Li
Yunfeng Shao
Jianye Hao
Fei Wu
Kun Kuang
Jun Xiao
Chao Wu
|
+
PDF
Chat
|
Traj-MAE: Masked Autoencoders for Trajectory Prediction
|
2023
|
Hao Chen
Jiaze Wang
Kun Shao
Furui Liu
Jianye Hao
Chenyong Guan
Guangyong Chen
Pheng‐Ann Heng
|
+
PDF
Chat
|
BiERL: A Meta Evolutionary Reinforcement Learning Framework via Bilevel Optimization
|
2023
|
Junyi Wang
Yuanyang Zhu
Zhi Wang
Yan Zheng
Jianye Hao
Chunlin Chen
|
+
|
Generative Flow Networks for Precise Reward-Oriented Active Learning on Graphs
|
2023
|
Yinchuan Li
Zhigang Li
Wenqian Li
Yunfeng Shao
Yan Zheng
Jianye Hao
|
+
|
Uncertainty-aware Consistency Learning for Cold-Start Item Recommendation
|
2023
|
Taichi Liu
Chen Gao
Zhenyu Wang
Dong Li
Jianye Hao
Depeng Jin
Yong Li
|
+
PDF
Chat
|
Structure Aware Incremental Learning with Personalized Imitation Weights for Recommender Systems
|
2023
|
Yuening Wang
Yingxue Zhang
Antonios Valkanas
Ruiming Tang
Chen Ma
Jianye Hao
Mark Coates
|
+
PDF
Chat
|
Neighbor Auto-Grouping Graph Neural Networks for Handover Parameter Configuration in Cellular Network
|
2023
|
Mehrtash Mehrabi
Walid Masoudimansour
Yingxue Zhang
Jie Chuai
Zhitang Chen
Mark Coates
Jianye Hao
Yanhui Geng
|
+
PDF
Chat
|
Models as Agents: Optimizing Multi-Step Predictions of Interactive Local Models in Model-Based Multi-Agent Reinforcement Learning
|
2023
|
Zifan Wu
Chao Yu
Chen Chen
Jianye Hao
Hankz Hankui Zhuo
|
+
PDF
Chat
|
Debiased Recommendation with User Feature Balancing
|
2023
|
Mengyue Yang
Guohao Cai
Furui Liu
Jiarui Jin
Zhenhua Dong
Xiuqiang He
Jianye Hao
Weiqi Shao
Jun Wang
Xu Chen
|
+
PDF
Chat
|
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain
|
2023
|
Jianye Hao
Tianpei Yang
Hongyao Tang
Chenjia Bai
Jinyi Liu
Zhaopeng Meng
Peng Liu
Zhen Wang
|
+
|
Neighbor Auto-Grouping Graph Neural Networks for Handover Parameter Configuration in Cellular Network
|
2023
|
Mehrtash Mehrabi
Walid Masoudimansour
Yingxue Zhang
Jie Chuai
Zhitang Chen
Mark Coates
Jianye Hao
Yanhui Geng
|
+
|
Plan To Predict: Learning an Uncertainty-Foreseeing Model for Model-Based Reinforcement Learning
|
2023
|
Zifan Wu
Chao Yu
Chen Chen
Jianye Hao
Hankz Hankui Zhuo
|
+
|
Reweighted Interacting Langevin Diffusions: an Accelerated Sampling Methodfor Optimization
|
2023
|
Junlong Lyu
Zhitang Chen
Wenlong Lyu
Jianye Hao
|
+
|
Spectral Augmentations for Graph Contrastive Learning
|
2023
|
Amur Ghose
Yingxue Zhang
Jianye Hao
Mark Coates
|
+
|
The Ladder in Chaos: A Simple and Effective Improvement to General DRL Algorithms by Policy Path Trimming and Boosting
|
2023
|
Hongyao Tang
Min Zhang
Jianye Hao
|
+
|
CFlowNets: Continuous Control with Generative Flow Networks
|
2023
|
Yinchuan Li
Shuang Luo
Haozhi Wang
Jianye Hao
|
+
|
DAG Matters! GFlowNets Enhanced Explainer For Graph Neural Networks
|
2023
|
Wenqian Li
Yinchuan Li
Zhigang Li
Jianye Hao
Yan Pang
|
+
|
DR-Label: Improving GNN Models for Catalysis Systems by Label Deconstruction and Reconstruction
|
2023
|
Bowen Wang
Liang Chen
Jiaze Wang
Furui Liu
Shaogang Hao
Dong Li
Jianye Hao
Guangyong Chen
Xiaolong Zou
Pheng‐Ann Heng
|
+
|
Out-of-distribution Detection with Implicit Outlier Transformation
|
2023
|
Qizhou Wang
Junjie Ye
Feng Liu
Quanyu Dai
Marcus Kalander
Tongliang Liu
Jianye Hao
Bo Han
|
+
|
Traj-MAE: Masked Autoencoders for Trajectory Prediction
|
2023
|
Hao Chen
Jiaze Wang
Kun Shao
Furui Liu
Jianye Hao
Chenyong Guan
Guangyong Chen
Pheng‐Ann Heng
|
+
|
Models as Agents: Optimizing Multi-Step Predictions of Interactive Local Models in Model-Based Multi-Agent Reinforcement Learning
|
2023
|
Zifan Wu
Chao Yu
Chen Chen
Jianye Hao
Hankz Hankui Zhuo
|
+
|
Multi-agent Policy Reciprocity with Theoretical Guarantee
|
2023
|
Haozhi Wang
Yinchuan Li
Qing Wang
Yunfeng Shao
Jianye Hao
|
+
|
Generative Flow Networks for Precise Reward-Oriented Active Learning on Graphs
|
2023
|
Yinchuan Li
Zhigang Li
Wenqian Li
Yunfeng Shao
Zheng Yan
Jianye Hao
|
+
|
Structure Aware Incremental Learning with Personalized Imitation Weights for Recommender Systems
|
2023
|
Yuening Wang
Yingxue Zhang
Antonios Valkanas
Ruiming Tang
Chen Ma
Jianye Hao
Mark Coates
|
+
|
Generalized Universal Domain Adaptation with Generative Flow Networks
|
2023
|
Didi Zhu
Yinchuan Li
Yunfeng Shao
Jianye Hao
Fei Wu
Kun Kuang
Jun Xiao
Chao Wu
|
+
|
Learnable Behavior Control: Breaking Atari Human World Records via Sample-Efficient Behavior Selection
|
2023
|
Jiajun Fan
Yuzheng Zhuang
Yuecheng Liu
Jianye Hao
Bin Wang
Jiangcheng Zhu
Hao Wang
Shu‐Tao Xia
|
+
|
GFlowNets with Human Feedback
|
2023
|
Yinchuan Li
Shuang Luo
Yunfeng Shao
Jianye Hao
|
+
|
MetaDiffuser: Diffusion Model as Conditional Planner for Offline Meta-RL
|
2023
|
Fei Ni
Jianye Hao
Yao Mu
Yifu Yuan
Yan Zheng
Bin Wang
Zhixuan Liang
|
+
|
Hierarchical Task Network Planning for Facilitating Cooperative Multi-Agent Reinforcement Learning
|
2023
|
Xuechen Mu
Hankz Hankui Zhuo
Chen Chen
Kai Zhang
Chao Yu
Jianye Hao
|
+
|
ChiPFormer: Transferable Chip Placement via Offline Decision Transformer
|
2023
|
Yao Lai
Jinxin Liu
Zhentao Tang
Bin Wang
Jianye Hao
Ping Luo
|
+
|
Prioritized Trajectory Replay: A Replay Memory for Data-driven Reinforcement Learning
|
2023
|
Jinyi Liu
Yi Ma
Jianye Hao
Yujing Hu
Yan Zheng
Tangjie Lv
Changjie Fan
|
+
|
VOLTA: Diverse and Controllable Question-Answer Pair Generation with Variational Mutual Information Maximizing Autoencoder
|
2023
|
Yueen Ma
Dafeng Chi
Jingjing Li
Yuzheng Zhuang
Jianye Hao
Irwin King
|
+
|
Exploiting Counter-Examples for Active Learning with Partial labels
|
2023
|
Fei Zhang
Yunjie Ye
Lei Feng
Zhongwen Rao
Jieming Zhu
Marcus Kalander
Chen Gong
Jianye Hao
Bo Han
|
+
|
Uncertainty-aware Consistency Learning for Cold-Start Item Recommendation
|
2023
|
Taichi Liu
Chen Gao
Zhenyu Wang
Dong Li
Jianye Hao
Depeng Jin
Yong Li
|
+
|
BiERL: A Meta Evolutionary Reinforcement Learning Framework via Bilevel Optimization
|
2023
|
Junyi Wang
Yuanyang Zhu
Zhi Wang
Yan Zheng
Jianye Hao
Chunlin Chen
|
+
|
A Circuit Domain Generalization Framework for Efficient Logic Synthesis in Chip Design
|
2023
|
Zhihai Wang
Lei Chen
Jie Wang
Xing Li
Yinqi Bai
Xijun Li
Mingxuan Yuan
Jianye Hao
Yongdong Zhang
Feng Wu
|
+
|
AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model
|
2023
|
Zibin Dong
Yifu Yuan
Jianye Hao
Fei Ni
Yao Mu
Zheng Yan
Yujing Hu
Tangjie Lv
Changjie Fan
Zhipeng Hu
|
+
|
Rethinking Decision Transformer via Hierarchical Reinforcement Learning
|
2023
|
Yi Ma
Chenjun Xiao
Hebin Liang
Jianye Hao
|
+
|
OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in Noisy Environments
|
2023
|
Jinyi Liu
Zhi Wang
Yan Zheng
Jianye Hao
Chenjia Bai
Junjie Ye
Zhen Wang
Haiyin Piao
Yang Sun
|
+
|
Pangu-Agent: A Fine-Tunable Generalist Agent with Structured Reasoning
|
2023
|
Filippos Christianos
Georgios Papoudakis
Matthieu Zimmer
Thomas Coste
Zhihao Wu
Jingxuan Chen
Khyati Khandelwal
J. Doran
Xidong Feng
Jiacheng Liu
|
+
PDF
Chat
|
Contrastive-ACE: Domain Generalization Through Alignment of Causal Mechanisms
|
2022
|
Yunqi Wang
Furui Liu
Zhitang Chen
Yik‐Chung Wu
Jianye Hao
Guangyong Chen
Pheng‐Ann Heng
|
+
PDF
Chat
|
Coach-assisted multi-agent reinforcement learning framework for unexpected crashed agents
|
2022
|
Jian Zhao
Youpeng Zhao
Weixun Wang
Mingyu Yang
Xunhan Hu
Wengang Zhou
Jianye Hao
Houqiang Li
|
+
PDF
Chat
|
Empirical Policy Optimization for <i>n</i>-Player Markov Games
|
2022
|
Yuanheng Zhu
Weifan Li
Mengchen Zhao
Jianye Hao
Dongbin Zhao
|
+
PDF
Chat
|
PMIC: Improving Multi-Agent Reinforcement Learning with Progressive
Mutual Information Collaboration
|
2022
|
Pengyi Li
Hongyao Tang
Tianpei Yang
Xiaotian Hao
Tong Sang
Zheng Yan
Jianye Hao
Matthew E. Taylor
Wenyuan Tao
Zhen Wang
|
+
PDF
Chat
|
Modeling Scale-free Graphs with Hyperbolic Geometry for Knowledge-aware Recommendation
|
2022
|
Yankai Chen
Meng‐Lin Yang
Yingxue Zhang
Mengchen Zhao
Ziqiao Meng
Jianye Hao
Irwin King
|
+
PDF
Chat
|
SEIHAI: A Sample-Efficient Hierarchical AI for the MineRL Competition
|
2022
|
Hangyu Mao
Chao Wang
Xiaotian Hao
Yihuan Mao
Yiming Lu
Chengjie Wu
Jianye Hao
Dong Li
Pingzhong Tang
|
+
PDF
Chat
|
Uncertainty-Aware Low-Rank Q-Matrix Estimation for Deep Reinforcement Learning
|
2022
|
Tong Sang
Hongyao Tang
Jianye Hao
Yan Zheng
Zhaopeng Meng
|
+
|
Breaking the Curse of Dimensionality in Multiagent State Space: A Unified Agent Permutation Framework
|
2022
|
Xiaotian Hao
Weixun Wang
Hangyu Mao
Yaodong Yang
Dong Li
Yan Zheng
Zhen Wang
Jianye Hao
|
+
|
Coach-assisted Multi-Agent Reinforcement Learning Framework for Unexpected Crashed Agents
|
2022
|
Jian Zhao
Youpeng Zhao
Weixun Wang
Mingyu Yang
Xunhan Hu
Wengang Zhou
Jianye Hao
Houqiang Li
|
+
|
Debiased Recommendation with User Feature Balancing
|
2022
|
Mengyue Yang
Guohao Cai
Furui Liu
Zhenhua Dong
Xiuqiang He
Jianye Hao
Jun Wang
Xu Chen
|
+
|
Generalizable Information Theoretic Causal Representation
|
2022
|
Mengyue Yang
Xinyu Cai
Furui Liu
Xu Chen
Zhitang Chen
Jianye Hao
Jun Wang
|
+
|
Introduction to The Dynamic Pickup and Delivery Problem Benchmark -- ICAPS 2021 Competition
|
2022
|
Jianye Hao
Jiawen Lu
Xijun Li
Xialiang Tong
Xiang Xiang
Mingxuan Yuan
Hankz Hankui Zhuo
|
+
|
LHNN: Lattice Hypergraph Neural Network for VLSI Congestion Prediction
|
2022
|
Bowen Wang
Guibao Shen
Dong Li
Jianye Hao
Wulong Liu
Yu Huang
Hongzhong Wu
Yibo Lin
Guangyong Chen
Pheng‐Ann Heng
|
+
|
Plan Your Target and Learn Your Skills: Transferable State-Only Imitation Learning via Decoupled Policy Optimization
|
2022
|
Minghuan Liu
Zhengbang Zhu
Yuzheng Zhuang
Weinan Zhang
Jianye Hao
Yong Yu
Jun Wang
|
+
|
Revisiting QMIX: Discriminative Credit Assignment by Gradient Entropy Regularization
|
2022
|
Jian Zhao
Yue Zhang
Xunhan Hu
Weixun Wang
Wengang Zhou
Jianye Hao
Jiangcheng Zhu
Houqiang Li
|
+
|
A Benchmark for Automatic Medical Consultation System: Frameworks, Tasks and Datasets
|
2022
|
Wei Chen
Zhiwei Li
Hongyi Fang
Qianyuan Yao
Cheng Zhong
Jianye Hao
Qi Zhang
Xuanjing Huang
Jinye Peng
Zhongyu Wei
|
+
|
PMIC: Improving Multi-Agent Reinforcement Learning with Progressive Mutual Information Collaboration
|
2022
|
Pengyi Li
Hongyao Tang
Tianpei Yang
Xiaotian Hao
Tong Sang
Zheng Yan
Jianye Hao
Matthew E. Taylor
Zhen Wang
|
+
|
Off-Beat Multi-Agent Reinforcement Learning
|
2022
|
Wei Qiu
Weixun Wang
Rundong Wang
Bo An
Yujing Hu
Svetlana Obraztsova
Zinovi Rabinovich
Jianye Hao
Yingfeng Chen
Changjie Fan
|
+
|
GALOIS: Boosting Deep Reinforcement Learning via Generalizable Logic Synthesis
|
2022
|
Yushi Cao
Zhiming Li
Tianpei Yang
Hao Zhang
Yan Zheng
Yi Li
Jianye Hao
Yang Liu
|
+
|
Branch Ranking for Efficient Mixed-Integer Programming via Offline Ranking-based Policy Learning
|
2022
|
Zeren Huang
Wenhao Chen
Weinan Zhang
Chuhan Shi
Furui Liu
Hui‐Ling Zhen
Mingxuan Yuan
Jianye Hao
Yong Yu
Jun Wang
|
+
|
A Graph-Enhanced Click Model for Web Search
|
2022
|
Jianghao Lin
Weiwen Liu
Xinyi Dai
Weinan Zhang
Shuai Li
Ruiming Tang
Xiuqiang He
Jianye Hao
Yong Yu
|
+
|
Towards A Unified Policy Abstraction Theory and Representation Learning Approach in Markov Decision Processes
|
2022
|
Min Zhang
Hongyao Tang
Jianye Hao
Zheng Yan
|
+
|
On the Convergence Theory of Meta Reinforcement Learning with Personalized Policies
|
2022
|
Haozhi Wang
Qing Wang
Yunfeng Shao
Dong Li
Jianye Hao
Yinchuan Li
|
+
|
EUCLID: Towards Efficient Unsupervised Reinforcement Learning with Multi-choice Dynamics Model
|
2022
|
Yifu Yuan
Jianye Hao
Fei Ni
Yao Mu
Yan Zheng
Yujing Hu
Jinyi Liu
Yingfeng Chen
Changjie Fan
|
+
|
Decomposed Mutual Information Optimization for Generalized Context in Meta-Reinforcement Learning
|
2022
|
Yao Mu
Yuzheng Zhuang
Fei Ni
Bin Wang
Jianyu Chen
Jianye Hao
Ping Luo
|
+
|
PTDE: Personalized Training with Distillated Execution for Multi-Agent Reinforcement Learning
|
2022
|
Yiqun Chen
Hangyu Mao
Tianle Zhang
Shiguang Wu
Bin Zhang
Jianye Hao
Dong Li
Bin Wang
Hongxing Chang
|
+
|
GFlowCausal: Generative Flow Networks for Causal Discovery
|
2022
|
Wenqian Li
Yinchuan Li
Shengyu Zhu
Yunfeng Shao
Jianye Hao
Yan Pang
|
+
|
ERL-Re$^2$: Efficient Evolutionary Reinforcement Learning with Shared State Representation and Individual Policy Representation
|
2022
|
Pengyi Li
Hongyao Tang
Jianye Hao
Yan Zheng
Xi′an Fu
Zhaopeng Meng
|
+
|
PAnDR: Fast Adaptation to New Environments from Offline Experiences via Decoupling Policy and Environment Representations
|
2022
|
Tong Sang
Hongyao Tang
Yi Ma
Jianye Hao
Yan Zheng
Zhaopeng Meng
Boyan Li
Zhen Wang
|
+
|
Prototypical context-aware dynamics generalization for high-dimensional model-based reinforcement learning
|
2022
|
Junjie Wang
Yao Mu
Dong Li
Qichao Zhang
Dongbin Zhao
Yuzheng Zhuang
Ping Luo
Bin Wang
Jianye Hao
|
+
|
State-Aware Proximal Pessimistic Algorithms for Offline Reinforcement Learning
|
2022
|
Chen Chen
Hongyao Tang
Yi Ma
Chao Wang
Qianli Shen
Dong Li
Jianye Hao
|
+
|
Planning Immediate Landmarks of Targets for Model-Free Skill Transfer across Agents
|
2022
|
Minghuan Liu
Zhengbang Zhu
Menghui Zhu
Yuzheng Zhuang
Weinan Zhang
Jianye Hao
|
+
|
Transformer in Transformer as Backbone for Deep Reinforcement Learning
|
2022
|
Hangyu Mao
Rui Zhao
Hao Chen
Jianye Hao
Yiqun Chen
Dong Li
Junge Zhang
Zhen Xiao
|
+
PDF
Chat
|
ED2: An Environment Dynamics Decomposition Framework for World Model
Construction
|
2021
|
Cong Wang
Tianpei Yang
Jianye Hao
Yan Zheng
Hongyao Tang
Fazl Barez
Jinyi Liu
Jiajie Peng
Haiyin Piao
Zhixiao Sun
|
+
|
Learning State Representations via Retracing in Reinforcement Learning.
|
2021
|
Changmin Yu
Dong Li
Jianye Hao
Jun Wang
Neil Burgess
|
+
PDF
Chat
|
Dynamic Bottleneck for Robust Self-Supervised Exploration
|
2021
|
Chenjia Bai
Lingxiao Wang
Lei Han
Animesh Garg
Jianye Hao
Peng Liu
Zhaoran Wang
|
+
PDF
Chat
|
Flattening Sharpness for Dynamic Gradient Projection Memory Benefits
Continual Learning
|
2021
|
Danruo Deng
Guangyong Chen
Jianye Hao
Qiong Wang
Pheng‐Ann Heng
|
+
PDF
Chat
|
Learning to select cuts for efficient mixed-integer programming
|
2021
|
Zeren Huang
Kerong Wang
Furui Liu
Hui‐Ling Zhen
Weinan Zhang
Mingxuan Yuan
Jianye Hao
Yong Yu
Jun Wang
|
+
|
HyAR: Addressing Discrete-Continuous Action Reinforcement Learning via Hybrid Action Representation.
|
2021
|
Boyan Li
Hongyao Tang
Yan Zheng
Jianye Hao
Pengyi Li
Zhen Wang
Zhaopeng Meng
Li Wang
|
+
|
Ordering-Based Causal Discovery with Reinforcement Learning
|
2021
|
Xiaoqiang Wang
Yali Du
Shengyu Zhu
Liangjun Ke
Zhitang Chen
Jianye Hao
Jun Wang
|
+
PDF
Chat
|
A Graph-Enhanced Click Model for Web Search
|
2021
|
Jianghao Lin
Weiwen Liu
Xinyi Dai
Weinan Zhang
Shuai Li
Ruiming Tang
Xiuqiang He
Jianye Hao
Yong Yu
|
+
PDF
Chat
|
Critic PI2: Master Continuous Planning via Policy Improvement with Path Integrals and Deep Actor-Critic Reinforcement Learning
|
2021
|
He Ba
Jiajun Fan
Xian Guo
Jianye Hao
|
+
PDF
Chat
|
Foresee then Evaluate: Decomposing Value Estimation with Latent Future Prediction
|
2021
|
Hongyao Tang
Zhaopeng Meng
Guangyong Chen
Pengfei Chen
Chen Chen
Yaodong Yang
Luo Zhang
Wulong Liu
Jianye Hao
|
+
PDF
Chat
|
Addressing Action Oscillations through Learning Policy Inertia
|
2021
|
Chen Chen
Hongyao Tang
Jianye Hao
Wulong Liu
Zhaopeng Meng
|
+
PDF
Chat
|
Towards Effective Context for Meta-Reinforcement Learning: an Approach based on Contrastive Learning
|
2021
|
Haotian Fu
Hongyao Tang
Jianye Hao
Chen Chen
Xidong Feng
Li Dong
Wulong Liu
|
+
|
Principled Exploration via Optimistic Bootstrapping and Backward Induction
|
2021
|
Chenjia Bai
Lingxiao Wang
Lei Han
Jianye Hao
Animesh Garg
Peng Liu
Zhaoran Wang
|
+
|
Automatic Web Testing Using Curiosity-Driven Reinforcement Learning
|
2021
|
Yan Zheng
Yi Liu
Xiaofei Xie
Yepang Liu
Lei Ma
Jianye Hao
Yang Liu
|
+
PDF
Chat
|
An Adversarial Imitation Click Model for Information Retrieval
|
2021
|
Xinyi Dai
Jianghao Lin
Weinan Zhang
Shuai Li
Weiwen Liu
Ruiming Tang
Xiuqiang He
Jianye Hao
Jun Wang
Yong Yu
|
+
PDF
Chat
|
Addressing Action Oscillations through Learning Policy Inertia
|
2021
|
Chen Chen
Hongyao Tang
Jianye Hao
Wulong Liu
Zhaopeng Meng
|
+
|
Ordering-Based Causal Discovery with Reinforcement Learning
|
2021
|
Xiaoqiang Wang
Yali Du
Shengyu Zhu
Liangjun Ke
Zhitang Chen
Jianye Hao
Jun Wang
|
+
|
Differentiable Logic Machines
|
2021
|
Matthieu Zimmer
Xuening Feng
Claire Glanois
Zhaohui Jiang
Jianyi Zhang
Paul Weng
Jianye Hao
Dong Li
Wulong Liu
|
+
|
Automatic Web Testing using Curiosity-Driven Reinforcement Learning
|
2021
|
Yan Zheng
Yi Liu
Xiaofei Xie
Yepang Liu
Lei Ma
Jianye Hao
Yang Liu
|
+
|
Learning Symbolic Rules for Interpretable Deep Reinforcement Learning
|
2021
|
Zhihao Ma
Yuzheng Zhuang
Paul Weng
Hankz Hankui Zhuo
Dong Li
Wulong Liu
Jianye Hao
|
+
|
Contrastive ACE: Domain Generalization Through Alignment of Causal Mechanisms
|
2021
|
Yunqi Wang
Furui Liu
Zhitang Chen
Qing Lian
Shoubo Hu
Jianye Hao
Yik‐Chung Wu
|
+
|
Principled Exploration via Optimistic Bootstrapping and Backward Induction
|
2021
|
Chenjia Bai
Lingxiao Wang
Lei Han
Jianye Hao
Animesh Garg
Peng Liu
Zhaoran Wang
|
+
|
Cooperative Multi-Agent Transfer Learning with Level-Adaptive Credit Assignment
|
2021
|
Tianze Zhou
Fubiao Zhang
Kun Shao
Kai Li
Wenhan Huang
Jun Luo
Weixun Wang
Yaodong Yang
Hangyu Mao
Bin Wang
|
+
|
Addressing Action Oscillations through Learning Policy Inertia
|
2021
|
Chen Chen
Hongyao Tang
Jianye Hao
Wulong Liu
Zhaopeng Meng
|
+
|
Foresee then Evaluate: Decomposing Value Estimation with Latent Future Prediction
|
2021
|
Hongyao Tang
Zhaopeng Meng
Guangyong Chen
Pengfei Chen
Chen Chen
Yaodong Yang
Luo Zhang
Wulong Liu
Jianye Hao
|
+
|
CMML: Contextual Modulation Meta Learning for Cold-Start Recommendation
|
2021
|
Xidong Feng
Chen Chen
Dong Li
Mengchen Zhao
Jianye Hao
Jun Wang
|
+
|
Learning to Select Cuts for Efficient Mixed-Integer Programming
|
2021
|
Zeren Huang
Kerong Wang
Furui Liu
Hui‐Ling Zhen
Weinan Zhang
Mingxuan Yuan
Jianye Hao
Yong Yu
Jun Wang
|
+
|
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain
|
2021
|
Tianpei Yang
Hongyao Tang
Chenjia Bai
Jinyi Liu
Jianye Hao
Zhaopeng Meng
Peng Liu
|
+
|
Ranking Cost: Building An Efficient and Scalable Circuit Routing Planner with Evolution-Based Optimization
|
2021
|
Shiyu Huang
Bin Wang
Dong Li
Jianye Hao
Ting Chen
Jun Zhu
|
+
|
Lifelong Reinforcement Learning with Temporal Logic Formulas and Reward Machines
|
2021
|
Xuejing Zheng
Chao Yu
Chen Chen
Jianye Hao
Hankz Hankui Zhuo
|
+
|
SEIHAI: A Sample-efficient Hierarchical AI for the MineRL Competition
|
2021
|
Hangyu Mao
Chao Wang
Xiaotian Hao
Yihuan Mao
Yiming Lu
Chengjie Wu
Jianye Hao
Dong Li
Pingzhong Tang
|
+
|
Flattening Sharpness for Dynamic Gradient Projection Memory Benefits Continual Learning
|
2021
|
Danruo Deng
Guangyong Chen
Jianye Hao
Qiong Wang
Pheng‐Ann Heng
|
+
|
Dynamic Bottleneck for Robust Self-Supervised Exploration
|
2021
|
Chenjia Bai
Lingxiao Wang
Lei Han
Animesh Garg
Jianye Hao
Peng Liu
Zhaoran Wang
|
+
|
Uncertainty-aware Low-Rank Q-Matrix Estimation for Deep Reinforcement Learning
|
2021
|
Tong Sang
Hongyao Tang
Jianye Hao
Yan Zheng
Zhaopeng Meng
|
+
|
ED2: An Environment Dynamics Decomposition Framework for World Model Construction
|
2021
|
Cong Wang
Tianpei Yang
Jianye Hao
Yan Zheng
Hongyao Tang
Fazl Barez
Jinyi Liu
Jiajie Peng
Haiyin Piao
Zhixiao Sun
|
+
|
A Survey on Interpretable Reinforcement Learning
|
2021
|
Claire Glanois
Paul Weng
Matthieu Zimmer
Dong Li
Tianpei Yang
Jianye Hao
Wulong Liu
|
+
|
Learning State Representations via Retracing in Reinforcement Learning
|
2021
|
Changmin Yu
Dong Li
Jianye Hao
Jun Wang
Neil Burgess
|
+
|
HyAR: Addressing Discrete-Continuous Action Reinforcement Learning via Hybrid Action Representation
|
2021
|
Boyan Li
Hongyao Tang
Yan Zheng
Jianye Hao
Pengyi Li
Zhen Wang
Zhaopeng Meng
Li Wang
|
+
|
Modeling Scale-free Graphs with Hyperbolic Geometry for Knowledge-aware Recommendation
|
2021
|
Yankai Chen
Meng‐Lin Yang
Yingxue Zhang
Mengchen Zhao
Ziqiao Meng
Jianye Hao
Irwin King
|
+
|
An Empirical Study of Assumptions in Bayesian Optimisation
|
2020
|
Alexander I. Cowen-Rivers
Wenlong Lyu
Rasul Tutunov
Zhi Wang
Antoine Grosnit
Ryan‐Rhys Griffiths
Jianye Hao
Jun Wang
Jan Peters
Haitham Bou Ammar
|
+
|
Represent Your Own Policies: Reinforcement Learning with Policy-extended Value Function Approximator
|
2020
|
Hongyao Tang
Zhaopeng Meng
Jianye Hao
Chen Chen
Daniel Graves
Dong Li
Hangyu Mao
Wulong Liu
Yaodong Yang
Changmin Yu
|
+
|
KoGuN: Accelerating Deep Reinforcement Learning via Integrating Human Suboptimal Knowledge
|
2020
|
Peng Zhang
Jianye Hao
Weixun Wang
Hongyao Tang
Yi Ma
Yihai Duan
Yan Zheng
|
+
|
Efficient Deep Reinforcement Learning via Adaptive Policy Transfer
|
2020
|
Tianpei Yang
Jianye Hao
Zhaopeng Meng
Zongzhang Zhang
Yujing Hu
Yingfeng Chen
Changjie Fan
Weixun Wang
Wulong Liu
Zhaodong Wang
|
+
|
Learning to Accelerate Heuristic Searching for Large-Scale Maximum Weighted b-Matching Problems in Online Advertising
|
2020
|
Xiaotian Hao
Junqi Jin
Jianye Hao
Jin Li
Weixun Wang
Yi Ma
Zhenzhe Zheng
Han Li
Jian Xu
Kun Gai
|
+
|
Triple-GAIL: A Multi-Modal Imitation Learning Framework with Generative Adversarial Nets
|
2020
|
Cong Fei
Bin Wang
Yuzheng Zhuang
Zongzhang Zhang
Jianye Hao
Hongbo Zhang
Xuewu Ji
Wulong Liu
|
+
|
Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising
|
2020
|
Xiaotian Hao
Zhaoqing Peng
Yi Ma
Guan Wang
Junqi Jin
Jianye Hao
Shan Chen
Rongquan Bai
Mingzhou Xie
Miao Xu
|
+
|
CausalVAE: Disentangled Representation Learning via Neural Structural Causal Models
|
2020
|
Mengyue Yang
Furui Liu
Zhitang Chen
Xinwei Shen
Jianye Hao
Jun Wang
|
+
PDF
Chat
|
Neighborhood Cognition Consistent Multi-Agent Reinforcement Learning
|
2020
|
Hangyu Mao
Wulong Liu
Jianye Hao
Jun Luo
Dong Li
Zhengchao Zhang
Jun Wang
Zhen Xiao
|
+
PDF
Chat
|
From Few to More: Large-Scale Dynamic Multiagent Curriculum Learning
|
2020
|
Weixun Wang
Tianpei Yang
Yong Liu
Jianye Hao
Xiaotian Hao
Yujing Hu
Yingfeng Chen
Changjie Fan
Yang Gao
|
+
PDF
Chat
|
Multi-Agent Game Abstraction via Graph Attention Neural Network
|
2020
|
Yong Liu
Weixun Wang
Yujing Hu
Jianye Hao
Xingguo Chen
Yang Gao
|
+
PDF
Chat
|
Continuous Multiagent Control Using Collective Behavior Entropy for Large-Scale Home Energy Management
|
2020
|
Jianwen Sun
Yan Zheng
Jianye Hao
Zhaopeng Meng
Yang Liu
|
+
|
Efficient Deep Reinforcement Learning through Policy Transfer
|
2020
|
Tianpei Yang
Jianye Hao
Zhaopeng Meng
Zongzhang Zhang
Weixun Wang
Yujing Hu
Yingfeng Cheng
Changjie Fan
Zhaodong Wang
Jiajie Peng
|
+
PDF
Chat
|
An Efficient Transfer Learning Framework for Multiagent Reinforcement
Learning
|
2020
|
Tianpei Yang
Weixun Wang
Hongyao Tang
Jianye Hao
Zhaopeng Meng
Hangyu Mao
Dong Li
Wulong Liu
Chengwei Zhang
Yujing Hu
|
+
PDF
Chat
|
Falsification of Cyber-Physical Systems Using Deep Reinforcement Learning
|
2020
|
Yoriyuki Yamagata
Shuang Liu
Takumi Akazaki
Yihai Duan
Jianye Hao
|
+
|
Qatten: A General Framework for Cooperative Multiagent Reinforcement Learning
|
2020
|
Yaodong Yang
Jianye Hao
Ben Liao
Kun Shao
Guangyong Chen
Wulong Liu
Hongyao Tang
|
+
|
Q-value Path Decomposition for Deep Multiagent Reinforcement Learning
|
2020
|
Yaodong Yang
Jianye Hao
Guangyong Chen
Hongyao Tang
Yingfeng Chen
Yujing Hu
Changjie Fan
Zhongyu Wei
|
+
|
KoGuN: Accelerating Deep Reinforcement Learning via Integrating Human Suboptimal Knowledge
|
2020
|
Peng Zhang
Jianye Hao
Weixun Wang
Hongyao Tang
Yi Ma
Yihai Duan
Yan Zheng
|
+
|
CausalVAE: Structured Causal Disentanglement in Variational Autoencoder
|
2020
|
Mengyue Yang
Furui Liu
Zhitang Chen
Xinwei Shen
Jianye Hao
Jun Wang
|
+
|
Learning to Accelerate Heuristic Searching for Large-Scale Maximum Weighted b-Matching Problems in Online Advertising
|
2020
|
Xiaotian Hao
Junqi Jin
Jianye Hao
Li Jin
Weixun Wang
Yi Ma
Zhenzhe Zheng
Han Li
Jian Xu
Kun Gai
|
+
|
Efficient Deep Reinforcement Learning via Adaptive Policy Transfer
|
2020
|
Tianpei Yang
Jianye Hao
Zhaopeng Meng
Zongzhang Zhang
Yujing Hu
Yingfeng Cheng
Changjie Fan
Weixun Wang
Wulong Liu
Zhaodong Wang
|
+
|
Continuous Multiagent Control using Collective Behavior Entropy for Large-Scale Home Energy Management
|
2020
|
Jianwen Sun
Yan Zheng
Jianye Hao
Zhaopeng Meng
Yang Liu
|
+
|
Triple-GAIL: A Multi-Modal Imitation Learning Framework with Generative Adversarial Nets
|
2020
|
Cong Fei
Bin Wang
Yuzheng Zhuang
Zongzhang Zhang
Jianye Hao
Hongbo Zhang
Xuewu Ji
Wulong Liu
|
+
|
Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising
|
2020
|
Xiaotian Hao
Zhaoqing Peng
Yi Ma
Guan Wang
Junqi Jin
Jianye Hao
Shan Chen
Rongquan Bai
Mingzhou Xie
Miao Xu
|
+
|
Dynamic Horizon Value Estimation for Model-based Reinforcement Learning
|
2020
|
Junjie Wang
Qichao Zhang
Dongbin Zhao
Mengchen Zhao
Jianye Hao
|
+
|
Towards Effective Context for Meta-Reinforcement Learning: an Approach based on Contrastive Learning
|
2020
|
Haotian Fu
Hongyao Tang
Jianye Hao
Chen Chen
Xidong Feng
Li Dong
Wulong Liu
|
+
|
Event-Triggered Multi-agent Reinforcement Learning with Communication under Limited-bandwidth Constraint
|
2020
|
Guangzheng Hu
Yuanheng Zhu
Dongbin Zhao
Mengchen Zhao
Jianye Hao
|
+
|
SMARTS: Scalable Multi-Agent Reinforcement Learning Training School for Autonomous Driving
|
2020
|
Ming Zhou
Jun Luo
Julian Villela
Yaodong Yang
David Rusu
Jiayu Miao
Weinan Zhang
Montgomery Alban
Iman Fadakar
Zheng Chen
|
+
|
Learning to Utilize Shaping Rewards: A New Approach of Reward Shaping
|
2020
|
Yujing Hu
Weixun Wang
Hangtian Jia
Wang Yi-xiang
Yingfeng Chen
Jianye Hao
Feng Wu
Changjie Fan
|
+
|
Critic PI2: Master Continuous Planning via Policy Improvement with Path Integrals and Deep Actor-Critic Reinforcement Learning
|
2020
|
Jiajun Fan
He Ba
Xian Guo
Jianye Hao
|
+
|
Learning to Utilize Shaping Rewards: A New Approach of Reward Shaping
|
2020
|
Yujing Hu
Weixun Wang
Hangtian Jia
Yixiang Wang
Yingfeng Chen
Jianye Hao
Feng Wu
Changjie Fan
|
+
PDF
Chat
|
MGHRL: Meta Goal-Generation for Hierarchical Reinforcement Learning
|
2020
|
Haotian Fu
Hongyao Tang
Jianye Hao
Wulong Liu
Chen Chen
|
+
|
An Efficient Transfer Learning Framework for Multiagent Reinforcement Learning
|
2020
|
Tianpei Yang
Weixun Wang
Hongyao Tang
Jianye Hao
Zhaopeng Meng
Hangyu Mao
Dong Li
Wulong Liu
Yingfeng Chen
Yujing Hu
|
+
|
What About Inputing Policy in Value Function: Policy Representation and Policy-extended Value Function Approximator
|
2020
|
Hongyao Tang
Zhaopeng Meng
Jianye Hao
Chen Chen
Daniel Graves
Dong Li
Changmin Yu
Hangyu Mao
Wulong Liu
Yaodong Yang
|
+
|
HEBO Pushing The Limits of Sample-Efficient Hyperparameter Optimisation
|
2020
|
Alexander I. Cowen-Rivers
Wenlong Lyu
Rasul Tutunov
Zhi Wang
Antoine Grosnit
Ryan‐Rhys Griffiths
Alexandre Max Maraval
Jianye Hao
Jun Wang
Jan Peters
|
+
PDF
Chat
|
There is Limited Correlation between Coverage and Robustness for Deep
Neural Networks
|
2019
|
Yizhen Dong
Peixin Zhang
Jingyi Wang
Shuang Liu
Jun Sun
Jianye Hao
Xinyu Wang
Li Wang
Jin Song Dong
Ting Dai
|
+
PDF
Chat
|
Learning Adaptive Display Exposure for Real-Time Advertising
|
2019
|
Weixun Wang
Junqi Jin
Jianye Hao
Chunjie Chen
Chuan Yu
Weinan Zhang
Jun Wang
Xiaotian Hao
Yixi Wang
Han Li
|
+
PDF
Chat
|
Using deep reinforcement learning to speed up collective cell migration
|
2019
|
Hanxu Hou
Tian Gan
Yaodong Yang
Xianglei Zhu
Sen Liu
Weiming Guo
Jianye Hao
|
+
|
Efficient meta reinforcement learning via meta goal generation
|
2019
|
Haotian Fu
Hongyao Tang
Jianye Hao
|
+
|
Independent Generative Adversarial Self-Imitation Learning in Cooperative Multiagent Systems.
|
2019
|
Xiaotian Hao
Weixun Wang
Jianye Hao
Yaodong Yang
|
+
|
Towards Efficient Detection and Optimal Response against Sophisticated Opponents
|
2019
|
Tianpei Yang
Jianye Hao
Zhaopeng Meng
Chongjie Zhang
Yan Zheng
Ze Zheng
|
+
|
Deep Multi-Agent Reinforcement Learning with Discrete-Continuous Hybrid Action Spaces
|
2019
|
Haotian Fu
Hongyao Tang
Jianye Hao
Zihan Lei
Yingfeng Chen
Changjie Fan
|
+
|
Action Semantics Network: Considering the Effects of Actions in Multiagent Systems
|
2019
|
Weixun Wang
Tianpei Yang
Yong Liu
Jianye Hao
Xiaotian Hao
Yujing Hu
Yingfeng Chen
Changjie Fan
Yang Gao
|
+
PDF
Chat
|
SA-IGA: a multiagent reinforcement learning method towards socially optimal outcomes
|
2019
|
Chengwei Zhang
Xiaohong Li
Jianye Hao
Siqi Chen
Karl Tuyls
Wanli Xue
Zhiyong Feng
|
+
|
Deep Multi-Agent Reinforcement Learning with Discrete-Continuous Hybrid Action Spaces
|
2019
|
Haotian Fu
Hongyao Tang
Jianye Hao
Zihan Lei
Yingfeng Chen
Changjie Fan
|
+
|
Disentangling Dynamics and Returns: Value Function Decomposition with Future Prediction
|
2019
|
Hongyao Tang
Jianye Hao
Guangyong Chen
Pengfei Chen
Zhaopeng Meng
Yaodong Yang
Li Wang
|
+
|
Independent Generative Adversarial Self-Imitation Learning in Cooperative Multiagent Systems
|
2019
|
Xiaotian Hao
Weixun Wang
Jianye Hao
Yaodong Yang
|
+
|
Spectral-based Graph Convolutional Network for Directed Graphs
|
2019
|
Yi Ma
Jianye Hao
Yaodong Yang
Han Li
Junqi Jin
Guangyong Chen
|
+
|
From Few to More: Large-scale Dynamic Multiagent Curriculum Learning
|
2019
|
Weixun Wang
Tianpei Yang
Yong Liu
Jianye Hao
Xiaotian Hao
Yujing Hu
Yingfeng Chen
Changjie Fan
Yang Gao
|
+
|
Diverse Behavior Is What Game AI Needs: Generating Varied Human-Like Playing Styles Using Evolutionary Multi-Objective Deep Reinforcement Learning
|
2019
|
Ruimin Shen
Yan Zheng
Jianye Hao
Yinfeng Chen
Changjie Fan
|
+
|
There is Limited Correlation between Coverage and Robustness for Deep Neural Networks
|
2019
|
Yizhen Dong
Peixin Zhang
Jingyi Wang
Shuang Liu
Jun Sun
Jianye Hao
Xinyu Wang
Li Wang
Jin Song Dong
Ting Dai
|
+
|
Multi-Agent Game Abstraction via Graph Attention Neural Network
|
2019
|
Yong Liu
Weixun Wang
Yujing Hu
Jianye Hao
Xingguo Chen
Yang Gao
|
+
|
Neighborhood Cognition Consistent Multi-Agent Reinforcement Learning
|
2019
|
Hangyu Mao
Wulong Liu
Jianye Hao
Jun Luo
Li Dong
Zhengchao Zhang
Jun Wang
Zhen Xiao
|
+
|
MGHRL: Meta Goal-generation for Hierarchical Reinforcement Learning
|
2019
|
Haotian Fu
Hongyao Tang
Jianye Hao
Wulong Liu
Chen Chen
|
+
|
Action Semantics Network: Considering the Effects of Actions in Multiagent Systems
|
2019
|
Weixun Wang
Tianpei Yang
Yong Liu
Jianye Hao
Xiaotian Hao
Yujing Hu
Yingfeng Chen
Changjie Fan
Yang Gao
|
+
|
Hierarchical Deep Multiagent Reinforcement Learning
|
2018
|
Hongyao Tang
Jianye Hao
Tangjie Lv
Yingfeng Chen
Zongzhang Zhang
Hangtian Jia
Chunxu Ren
Yan Zheng
Changjie Fan
Li Wang
|
+
|
SCC-rFMQ Learning in Cooperative Markov Games with Continuous Actions
|
2018
|
Chengwei Zhang
Xiaohong Li
Jianye Hao
Siqi Chen
Karl Tuyls
Zhiyong Feng
Wanli Xue
Rong Chen
|
+
|
Bayes-ToMoP: A Fast Detection and Best Response Algorithm Towards Sophisticated Opponents
|
2018
|
Tianpei Yang
Zhaopeng Meng
Jianye Hao
Chongjie Zhang
Yan Zheng
|
+
|
Learning to Advertise with Adaptive Exposure via Constrained Two-Level Reinforcement Learning.
|
2018
|
Weixun Wang
Junqi Jin
Jianye Hao
Chunjie Chen
Chuan Yu
Weinan Zhang
Jun Wang
Yixi Wang
Han Li
Jian Xu
|
+
|
Hierarchical Heuristic Learning towards Effcient Norm Emergence.
|
2018
|
Tianpei Yang
Jianye Hao
Zhaopeng Meng
Sandip Sen
Sheng Jin
|
+
|
Weighted Double Deep Multiagent Reinforcement Learning in Stochastic Cooperative Environments
|
2018
|
Yan Zheng
Jianye Hao
Zongzhang Zhang
|
+
|
SA-IGA: A Multiagent Reinforcement Learning Method Towards Socially Optimal Outcomes
|
2018
|
Chengwei Zhang
Xiaohong Li
Jianye Hao
Siqi Chen
Karl Tuyls
Wanli Xue
|
+
|
Towards Cooperation in Sequential Prisoner's Dilemmas: a Deep Multiagent Reinforcement Learning Approach
|
2018
|
Weixun Wang
Jianye Hao
Yixi Wang
Matthew E. Taylor
|
+
PDF
Chat
|
Falsification of Cyber-Physical Systems Using Deep Reinforcement Learning
|
2018
|
Takumi Akazaki
Shuang Liu
Yoriyuki Yamagata
Yihai Duan
Jianye Hao
|
+
|
An Optimal Rewiring Strategy for Reinforcement Social Learning in Cooperative Multiagent Systems
|
2018
|
Hongyao Tang
Li Wang
Zan Wang
Tim Baarslag
Jianye Hao
|
+
|
Towards Efficient Detection and Optimal Response against Sophisticated Opponents
|
2018
|
Tianpei Yang
Zhaopeng Meng
Jianye Hao
Chongjie Zhang
Yan Zheng
Ze Zheng
|
+
|
Hierarchical Deep Multiagent Reinforcement Learning with Temporal Abstraction
|
2018
|
Hongyao Tang
Jianye Hao
Tangjie Lv
Yingfeng Chen
Zongzhang Zhang
Hangtian Jia
Chunxu Ren
Yan Zheng
Zhaopeng Meng
Changjie Fan
|
+
PDF
Chat
|
Weighted Double Deep Multiagent Reinforcement Learning in Stochastic Cooperative Environments
|
2018
|
Yan Zheng
Zhaopeng Meng
Jianye Hao
Zongzhang Zhang
|
+
|
Learning Adaptive Display Exposure for Real-Time Advertising
|
2018
|
Weixun Wang
Junqi Jin
Jianye Hao
Chunjie Chen
Chuan Yu
Weinan Zhang
Jun Wang
Xiaotian Hao
Yixi Wang
Han Li
|
+
|
Hierarchical Heuristic Learning towards Effcient Norm Emergence
|
2018
|
Tianpei Yang
Jianye Hao
Zhaopeng Meng
Sandip Sen
Sheng Jin
|
+
|
Dynamic analysis of cell interactions in biological environments under multiagent social learning framework
|
2016
|
Chengwei Zhang
Xiaohong Li
Shuxin Li
Jianye Hao
|
+
PDF
Chat
|
Blind Image Denoising via Dependent Dirichlet Process Tree
|
2016
|
Fengyuan Zhu
Guangyong Chen
Jianye Hao
Pheng‐Ann Heng
|
+
|
Blind Image Denoising via Dependent Dirichlet Process Tree
|
2016
|
Fengyuan Zhu
Guangyong Chen
Jianye Hao
Pheng‐Ann Heng
|