Is Pessimism Provably Efficient for Offline RL?

Type: Preprint

Publication Date: 2020-01-01

Citations: 30

DOI: https://doi.org/10.48550/arxiv.2012.15085

Locations

  • arXiv (Cornell University) - View - PDF
  • DataCite API - View

Similar Works

Action Title Year Authors
+ Is Pessimism Provably Efficient for Offline RL 2020 Ying Jin
Zhuoran Yang
Zhaoran Wang
+ Pessimistic Q-Learning for Offline Reinforcement Learning: Towards Optimal Sample Complexity 2022 Laixi Shi
Gen Li
Yuting Wei
Yuxin Chen
Yuejie Chi
+ VIPeR: Provably Efficient Algorithm for Offline RL with Neural Function Approximation 2023 Thanh Nguyen-Tang
Raman Arora
+ PDF Chat Bridging Offline Reinforcement Learning and Imitation Learning: A Tale of Pessimism 2022 Paria Rashidinejad
Banghua Zhu
Cong Ma
Jiantao Jiao
Stuart Russell
+ Near-optimal Offline Reinforcement Learning with Linear Representation: Leveraging Variance Information with Pessimism 2022 Ming Yin
Yaqi Duan
Mengdi Wang
Yuxiang Wang
+ Pessimism in the Face of Confounders: Provably Efficient Offline Reinforcement Learning in Partially Observable Markov Decision Processes 2022 Miao Lu
Yifei Min
Zhaoran Wang
Zhuoran Yang
+ PDF Chat Optimistic Model Rollouts for Pessimistic Offline Policy Optimization 2024 Yuanzhao Zhai
Yiying Li
Zijian Gao
Xudong Gong
Kele Xu
Dawei Feng
Bo Ding
Huaimin Wang
+ Optimistic Model Rollouts for Pessimistic Offline Policy Optimization 2024 Yuanzhao Zhai
Yiying Li
Zijian Gao
Xudong Gong
Kele Xu
Dawei Feng
Bo Ding
Huaimin Wang
+ Bridging Offline Reinforcement Learning and Imitation Learning: A Tale of Pessimism 2021 Paria Rashidinejad
Banghua Zhu
Cong Ma
Jiantao Jiao
Stuart Russell
+ Towards Instance-Optimal Offline Reinforcement Learning with Pessimism 2021 Minghao Yin
Yu-Xiang Wang
+ Model-Based Offline Reinforcement Learning with Pessimism-Modulated Dynamics Belief 2022 Kaiyang Guo
Yunfeng Shao
Yanhui Geng
+ PDF Chat Pessimism Meets Risk: Risk-Sensitive Offline Reinforcement Learning 2024 Dake Zhang
Boxiang Lyu
Shuang Qiu
Mladen Kolar
Tong Zhang
+ Bi-Level Offline Policy Optimization with Limited Exploration 2023 Wenzhuo Zhou
+ Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning 2022 Chenjia Bai
Lingxiao Wang
Zhuoran Yang
Zhihong Deng
Animesh Garg
Peng Liu
Zhaoran Wang
+ Nearly Minimax Optimal Offline Reinforcement Learning with Linear Function Approximation: Single-Agent MDP and Markov Game 2022 Wei Xiong
Han Zhong
Chengshuai Shi
Cong Shen
Liwei Wang
Tong Zhang
+ PDF Chat Bayesian Design Principles for Offline-to-Online Reinforcement Learning 2024 Hao Hu
Yiqin Yang
Jianing Ye
Chengjie Wu
Ziqing Mai
Yujing Hu
Tangjie Lv
Changjie Fan
Qianchuan Zhao
Chongjie Zhang
+ POPO: Pessimistic Offline Policy Optimization 2020 Qiang He
Xinwen Hou
+ Towards Tractable Optimism in Model-Based Reinforcement Learning 2020 Aldo Pacchiano
Philip Ball
Jack Parker-Holder
Krzysztof Choromański
Stephen Roberts
+ MOReL : Model-Based Offline Reinforcement Learning 2020 Rahul Kidambi
Aravind Rajeswaran
Praneeth Netrapalli
Thorsten Joachims
+ PDF Chat Bellman-consistent Pessimism for Offline Reinforcement Learning 2021 Tengyang Xie
Ching-An Cheng
Nan Jiang
Paul Mineiro
Alekh Agarwal

Works That Cite This (17)

Action Title Year Authors
+ Offline Reinforcement Learning as One Big Sequence Modeling Problem. 2021 Michael Jänner
Qiyang Li
Sergey Levine
+ Towards Instance-Optimal Offline Reinforcement Learning with Pessimism 2021 Minghao Yin
Yu-Xiang Wang
+ PDF Chat Pessimistic Off-Policy Optimization for Learning to Rank 2024 Matej Cief
Branislav Kveton
Michal Kompan
+ PDF Chat Compressive Features in Offline Reinforcement Learning for Recommender Systems 2021 Minh Phạm
Hung T. Nguyen
Long H. Dang
Jennifer Adorno Nieves
+ PDF Chat Bridging Offline Reinforcement Learning and Imitation Learning: A Tale of Pessimism 2022 Paria Rashidinejad
Banghua Zhu
Cong Ma
Jiantao Jiao
Stuart Russell
+ PDF Chat Safe Policy Improvement for POMDPs via Finite-State Controllers 2023 Thiago D. Simão
Marnix Suilen
Nils Jansen
+ Compressive Features in Offline Reinforcement Learning for Recommender Systems. 2021 Hung T. Nguyen
Minh Hoang Nguyen
Long Pham
Jennifer Adorno Nieves
+ Offline Reinforcement Learning with Value-based Episodic Memory 2021 Xiaoteng Ma
Yiqin Yang
Hao Hu
Qihan Liu
Jun Yang
Chongjie Zhang
Qianchuan Zhao
Bin Liang
+ PDF Chat On Instance-Dependent Bounds for Offline Reinforcement Learning with Linear Function Approximation 2023 Thanh Nguyen-Tang
Ming Yin
Sunil Gupta
Svetha Venkatesh
Raman Arora
+ PDF Chat Optimistic Model Rollouts for Pessimistic Offline Policy Optimization 2024 Yuanzhao Zhai
Yiying Li
Zijian Gao
Xudong Gong
Kele Xu
Dawei Feng
Bo Ding
Huaimin Wang

Works Cited by This (0)

Action Title Year Authors