The Sample Complexity of Teaching-by-Reinforcement on Q-Learning

Type: Preprint

Publication Date: 2020-01-01

Citations: 1

DOI: https://doi.org/10.48550/arxiv.2006.09324

Locations

  • arXiv (Cornell University) - View - PDF
  • DataCite API - View

Similar Works

Action Title Year Authors
+ PDF Chat The Sample Complexity of Teaching by Reinforcement on Q-Learning 2021 Xuezhou Zhang
Shubham Bharti
Yuzhe Ma
Adish Singla
Xiaojin Zhu
+ The Teaching Dimension of Q-learning 2020 Xuezhou Zhang
Shubham Bharti
Yuzhe Ma
Adish Singla
Xiaojin Zhu
+ PAC Reinforcement Learning without Real-World Feedback 2019 Yuren Zhong
Aniket Anand Deshmukh
Clayton Scott
+ PDF Chat Is Q-Learning Minimax Optimal? A Tight Sample Complexity Analysis 2023 Gen Li
Changxiao Cai
Yuxin Chen
Yuting Wei
Yuejie Chi
+ Efficient Online Reinforcement Learning with Offline Data 2023 Philip Ball
Laura Smith
Ilya Kostrikov
Sergey Levine
+ Offline Reinforcement Learning at Multiple Frequencies 2022 Kaylee Burns
Tianhe Yu
Chelsea Finn
Karol Hausman
+ Understanding the Complexity Gains of Single-Task RL with a Curriculum 2022 Qiyang Li
Yuexiang Zhai
Yi Ma
Sergey Levine
+ Jump-Start Reinforcement Learning 2022 Ikechukwu Uchendu
Ted Xiao
Yao Lu
Banghua Zhu
Mengyuan Yan
Joséphine Simon
Matthew Bennice
Chuyuan Fu
Cong Ma
Jiantao Jiao
+ Sequence Modeling is a Robust Contender for Offline Reinforcement Learning 2023 Prajjwal Bhargava
Rohan Chitnis
Alborz Geramifard
Shagun Sodhani
Amy Zhang
+ Task-agnostic Exploration in Reinforcement Learning 2020 Xuezhou Zhang
Yuzhe Ma
Adish Singla
+ Demonstration-Regularized RL 2023 Daniil Tiapkin
Denis Belomestny
Daniele Calandriello
Éric Moulines
Alexey Naumov
Pierre Perrault
Michal Vaľko
Pierre Ménard
+ Unpacking Reward Shaping: Understanding the Benefits of Reward Engineering on Sample Complexity 2022 Abhishek Gupta
Aldo Pacchiano
Yuexiang Zhai
Sham M. Kakade
Sergey Levine
+ Is Inverse Reinforcement Learning Harder than Standard Reinforcement Learning? 2023 Lei Zhao
Mengdi Wang
Yu Bai
+ Guarded Policy Optimization with Imperfect Online Demonstrations 2023 Zhenghai Xue
Zhenghao Peng
Quanyi Li
Zhihan Liu
Bolei Zhou
+ Provably Feedback-Efficient Reinforcement Learning via Active Reward Learning 2023 Dingwen Kong
Lin F. Yang
+ Reward-agnostic Fine-tuning: Provable Statistical Benefits of Hybrid Reinforcement Learning 2023 Gen Li
Wenhao Zhan
Jason D. Lee
Yuejie Chi
Yuxin Chen
+ TTR-Based Reward for Reinforcement Learning with Implicit Model Priors 2019 Xubo Lyu
Mo Chen
+ PDF Chat TTR-Based Reward for Reinforcement Learning with Implicit Model Priors 2020 Xubo Lyu
Mo Chen
+ Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control via Sample Multiple Reuse 2023 Jiafei Lyu
Le Wan
Zongqing Lu
Xiu Li
+ Autonomous Reinforcement Learning via Subgoal Curricula 2021 Archit Sharma
Abhishek Gupta
Sergey Levine
Karol Hausman
Chelsea Finn