Finite-Time Performance Bounds and Adaptive Learning Rate Selection for Two Time-Scale Reinforcement Learning

Harsh Gupta, R. Srikant, Lei Ying

Type: Article

Publication Date: 2019-07-14

Citations: 49

Locations

arXiv (Cornell University) - View

Similar Works

Action	Title	Year	Authors
+	Finite-Time Performance Bounds and Adaptive Learning Rate Selection for Two Time-Scale Reinforcement Learning	2019	Harsh Gupta R. Srikant Lei Ying
+	Tight Finite Time Bounds of Two-Time-Scale Linear Stochastic Approximation with Markovian Noise	2024	Shaan ul Haque Sajad Khodadadian Siva Theja Maguluri
+	Finite-Sample Analysis for Two Time-scale Non-linear TDC with General Smooth Function Approximation.	2021	Yue Wang Shaofeng Zou Yi Zhou
+	Finite-Time Analysis and Restarting Scheme for Linear Two-Time-Scale Stochastic Approximation	2019	Thinh T. Doan
+	Finite-Time Analysis and Restarting Scheme for Linear Two-Time-Scale Stochastic Approximation	2019	Thinh T. Doan
+ PDF Chat	Non-Asymptotic Analysis for Two Time-scale TDC with General Smooth Function Approximation	2021	Yue Wang Shaofeng Zou Yi Zhou
+	Non-Asymptotic Analysis for Two Time-scale TDC with General Smooth Function Approximation	2021	Yue Wang Shaofeng Zou Yi Zhou
+ PDF Chat	A Tale of Two-Timescale Reinforcement Learning with the Tightest Finite-Time Bound	2020	Gal Dalal Balázs Szörényi Gugan Thoppe
+	A Finite Time Analysis of Two Time-Scale Actor Critic Methods	2020	Yue Wu Weitong Zhang Pan Xu Quanquan Gu
+	A Finite Time Analysis of Two Time-Scale Actor Critic Methods	2020	Yue Wu Weitong Zhang Pan Xu Quanquan Gu
+ PDF Chat	Accelerated Multi-Time-Scale Stochastic Approximation: Optimal Complexity and Applications in Reinforcement Learning and Multi-Agent Games	2024	Sihan Zeng Thinh T. Doan
+	Control Theoretic Analysis of Temporal Difference Learning	2021	Donghwan Lee
+	Finite-sample Analysis of Greedy-GQ with Linear Function Approximation under Markovian Noise	2020	Yue Wang Shaofeng Zou
+ PDF Chat	Finite-Time Analysis and Restarting Scheme for Linear Two-Time-Scale Stochastic Approximation	2021	Thinh T. Doan
+	Adaptive Temporal Difference Learning with Linear Function Approximation	2020	Tao Sun Han Shen Tianyi Chen Dongsheng Li
+	A Tale of Two-Timescale Reinforcement Learning with the Tightest Finite-Time Bound	2019	Gal Dalal Balázs Szörényi Gugan Thoppe
+	A Two-Time-Scale Stochastic Optimization Framework with Applications in Control and Reinforcement Learning	2021	Sihan Zeng Thinh T. Doan Justin Romberg
+	Finite-Time Error Bounds for Greedy-GQ	2022	Yue Wang Yi Zhou Shaofeng Zou
+ PDF Chat	Finite Sample Analyses for TD(0) With Function Approximation	2018	Gal Dalal Balázs Szörényi Gugan Thoppe Shie Mannor
+	Finite Sample Analyses for TD(0) with Function Approximation	2017	Gal Dalal Balázs Szörényi Gugan Thoppe Shie Mannor

Works That Cite This (43)

Action	Title	Year	Authors
+	Breaking the Sample Size Barrier in Model-Based Reinforcement Learning with a Generative Model	2020	Gen Li Yuting Wei Yuejie Chi Yuantao Gu Yuxin Chen
+	Provably-Efficient Double Q-Learning.	2020	Wentao Weng Harsh Gupta Niao He Lei Ying R. Srikant
+	Some Limit Properties of Markov Chains Induced by Recursive Stochastic Algorithms	2020	Abhishek Gupta Chen Hao Jianzong Pi Gaurav Tendolkar
+	Finite-Time Analysis and Restarting Scheme for Linear Two-Time-Scale Stochastic Approximation	2019	Thinh T. Doan
+ PDF Chat	Gradient Temporal Difference with Momentum: Stability and Convergence	2022	Rohan Deb Shalabh Bhatnagar
+	Finite-Time Convergence Rates of Nonlinear Two-Time-Scale Stochastic Approximation under Markovian Noise.	2021	Thinh T. Doan
+	A Two-Timescale Framework for Bilevel Optimization: Complexity Analysis and Application to Actor-Critic	2020	Mingyi Hong Hoi To Wai Zhaoran Wang Zhuoran Yang
+	The Mean-Squared Error of Double Q-Learning	2020	Wentao Weng Harsh Gupta Niao He Lei Ying R. Srikant
+	Greedy-GQ with Variance Reduction: Finite-time Analysis and Improved Complexity	2021	Shaocong Ma Ziyi Chen Yi Zhou Shaofeng Zou
+	On the Stability of Random Matrix Product with Markovian Noise: Application to Linear Stochastic Approximation and TD Learning	2021	Alain Durmus Éric Moulines Alexey Naumov Sergey Samsonov Hoi-To Wai

Works Cited by This (0)

Action	Title	Year	Authors