Projects
Reading
People
Chat
SU\G
(𝔸)
/K·U
Projects
Reading
People
Chat
Sign Up
Light
Dark
System
Finite-Time Performance Bounds and Adaptive Learning Rate Selection for Two Time-Scale Reinforcement Learning
Harsh Gupta
,
R. Srikant
,
Lei Ying
Type:
Article
Publication Date:
2019-07-14
Citations:
49
View Publication
Share
Locations
arXiv (Cornell University) -
View
Similar Works
Action
Title
Year
Authors
+
Finite-Time Performance Bounds and Adaptive Learning Rate Selection for Two Time-Scale Reinforcement Learning
2019
Harsh Gupta
R. Srikant
Lei Ying
+
Tight Finite Time Bounds of Two-Time-Scale Linear Stochastic Approximation with Markovian Noise
2024
Shaan ul Haque
Sajad Khodadadian
Siva Theja Maguluri
+
Finite-Sample Analysis for Two Time-scale Non-linear TDC with General Smooth Function Approximation.
2021
Yue Wang
Shaofeng Zou
Yi Zhou
+
Finite-Time Analysis and Restarting Scheme for Linear Two-Time-Scale Stochastic Approximation
2019
Thinh T. Doan
+
Finite-Time Analysis and Restarting Scheme for Linear Two-Time-Scale Stochastic Approximation
2019
Thinh T. Doan
+
PDF
Chat
Non-Asymptotic Analysis for Two Time-scale TDC with General Smooth Function Approximation
2021
Yue Wang
Shaofeng Zou
Yi Zhou
+
Non-Asymptotic Analysis for Two Time-scale TDC with General Smooth Function Approximation
2021
Yue Wang
Shaofeng Zou
Yi Zhou
+
PDF
Chat
A Tale of Two-Timescale Reinforcement Learning with the Tightest Finite-Time Bound
2020
Gal Dalal
Balázs Szörényi
Gugan Thoppe
+
A Finite Time Analysis of Two Time-Scale Actor Critic Methods
2020
Yue Wu
Weitong Zhang
Pan Xu
Quanquan Gu
+
A Finite Time Analysis of Two Time-Scale Actor Critic Methods
2020
Yue Wu
Weitong Zhang
Pan Xu
Quanquan Gu
+
PDF
Chat
Accelerated Multi-Time-Scale Stochastic Approximation: Optimal Complexity and Applications in Reinforcement Learning and Multi-Agent Games
2024
Sihan Zeng
Thinh T. Doan
+
Control Theoretic Analysis of Temporal Difference Learning
2021
Donghwan Lee
+
Finite-sample Analysis of Greedy-GQ with Linear Function Approximation under Markovian Noise
2020
Yue Wang
Shaofeng Zou
+
PDF
Chat
Finite-Time Analysis and Restarting Scheme for Linear Two-Time-Scale Stochastic Approximation
2021
Thinh T. Doan
+
Adaptive Temporal Difference Learning with Linear Function Approximation
2020
Tao Sun
Han Shen
Tianyi Chen
Dongsheng Li
+
A Tale of Two-Timescale Reinforcement Learning with the Tightest Finite-Time Bound
2019
Gal Dalal
Balázs Szörényi
Gugan Thoppe
+
A Two-Time-Scale Stochastic Optimization Framework with Applications in Control and Reinforcement Learning
2021
Sihan Zeng
Thinh T. Doan
Justin Romberg
+
Finite-Time Error Bounds for Greedy-GQ
2022
Yue Wang
Yi Zhou
Shaofeng Zou
+
PDF
Chat
Finite Sample Analyses for TD(0) With Function Approximation
2018
Gal Dalal
Balázs Szörényi
Gugan Thoppe
Shie Mannor
+
Finite Sample Analyses for TD(0) with Function Approximation
2017
Gal Dalal
Balázs Szörényi
Gugan Thoppe
Shie Mannor
Works That Cite This (43)
Action
Title
Year
Authors
+
Breaking the Sample Size Barrier in Model-Based Reinforcement Learning with a Generative Model
2020
Gen Li
Yuting Wei
Yuejie Chi
Yuantao Gu
Yuxin Chen
+
Provably-Efficient Double Q-Learning.
2020
Wentao Weng
Harsh Gupta
Niao He
Lei Ying
R. Srikant
+
Some Limit Properties of Markov Chains Induced by Recursive Stochastic Algorithms
2020
Abhishek Gupta
Chen Hao
Jianzong Pi
Gaurav Tendolkar
+
Finite-Time Analysis and Restarting Scheme for Linear Two-Time-Scale Stochastic Approximation
2019
Thinh T. Doan
+
PDF
Chat
Gradient Temporal Difference with Momentum: Stability and Convergence
2022
Rohan Deb
Shalabh Bhatnagar
+
Finite-Time Convergence Rates of Nonlinear Two-Time-Scale Stochastic Approximation under Markovian Noise.
2021
Thinh T. Doan
+
A Two-Timescale Framework for Bilevel Optimization: Complexity Analysis and Application to Actor-Critic
2020
Mingyi Hong
Hoi To Wai
Zhaoran Wang
Zhuoran Yang
+
The Mean-Squared Error of Double Q-Learning
2020
Wentao Weng
Harsh Gupta
Niao He
Lei Ying
R. Srikant
+
Greedy-GQ with Variance Reduction: Finite-time Analysis and Improved Complexity
2021
Shaocong Ma
Ziyi Chen
Yi Zhou
Shaofeng Zou
+
On the Stability of Random Matrix Product with Markovian Noise: Application to Linear Stochastic Approximation and TD Learning
2021
Alain Durmus
Éric Moulines
Alexey Naumov
Sergey Samsonov
Hoi-To Wai
Works Cited by This (0)
Action
Title
Year
Authors