Follow
Thinh T. Doan
Title
Cited by
Cited by
Year
Finite-Time Analysis of Distributed TD(0) with Linear Function Approximation for Multi-Agent Reinforcement Learning
TT Doan, ST Maguluri, J Romberg
International Conference on Machine Learning, 2019
1642019
Performance of Q-learning with Linear Function Approximation: Stability and Finite-Time Analysis
Z Chen, S Zhang, TT Doan, ST Maguluri, JP Clarke
arXiv preprint arXiv:1905.11425, 2019
138*2019
Fast Convergence Rates of Distributed Subgradient Methods with Adaptive Quantization
TT Doan, ST Maguluri, J Romberg
arXiv preprint arXiv:1810.13245, 2018
832018
Convergence rates of distributed gradient methods under random quantization: A stochastic approximation approach
TT Doan, ST Maguluri, J Romberg
IEEE Transactions on Automatic Control 66 (10), 4469-4484, 2020
68*2020
Distributed resource allocation on dynamic networks in quadratic time
TT Doan, A Olshevsky
https://arxiv.org/abs/1507.07850, 2015
592015
Convergence of the iterates in mirror descent methods
TT Doan, S Bose, DH Nguyen, CL Beck
IEEE control systems letters 3 (1), 114-119, 2018
542018
Finite-time performance of distributed temporal-difference learning with linear function approximation
TT Doan, ST Maguluri, J Romberg
SIAM Journal on Mathematics of Data Science 3 (1), 298-320, 2021
522021
On the convergence rate of distributed gradient methods for finite-sum optimization under communication delays
TT Doan, CL Beck, R Srikant
arXiv preprint arXiv:1708.03277, 2017
49*2017
Finite-sample analysis of two-time-scale natural actor–critic algorithm
S Khodadadian, TT Doan, J Romberg, ST Maguluri
IEEE Transactions on Automatic Control 68 (6), 3273-3284, 2022
472022
A decentralized policy gradient approach to multi-task reinforcement learning
S Zeng, MA Anwar, TT Doan, A Raychowdhury, J Romberg
Uncertainty in Artificial Intelligence, 1002-1012, 2021
462021
Distributed Lagrangian Methods for Network Resource Allocation
TT Doan, CL Beck
arXiv preprint arXiv:1609.06287, 2016
46*2016
Nonlinear two-time-scale stochastic approximation: Convergence and finite-time performance
TT Doan
IEEE Transactions on Automatic Control 68 (8), 4695-4705, 2022
412022
On the Convergence Rate of Distributed Gradient Methods for Finite-Sum Optimization under Communication Delays
TT Doan, CL Beck, R Srikant
Proceedings of the ACM on Measurement and Analysis of Computing Systems 1 (2 …, 2017
392017
Finite-time analysis and restarting scheme for linear two-time-scale stochastic approximation
TT Doan
SIAM Journal on Control and Optimization 59 (4), 2798-2819, 2021
362021
Distributed resource allocation over dynamic networks with uncertainty
TT Doan, CL Beck
IEEE Transactions on Automatic Control, 2020
352020
On the geometric convergence rate of distributed economic dispatch/demand response in power networks
TT Doan, A Olshevsky
arXiv preprint arXiv:1609.06660, 2016
322016
Convergence rates of accelerated markov gradient descent with applications in reinforcement learning
TT Doan, LM Nguyen, NH Pham, J Romberg
arXiv preprint arXiv:2002.02873, 2020
282020
Byzantine fault-tolerance in decentralized optimization under 2f-redundancy
N Gupta, TT Doan, NH Vaidya
2021 American Control Conference (ACC), 3632-3637, 2021
272021
Finite-time analysis of markov gradient descent
TT Doan
IEEE Transactions on Automatic Control 68 (4), 2140-2153, 2022
232022
Finite-time analysis of stochastic gradient descent under markov randomness
TT Doan, LM Nguyen, NH Pham, J Romberg
arXiv preprint arXiv:2003.10973, 2020
222020
The system can't perform the operation now. Try again later.
Articles 1–20