Wang, S., Blanchet, J., & Glynn, P. (2023). Optimal sample complexity of reinforcement learning for uniformly ergodic discounted markov decision processes. 3. CoRR.
Abstract
Authors
Shengbo Wang, J Blanchet, P Glynn
Publication date
2023
Journal
CoRR