Wang, S., Blanchet, J., & Glynn, P. (2023). Optimal sample complexity of reinforcement learning for uniformly ergodic discounted markov decision processes. 3. CoRR.

View Publication

Abstract

Authors
Shengbo Wang, J Blanchet, P Glynn
Publication date
2023
Journal
CoRR