Papers dealing with Exploration versus Exploitation in RL
Click here to return to the main page on reinforcement learning Satinder Singh.
- Near-Optimal Reinforcement Learning in Polynomial Time by Michael Kearns and Satinder Singh. In Machine Learning journal, Volume 49, Issue 2, pages 209-232, 2002.
( shorter version appears in ICML 1998).
gzipped postscript pdf.
- Near-Optimal Reinforcement Learning in Polynomial Time by Michael Kearns and Satinder Singh. In Proceedings of the Fifteenth International Conference on Machine Learning (ICML), pages 260-268, 1998.
gzipped postscript.
Click here to return to the main page on reinforcement learning Satinder Singh.