Papers dealing with Exploration versus Exploitation in RL

Click here to return to the main page on reinforcement learning Satinder Singh.

  1. Near-Optimal Reinforcement Learning in Polynomial Time by Michael Kearns and Satinder Singh. In Machine Learning journal, Volume 49, Issue 2, pages 209-232, 2002.
    ( shorter version appears in ICML 1998).
    gzipped postscript pdf.

  2. Near-Optimal Reinforcement Learning in Polynomial Time by Michael Kearns and Satinder Singh. In Proceedings of the Fifteenth International Conference on Machine Learning (ICML), pages 260-268, 1998.
    gzipped postscript.
Click here to return to the main page on reinforcement learning Satinder Singh.