next up previous
Up: How to Make Software Previous: Sidebar: Genetic Algorithms

References

1
A.G. Barto, S.J. Bradtke, and S.P. Singh. Learning to act using real-time dynamic programming. Artificial Intelligence, 72:81--138, 1995.

2
D. P. Bertsekas. Dynamic Programming and Optimal Control: Vols 1 and 2. Athena-Scientific, Belmont, MA, 1995.

3
R. H. Crites and A. G. Barto. Improving elevator performance using reinforcement learning. In D. S. Touretzky, M. C. Mozer, and M. E. Hasselmo, editors, Advances is Neural Information Processing Systems 8, pages 1017--1023. MIT press, 1996.

4
D. W. North. A tutorial introduction to decision theory. IEEE Transactions on Systems Man and Cybernetics, SSC-4(3), Sept. 1968.

5
S. Singh and D. Bertsekas. Reinforcement learning for dynamic channel allocation in cellular telephone systems. submitted.

6
G. J. Tesauro. Practical issues in temporal difference learning. Machine Learning, 8(3/4):257--277, May 1992.

7
W. Zhang and T. G. Dietterich. High-performance job-shop scheduling with a time-delay td(lambda) network. In D. S. Touretzky, M. C. Mozer, and M. E. Hasselmo, editors, Advances is Neural Information Processing Systems 8, pages 1024--1030. MIT Press, 1996.