next up previous
Next: About this document ... Up: REINFORCEMENT DRIVEN INFORMATION ACQUISITION Previous: 3. SIMULATIONS OF RDIA

Bibliography

1
E. B. Baum.
Neural nets that learn in polynomial time from examples and queries.
IEEE Transactions on Neural Networks, 2(1):5-19, 1991.

2
D. A. Cohn.
Neural network exploration using optimal experiment design.
In J. Cowan, G. Tesauro, and J. Alspector, editors, Advances in Neural Information Processing Systems 6. San Mateo, CA: Morgan Kaufmann, 1994.

3
V. V. Fedorov.
Theory of optimal experiments.
Academic Press, 1972.

4
J. Hwang, J. Choi, S. Oh, and R. J. Marks II.
Query-based learning applied to partially trained multilayer perceptrons.
IEEE Transactions on Neural Networks, 2(1):131-136, 1991.

5
L. Kaelbling.
Learning in Embedded Systems.
MIT Press, 1993.

6
D. J. C. MacKay.
Information-based objective functions for active data selection.
Neural Computation, 4(2):550-604, 1992.

7
M. Plutowski, G. Cottrell, and H. White.
Learning Mackey-Glass from 25 examples, plus or minus 2.
In J. Cowan, G. Tesauro, and J. Alspector, editors, Advances in Neural Information Processing Systems 6, pages 1135-1142. San Mateo, CA: Morgan Kaufmann, 1994.

8
J. H. Schmidhuber.
Curious model-building control systems.
In Proc. International Joint Conference on Neural Networks, Singapore, volume 2, pages 1458-1463. IEEE, 1991.

9
J. H. Schmidhuber.
A possibility for implementing curiosity and boredom in model-building neural controllers.
In J. A. Meyer and S. W. Wilson, editors, Proc. of the International Conference on Simulation of Adaptive Behavior: From Animals to Animats, pages 222-227. MIT Press/Bradford Books, 1991.

10
J. Storck.
Reinforcement-Lernen und Modellbildung in nicht-deterministischen Umgebungen. Fortgeschrittenenpraktikum, Fakultät für Informatik, Lehrstuhl Prof. Brauer, Technische Universität München, 1994.

11
S. Thrun and K. Möller.
Active exploration in dynamic environments.
In D. S. Lippman, J. E. Moody, and D. S. Touretzky, editors, Advances in Neural Information Processing Systems 4, pages 531-538. San Mateo, CA: Morgan Kaufmann, 1992.

12
C. Watkins.
Learning from Delayed Rewards.
PhD thesis, King's College, 1989.



Juergen Schmidhuber 2003-02-28


Back to Active Learning - Exploration - Curiosity page
Back to Reinforcement Learning page