Bibliography

Next: About this document ... Up: REINFORCEMENT DRIVEN INFORMATION ACQUISITION Previous: 3. SIMULATIONS OF RDIA

Bibliography

1: E. B. Baum.
Neural nets that learn in polynomial time from examples and queries.
IEEE Transactions on Neural Networks, 2(1):5-19, 1991.
2: D. A. Cohn.
Neural network exploration using optimal experiment design.
In J. Cowan, G. Tesauro, and J. Alspector, editors, Advances in Neural Information Processing Systems 6. San Mateo, CA: Morgan Kaufmann, 1994.
3: V. V. Fedorov.
Theory of optimal experiments.
Academic Press, 1972.
4: J. Hwang, J. Choi, S. Oh, and R. J. Marks II.
Query-based learning applied to partially trained multilayer perceptrons.
IEEE Transactions on Neural Networks, 2(1):131-136, 1991.
5: L. Kaelbling.
Learning in Embedded Systems.
MIT Press, 1993.
6: D. J. C. MacKay.
Information-based objective functions for active data selection.
Neural Computation, 4(2):550-604, 1992.
7: M. Plutowski, G. Cottrell, and H. White.
Learning Mackey-Glass from 25 examples, plus or minus 2.
In J. Cowan, G. Tesauro, and J. Alspector, editors, Advances in Neural Information Processing Systems 6, pages 1135-1142. San Mateo, CA: Morgan Kaufmann, 1994.
8: J. H. Schmidhuber.
Curious model-building control systems.
In Proc. International Joint Conference on Neural Networks, Singapore, volume 2, pages 1458-1463. IEEE, 1991.
9: J. H. Schmidhuber.
A possibility for implementing curiosity and boredom in model-building neural controllers.
In J. A. Meyer and S. W. Wilson, editors, Proc. of the International Conference on Simulation of Adaptive Behavior: From Animals to Animats, pages 222-227. MIT Press/Bradford Books, 1991.
10: J. Storck.
Reinforcement-Lernen und Modellbildung in nicht-deterministischen Umgebungen. Fortgeschrittenenpraktikum, Fakultät für Informatik, Lehrstuhl Prof. Brauer, Technische Universität München, 1994.
11: S. Thrun and K. Möller.
Active exploration in dynamic environments.
In D. S. Lippman, J. E. Moody, and D. S. Touretzky, editors, Advances in Neural Information Processing Systems 4, pages 531-538. San Mateo, CA: Morgan Kaufmann, 1992.
12: C. Watkins.
Learning from Delayed Rewards.
PhD thesis, King's College, 1989.

Juergen Schmidhuber 2003-02-28

Back to Active Learning - Exploration - Curiosity page
Back to Reinforcement Learning page