Next: About this document ...
Up: REINFORCEMENT DRIVEN INFORMATION ACQUISITION
Previous: 3. SIMULATIONS OF RDIA
- 1
-
E. B. Baum.
Neural nets that learn in polynomial time from examples and queries.
IEEE Transactions on Neural Networks, 2(1):5-19, 1991.
- 2
-
D. A. Cohn.
Neural network exploration using optimal experiment design.
In J. Cowan, G. Tesauro, and J. Alspector, editors, Advances in
Neural Information Processing Systems 6. San Mateo, CA: Morgan Kaufmann,
1994.
- 3
-
V. V. Fedorov.
Theory of optimal experiments.
Academic Press, 1972.
- 4
-
J. Hwang, J. Choi, S. Oh, and R. J. Marks II.
Query-based learning applied to partially trained multilayer
perceptrons.
IEEE Transactions on Neural Networks, 2(1):131-136, 1991.
- 5
-
L. Kaelbling.
Learning in Embedded Systems.
MIT Press, 1993.
- 6
-
D. J. C. MacKay.
Information-based objective functions for active data selection.
Neural Computation, 4(2):550-604, 1992.
- 7
-
M. Plutowski, G. Cottrell, and H. White.
Learning Mackey-Glass from 25 examples, plus or minus 2.
In J. Cowan, G. Tesauro, and J. Alspector, editors, Advances in
Neural Information Processing Systems 6, pages 1135-1142. San Mateo, CA:
Morgan Kaufmann, 1994.
- 8
-
J. H. Schmidhuber.
Curious model-building control systems.
In Proc. International Joint Conference on Neural Networks,
Singapore, volume 2, pages 1458-1463. IEEE, 1991.
- 9
-
J. H. Schmidhuber.
A possibility for implementing curiosity and boredom in
model-building neural controllers.
In J. A. Meyer and S. W. Wilson, editors, Proc. of the
International Conference on Simulation of Adaptive Behavior: From Animals to
Animats, pages 222-227. MIT Press/Bradford Books, 1991.
- 10
-
J. Storck.
Reinforcement-Lernen und Modellbildung in nicht-deterministischen
Umgebungen. Fortgeschrittenenpraktikum, Fakultät für Informatik,
Lehrstuhl Prof. Brauer, Technische Universität München, 1994.
- 11
-
S. Thrun and K. Möller.
Active exploration in dynamic environments.
In D. S. Lippman, J. E. Moody, and D. S. Touretzky, editors, Advances in Neural Information Processing Systems 4, pages 531-538. San
Mateo, CA: Morgan Kaufmann, 1992.
- 12
-
C. Watkins.
Learning from Delayed Rewards.
PhD thesis, King's College, 1989.
Juergen Schmidhuber
2003-02-28
Back to Active Learning - Exploration - Curiosity page
Back to Reinforcement Learning page