Experiment 2: Towers of Hanoi

3 pegs: S, A, D; n disks on S; move all to D, but never a larger on a smaller. Additional primitives: movdiskSD(); exchSA(); exchAD().

Fastest solution costs 2n-1 moves.

Anderson 1986: reinforcement learning, nɜ.

Langley 1985: production systems, nɞ.

Baum & Durdanovic 1999: simpler blocks problem scales linearly, nɞ (Kwee 2001)

Nonlearning AI planners: n᝿ size < 100,000

Here: n up to 30; solution size > 1,000,000,000

Speedup: first learn a seemingly unrelated task (next slide)!

This gives us an opportunity to demonstrate incremental learning - knowledge transfer from one task to the next

Previous slide Next slide Back to first slide View graphic version

Back to J. Schmidhuber's OOPS page