Experiment 2: Towers of Hanoi
3 pegs: S, A, D; n disks on S; move all to D, but never a larger on a smaller. Additional primitives: movdiskSD(); exchSA(); exchAD().
Fastest solution costs 2n-1 moves.
Anderson 1986: reinforcement learning, nɜ.
Langley 1985: production systems, nɞ.
Baum & Durdanovic 1999: simpler blocks problem scales linearly, nɞ (Kwee 2001)
Nonlearning AI planners: n size < 100,000
Here: n up to 30; solution size > 1,000,000,000
Speedup: first learn a seemingly unrelated task (next slide)!
This gives us an opportunity to demonstrate incremental learning - knowledge transfer from one task to the next
Back to J. Schmidhuber's OOPS page