Adaptive Levin Search (ALS)

Next: Plugging ALS into the Up: Implementation 1: Plugging LS Previous: Levin Search (LS)

Adaptive Levin Search (ALS)

LS is not necessarily optimal for ``incremental'' learning problems where experience with previous problems may help to reduce future search costs. To make an incremental search method out of non-incremental LS, we introduce a simple, heuristic, adaptive LS extension (ALS) that uses experience with previous problems to adaptively modify LS' underlying probability distribution. ALS essentially works as follows: whenever LS found a program that computed a solution for the current problem, the probabilities of 's instructions $q_1, q_2, \ldots, q_{l(q)}$ are increased (here $q_i \in \{b_1,\ldots,b_{n_{ops}}\}$ denotes 's -th instruction, and denotes 's length -- if LS did not find a solution ( is the empty program), then is defined to be 0). This will increase the probability of the entire program. The probability adjustment is controlled by a learning rate $\gamma$ ( $0 < \gamma$ ). ALS is related to the linear reward-inaction algorithm, e.g., [#!Narendra:74!#,#!Kaelbling:93!#] -- the main difference is: ALS uses LS to search through program space as opposed to single action space. As in the previous section, the probability distribution is determined by . Initially, all $P_{ij} = \frac{1}{n_{ops}}$ . However, given a sequence of problems , the $P_{ij}$ may undergo changes caused by ALS: