|
The basic experiment tests the ability of MICRO-HILLARY to learn the 15-puzzle domain. Table 1 shows the learning resources and the macro statistics averaged over the 100 learning sessions. We also tested the utility of the learned knowledge by comparing the performance of the problem solver that uses the learned macros to the following problem solvers:
|
MICRO-HILLARY, after learning, solves problems much faster than the other algorithms. It uses 19 times fewer operator applications than best-first search using the MD heuristic, with solutions of comparable length. If we compare the CPU time of best-first search to that of MICRO-HILLARY we can see that MICRO-HILLARY is more than 100 times faster than best-first search. This is because best-first search has an overhead per generated node which is much higher than MICRO-HILLARY's.9 MICRO-HILLARY also has the advantage of a very small standard deviation. MICRO-HILLARY after learning is 322 times faster than MICRO-HILLARY before learning. MICRO-HILLARY is 59 times faster than WA* with W=3, but yields solutions that are twice as long. MICRO-HILLARY is more than 1,000,000 times faster than IDA* with solutions that are 2.8 times longer (than the optimal solutions found by IDA*).
During the testing phase, MICRO-HILLARY never encountered local minima. Thus, according to definition 1, all the sets of macros learned by MICRO-HILLARY were apparently complete. This was also the case with the rest of the experiments described in this paper.
We have tried using iterative deepening (ID) for escape instead of ILB. We used an improved version of ID that tests each new state against the states along the path to the root to avoid loops. Table 3 shows the resources consumed during learning. Table 4 shows the statistics of the macro sets acquired. As expected, learning with ILB is faster than with ID (by a factor of 9), but produces macros that are slightly longer (ID finds the shortest escape routes).
|
|