For this class of problems, the phase transition behavior is
illustrated in Fig. 3. Specifically, this shows the cost to solve the problem with
a simple chronological backtrack search. The cost is given in terms of
the number of search nodes considered until a solution is found. The
minimum cost, for a search that proceeds directly to a solution with no
backtrack is . The parameter distinguishing underconstrained from
overconstrained problems is the ratio
of the number of nogoods
m at level 2 given by the constraints
to the number of items N.
Figure 3: The solid curves show the classical backtrack search cost for randomly generated problems with a prespecified solution as a function of =m/N for N=10 (gray) and 20 (black) and L=N/2. Here m is the number of nogoods selected at level 2 of the search lattice. The cost is the average number of backtrack steps, starting from the empty set, required to find the first solution to the problem, averaged over 1000 problems. The error bars indicate the standard deviation of this estimate of the average value, and in most cases are smaller than the size of the plotted points. For comparison, the dashed curves show the probability for having a solution in randomly generated problems, ranging from 1 at the left to 0 at the right.
Even for these relatively small problems, a peak in the average
search cost is evident. Moreover, this peak is near the transition
region where random problems change from mostly soluble to
mostly insoluble. A simple, but approximate, theoretical value for the
location of the transition is given by the point where the expected
number of solutions is equal to one [50, 53]. Applying this to the class of
problems considered here is straightforward. Specifically, there are
complete sets for the problem, as given by
Eq. 1. A particular set
s of size
L will be good, i.e., a solution,
only if none of the nogoods selected for the problem are a subset of
s. Hence the probability it will be a
solution is given by
because there are sets of size 2 from which to choose the
m nogoods specified directly by the
constraints. The average number of solutions is then just
. If we set
and
, for large N this
becomes
where . The predicted transition
point
is then
given by
which is for the case considered here (i.e.,
). This closely matches the location of the peak in
the search cost for problems with prespecified solution, as shown in
Fig. 3, but is
about 20% larger than the location of the step in
solubility
. Furthermore, the
theory predicts there is a regime of polynomial average cost for
sufficiently few constraints [28]. This
is determined by the condition that the expected number of goods at each
level in the lattice is monotonically increasing. Repeating the above
argument for smaller levels in the lattice, we find that this condition
holds up to
which is for
.
While these estimates are only approximate, they do indicate that
the class of random soluble problems defined here behaves qualitatively
and quantitatively the same with respect to the transition behavior as a
variety of other, perhaps more realistic, problem classes. This close
correspondence with the theory (derived for the limit of large
problems), suggests that we are observing the correct transition
behavior even with these relatively small problems.
Moreover the above approximate theoretical argument suggests
that the average cost of general classical search methods scales
exponentially with the size of the problem over the full range of
. Thus this provides a good test case for the average
behavior of the quantum algorithm. As a final observation, it is
important to obtain a sufficient number of samples, especially near the
transition region. This is because there is considerable variation in
problems near the transition, specifically a highly skewed distribution
in the number of solutions. In this region, most problems have few
solutions but a few have extremely many: enough in fact to give a
substantial contribution to the average number of solutions even though
such problems are quite rare.