Proof of Theorem 1

Next: Proof of Theorem 3 Up: Appendix A Previous: Appendix A

Proof of Theorem 1

Since the hardness results of Theorems 1 and 3 are stated for the two-classes case, we shall use the notation $\Delta_{(i)}=\vec{v}_{(i)}[1] - \vec{v}_{(i)}[0]$ for some arbitrary rule $(t_{(i)}, \vec{v}_{(i)})$ , where $\vec{v}_{(i)}[0]$ is the value for class ``-'' and $\vec{v}_{(i)}[1]$ is the value for class ``+''. A positive value for $\Delta_{(i)}$ means that $t_{(i)}$ is in favor of class ``+'' whereas a negative value gives a $t_{(i)}$ in favor of class ``-''. Value for $\Delta_{(i)}$ gives a $t_{(i)}$ neutral with respect to the classes. We use a reduction from the -Hard problem ``Minimum Cover'' [Garey Johnson1979]:

Name : ``Minimum Cover''.
Instance : A collection of subsets of a finite set . A positive integer , $K \leq \vert C\vert$ .
Question : Does contain a cover of size at most , that is, a subset $C' \subseteq C$ with $\vert C'\vert \leq K$ , such that any element of belongs to at least one member of ?

The reduction is constructed as follows : from a ``Minimum Cover'' instance we build a learning sample

such that if there exists a cover of size $\vert C'\vert \leq K$ of

, then there exists a decision committee with $\vert C'\vert$ literals consistent with

, and, reciprocally, if there exists a decision committee with

literals consistent with

, then there exists a cover of size

. Hence, finding the smallest decision committee consistent with

is equivalent to finding the smallest

for which there exists a solution to ``Minimum Cover'', and this is intractable if $P\neq NP$ .
Let

denote the $j^{th}$ element of

, and

the $j^{th}$ element of

. We define a set of $\vert C\vert$ Boolean variables in one to one correspondence with the elements of

, which we use to describe the examples of

. The corresponding set of literals is denoted $\{x_1, \overline{x}_1, x_2, \overline{x}_2, ..., x_{\vert C\vert}, \overline{x}_{\vert C\vert}\}$ . The sample

contains two disjoint subsets : the set of positive examples

, and the set of negative ones

contains $\vert S\vert$ examples, denoted by $e^+_1, e^+_2, ..., e^+_{\vert S\vert}$ . We construct each positive example so that it encodes the membership of the corresponding element of

in the elements of

. More precisely,

$\displaystyle \forall 1\leq i\leq \vert S\vert, e^+_i$

$\textstyle =$

$\displaystyle \left( \bigwedge_{j : s_i \in c_j} {x_j} \right) \wedge \left( \bigwedge_{j : s_i \not\in c_j} {\overline{x}_j} \right) \:\:.$

(9)

contains a single negative example, defined by:

$\displaystyle e^-$

$\textstyle =$

$\displaystyle \bigwedge_{j=1}^{j=\vert C\vert} {\overline{x}_j} \:\:.$

(10)

$\bullet$ Suppose there exists a cover

satisfying $\vert C'\vert \leq K$ . We create a decision committee consisting of

monomials, each with one literal only and associated to a positive $\Delta$ . Each monomial codes one of the sets in

. The default class is ``-''. This decision committee is consistent with the examples of $LS^+ \cup LS^-$ , otherwise some element of

would not be covered. If there are only two values authorized for the vectors and they are $\leq 0$ , we simply create a DC consisting of one monomial with negative literals associated to a negative $\Delta$ (the value for the negative class is greater than the one of the positive class); each of the negative literals codes one of the sets in

. The default class is ``+''.
$\bullet$ Suppose now that there exists a decision committee

with at most

literals consistent with

. Denote $t_1, t_2, ..., t_{\vert f\vert}$ each monomial of

, in no specific order, and $\Delta_1, \Delta_2, ..., \Delta_{\vert f\vert}$ their associated values for $\Delta$ . The monomials of

can belong to three types of subsets of monomials:

monotonous monomials (without negative literals),
monomials containing only negative literals,
monomials containing positive and negative literals.

Let us call respectively

these three classes. Given that each monomial of

can be associated to a positive or a negative $\Delta$ , there exists on the whole six classes of rules, presented in Figure 5.

**Figure 5:** The six possible cases of rules.
$\begin{figure}\begin{center} \begin{tabular}{l\vert c\vert\vert c\vert\vert} \m... ...$MN$\ \hspace{2cm} & $<0$\ \\ \cline{2-3} \end{tabular}\end{center}\end{figure}$

Any monomial of

containing at least one positive literal can only be satisfied by positive examples. Therefore, if there exists rules belonging to class II or VI, we can remove them without losing consistency. Furthermore, since

contains only negative literals, if we remove their negative literals from all rules belonging to class V (making them go to class I), we do not lose consistency. As a consequence, we can suppose without loss of generality that all rules of

are in class I, III, or IV.
We now treat independently two cases, depending on whether the default class of

is ``+'' or ``-''.

The default class is ``-''. Any positive example satisfies therefore a monomial in . There can exist two types of positive examples: those satisfying at least one rule of class I, and those not satisfying any class I rule (therefore satisfying at least one rule of class III). satisfies all class III and IV rules. Therefore,

$\displaystyle \sum_{(t_i,\vec{v}_i) \in f \cap (\mbox{ class III } \cup \mbox{ class IV })} {\Delta_i} \leq 0 \:\:.$ (11)

This shows that, if a positive example not satisfying any class I rules would satisfy all class IV rules, then it would be misclassified, which is impossible by the consistency hypothesis. This gives an important property, namely that any positive example not satisfying any class I rule cannot satisfy all class IV rules. Let us call P this property in what follows. We now show how to build a valid solution to ``Minimum Cover'' with at most elements. For any positive example ,
- if satisfies at least one class I rule, choose in a subset of corresponding to a positive literal of some satisfied class I rule. This subset contains .
- if does not satisfy any class I rule, there exists from P some class IV rule which is not satisfied. Among all negative literals of a class IV rule which is not satisfied by , choose one which is positive in (causing it not to satisfy the rule), and then choose the corresponding element of . This subset of contains .
Iterating the above procedure for all positive examples, we obtain a cover of consisting of at most subsets of .
The default class is ``+''. satisfies all class III and IV rules. Therefore,

$\displaystyle \sum_{(t_i,\vec{v}_i) \in f \cap (\mbox{ class III } \cup \mbox{ class IV })} {\Delta_i} < 0 \:\:.$ (12)

Even if the inequality is now strict, it gives the same procedure for efficiently building the solution to ``Minimum Cover'' with at most elements, by using the same argument as in the preceeding case.

This ends the proof of Theorem 1.

Next: Proof of Theorem 3 Up: Appendix A Previous: Appendix A