Proof of Theorem 3

Next: Proof of Theorem 4 Up: Appendix A Previous: Proof of Theorem 1

Proof of Theorem 3

We use a reduction from the -Hard problem ``2-NM-Colorability'' [Kearns et al.1987]:

Name : ``2-NM-Colorability''.
Instance : A finite set $S=\{s_1, s_2, ..., s_{\vert S\vert}\}$ and a collection of constraints over , $C=\{c_1, c_2, ..., c_{\vert C\vert}\}$ , such that $\forall i\in \{1, 2, ..., \vert C\vert\}, c_i \subseteq S$ .
Question : Does there exist a 2-NM-Coloration of the elements of , i.e. a function $\chi: S \rightarrow \{1,2\}$ such that

$\begin{eqnarray*} (\forall i \in \{1, 2, ..., \vert C\vert\}), (\exists s_k, s_l \in c_i): \chi(s_k) & \neq & \chi(s_l) \:\: ? \end{eqnarray*}$

The reduction is constructed as follows : from a ``2-NM-Colorability'' instance, we build a learning sample

such that if there exists a 2-NM-Coloration of the elements of

, then there exists a decision committee with two rules consistent with

, and, reciprocally, if there exists a decision committee with two rules consistent with

, then there exists a 2-NM-Coloration of the elements of

. Furthermore, there never exists a decision committee with only one rule consistent with

. Hence, finding the decision committee with the smallest number of rules consistent with

is at least as hard as solving ``2-NM-Colorability'', and this is intractable if $P\neq NP$ .
Let

denote the $j^{th}$ element of

, and

the $j^{th}$ element of

. We define a set of $\vert S\vert$ Boolean variables in one to one correspondence with the elements of

, which we use to describe the examples of

. The corresponding set of literals is denoted $\{x_1, \overline{x}_1, x_2, \overline{x}_2, ..., x_{\vert S\vert}, \overline{x}_{\vert S\vert}\}$ . Our reduction is made in the two-classes framework. The sample

contains two disjoint subsets : the set of positive examples

, and the set of negative ones

contains $\vert S\vert$ examples, denoted by $e^+_1, e^+_2, ..., e^+_{\vert S\vert}$ . We construct each positive example so that it represents an element of

. More precisely,

$\displaystyle \forall 1\leq i\leq \vert S\vert, e^+_i$

$\textstyle =$

$\displaystyle \overline{x}_i \wedge \bigwedge_{j=1, j\neq i}^{j=\vert S\vert} {\overline{x}_j} \:\:.$

(13)

contains $\vert C\vert$ examples, denoted by $e^+_1, e^+_2, ..., e^+_{\vert C\vert}$ . We construct each negative example so that it encodes each of the constraints of

. More precisely:

$\displaystyle \forall 1\leq i\leq \vert C\vert, e^-_i$

$\textstyle =$

$\displaystyle \left( \bigwedge_{j : s_j \in c_i} {\overline{x}_j} \right) \wedge \left( \bigwedge_{j : s_j \not\in c_i} {x_j} \right) \:\:.$

(14)

Without loss of generality, we make four assumptions on the instance of ``2-NM-Colorability'' due to the fact that it is not trivial:

There does not exist some element of present in all constraints. In this case indeed, the trivial coloration consists in giving to one of such elements one color, and the other color to all other elements of .
$\forall (i,j,k,l) \in \{1, 2, ..., \vert S\vert\}^4$ with $i \neq j$ and $k \neq l$ ,

$\displaystyle \exists o \in \{1, 2, ..., \vert C\vert\}, \{s_i,s_j\} \not\subseteq c_o$ $\textstyle \wedge$ $\displaystyle \{s_k,s_l\} \not\subseteq c_o \:\:.$ (15)

Otherwise indeed, there would exist $(i,j,k,l) \in \{1, 2, ..., \vert S\vert\}^4$ with $i \neq j$ and $k \neq l$ such that

$\displaystyle \forall o \in \{1, 2, ..., \vert C\vert\}, \{s_i,s_j\} \subseteq c_o$ $\textstyle \vee$ $\displaystyle \{s_k,s_l\} \subseteq c_o \:\:,$ (16)

and in that case, a trivial solution to ``2-NM-Colorability'' would consist in giving to one color and to the other one, and to one color and to the other one.
Each element of belongs to at least one constraint in . Otherwise, it can be removed.
Each constraint contains at least two elements from . Otherwise it can be removed.

$\bullet$ Suppose there exists a solution to ``2-NM-Colorability''. We build the DNF with two monomials of [Kearns et al.1987] consistent with the examples. Then, we build two rules by associating the two monomials to some (arbitrary) positive value. The default class is ``-''. This leads to a decision committee with two rules consistent with

.
$\bullet$ Suppose that there exists a decision committee

with at most two rules consistent with

. We now show that there exists a valid 2-NM-Coloration of the elements of

. We first show three lemmas which shall be used later on. Then, we show that the decision committee is actually equivalent to a DNF with two monomials consistent with

. We conclude by using previous results [Kearns et al.1987] on how to transform this DNF into a valid 2-NM-Coloration of the elements of

Lemma 3 If a monomial is not satisfied by any positive example,

either it contains at least two negative literals, or
it is the monomial containing all positive literals:

$\begin{displaymath}\bigwedge_{j=1}^{j=\vert S\vert} {x_j} \:\:.\end{displaymath}$

(Proof straightforward).

Lemma 4 If a monomial is satisfied by all positive examples, it is empty.

(Indeed, for any variable, there exist two positive examples having the corresponding positive literal, and the corresponding negative literal).

Lemma 5

contains exactly two rules.

Proof: Suppose that

contains one rule, whose monomial is called

. If the default class is ``-'', all positive examples satisfy

, which is impossible by Lemma 4: the monomial would be empty, and

could not be consistent. If the default class is ``+'', the negative examples are classified by

and therefore $\Delta_1<0$ . Thus, no positive example satisfies

. From Lemma 3, either $t_1=\bigwedge_{j=1}^{j=\vert S\vert} {x_j}$ and no negative example can satisfy it (impossible), or

contains at least two negative literals, and the constraints all have in common two elements of

. Thus, the instance of ``2-NM-Colorability'' is trivial, which is impossible. This ends the proof of Lemma 5. $\hbox{\vrule width 0.8pt \vbox to6pt{\hrule depth 0.8pt width 5.2pt \vfill\hrule depth 0.8pt}\vrule width 0.8pt}$
We now show that the default class of

is ``-''. For the sake of simplicity, we write the two monomials of

and

. The default class is denoted $\beta \in \{$ ``-'', ``+'' $\}$ . Making the assumption that $\beta=$ ``+'' implies that all negative examples must satisfy at least one monomial in

Suppose that $\Delta_{1}<0$ and $\Delta_{2}<0$ . Then, no positive example can satisfy either or . From the two possibilities of Lemma 3, only the first one is valid ( $\bigwedge_{j=1}^{j=\vert S\vert} {x_j}$ cannot be satisfied by any negative example). Thus, and contain each at least two negative literals:

$\displaystyle \{\overline{x}_i, \overline{x}_j\}$ $\textstyle \subseteq$ $\displaystyle t_1 \:\:,$ (17)

$\displaystyle \{\overline{x}_k, \overline{x}_l\}$ $\textstyle \subseteq$ $\displaystyle t_2 \:\:.$ (18)

We are in the second case of triviality of the instance of ``2-NM-Colorability'', since making the assumption that is consistent implies:

$\displaystyle \exists o \in \{1, 2, ..., \vert C\vert\}, \{s_i,s_j\} \not\subseteq c_o$ $\textstyle \wedge$ $\displaystyle \{s_k,s_l\} \not\subseteq c_o \:\:.$ (19)
Suppose that and . All negative examples must satisfy . is forced to be monotonous since otherwise (given that ``+'') all negative examples would share a common negative literal, thus all constraints would share a common element of , and the instance of ``2-NM-Colorability'' would be trivial. being satisfied by at least one positive example (otherwise, would be equivalent to a single-rule decision committee, and we fall in the contradiction of Lemma 5), it contains at most one negative literal. If it contains exactly one negative literal, it is satisfied by exactly one positive example, and we can replace it by the monotonous monomial with positive literals (we leave empty the position of the initial negative literal). Consequently, similarly for , we can suppose that is monotonous. We distinguish two cases.
- If $\vert\Delta_{1}\vert>\vert\Delta_{2}\vert$ , no positive example can satisfy . By fact 3, $t_1=\bigwedge_{j=1}^{j=\vert S\vert} {x_j}$ , and no negative example can satisfy it, a contradiction ( cannot be consistent).
- If $\vert\Delta_{1}\vert\leq \vert\Delta_{2}\vert$ . cannot be empty; therefore it contains a certain number of positive literals. Each positive example satisfying must also satisfy , since otherwise is not consistent; Since and are monotonous, is a generalization of , and any example satisfying (in particular, the negative examples) must satisfy , a contradiction.

Therefore $\beta=$ ``-''. This forces all positive examples to satisfy at least one monomial of

. Recall that

contains two monomials. Suppose that $\Delta_{1} > 0$ and $\Delta_{2}<0$ . It comes $t_1=\emptyset$ (Lemma 4). All negative examples must satisfy

, and we also have $\vert\Delta_{1}\vert\leq \vert\Delta_{2}\vert$ . No positive example can satisfy

, and Lemma 3 gives either $t_1=\bigwedge_{j=1}^{j=\vert S\vert} {x_j}$ (satisfied by no example, impossible) or

contains at least two negative literals, whose corresponding elements of

are shared by all constraints, and we obtain again that the instance of ``2-NM-Colorability'' is trivial.
Therefore, $\Delta_{1} > 0$ and $\Delta_{2}>0$ , and each monomial is satisfied by at least one positive example.

is thus equivalent to a DNF with the same two monomials, and we can use a previous solution [Kearns et al.1987] to build a valid 2-NM-Coloration. First, we can suppose that

is again monotonous [Kearns et al.1987]. Then, since each positive example satisfies at least one monomial ( $\beta=$ ``-''), then for all variable, there exists a monomial which does not contain the corresponding positive literal. The 2-Coloration is then

$\displaystyle \forall i\in\{1, 2, ..., \vert S\vert\}, \chi(s_i)$

$\textstyle =$

$\displaystyle \min_{j \in \{1, 2\}} {\{j : x_i \not\in t_j\}} \:\:.$

(20)

Could this be invalid ? That would mean that there exists a constraint

such that $\forall s_j \in c_i, \chi(s_j)=K=cst$ . This would mean that the corresponding negative example satisfies

, a contradiction [Kearns et al.1987]. This ends the proof of Theorem 3.

Next: Proof of Theorem 4 Up: Appendix A Previous: Proof of Theorem 1

$\displaystyle \{\overline{x}_i, \overline{x}_j\}$	$\textstyle \subseteq$	$\displaystyle t_1 \:\:,$	(17)
$\displaystyle \{\overline{x}_k, \overline{x}_l\}$	$\textstyle \subseteq$	$\displaystyle t_2 \:\:.$	(18)