Subgroup Evaluation

Next: The Expert's Role in Up: The Descriptive Induction Process Previous: Subgroup Interpretation through Visualization

3.6 Subgroup Evaluation

In order to evaluate the discovered risk groups, the medical expert has tested the induced subgroup patterns on an independent set of 70 people (50 CHD patients and 20 non-CHD cases from the same hospital). The results for these patients, summarized in Table 4, show that the patterns are successful in detecting CHD patients. About 90% of CHD patients were included into at least one of the five patterns. The detected sensitivity values (TPr) for patterns A1, B2, and C1 are significantly higher than the values computed on the set of patients used for subgroup discovery. For the other two patterns the values do not differ significantly. Note that the accuracy values are relatively high, despite the relatively high false positive rate (FPr): a lower FPr could have been achieved by selecting lower values of the generalization parameter g, at a cost of detecting subgroups with lower coverage of positive cases.

	Training set			Test set
	TPr	FPr	Accuracy	TPr	FPr	Accuracy
A1	47.5%	26.8%	59.4%	84.8%	77.8%	80.0%
A2	48.4%	6.7%	81.2%	41.2%	27.3%	70.0%
B1	28.8%	9.4%	72.7%	36.0%	20.0%	81.8%
B2	32.4%	12.6%	69.2%	42.0%	15.0%	87.5%
C1	23.4%	5.5%	78.8%	82.0%	40.0%	83.7%

Table 4: Summary of results obtained on the training set and on an independent set of 70 persons (50 CHD patients and 20 non-CHD cases from the same hospital), measured in terms of TPr, FPr and Accuracy.

Next: The Expert's Role in Up: The Descriptive Induction Process Previous: Subgroup Interpretation through Visualization