set. Each learning set includes 66 points: 8 points are labeled, 58 points are unla-
belled. For the same data the supervised LS-SVM classifier was created. Average
error rate of SS-LS-SVM classification results on test set 2.3%. This is twice lower
than the error rate calculated for the LS-SVM classifier based on 8 points that is equal
5.6%. The results prove that SS-LS-SVM classifier is independent of the selection of
data points to the learning set.
4.5 Pruning Procedure for Two Moons Problem
We tested the pruning procedure on two data sets tm_27749_20 and tm_94326. Ap-
plications of pruning procedure to the learning set tm_27749_20 yields in the super-
vised LS-SVM model that consists of 10 points (the original set comprised 66 points)
shown in Fig. 5. The test set classification (using pruned model) yielded in 100%
correctly classified examples.
Table 1. Results of pruning procedure applied to 10 permutations of tm_94326 data sets.
original classifier pruned classifier
data
set tp tn fp fn prec recall tp tn fp fn prec recall
Model
size
01 673 653 5 3 99,26 99,56 673 653 5 3 99,26 99,56 83
02 657 667 4 6 99,39 99,10 658 671 0 5 100,00 99,25 56
03 661 664 4 5 99,40 99,25 662 664 4 4 99,40 99,40 70
04 646 677 3 8 99,54 98,78 648 678 2 6 99,69 99,08 61
05 665 661 5 3 99,25 99,55 667 663 3 1 99,55 99,85 63
06 654 668 8 4 98,79 99,39 654 668 8 4 98,79 99,39 82
07 668 654 7 5 98,96 99,26 669 653 8 4 98,82 99,41 76
08 648 674 8 4 98,78 99,39 647 674 8 5 98,78 99,23 76
09 661 659 6 8 99,10 98,80 662 659 6 7 99,10 98,95 64
10 657 663 8 6 98,80 99,10 658 663 8 5 98,80 99,25 72
Average
99,13 99,28 99,18 99,34 69
10 permutations of the tm_94326 data set were generated by randomly assigning
the points to the test subset and the training set. For every permutation the supervised
LS-SVM classifier was created using the learning set. Such model is based on 666
data points. The performance of the classifiers was checked on the corresponding test
data sets (see precision and recall values in Table 1). After application of the pruning
procedure the obtained LS-SVM model was evaluated (results are shown in Table 1)
on the corresponding test sets.
The average model after pruning comprises 69 data points (versus 666 in the orig-
inal classifier before pruning procedure). The recall and precision values obtained
from the tests of the pruned classifier are similar to the values obtained for the origi-
nal classifier. The pruning procedure can be safely applied for obtaining of much
smaller model with the same excellent properties as its original counterpart.
78