most data sets, in terms of the obtained unbiased AUC
values by averaging over 10 different models. This
strengthens our claim that by emphasizing each sub-
class low variance directions will allow the best se-
paration between the target class and outliers, which
results in best performance. Also, according to Table
(4) and Table (3), the AUC values average is around
50%, this is expected as both used datasets (“MNIST
Database of Handwritten Digits” and “Optical Recog-
nition of Handwritten Digits”) are highly overlapped.
The last rows of Table (4) and Table (3) provides the
confidence intervals (in %) obtained from the perfor-
med t-tests. This confidence interval quantifies the
probability of the paired distributions being the same.
The higher the confidence interval, the lower is the
probability that the underlying distributions are sta-
tistically indifferent. As we can see, all the confi-
dence intervals are high, which shows that SCOSVM
indeed provides statistically significant accuracy im-
provements.
In terms of training computational complexity, the
COSVM algorithm uses sequential minimal optimi-
zation to solve the quadratic programming problem,
and therefore scales with is O(N
3
). According to the
Equation (7) the SCOSVM scales with same com-
plexity. However, we expect that SCOSVM has hig-
her training time, especially, as target class has several
clusters. Table (5) shows the average training times
per model for the data sets. As we expect, the running
time of the SCOSVM method is reasonably higher
than the unimodal COSVM classifier. We also present
some individual graphical results for the data set mo-
dels by plotting the actual Receiver Operating Cha-
racteristics (ROC) for the data set (mfeat zer). Figure
3 shows the ROC curves for three classifiers (OSVM,
COSVM, SCOSVM) for one out of the 10 models for
this data set. We can clearly see from Figure 3 that
SCOSVM indeed leads to a best ROC curve in terms
of performance (Nallammal and Radha, 2010).
5 CONCLUSION
In this paper, we investigate the effectiveness of a
novel SCOSVM classification approach (SCOSVM)
in Handwritten Digits Recognition. Comparatively
to the unimodal COSVM, the SCOSVM is able to
handle multi-modal target class, and takes advantage
of the target class clusters low variance directions, to
improve classification performance. The evaluation
and comparison are carried out on the relevant Hand-
written Digits datasets, namely, “The Optical Recog-
nition of Handwritten Digits” and “The MNIST Da-
tabase of Handwritten Digits”, where we compared
our method against contemporary one-class classi-
fiers. Results have shown the superiority of the met-
hod. Future work will consist in validating the propo-
sed novel SCOSVM on strong applications, such as,
face recognition, anomaly detection, etc.
REFERENCES
Bache, K. and Lichman, M. (2013). UCI machine learning
repository.
Bouguessa, M., Wang, S., and Sun, H. (2006). An objective
approach to cluster validation. Pattern Recognition
Letters, 27(13):1419–1430.
Cabral, G. G. and de Oliveira, A. L. I. (2011). A novel one-
class classification method based on feature analysis
and prototype reduction. In SMC, pages 983–988.
Cortes, C. and Vapnik, V. (1995). Support vector networks.
Machine Learning, 20:273–297.
Deng, L. (2012). The MNIST database of handwritten digit
images for machine learning research. IEEE Signal
Process. Mag., 29(6):141–142.
Ebrahimzadeh, R. and Jampour, M. (2014). Efficient hand-
written digit recognition based on histogram of orien-
ted gradients and SVM. International Journal of Com-
puter Applications., 104(9):10–13.
Fawcett, T. (2004). ROC graphs: Notes and practical consi-
derations for researchers. Machine learning, 31:1–38.
Gorgevik, D. and Cakmakov, D. (2004). An efficient three-
stage classifier for handwritten digit recognition. In
ICPR (4), pages 507–510.
Hanley, J. A. and McNeil, B. J. (1983). A method of com-
paring the areas under receiver operating characteris-
tic curves derived from the same cases. Radiology,
148(3):839–843.
Jagannathan, J., Sherajdheen, A., Muthu Vijay Deepak, R.,
and Krishnan, N. (2014). License plate character
segmentation using horizontal and vertical projection
with dynamic thresholding. In International Confe-
rence on Emerging Trends in Computing, Communi-
cation and Nanotechnology., pages 700–705.
Khan, N. M., Ksantini, R., Ahmad, I. S., and Guan, L.
(2014). Covariance-guided one-class support vector
machine. Pattern Recognition, 47(6):2165–2177.
Larroza, A., Moratal, D., Paredes-Sanchez, A., Soria-
Olivas, E., Chust, M. L., Arribas, L. A., and Arana,
E. (2015). Support vector machine classification of
brain metastasis and radiation necrosis based on tex-
ture analysis in MRI. Journal of Magnetic Resonance
Imaging., 42(5):1362–1368.
Lauer, F., Suen, C. Y., and Bloch, G. (2007). A trainable fe-
ature extractor for handwritten digit recognition. Pat-
tern Recognition, 40(6):1816–1824.
Mahmoud, S. A. and Al-Khatib, W. G. (2011). Recognition
of arabic (indian) bank check digits using log-gabor
filters. Appl.Intell., 35(3):445–456.
Malon, C., Uchida, S., and Suzuki, M. (2008). Mathemati-
cal symbol recognition with support vector machines.
Pattern Recognition Letters, 29(9):1326–1332.
A Novel Handwritten Digits Recognition Method based on Subclass Low Variances Guided Support Vector Machine
35