3.3 Handwritten Dataset
The handwritten dataset contained 2900 samples (29
classes X 100 samples). The samples were chosen
randomly from the multi-writers IFN/ENIT dataset
(400 different writers). The segmentation and the
diacritics elimination were done manually. Figure 3
presents examples from the handwritten dataset.
Figure 3: Examples from IFN/ENIT dataset.
Table 4 presents top results using handwritten
dataset. The poor performance rates are explained by
the dataset nature (handwriting) and the number of
writers (400 writers). Despite their weakness,
features combination can improve the results by
using k-NN (66.68%) as well as SVM (66.62%).
Table 4: Performance rates using handwritten dataset.
k-NN Classifier
(Canberra distance and k=1)
ZM TMM IM FMT FD AMI
Rate
45.65
34.13 32.75 26.34 22.55 22.20
Features Combination
All
ZM+TMM
+IM+AMI+
FMT
ZM+TMM
+IM+FMT
ZM+TMM
+IM+AMI
ZM+TM
M+IM+F
D
Rate
66.68
66.13 65.10 62.06 61.10
SVM Classifier
ZM FMT FD AMI IM TMM
L
55.24 33.44 17.03 14.27 13.1 3.44
P
(a)
60.34 26.62 20.55 11.17 9.10 3.44
(b)
60.62 30.96 17.79 11.86 6.41 3.44
(c)
57.72 37.37 3.44 14.82 3.44 3.44
S
(d)
50.89 3.86 2.96 7.37 3.51 3.65
(e)
54.20 3.31 3.44 8.27 3.70 3.65
(f)
51.93 3.44 3.79 7.58 3.03 3.51
R
B
F
(g)
60.68 37.17 18.62 16.20 7.86 4.68
(h)
60.89
38.13 23.03 14.13 7.72 4.68
(i)
60.34 30.48 22.89 12.68 8.06 4.75
Features Combination
ZM+FM
T
ZM+FMT+
FD
ZM+FMT+
FD+AMI
ZM+FD All
L 61.86 61.17 62.13 54.96 3.44
(h)
66.62
37.44 35.51 32.62 5.03
(b) 26.82 39.44 40.62 38.20 3.44
4 CONCLUSIONS AND FUTURE
WORK
The present paper proposed a comparative study
over Arabic optical character recognition, following
the statistic approach. We tried to highlight the
obtained results using different datasets, different
feature extraction techniques and different
classifiers. For printed and for handwritten datasets,
Zernike moments give the best recognition rate. This
conclusion can be explained by Zernike polar
coordinates which are more robust than other
coordinates types. For multi-oriented dataset, affine
moment invariants are in first position. This
conclusion can be explained by their robust
invariance to the rotation. The choice of k-NN or
SVM depends on the system needs. In future
experiments, we aim to extend our study to larger
datasets and to incorporate and to study other
different feature extraction techniques (wavelets,
fractal dimension ...) and different classifiers (neural
networks...). In future work, we will develop the
system towards Arabic words and texts recognition.
REFERENCES
Abandah, G. and Anssari, N., 2009. Novel moment
features extraction for recognizing handwritten arabic
letters. J. Comput. Sci., 5: 226-232. DOI: 10.3844/
jcssp. 2009. 226.232.
Abdul Sattar, S., Shah, S., 2012. Character Recognition of
Arabic Script Languages. ICCIT, pp. 502-506.
Aboaisha, Hosain, Xu, Zhijie and El-Feghi, Idris (2012)
An investigation on efficient feature extraction
approaches for Arabic letter recognition. In:
PQDJCEAR’ Conference 2012: CEARC’12, pp. 80-
85. ISBN 978-1-86218-106-9.
Arif, M., Brouard, T., Vincent, N., 2006. A new fuzzy k-
Nearest Neighbors rule. Pattern recognition.
Imran, K. P., Abdulbari A. A., Ramteke R. J., 2012.
Recognition of Offline Handwritten Isolated Urdu
Character. ACR ISSN: 0975-3273 & E-ISSN: 0975-
9085, Volume 4, Issue 1, pp.-117-121.
Jenabzadeh, M. R., Azmi, R., Pishgoo, B., Shirazi, S.S.,
2011. Two Methods for Recognition of Hand Written
Farsi Characters. IJIP, Vol. 5(4).
Mozaffari, S., Faez, K., Kanan, H., R., 2004. Feature
Comparison between Fractal Codes and Wavelet
Transform in Handwritten Alphanumeric Recognition
Using SVM Classifier. ICPR (2): 331-334.
Sabri A. Mahmoud, Ashraf S. Mahmoud, 2009. Arabic
Character Recognition Using Modified Fourier
Spectrum (MFS) vs. Fourier Descriptors. Cybernetics
and Systems 40(3): 189-210.
Slimane, F., Ingold, R., Kanoun, S., Alimi, M. A., and
Hennebert, J., 2009. A New Arabic Printed Text
Image Database and Evaluation Protocols. In proc. of
10th IEEE ICDAR 2009, Barcelona (Spain), July 26 -
29 2009 , pp. 946-950.
Zaghloul, R. I., AlRawashdeh, E., F., Bader, D. M. K.,
2011. Multilevel Classifier in Recognition of
Handwritten Arabic Characters. Journal of Computer
Science 7 (4): 512-518.
ICPRAM2013-InternationalConferenceonPatternRecognitionApplicationsandMethods
284