Fuzzy Classifier for Church Cyrillic Handwritten Characters
Cveta Martinovska
1
, Igor Nedelkovski
2
, Mimoza Klekovska
2
and Dragan Kaevski
3
1
Computer Science Faculty, University Goce Delcev, Tosho Arsov 14, Stip, R. Macedonia
2
Faculty of Technical Sciences, University St Kliment Ohridski, Ivo Ribar Lola bb, Bitola, R. Macedonia
3
Faculty of Electrical Engineering and Information Technologies, University St Cyril and Methodius,
Rugjer Boshkovik bb Skopje, R. Macedonia
Keywords: Handwritten Character Recognition, Historical Manuscripts Recognition, Fuzzy Decision Techniques,
Feature Extraction, Recognition Accuracy and Precision.
Abstract: This paper presents a fuzzy methodology for classification of Old Slavic Cyrillic handwritten characters.
The main idea is that the most discriminative features are extracted from the outer character segments
defined by intersections. Prototype classes are formed using fuzzy aggregation techniques applied over the
fuzzy rules that constitute the descriptions of the characters. Recognition methods use features like number
and position of spots in outer segments, compactness, symmetry, beams and columns to assign a pattern to a
prototype class. The accuracy and precision of the fuzzy classifier are evaluated experimentally. This fuzzy
recognition system is applicable to a large collection of Old Church Slavic Cyrillic manuscripts.
1 INTRODUCTION
Recognition of handwritten characters has been a
subject of intensive research in the last 20 years
(Arica and Yarman-Vural, 2001); (Vinciarelli,
2002). Different approaches for developing
handwritten character recognition systems are
proposed, like Fuzzy Logic (Malaviya and Peters,
2000); (Ranawana et al., 2004), Neural Networks
(Zhang, 2000) and Genetic Algorithms (Kim and
Kim, 2000).
This paper describes a character recognition
system developed for digitalization of a large Old
Cyrillic manuscripts collection found in Macedonian
churches and monasteries. This process cannot be
performed using the existing computer software due
to the specific properties of Old Slavic characters.
A novel classification methodology based on the
fuzzy descriptions of characters is proposed.
Number and position of spots, beams and columns
that appear in the outer segments of the topological
character map are considered as significant features.
This character recognition system is applicable to a
large historical collection of manuscripts that
originate from various periods and locations. The
manuscripts used for church liturgical purposes are
unaffected by style changes. They are written in
Constitutional Script. This Script looks like printed
text, where character contour lines can be easily
extracted.
2 CHARACTER ANALYSIS AND
FEATURE EXTRACTION
Manuscripts are converted to black and white
bitmaps. The first step of processing is extracting the
characters using contour following function (Fig. 1).
Visual prototype of a normalized character is
analyzed to determine character features and their
membership functions. Several features are
examined, such as compactness, x-y symmetry,
presence of beams and columns in three horizontal
and vertical segments and number of spots in outer
segments.
According to visual features, the characters of
the Church Slavic alphabet can be grouped in
several subsets. There is a subset whose members
are Г, В and Б that have emphasized vertical lines on
the left-side or left column. Another subset contains
characters such as П and Ш that have a right-side
and left-side column. The third subset consists of
characters like П, Г and Б that have noticeable
horizontal line in the upper segment (upper beam).
The fourth subset consisting of characters as Ш and
310
Martinovska C., Nedelkovski I., Klekovska M. and Kaevski D..
Fuzzy Classifier for Church Cyrillic Handwritten Characters.
DOI: 10.5220/0003968403100313
In Proceedings of the 14th International Conference on Enterprise Information Systems (ICEIS-2012), pages 310-313
ISBN: 978-989-8565-10-5
Copyright
c
2012 SCITEPRESS (Science and Technology Publications, Lda.)