different styles and characteristics. Therefore, the
types of touching characters vary from language to
language, which in turn require different methods
for segmenting the touching characters in each
language. For example, the handwritten cursive
characters shown in Figure 3(a) are a type of
touching characters typically found in English
handwritten manuscripts but not in Lanna
manuscripts. Only the types shown in Figures 3(b),
3(c), and 3(d) can be found in Lanna manuscripts.
The purpose of this research is to separate the
touching or overlapping Lanna characters, which
have not been effectively solved using any other
character segmentation methods.
(a) (b)
(c) (d)
Figure 3: Four types of touching characters.
Character segmentation is a process that seeks to
decompose a sequence of characters into individual
symbols. There have been substantial researches
undertaken to solve character segmentation problem,
mostly for numerals, English script, Chinese script,
Arabic script, and Bangla script. Segmentation
strategies can be divided into three main categories
(Casey and Lecolinet, 1996; Marinai, Gori, and
Soda, 2005): dissection methods, recognition-based
methods, and holistic methods. Dissection methods
decompose the image into a sequence of sub-images
using general features, e.g., character height and
width (Hoang, Tabbone, and Pham, 2009).
Recognition-based methods search the image for
components that match classes in its alphabet.
Holistic methods seek to recognize entire words as a
whole, thus avoiding the need to segment the image
into characters. Among the methods proposed for
character segmentation, Tseng and Chen (1998)
proposed a three-stage Chinese character
segmentation algorithm. Firstly, a bounding box is
created around each stroke of a Chinese character.
Secondly, the knowledge-based merging operations
are used to merge the stroke bounding boxes
together. Finally, a dynamic programming is used to
find the optimal segmentation boundaries. The
experimental results show that the proposed
algorithm is a very effective segmentation algorithm.
It works well even with touching and/or overlapping
characters. Xiao and Leedham (2000) proposed a
novel approach to English cursive script
segmentation. In the proposed approach, connected
components are split into sub-components based on
their face-up or face-down background regions.
Then the over-segmented sub-components are
merged into characters according to the knowledge
of character structures are their joining
characteristics. Bhowmik, Roy, and Roy (2005)
proposed a segmentation scheme for handwritten
Bangla words. The authors use the analysis of
directional chaincode and the positional information
to extract the features from the image, then employ
multilayer perceptron neural network to determine
the segmentation points. The authors also point out
that their segmentation result can be significantly
improved if their proposed technique is combined
with the recognition process in a holistic system.
This study focuses mainly on the dissection
methods. Projection analysis, connected component
processing, and bounding box analysis are three
widely used dissection methods (Chen, Wu, and
Lee, 1998). While projection analysis is very
effective for segmenting good quality machine
printed manuscripts (Casey and Lecolinet, 1996), it
has limited success when segmenting handwritten
manuscripts. Connected component processing and
bounding box analysis usually offer an efficient way
to segment handwritten manuscripts. However, they
might lead to incorrect segmentation when dealing
with touching characters as shown in Figure 4.
Figure 4: Segmentation results by bounding box analysis.
In this paper, the new dissection algorithm is
proposed to segment touching Lanna characters. The
performance of the proposed algorithm is measured
by the ability of the proposed algorithm to correctly
segment 6 different handwritten Lanna manuscripts.
Following this introduction, section 2 briefly
describes the general process of the proposed
character segmentation process. Section 3 explains
Touching
Characters
Bounding Box Analysis
SIGMAP 2011 - International Conference on Signal Processing and Multimedia Applications
48