EXTRACTION OF OBJECTS AND PAGE SEGMENTATION OF
COMPOSITE DOCUMENTS WITH NON-UNIFORM
BACKGROUND
Yasser Alginahi, Maher Sid-Ahmed, Majid Ahmadi
Univerisyt of Windsor, 401 Sunset Ave, Windsor, Ontario,Canada
Keywords: Statistical features, Multi-Layer Neural Networks, Non-uniform background.
Abstract: In designing page segmentation systems for documents with complex background and poor illumination,
separating the background from the objects (text and images) is very crucial for the success of such system.
The new local based neural binarization technique developed by the authors will be used to extract the
objects from document images with complex backgrounds. This algorithm uses statistical and textural
feature measures to obtain a feature vector for each pixel from a window of size
)12()12(
×+ nn
,
where
1≥n
. These features provide a local understanding of pixels from their neighbourhoods making it
easier to classify each pixel into its proper class. A Multi-Layer Perceptron Neural Network (MLP NN) is
then used to classify each pixel in the image. The results of thresholding are then passed to a block
segmentation stage. The block segmentation technique developed is a feature-based method that uses a
Neural Network classifier to automatically segment and classify the image contents into text and halftone
images. The results of page segmentation are then ready to be passed into an OCR system that will convert
the text image into a format the can be stored and modified
.
1 INTRODUCTION
Document image analysis is an important area of
research in image processing, pattern recognition
and computer vision. The goal of our research is to
process grey level document images with complex
backgrounds, bad illumination and poor contrast.
The motivation behind most of the applications of
off-line text recognition is to convert data from the
conventional media into electronic media. In this
paper, a document segmentation system is presented
to transfer grey level composite images with
complex backgrounds and poor illumination into
electronic format that is suitable for efficient storage
retrieval and interpretation. Such applications are
bank cheques, security documents and form
processing.
There are many threshold selection schemes
published in the literature, and selecting an
appropriate one can be a difficult task. Thresholding
of documents can be categorized into two main
classes: global and local thresholding. Global
thresholding techniques use a single threshold value;
on the other hand, local thresholding compute a
separate threshold based on the neighbourhood of
the pixels. (Sahoo et al., 1998) showed that the
Otsu’s (Otsu, 1979) class separability thresholding
method is the best global thresholding method. In
(Trier and Jain, 1995), Trier and Jain showed the
Niblack’s (Niblack, 1986) method to be the best
local thresholding method compared to other
methods. Few methods used NNs in thresholding
grey scale images into two levels. The technique
proposed by Koker and Sari (Koker and Sari, 2003),
uses NNs to select a global threshold value for an
industrial vision system based on the histogram of
the image. The method developed by Papamarkos
(Papamarkos, 2001) uses Self Organizing Feature
Maps (SOFM) to define two bi-level classes. Then,
the contents of these classes are used with the fuzzy
C-mean algorithm to reduce the character blurring
effect. Both methods are not suitable for
thresholding composite images with complex
backgrounds. In this paper, a new threshold selection
algorithm is proposed which handles images with
non-uniform and complex backgrounds. The new
method uses a MLP NN with statistical and textural
feature measures as inputs to the network.
344
Alginahi Y., Sid-Ahmed M. and Ahmadi M. (2005).
EXTRACTION OF OBJECTS AND PAGE SEGMENTATION OF COMPOSITE DOCUMENTS WITH NON-UNIFORM BACKGROUND.
In Proceedings of the Second International Conference on Informatics in Control, Automation and Robotics - Signal Processing, Systems Modeling and
Control, pages 344-347
DOI: 10.5220/0001167903440347
Copyright
c
SciTePress