is presented from the viewpoint of the influence of
resolution to color document analysis.
Dealing with complex color documents such as
cover books or journal covers raises some
challenging difficulties. Text is overlaid on images
or graphics and often it is impossible to spatially
define the background.
Generally, a color segmentation algorithm for
text information extraction applications must be able
to perform its task without oversegmenting
characters and still preventing fusion with the
background. Additionally, it is desirable to merge
low contrast objects with their background and
create large compact areas. This will result to a
small number of connected components, so the
outcome of a text information extraction algorithm
will be extensively improved.
2 DESCRIPTION OF THE
METHOD
In this paper, we propose an approach which
efficiently approximates the RGB color distribution
of the image by taking advantage an important
property of the edge map. Specifically, we sub
sample the image by selecting only those pixels
which are local minima in the 8-neighborhood on
the edge image. This ensures that the samples are
taken from inner points of the objects so fuzzy areas
are avoided. Also, all objects will be represented in
the obtained sample set. The benefits of this
approach are analyzed in section 4.1.
These samples are used in the next step to
initially reduce the colors of the input image with a
relatively large number of colors, usually no more
than 100 (section 4.2). The extracted image at this
stage is oversegmented.
The resulted color centers are then used by a
mean shift operation (Fukunaga, 1975), (Cheng,
1995), (Comaniciu, 2002) to locate the final points
of the RGB color space, on which the algorithm will
be based to extract the final result (sections 4.3, 4.4).
In order to deal with noisy cases and to improve
the performance of the system, a proposed edge
preserving smoothing filter is used (section 3) as a
preprocessing step.
The overall process consists of the following
stages.
1. Edge preserving smoothing.
2. Color edge detection.
3. RGB color space approximation (Sub sampling).
4. Initial color reduction.
5. Mean shift
6. Finalization of the color reduction process.
The method is implemented in a visual
environment and the computer system used for all
tests is a PENTIUM 4 PC with 2.4GHz CPU speed
and 512MB RAM.
In section 5 of this paper, experimental results
are depicted where the efficiency of the method is
demonstrated. Computation time is also mentioned.
3 EDGE PRESERVING
SMOOTHING
A common technique for removing noise from
images is blurring them by replacing the center pixel
of a window with the weighted average of the pixels
in the window (Mean, Gaussian filters). Through
this process valuable information is lost and the
details of object boundaries are deformed. A
solution to this problem is to use an anisotropic
diffusion process (Perona, 1990). In this paper we
present a filter which performs as well as anisotropic
diffusion but requires less computation time.
First we calculate the Manhattan color distances
i
d between the center pixel
c
a and the pixels
i
a in
a 3x3 window. Values are normalized in [0,1]
ci c i ci
iaa aa aa
dRR GG BB=−+−+−
(1)
To compute the coefficients for the convolution
mask of the filter the following equation is used.
(1 )
p
ii
cd=−
(2)
In words,
i
c receives larger values for smaller
values of
i
d . This concludes to the following
convolution mask
123
45
8
678
1
1
0
i
i
ccc
cc
c
ccc
=
⎤
⎥
⎥
⎥
⎦
∑
(3)
Factor
p in equation (2) scales exponentially the
color differences. Thus it controls the amount of
blurring performed on the image. As it gets larger,
coefficients with small color distance from the
center pixel increase their relative value difference
from coefficients with large color distance, so the
blurring effect decreases. A fixed value 10 is used
for all of our experiments since this resulted in very
good performance.
COLOR SEGMENTATION OF COMPLEX DOCUMENT IMAGES
221