Palumbo and Guliano (Yang Y. and Yan H., 2000)
use a fixed 9 × 9 window to evaluate the class of each
pixel within the image. The pixel value is determined
according to the five 3 × 3 local pixels within the
9 × 9 window centered on the pixel in consideration.
An initial threshold is used to determine the pixels
which definitely belong to the background whilst the
remaining pixels are classified using a different label
assignment rule requiring the specification of three
additional user defined parameters. Determining the
values of these thresholds is not straightforward since
they cannot be deduced from the image properties.
Niblack’s algorithm (Bulen S. and Mehmet S.,
2004) evaluates a threshold for each pixel within the
image according to the mean and standard deviation
of the pixels in a predetermined window W centered
at the pixel in consideration. The pixel class is deter-
mined as follows:
w(x, y) →
w
f
,p(x, y) <T(x, y)
w
b
,p(x, y) ≥ T (x, y)
(1)
where T (x, y)=µ(x, y)+k × σ(x, y), µ(x, y) is
the mean grey level of the pixels within the window,
σ(x, y) is the standard deviation of these pixels and k
is a user defined parameter. Niblack’s algorithm re-
quires two user defined parameters. The window size
determines the number of pixels from which the mean
µ(x, y) and standard deviation σ(x, y) are evaluated.
Thus, the window size should reflect the quality of the
image background, on which prior knowledge is un-
available. The value of k is used to adjust the amount
of print object boundary that is taken as part of the
foreground, and is therefore dependent on the quality
of the drawn line which is also an unknown quantity.
Eikvil’s method (Trier O. D. and Jain A. K., 2000)
is based on Otsu’s thresholding technique (Gonzalez
R. and Woods R. E., 2002), and makes use of two con-
centric windows L and S. The pixels within the larger
window L are temporarily classified into two classes
by Otsu’s threshold. The mean of these two clusters is
evaluated and their difference is compared to a para-
meter k which determines whether there there is suffi-
cient contrast between the two clusters. This indicates
the effectiveness of Otsu’s threshold on the selected
region. Thus, if the difference between the two means
is larger than k, the pixels within the smaller window
are thresholded according to Otsu’s threshold. Other-
wise, the pixels are assigned to the class whose label
is closest to the mean grey level within the smaller
window S. This method requires the specification of
three user defined parameters, of which S and L de-
fine window sizes, whilst k determines the threshold-
ing method applied to the smaller window. The size of
the smaller window S may be set to 3 which defines
the smallest window centered on a pixel. However,
the remaining parameters must be specified according
to the particular image properties.
Kamel and Zhao’s logical adaptive tech-
nique (Kamel M. and Zhao A., 1993) compares
the grey level of the pixel in consideration with
eight local averages in a pixel neighborhood of size
(2SW +1)
× (2SW +1)where SW represents
the stroke width of the line drawing. A comparison
operator is derived from these averages and is used
to determine the class of the pixel in consideration.
The algorithm requires two user defined parameters,
namely the stroke width SW and an initial threshold
T which is used to evaluate the required comparison
operator. Yang and Yan (Yang Y. and Yan H., 2000)
proposed a method by which the two parameters
SW and T are calculated adaptively. However,
the adaptive evaluation of the parameter T requires
another parameter α. Yang and Yan (Yang Y. and Yan
H., 2000) specify a range of values of α for which
suitable values of T may be obtained.
Brensen’s method (Bulen S. and Mehmet S., 2004)
may either classify a single pixel or a group of pix-
els simultaneously according to the contrast present
within a selected window. The window’s contrast is
defined as C(x, y)=Z
max
− Z
min
, where Z
max
and Z
min
are the maximum and minimum grey levels
within the window. If this contrast is smaller than a
predefined value k, the pixels within the window be-
long to the same class, and the entire window may be
assigned to a single class. However, if the contrast C
is sufficiently large, then the pixels within that win-
dow belong to two different classes. Since the win-
dow has high contrast, a simple threshold based on
the average gray level may be used to classify the pix-
els within this window. Thus, the threshold T is de-
fined as T (x, y)=
1
2
× (Z
max
+ Z
min
). This method
requires the specification of parameter k which may
be evaluated adaptively using the method proposed
in (Bartolo A. et al., 2004)
2.1 Drawbacks
Although the above methods may yield results of con-
siderably good quality, the classification processes re-
quires that a classifying criterion is evaluated for each
pixel in the image. Furthermore, these algorithms re-
quire the specification of some parameter, such as a
window size in order to evaluate the threshold. Al-
though suggested values are specified for some algo-
rithms, better results are obtained after fine-tuning the
parameter to the characteristics of the image under
test. Thus the performance of these methods is sus-
ceptible to image conditions. Methods for the adap-
tive evaluation for Brensen’s and Kamel & Zhao’s
methods have been proposed, but these require con-
siderable computational times, which slow down the
product prototyping process. In this paper, we attempt
to overcome these problems by modelling the sketch
as a trajectory being tracked in time.
IMAGE BINARISATION USING THE EXTENDED KALMAN FILTER
161