automatically for each nucleus in the image. As
there could be regions not being cell nuclei, we
apply the fuzzy C-means algorithm for the
classification of the closed regions (the result of the
GVF snakes) in the class of interest (nuclei class) or
in the class of undesired findings.
To underpin the accuracy of the GVF snake
segmentation, we construct two data sets. The first
data set comprises the areas enclosed by the initial
position of the snakes and the second data set
contains the areas under the final position of each
snake. As it is verified by the results, the
performance of the method is improved when the
data set of the area enclosed by the final snake
position are used. This is a confirmation that the
obtained contour is an accurate nucleus boundary.
The proposed method is fully automated and it can
be applied in any microscopic cervical cell sample
image.
2 MATERIALS AND METHODS
2.1 Study Group
We have collected 19 images of conventional PAP
stained cervical cell slides, which were acquired
through a microscope digital camera (Olympus
DP71) adapted to an optical microscope (Olympus
BX51). We have used a 10× magnification lens and
the acquired images were stored in JPEG format.
The total number of cell nuclei in the images, which
were identified by two expert observers is 3616.
2.2 Segmentation
The purpose of this step is firstly the detection of the
location of every nucleus in the images and secondly
the determination of the boundary of each nucleus
area. This is obtained automatically, as we follow
the method proposed in (Plissiti, 2006) and (Plissiti,
2008). This method consists of three individual steps
and it is described in the following paragraphs.
2.2.1 Detection of the Candidate Nuclei
Centroids
This step is necessary for the determination of the
location of every nucleus in each image. It is
comprised of two sequential stages: the
preprocessing and the determination of the
probable location of each nucleus. The outcome of
this step is a set of image points which indicate the
areas of the image that are occupied by the nuclei of
the cells.
In the preprossesing step, the extraction of the
background and the definition of smooth regions of
interest are achieved. We perform contrast-limited
adaptive histogram equalization and global
thresholding to the red, green and blue component of
the image. In the final binary mask, which is the
result of a logical OR operation of these three binary
images, all particles with an area smaller than a
threshold t are removed, in order to exclude objects
that may interfere in the next steps.
The parts of the image found in the
preprocessing step contain either isolated cells or
cell clusters. Considering that nuclei are darker than
the surrounding cytoplasm (Figure 1(a)), we search
for intensity valleys in the image. For the formation
of homogenous minima valleys we apply the h-
minima transform (Soille, 1999) in the red, green
and blue components of the original image. The
resulted image is used as a mask for the
morphological reconstruction of the initial image. In
the final image, we search for regional minima and
the extracted regions of the image intimate the
existence of the cell nuclei (Figure 1(b)). The
location of each candidate nucleus is determined
with the centroid
c
r
of each detected intensity valley
(Figure 1(c)).
2.2.2 Initial Approximation of the Cell
Nuclei Boundaries
After the definition of the locations of each
candidate nuclei centroid, we proceed with the initial
approximation of the nuclei boundaries, which is a
prerequisite for the application of the deformable
model. For this purpose we collect some points near
the centroid of each nucleus, which are likely lying
in the nucleus circumference.
Given the fact that the nucleus is darker than the
background, we expect high gradient of the image in
each nucleus boundaries. In order to avoid threshold
dependent techniques such as edge detectors, we
construct an image with each nucleus boundaries
pronounced. This image is a result of the subtraction
of two images. The first image is the result of the
application of an averaging filter in the initial image.
The second image is the outcome of successive
erosions of the initial image, using a flat disk-shaped
structuring element. The result of the subtraction of
these two images is an image with all cell nuclei
boundaries sharp (Figure 2). In this image, we
construct a circular searching grid centered at the
ACCURATE LOCALIZATION OF CELL NUCLEI IN PAP SMEAR IMAGES USING GRADIENT VECTOR FLOW
DEFORMABLE MODELS
285