After finding the clusters, thresholding is per-
formed to obtain a binary image. Selection of the
threshold value is based on the clusters: the first
choice is to use the minimum value of the cluster
with 2nd highest mean intensity. Pixels with inten-
sity value above the threshold are marked as tumor
(i.e. they’re assigned the value “1”), whereas pixels
with intensity value below the threshold are marked
as background (i.e. they’re assigned the value “0”). If
the thresholding operation results in a segmentation
that selects more than 15% of the area of the brain as
tumor, then the first binary image is abandoned and
the minimum value of the cluster with highest mean
intensity is chosen as the threshold value. The thresh-
old operation is repeated, producing a new binary im-
age.
Once the binary image is produced, the program
selects the largest connected component and applies
to it a neighborhood filter. This filter computes, for
each pixel, the number of tumor pixels that are 8-
connected to it, i.e. the number of tumor pixels that
touches one of the edges or corners of the pixel. If
this number is 4 or more, then the current pixel is in-
cluded. If it’s 2 or less, the current pixel is excluded
or not included in the tumor area (Gibbs et al., 1996;
Sonka et al., 2014). Following the filtering step, holes
in the segmented area are removed by applying the
morphological closing operation iteratively, using a
small disk as the structuring element.
Finally, an AdaBoost classifier is used to improve
the segmentation. The AdaBoost algorithm is a tech-
nique used to create a strong, accurate classifier by
combining weak classifiers, assumed to be better than
random guessing in correctly classifying the data. For
a training set of multidimensional data points, a clas-
sifier will assign to each data point a label, either +1
or -1. An exponential error function is used to rank
all the weak classifiers based on the number of cor-
rect and incorrect classifications. AdaBoost proceeds
by systematically extracting one classifier of the pool
in each of the iterations, by focusing on the ones that
can help with the misclassified data points. After ex-
tracting a weak classifier, AdaBoost assigns a weight
to it. The stronger classifier is then given by the group
of extracted weak classifiers combined with their as-
signed weights (Freund and Schapire, 1995; Schapire,
1999; Alpaydin, 2014).
In our method, the AdaBoost algorithm classifies
pixels according to three features: intensity value and
coordinates “x” and “y”, which determine the pixel
position in the slice. Preliminary class information,
used as training data, is given by the binary image
produced by the previous steps.
The AdaBoost classifier is applied one slice at a
time, from bottom to top. At each slice, the mean
pixel intensity and standard deviation for healthy
tissue and for tumor tissue are calculated. Non-
background pixels are selected, and data features for
each pixel are extracted. Since AdaBoost seems es-
pecially sensible to noise (Schapire, 1999; Alpay-
din, 2014), pixels that present intensity values that
are more than one standard deviation either above or
below the mean are considered outliers and are ex-
cluded. The selected pixels are then used to train
the classifier, building a model that will be used to
classify pixels in the slice immediately above. This
process continues until there are no more slices to be
classified. The largest connected component is then
selected, representing the final result of the segmenta-
tion process.
In summary, our method uses as basis for an initial
segmentation the GA-based clustering method (Kole
and Halder, 2012). Then, it refines the segmentation
by applying the neighborhood filter, the morphologi-
cal closing operation and the AdaBoost classifier.
The algorithm was implemented using MATLAB,
from The Mathworks, Inc. Tests using MRI images
were performed on a Windows 10 PC, 8 GB RAM,
Intel(R) Core(TM) i7-5500U CPU @ 2.40 GHz.
2.2 Image Dataset and Evaluation
Metric
The proposed method was used to perform “whole”
tumor segmentation of low-grade glioma tumors on
tridimensional T2-weighted FLAIR images. These
images, as well as the ground truth information, were
extracted from the 2015 Multimodal Brain Tumor
Image Segmentation Benchmark (BRATS) challenge
database, which is the largest public dataset of its
type, containing a great variety of cases. All of them
were preprocessed in order to homogenize the data
and remove the skulls, guaranteeing anonymization
of the patients (Menze et al., 2015). Two images from
the original database were excluded since the assump-
tion that the largest component represents the tumor
did not hold for them, resulting in a total of 42 test
cases.
Ground truth information was constructed based
on manual annotations performed by a team of trained
radiologists (Menze et al., 2015). Segmentation re-
sults were compared to the ground truth information
using the Dice similarity coefficient (DSC). The DSC
is based on the computation of the area of overlap be-
tween segmented region and ground truth, and it is
considered a very attractive metric because of its sim-
plicity, being widely used for evaluation of segmen-
tation algorithms. It is calculated using the following
Brain Tumor Segmentation in Magnetic Resonance Images using Genetic Algorithm Clustering and AdaBoost Classifier
79