Several methods previously presented in the lite-
rature resort to machine learning approaches. Some
methods use supervised approaches with hand-crafted
features or learned representations and some other
methods use unsupervised approaches like clustering
which aim to detect lesion voxels as outliers. Ex-
amples of supervised models used in MS segmenta-
tion tasks are k-nearest neighbour methods, artificial
neural networks, random decision forests and baye-
sian frameworks among others (Garcia-Lorenzo et al.,
2013). Examples of unsupervised models are fuzzy c-
means or Gaussian mixture models with expectation
maximization (EM) (Garcia-Lorenzo et al., 2013).
Unsupervised models suffer from non-uniformity in
the image intensities and lesion intensities since this
variability cannot be captured by a single global mo-
del (Havaei, 2016). In this respect supervised met-
hods present an advantage, potentially being able to
capture this variability with the appropriate choice of
training set or features.
Recently, Deep Learning (DL) has been very
successful in the Computer Vision area, achieving
improvements in accuracies sometimes as high as
30% (Plis et al., 2014). The main strength in DL,
also differentiating it from other machine learning
methods, is its automatic feature extraction capability.
Normally, raw data has to be processed automatically
or manually to extract meaningful and useful featu-
res through a process commonly known as ”feature
engineering”. This process requires time and careful
analysis, and includes subjectivity, which might bias
the results or produce erroneous results. However,
in DL, the feature extraction is data-driven using an
appropriate loss function and learning algorithm for
Deep Neural Networks, which removes the subjecti-
vity, randomness and expert knowledge to a certain
degree. Moreover, the features obtained are hierarchi-
cal, each network layer producing more abstract fea-
tures using the less abstract features obtained in the
previous layer. Thus feature extraction is carried out
step-by- step, which is likelier to produce more com-
plex and useful features. Another strength of DL is
its ability to represent very complex functions, which
might also be considered as its drawback since it is
prone to easily over-fit. However, the over-fitting can
be prevented with the correct guidance and regulariza-
tion methods. DL methods are also robust to outliers,
which is very common in neuroimaging data (Good-
fellow et al., 2016), (Bengio, 2012) and (Deep Lear-
ning, 2017).
Previous work on MS lesion segmentation with
DL is generally developed using voxelwise classifica-
tion (lesion vs. normal) and is done on 2D/3D patches
centered on the voxel of interest to obtain a complete
segmentation of the whole brain (Greenspan et al.,
2016). There are also some studies considering the
whole image as input and performing a segmentation
in a single step as in (Brosch et al., 2015) and (Brosch
et al., 2016). In some methods global context is pro-
vided to the network, in addition to the local con-
text, to give more information about the nature of a
voxel (Ghafoorian et al., 2017). Convolutional Neu-
ral Networks (CNNs) are commonly used as part of
the architecture due to their strong feature extraction
capabilities dealing with images (Vaidya et al., 2015),
while Restricted Boltzmann Machines (RBMs) and
Auto-encoders are generally exploited to obtain a
good initialization of the network, which might affect
the ultimate performance, as shown in (Brosch et al.,
2015) and (Brosch et al., 2016).
In this paper, we propose a MS lesion segmen-
tation method based on a voxelwise classification on
MRI with DL using a combination of different appro-
aches presented in the literature together with our own
contributions. We explore a new sub-sampling met-
hod to improve the learning process and develop a
new Convolutional Neural Networks (CNN) approach
to achieve better performance results. The aim of this
study is to achieve a method that will surpass the per-
formance of existing methods in helping the experts
in the MS lesion segmentation work and even make
their interruption minimal. We compare different ap-
proaches with DL so far applied to MS Segmentation.
2 METHODOLOGY
In this section, we present our strategies for data
pre-processing, sub-sampling of the training set, de-
signing the CNN architecture and developing diffe-
rent approaches to improve the segmentation perfor-
mance.
2.1 Data Pre-processing
We have, at our disposal, T1 and T2 MRI modali-
ties, tissue segmentation and manual lesion segmenta-
tions of 59 subjects from Hospital Cl
´
ınic (Barcelona).
The tissue segmentation was performed by Freesurfer
v.5.3.0 toolbox (Freesurfer, 2013). The manual seg-
mentation was performed by an expert / two experts
from Hospital Cl
´
ınic team. The voxel resolution of
the MRIs is 0.86mm x 0.86mm x 0.86mm and the
image size is 208 x 256 x 256. As a pre-processing of
the MRI images we apply skull stripping, bias-field
correction, tissue-segmentation and co-registration.
Additionally we apply 0-mean unit-variance normali-
Multiple Sclerosis Lesion Segmentation using Improved Convolutional Neural Networks
261