on Multi-Level Artificial neural networks (MANN) to
find functions Φ
α
: Ω × R
s
→ R (α = 1, 2, . . . , k) s.t.
the level sets:
V
α
= {x ∈ Ω| Φ
α
(x,
F (x)) = 0} α = 1,2,...k
correspond to the surface V
α
respectively.
The functions Φ
α
are learned using a training set of
segmented images and they can be used subsequently
to segment new instances.
3.2.1 Features Extraction
Given a scan S
t
: Ω → R we briefly describe how a
features function
F
t
: Ω → R
s
may be constructed.
Since the neural network will eventually use this func-
tion for the identification of image edges, it is clear
that the function
F
t
should include “edge detector”-
like clues.
The involved features can be divided into two classes.
First, low-level features are considered: they are
context-independent and do not require any knowl-
edge and/or pre-processing. Some examples are voxel
position, gray level value, gradients and other differ-
entials, texture, and so forth. Middle-level features
are also selected, since voxel classification can benefit
from more accurate clues, specific of the problem at
hand. In particular, the knowledge of the deformable
structure orientation, obtained as a byproduct in Sec-
tion 3.1, can be used to individuate an Intrinsic Ref-
erence System (IRS) suitable to describe the structure
shape. If, in addition, a priori information about the
structure shape is available, a reliable clue for detect-
ing edges in the images is given by the gradient along
the normal direction to the expected edge orientation.
Moreover, a multiscale approach is adopted: the fea-
tures are computed on blurred images, supplying in-
formation about the behavior of the voxel neighbor-
hood, which results in a more robust classification.
3.2.2 MANN-based Voxel Classification
The set of selected features are processed to accom-
plish the voxel classification by means of a Multilevel
Artificial Neural Network (MANN), which assures
several computational advantages (Di Bona et al.,
2003).
For each voxel x, its computed features vector
F
t
(x) is divided into vectors
F
i
t
(x), each one contain-
ing features of the same typology and/or correlated.
Then each
F
i
t
(x) is processed by a dedicated classi-
fier based on an unsupervised Self Organizing Maps
(SOM) architecture. The set of parallel SOM modules
constitutes the first level of the MANN which aims at
clustering each portion of the feature vector into crisp
classes, thus reducing the computational complexity.
Cluster indexes, in turn, are the input of the final de-
cisional level, operated by a single EBP network. The
output of this last module consists in a vector of mem-
bership grade of the voxel x to the various surfaces
V
α
(1 ≤ α ≤ k). The SOM modules are trained ac-
cording to Kohonen algorithm (Kohonen, 1997). For
the EBP module, a set of 3D scans should be pre-
classified by an expert observer and used for super-
vised training, performed according to the Resilient
Back-Propagation algorithm (Riedmiller and Braun,
1993).
4 STUDY CASE: LEFT
VENTRICLE SEGMENTATION
Accurate segmentation of cardiac cavities is funda-
mental in assessing cardiac function and determin-
ing quantitative parameters. Magnetic Resonance
Imaging (MRI) is a high quality and well-established
imaging modality in analyzing heart diseases and has
proved to be more reliable than other techniques,
both in supplying accurate and reproducible morpho-
logical information and in assessing heart functions
(ACC\AHA Task Force, 2005). However, due to
noise or acquisition artifacts, visual information can
be corrupted or ill defined: in a usual edge map
of a cardiac MR slice, boundaries belonging to the
LV appear broken or, even worse, close to stronger
edges of other structures. In such cases, only expert
knowledge may help: the exact location of the con-
tours cannot be based only on image evidence, but
should be learned from examples provided by expert
observers. Usually, researchers have tried to design
ad hoc algorithms able to incorporate a priori infor-
mation about the LV shape. Model based surface
detector have been widely used: for example, (De-
clerck et al., 1997) employed a Canny-Deriche edge
detector in a 3D polar map to segment endocardial
and epicardial surfaces, while (Faber et al., 1991) de-
fined a hybrid spherical-cylindrical coordinate sys-
tem. Snakes, since their introduction in the semi-
nal paper by (Kass et al., 1988), have been a pow-
erful tool in cardiac images analysis for segmenta-
tion and motion tracking. Recent improvements in
this field include works by (Jolly et al., 2001), who
reduced sensitivity to initial contour through Dijk-
stra algorithm, and by (Paragios, 2002) and (Huang
et al., 2004) who introduced deformable models influ-
enced by forces derived from image region informa-
tion. (Mitchell et al., 2002;
¨
Uz
¨
umc
¨
u et al., 2003) used
the concept of active appearance model (AAM). An
AAM is a technique of analysis by synthesis, which,
in principle, could describe any heart through a set
VISAPP 2007 - International Conference on Computer Vision Theory and Applications
218