Garamendi et al., 2013; Chambolle and Pock, 2011).
Possible applications of the proposed multiclass
model are segmentation of a given Optical Flow,
which is a 2D vectorial field, multimodal magnetic
resonance image, considering each modality as a sin-
gle channel, and in general multiclass segmentation
task. In this work, as a specific application of the
above general setting, we consider Light Field images
(LF) partition. There is a growing interest for light
field imaging applied to computer vision due to the
new hand-held cameras such as Lytro
1
or Raytrix
2
.
In fact, compared to conventional imaging, light field
imaging increases the directional information of the
scene. Plenoptic cameras (Lippman, 1908; Ng, 2006;
Ng et al., 2005; Perwass and Wietzke, 2010) capture
LF images from scene keeping the light direction in-
formation. The purpose of this cameras is capturing
the amount of light (radiance) traveling along each ray
that intersects the sensor. So, for each spatial position
in the 2-D space of the sensor, it stores the amount of
light coming from a certain space directions. So one
acquisition, or data set, has the information about how
the scene appears from a certain number of possible
viewpoints. More details can be found in (Wanner
et al., 2013a), (Reddy et al., 2013).
The mathematical modeling of the light filed im-
ages is usually done considering two planes of R
2
.
One of them defines the spatial coordinates in a sin-
gle view and the other one defines the view itself. Let
Ω ⊂ R
2
and Π ⊂ R
2
be bounded Lipschitz domains
representing the spatial image domain and the angu-
lar domain respectively, and let f be a multi-channel
data. The LF image can be modeled as the 4-D func-
tion
f : Ω × Π → R
M
,
( ¯p, ¯q) 7→ f( ¯p, ¯q)
where ¯p := (x,y) ∈ Ω and ¯q := (s,t) ∈ Π represent co-
ordinate pairs in the sensor plane spatial domain and
in the view angular domain respectively. This pro-
vides a model of a lightfield color image as a vecto-
rial 4D function, in such a way f(x, y,s,t) represents
the color (M = 3 in the case of RGB color images) at
pixel (x,y) corresponding to ray (s,t).
The structure of LF images allows for a very pre-
cisely disparity map computation with a very small
cost (Wanner et al., 2013b), so one can assumes that
this information is available as an additional feature to
the intensity (gray) color, making this modality of im-
age very well suited to segmentation. In this case, for
a color image in a RGB color space, M = 4, the three
1
www.lytro.com
2
www.raytrix.de
first components corresponding to color information
and the fourth to depth information. Notice that the
user can add other information to the channels as for
example local variance, texture, etc. This vectorial
4D dimensional structure of the images (color com-
ponents and depth) allows to test the model presented
in this work.
2 NOTATION AND DEFINITIONS
Let Ω ⊂ R
D
be a bounded Lipschitz domain repre-
senting the digital image domain and let f : Ω ⊂ R
D
→
R
M
be a given D-dimensional noisy image represent-
ing the data, where M = 1 for scalar images and M > 1
for vector valued (multichannel) images. As usual in
image processing we assume f ∈ [L
∞
(Ω)]
M
, i.e. f es-
sentially bounded.
Given an image and chosen the number N ≥ 2
of classes into which we wish to partition the given
image, the segmentation problem can be formulated
as the determination of a partition of the domain Ω
into a collection of sets
{
Ω
i
}
i=1..N
of finite perime-
ter (Cacciopoli sets) in Ω such that no overlap and
no vacuum can occur, e.g. Ω
i
∩ Ω
j
=
/
0, i 6= j, Ω =
S
N
i=1
Ω
i
∪ Γ
i
where the boundary of each class is de-
noted by Γ
i
= ∂Ω
i
∩Ω. Given a partition P(Ω) we de-
fine
¯
χ = (χ
i
), i = 1..N as the associated vectorial char-
acteristic function. For almost every point x ∈ Ω
i
⊂ Ω
we have
¯
χ : Ω → R
N
,
¯
χ(x) = ¯e
i
where ¯e
i
is a vec-
tor of the canonical base of R
N
. Notice that the di-
mension of
¯
χ depends on the number of classes and
it is independent from number of channels M. Also
N
∑
i=1
χ
i
(x) = 1,a.e.x ∈ Ω.
For a given vector valued function u : Ω → R
M
the vectorial TV norm, denoted as TV, is defined by
the finite positive measure (Ambrosio et al., 2000;
Bresson and Chan, 2008)
|Du|(Ω) =
Z
Ω
|Du|
.
= sup
P∈K
Z
Ω
hu,∇ · Pidx
(1)
where P : Ω → R
M×D
is a matrix dual function,
∇· is the divergence operator and the product h.,.i
is the Euclidean scalar product defined as hv,wi
.
=
∑
M
i=1
hv
i
,w
i
i from where hu,∇ · Pi =
∑
M
i=1
hu
i
,∇ · p
i
i.
The set K of functions of the dual variable P is
K
.
=
P ∈ C
1
c
Ω;R
M×D
: |P| ≤ 1
(2)
where | · | is the L
2
norm such that |P| =
q
∑
M
i=1
hp
i
,p
i
i.
A Multiclass Anisotropic Mumford-Shah Functional for Segmentation of D-dimensional Vectorial Images
469