2 STATE OF THE ART
The MDC method is best recognized for its error
robust property at the expense of compression ra-
tio as it adds redundancies in its temporal, spatial
or frequency domain. With the temporal MDC met-
hod, usually two descriptions are produced in or-
der to avoid a drop in the coding efficiency. The
drop in the coding efficiency is reflected when more
than two descriptions are used because the distance
between the assigned frames to each description is
increasing resulting in the motion prediction being
less effective (Liu et al., 2015; Chakareski et al.,
2005). When the network is very noisy, a higher num-
ber of descriptions are required. Therefore the tem-
poral MDC method is no longer a suitable techni-
que. The frequency MDC method partitions Discrete
Cosine Transform (DCT) coefficients between video
descriptions. Because DCT transformation provides
independent components, the descriptions will be less
dependent. To maintain the correlation of the descrip-
tions, extra transformation like Lapped Orthogonal
Transformation (LOT) needs to be applied (Chung
and Wang, 1999; Sun et al., 2009). Therefore the
complexity of frequency MDC methods is higher than
that of both the spatial and temporal MDC methods
respectively. With the spatial MDC method, each vi-
deo frame is partitioned into several lower resolution
subimages using Polyphase SubSampling (PSS) algo-
rithm (Shirani et al., 2001; Gallant et al., 2001; Ka-
zemi, 2012). It is worth mentioning that with a simple
spatial MDC method, there is no precise adjustment
tool over the redundancy in order to control the side
quality(Shirani et al., 2001; Gallant et al., 2001; Ka-
zemi, 2012). This means that there is no control for
the redundancy increase resulting in higher resistivity
to compensate for the higher noise level.
To apply the MDC method for 3D videos, the
depth map image also needs to be partitioned into dif-
ferent descriptions. It is worth mentioning that the
depth map image mainly contains depth information
of the scene’s objects. Because of the nature of the
real objects, depth information of 3D scenes rarely
contain high frequency content. Consequently, the
depth map image can be effectively compressed ef-
fectively resulting in saved bandwidth and disk space
(Fehn, 2004; Hewage, 2014). To improve compres-
sion, Karim et al. have shown that the downsampled
version of the depth map image provides an adequate
reconstruction of the 3D video in the receiver (Karim
et al., 2008). They have experimented with the spatial
MDC method for 3D videos using color plus depth
map image representation. Karim et al. have carried
out experimental tests with a scalable multiple des-
cription coding approach arriving at the same result.
Therefore, it can be said that downsampling of the
depth map image does not cause a considerable de-
gradation in the quality of a reconstructed video. This
is due to the fact that the depth map image includes
low frequency contents or more precisely, the depth
values of adjacent pixels are similar. Consequently,
one can state that the neglected pixels during downs-
ampling can be better predicted. Liu et al utilized the
fact of having similar depth values of pixels for real
objects and introduced a texture block partitioing al-
gorithm in order to perform their MDC algorithm for
wireless multi-path streaming (Liu et al., 2015).
However, multiple description coding has been in-
vestigated for 2D videos thoroughly. More investi-
gation is required to apply MDC to 3D video speci-
fically. For 2D videos, different MDC methods are
classified according to the type of data which is di-
vided into descriptions which include: temporal, spa-
tial, frequency, or compressed. For example, with a
temporal MDC method using two descriptions, one
description can be odd frames and the other descrip-
tion even frames. With a spatial MDC algorithm each
video frame is partitioned into several lower resolu-
tion subimages. With a frequency MDC method, the
frequency components divide between descriptions.
Each type of MDC method has its own advantages
and disadvantages with regard to its particular appli-
cation. The temporal MDC method is simple though
unsuitable for an application involving a network with
high packet failure due to its low capability in incre-
asing data redundancy. With the higher complexity
of frequency MDC method, the spatial MDC method
can best accommodate a live HD video conference ap-
plication over an error prone environment.
3 PROPOSED METHOD
This section describes the new proposed multiple des-
cription coding applicable for 3D videos considering
ROI. In order to be able to recognize which part of
the frame is more important or ROI map extraction, a
metric needs to be defined. To this end, two metrics
(PV and CV ) are defined and the result for each me-
tric will be compared at the end. For the first metric
(PV ), we calculated the average of the absolute varia-
tions for pixels’ values found in the depth map image
in a block wise manner:
PV
i
=
1
N
i
N
i
∑
j=1
|D
j
− µ
i
| (1)
where µ
i
is the average of depth values for block i
and PV
i
stands for the pixel variation of block i; D
j
VISAPP 2018 - International Conference on Computer Vision Theory and Applications
136