independent decoding of ROI and preserving its
coding quality.
A further issue with ROI coding is the change in
the shape, size and position of the ROI. (Wang and
Hannuksela, 2002) proposes ways to code evolving
ROI, that is, the shape of the isolated region
grows/evolves with time. FMO map type 3, 4 and 5
provide the feature of growing and evolving slice
groups. However, these map types do not cater to a
moving slice group. The slice group can grow from
its initial position but not change shape, or move
horizontally or vertically across the frames.
Therefore, these map types cannot be used for
implementing moving ROI and special handling
needs to be provided for changing/moving slice
groups. In light of the above observations and
practical significance, support for moving ROI in
H.264 SVC has been proposed in this paper.
The rest of the paper is organized as follows.
Section 2 describes the proposed algorithms for
constrained inter frame prediction across slice group
boundaries. Changing slice groups (moving ROI)
feature is presented in Section 3. Section 4 provides
the experimental results and their analysis. The
conclusion to this research is drawn in Section 5
with suggestions for future work.
2 CONSTRAINED INTER FRAME
PREDICTION
Constrained inter prediction across the slice group
boundary is a useful functionality to allow for
independent decoding of slice groups, and hence the
ROI. The independent ROI decoding can increase
the error resilience by limiting the motion search for
the ROI to the same slice group in the reference
pictures. It will restrict motion compensation from
slice groups coded at lower quality. Restricted
motion compensation, in turn, maintains the
compression quality of different slice groups. A slice
group that is coded at a lower compression rate
would maintain its quality by not referring to the
samples that are outside this slice group and are
possibly coded with higher compression.
Three different techniques to constrain the inter
prediction across slice groups boundaries are
proposed as follows.
2.1 Boundary Padding of Non-ROI
The proposed method to restrict the inter-frame
prediction across slice group boundaries is to
eliminate the possibility of any sample in one slice
group finding its best match from the other slice
group. This can be done by padding the boundaries
of the ‘non-current’ (current slice group being the
one for whose samples, a best match is being found)
slice group.
The size of padding should be equal in width to
the minimum of search range specified in the
encoder configuration and the width of ‘non-current’
slice group. The value with which this region is
padded should be some value other than a
permissible pixel sample value (both luma and
chroma). This padding shall be applied to all
reference pictures used for inter prediction, and not
the current picture. The interpolation process for the
reference frame, for creating half pixel accurate and
quarter pixel accurate sample buffers, shall be
carried out after the padding.
Figure 1 illustrates the padding process. In figure
1(b), the macroblocks from slice group B are padded
with an undefined value. Although they fall inside
the search range of the current macroblock, their
undefined value cannot provide a match for this
macroblock. Based on the implementation it is
possible to restrict the motion vectors of just one
slice group or multiple slice groups. The one draw
back of this technique is that if the reference frames
are padded only once and used for all slice groups,
then the padded slice group can effect the motion
estimation of its own samples.This is because the
padded area would become ‘inaccessible’ to the
padded slice group as well. A way to solve this can
be to pad the reference frames for each slice group
separately.
2.2 Limiting the Search Rectangle
Constrained inter prediction can be implemented by
redefining the search range for each macroblock
according to its position in the slice group. In this
algorithm, the search range of the current
macroblock is defined in a way that the rows and
columns of macroblocks belonging to other slice
groups are excluded from the search rectangle of the
current macroblock. The technique is illustrated in
figure 2.
VISAPP 2008 - International Conference on Computer Vision Theory and Applications
14