Figure 1: AVC variable block sizes.
In this paper, we benchmark the video quality
impact of each AVC/H.264 inter-frame prediction
block size for various spatial and temporal contents.
This study follows a discrete methodology, meaning
that the testing and evaluation of each block size is
performed independently and not in parallel with the
remaining block sizes. For the purpose of this paper,
in the motion estimation for AVC, there is each time
only one available block size out of the total seven
to be searched for each MB. By this way this paper
researches if the sophisticated inter-frame
predictions algorithms and variable in size blocks
have an impact on the deduced video quality or if
the expected perceptual enhancement is negligible.
If the perceptual performance of the variable block
sizes and motion compensation algorithms is very
low then it means that the required high processing
power for the execution of such algorithms in the
encoding process is practically wasted without
significant perceptual outcome.
The remainder of this paper is organized as
follows. Section 2 performs a brief description of
the inter-frame prediction, Section 3 describes the
video quality metric that used in this paper, Section
4 presents the test signals of the experimental
section, Section 5 discusses the evaluation results,
and, finally, Section 6 concludes this paper.
2 INTER-FRAME PREDICTION
For completeness of the paper, this section provides
some basic background information for the inter-
frame and motion compensation algorithm that is
implemented in the AVC reference encoder.
Motion compensation is an algorithmic
technique employed in the encoding of video data
for video compression, which describes a frame in
terms of the transformation of a reference frame to
the current picture. Variable block-size motion
compensation is the use of block motion
compensation with the ability for the encoder to
dynamically select the size of the used blocks. Thus,
the use of larger blocks can reduce the number of
bits needed to represent the motion vectors (better
compression), while the use of smaller blocks can
result in a smaller amount of prediction residual
information to encode (better prediction). Older
designs such as H.261 and MPEG-1 video typically
use a fixed block size while newer ones such as
AVC give the encoder the ability to dynamically
choose which block size fits better to a specific
region. The overall procedure, which practically
exploits the temporal redundancy of a video
sequence for compression purposes, is called inter-
frame prediction. In the motion estimation for AVC,
there are in total seven possible block sizes to be
searched for each MB (modes 1–7 denote block
sizes of 16 × 16, 16 × 8, 8 × 16, 8 × 8, 8 × 4, 4 × 8,
and 4 × 4, respectively).
The simplest algorithmic implementation for the
inter-frame prediction is the Full Search (FS)
algorithm, which checks every displacement inside
the designated search window in order to specify the
best block size out of the seven available. The FS
algorithm, which evaluates Mean Absolute
Difference (MAD) at all possible regions of a frame,
has very high computational requirements, making
necessary the development of most sophisticated
algorithms providing a better trade-off between
computational complexity and prediction efficiency.
In this paper, we have modified the inter-frame
prediction mechanism in the reference encoder of
AVC/H.264 in order to perform the search and
match as it normally does, but each time looking for
the better match of only one specific block size (out
of the total seven available). Thus, the inter-frame
prediction continues to run but modified for a fixed
block size each time. So practically with this
modification the processing power consumption and
requirements have been reduced to the minimum.
The scope is to examine the perceptual impact of
the inter-frame prediction algorithms in conjunction
with variable in size blocks in relevance to the
spatiotemporal dynamics of the content.
3 VIDEO QUALITY
ASSESSMENT
The evaluation of the video quality is a matter of
objective and subjective procedures, which take
place after the encoding process. Subjective quality
evaluation processes of video streams require large
amount of human resources, establishing it as a
time-consuming process (Pereira and Alpert, 1997),
IMPACT OF H.264 ADVANCED VIDEO CODING INTER-FRAME BLOCK SIZES ON VIDEO QUALITY
119