viewpoint density which provides the maximum
amount of visual information possible at a given
video rate. Densities lower than this maximum
might result in a decrease of perceived smoothness.
The relation between perceived smoothness,
viewpoint density, and speed of movement was
examined for two simple types of viewer (virtual
camera) motion: forward motion (i.e., moving
toward a target in a straight line) and lateral motion
(i.e., moving sideway in a straight line).
2 EXPERIMENT 1 - FORWARD
MOTION
In this experiment, we simulated what the observer
would see if she/he were moving in the environment
from point A to point B along a straight path and
looking in the same direction as that of the
movement. These “forward motion” test sequences
were constructed by selecting viewpoints that had
the same visual direction as the direction of
movement.
2.1 Generation of Video Test Material
We used four were natural sequences, captured with
a LadyBug camera (Point Grey Research Inc.), and
one synthetic sequence, created with 3D StudioMax.
The pixel resolution of all sequences was 1024 x720.
The natural sequences were captured in a
rectangular room. Two sequences, named
CastleLongFront and CastleLongBack, represented a
movement along the longest axis of the room but in
opposite directions. The other two sequences, named
CastleShortFront and CastleShortBack, represented
a movement along the shortest axis of the room, but
again in opposite directions. All four sequences were
created by capturing 4 original images per foot at
equally spaced intervals. The long sequences
encompassed a distance of 24 feet (96 original
viewpoints) whereas the short sequences
encompassed a distance of 10 feet (40 viewpoints).
The synthetic sequence, named SaharaLong,
contained several geometric shapes and a model of a
vehicle whose dimensions were used as a baseline
for the spatial dimensions of the environment.
Simulate distance and viewpoint density were the
same as those of the long natural sequences.
These original sequences were used to generate
sequences having different levels of viewpoint
density and speed of movement. There were four
levels of viewpoint density: 4, 2, 1, and 0.5
views/foot. The lower density sequences were
created by sub-sampling the 4 views/foot original
sequences and duplicating the remaining views.
Thus, the eight views (1,2,3,4,5,6,7,8) that spanned
two feet in the original sequences became
(1,1,3,3,5,5,7,7), (1,1,1,1,5,5,5,5), and
(1,1,1,1,1,1,1,1) in the 2, 1, and 0.5 densities,
respectively.
Each of the four densities was presented at three
speeds: 3.8 (slow), 7.6 (medium), and 15.2 (fast)
feet/second. The speed refers here to the speed at
which the camera (real or virtual) is moving through
the environment. In this study, this speed was
simulated by changing the speed at which the
sequences were played by a DVS HDProStation
digital disk recorder. The slowest speed
approximated the average walking speed
(Knoblauch, Pietrucha, and Nitzburg, 1996).
It was noted that at the fastest speeds the
duration of the sequences would be perhaps too
short for a proper assessment of smoothness. To
obtain sequences of sufficient temporal duration we
first increase the length of the sequence by repeating
the sequence backwards to form a cycle, i.e., from
point A to point B and vice versa (ABA). Secondly,
we concatenated these cycles proportionally to the
speed at which the sequence was to be played: once
for the slow speed, twice for the medium speed, and
four times for the fastest speed. As a result, the long
sequences had duration of 12.6 seconds and the short
sequences had duration of 5.2 seconds.
2.2 Subjective Assessment
In order to evaluate the perceived smoothness of
sequences generated as described above, we
performed a subjective assessment experiment.
Eighteen viewers participated in the experiment.
The combination of five sequences, four
viewpoint densities, and three speeds yielded 60
experimental conditions. The perceived smoothness
of these conditions was assessed using a single
stimulus method (ITU-R Recommendation BT.500,
2004). A test session involved of a series of
assessment trials, each one consisting of the
presentation of a single video sequence followed by
a blank, i.e. mid-grey, display. At the end of each
trial, the viewer was asked to provide a rating of the
perceived smoothness of the entire presentation
using a continuous line judgment scale, which was
divided into five segments. As a guide, the
adjectives “Excellent”, “Good”, “Fair”, “Poor”, and
“Bad” were aligned with the five segments of the
scale. For analysis, the viewers’ responses were
GRAPP 2008 - International Conference on Computer Graphics Theory and Applications
440