ally, key poses are displayed as a set of images po-
sitioned side by side, while another approach called
digital strobing combines all the key poses into a sin-
gle image by sharing a common background. A novel
method called spatially extended layout was intro-
duced in the paper to address the drawbacks of dig-
ital strobing. However, in the paper, the author didn’t
mention the problem of determining camera positions
and orientations in displaying mocap frames.
It seems natural that the best viewpoint is the one
that obtains the maximum information of a scene.
(Roberts and Marshall, 1998) defined the best view
as the view which direction has the smallest angular
offset from the inverse surface normals of the faces
in the scene. (V
´
azquez et al., 2001) proposed another
approach by using the probability distribution of the
projected area over the sphere of directions centered
in the viewpoint, called viewpoint entropy, to measure
the maximum information of a scene, while (Stoev
and Strasser, 2002) argued that the above methods fail
to give a good overview of the scene’s depth such as
the scene of a landscape and extended the approach by
maximizing not only the projected area of the scene,
but also the depth of the scene.
(Bares and Lester, 1999) introduced partial con-
traints which are defined by system users through an
interface to determine the best camera position auto-
matically, while (Arbel and Ferrie, 1999) and (Marc-
hand and Courty, 2000) addressed the problem of gen-
erating camera trajectories automatically based on the
current view.
Some of the rules applied in the film domain have
also been surveyed. (Drucker and Zeltzer, 1995) en-
capsulated several constraints based on the rules into
a camera module, which can be connected to another
camera module for transition. Using a similar con-
cept, (He et al., 1996) introduced film idioms, a hier-
archical finite state machine to determine transitions,
while the idioms determine which camera modules
should be used in a particular state. Another approach
(David B. Christianson, 1996) described Declarative
Camera Control Language (DCCL) to encode the
rules found in the film domain.
While we generally agree that the best view
should have the maximum information of the corre-
sponding scene, in displaying mocap, it is usually not
the projected area of the joints or bones that have to be
considered, but rather the motion itself which involves
multiple joints moving in spacetime. Further, mocap
itself naturally has no faces since it contains only the
coordinates of the joints. One may be tempted to cre-
ate a virtual face by connecting both hand joints and
feet joints, and then determine the view which will
produce the widest projection area of the virtual face
into the camera plane, but such an approach fails to
address the fundamental issue of looking for the view
that best conveys the motion itself. For example rather
than the general view which is able to display all the
joints clearly, we are naturally more interested in the
movement of the leg joints in more detail when look-
ing at a kicking motion.
The rules from the film domain cannot also be
straightforwardly applied to the problem since instead
of looking for ways to move the camera, our goal
is to determine camera positions in displaying mo-
cap frames. There are however several basic rules
that can be applied, such as the possible locations of
the camera (internal, parallel, external), and the dis-
tance of the camera with respect to the subject (ex-
treme, closeup, medium, full, long). In fact, almost
all the previous approaches described above refer to
the use of positions located on the surface of a virtual
sphere surrounding the subject as candidates for the
best camera position, which are usually represented
as spherical coordinates.
As also pointed out by (Stoev and Strasser, 2002),
we believe that until now, there are no objective mea-
surements and criteria for evaluating the goodness of
camera positions, especially in the case of display-
ing mocap. Therefore, we propose a novel approach
to determine the best camera position relative to the
subject by using a data mining approach. The use of
data mining for camera transition in computer graph-
ics community was first explored by (Singh and Bal-
akrishnan, 2004) to generate non-linear projection of
a 3d scene.
3 APPROACH
3.1 Overview
Displaying all frames of the mocap will cause in-
formation overload, no matter from which angle the
frames are rendered. Therefore, in the next subsec-
tion, we will first describe our simple method to select
the frames of one mocap. After that, we will describe
what attributes that we choose in building data mining
classifiers, or specifically the attributes of the joints,
which are used as input attributes, and the attributes
of the camera, which are used as the output or target
attributes of the classifiers. We stress in this section
that we are not concerned with discovering novel data
mining techniques, but rather, we seek to apply es-
tablished data mining techniques to a new problem
domain.
DATA MINING APPROACH FOR POSITIONING CAMERA IN DISPLAYING MOCAP SEARCH RESULTS
267