ability of the trial when it is subjected to repeated
trials. When the motor learning is in the cognitive
phase, we assume there is large variation in move-
ments. When the learning progresses and moves to
the associative phase, we assume the variation be-
comes rather small because a trainee gets to know the
check points and how to modify incorrect movements.
Finally, when it reaches the autonomous phase, we as-
sume that the variation in movements becomes quite
small.
To achieve our purpose, we use a markerless mo-
tion capture system proposed by Matsumoto et al.
(Matsumoto et al., 2012) that can obtain human poses
in 3D world coordinates from one camera input that
is robust under camera positions. This feature en-
ables motion variations to be compared from different
views for the same criteria. In this paper, we use the
golf swing as the motor learning target since there are
so many persons who enjoy the game of golf.
The reminder of this paper is organized as follows.
Section 2 overviews the markerless human motion
capture method proposed by Matsumoto et al., Sec-
tion 3 outlines our proficiency estimation method in
detail, Section 4 describes experiments we conducted,
and Section 5 concludes the paper with a brief sum-
mary.
2 GPDM-BASED MARKERLESS
MOTION CAPTURE WITH
SINGLE CAMERA
Our method uses a GPDM-based markerless motion
capture system proposed as a method for obtain-
ing human poses under robust camera position (Mat-
sumoto et al., 2012). Consisting of a training step and
a pose estimation step, it estimates the poses made
when subject movements are similar to trained move-
ments.
The method we propose has two principal fea-
tures. First, it requires only a single camera to work
well. Due to the innate characteristics of single cam-
era input, it is unable to obtain depth information.
To compensate for this, it includes a training step
in which use is made of 3D motion data captured
by a marker-based motion capture system. 3D mo-
tion data of various motions in sports can be found
in databases that are publicly available. For exam-
ple, the CMU mocap library includes motions of golf
swings, basketball dribbles, and soccer ball kicks.
Our method can employ 3D motion data from such
motion databases and thus skirt troublesome prepara-
tions. Second, it is robust against differences in the
relative positions of the camera and the target human.
The GPDM-based markerless motion capture method
learns the state dynamics of all possible views. In
the pose estimation step, it jointly estimates a 3D hu-
man pose and the camera position relative to it. This
enables robust tracking against camera position. The
following subsections describe each of these steps in
more detail.
2.1 Training Step
The GPDM-based markerless motion capture method
uses sequences of 3D human poses in the training
step. That is, it requires pose sequences obtained
by a marker-based motion capture system or a multi-
camera system. Note that since our system is de-
signed for specific motor learning, this requirement
is not especially problematic.
The training step actually comprises three steps.
The first is estimating a view-dependent trajectory. It
should be noted that the a view-dependent observa-
tion can be virtually generated from training data be-
cause it consists of 3D motion data. The second is
reducing the dimensionality of data by using GPDM
(Wang et al., 2005). The third is learning the state dy-
namics for each view. Through this training step, the
view-dependent dynamics of human movement in a
low-dimensional feature space are obtained.
2.2 Pose Estimation Step
In the pose estimation step, state parameters, i.e., view
and pose parameters trained in the training step, are
estimated by particle filtering (Isard and Blake, 1998)
of a video sequence captured by a single camera. Be-
cause this step estimates not only pose parameters but
also the view (i.e., the relative positions of human and
camera), it is robust with respect to the latter. An
HSV histogram of joints was used for the observation
model.
3 PROPOSED METHOD
This section outlines the details of the proficiency es-
timation system we propose for assisting motor learn-
ing. It has been said that motor learning can be di-
vided into three phases: cognitive, associative, and
autonomous (Schmidt and Lee, 1988). On the basis
of this knowledge, the proposed method estimates a
trainee’s proficiency in making motions from the vari-
ability of his/her own movements.
Figure 1 shows the system environment that we
assume. The proposed method uses a single cam-
icSPORTS2013-InternationalCongressonSportsScienceResearchandTechnologySupport
148