3D HUMAN TRACKING WITH GAUSSIAN PROCESS
ANNEALED PARTICLE FILTER
Leonid Raskin, Ehud Rivlin and Michael Rudzsky
Computer Science Department,Technion Israel Institute of Technology, Technion City, Haifa, Israel
Keywords: Tracking, Annealed particle filter, Gaussian fields, Latent space.
Abstract: We present an approach for tracking human body parts with prelearned motion models in 3D using multiple
cameras. We use an annealed particle filter to track the body parts and a Gaussian Process Dynamical
Model in order to reduce the dimensionality of the problem, increase the tracker's stability and learn the
motion models. We also present an improvement for the weighting function that helps to its use in occluded
scenes. We compare our results to the results achieved by a regular annealed particle filter based tracker and
show that our algorithm can track well even for low frame rate sequences.
1 INTRODUCTION
This paper presents an approach to 3D people
tracking that enables reduction in the complexity of
this model. We propose a novel algorithm, Gaussian
Process Annealed Particle Filter (GPAPF). In this
algorithm we use nonlinear dimensionality reduction
with the help a Gaussian Process Dynamical Model
(GPDM), (Lawrence (2004), Wang et al. (2005)),
and an annealed particle filter proposed by
Deutscher and Reid (2004). The annealed particle
filter has good performance when working on videos
which were shot with a high frame rate (60 fps, as
reported by Balan et al. (2005)), but performance
drops when the frame rate is lower (30fps). We
show that our approach provides good results even
for the low frame rate (30fps). An additional
advantage of our tracking algorithm is the capability
to recover after temporal loss of the target.
2 RELATED WORKS
One of the common approaches for tracking is using
a Particle Filtering. Particle Filtering uses multiple
predictions, obtained by drawing samples of the
pose and location prior and then propagating them
using the dynamic model, which are refined by
comparing them with the local image data (the
likelihood) (see, for example Isard (1998) or Bregler
et al. (1998)). The prior is typically quite diffused
(because motion can be fast) but the likelihood
function may be very peaky, containing multiple
local maxima which are hard to account for in detail.
For example, if an arm swings past an “arm-like”
pole, the correct local maximum must be found to
prevent the track from drifting (Sidenbladh (2000)).
Annealed particle filter (Deutscher and Reid (2004))
or local searches are ways to attack this difficulty.
There exist several possible strategies for
reducing the dimensionality of the configuration
space. Firstly it is possible to restrict the range of
movement of the subject. This approach has been
pursued by Rohr et al. (1997). The assumption is
that the subject is performing a specific action.
Agarwal et al. (2004) assume a constant angle of
view of the subject. Because of the restricting
assumptions the resulting trackers are not capable of
tracking general human poses.
Another way to cope with high-dimensional data
is to learn low-dimensional latent variable models.
Urtasun et al. (2006) uses a form of probabilistic
dimensionality reduction by Gaussian Process
Dynamical Model (GPDM) (Lawrence (2004), and
Wang et al. (2005)) formulate the tracking as a
nonlinear least-squares optimization problem.
Our approach is similar in spirit to the one
proposed by Urtasun et al. (2006), but we perform a
two stage process. The first stage is annealed particle
filtering in a latent space of low dimension. The
particles obtained after this step are transformed into
the data space by GPDM mapping. The second stage
is annealed particle filtering with these particles in
the data space.
459