Figure 4: Tracking error of the PIP middle finger.
is well-adapted to the 3D hand tracking. A 3D para-
metric hand model is used to achieve the tracking by
comparing its poses with the hand ones using our pro-
posed function. The depth-based function compares
3D point clouds stemming from depth maps. Each
hand point cloud is compared with several clouds of
points which correspond to different model poses in
order to obtain the model pose that is close to the hand
one. Classical functions comparing 3D point clouds
such as the Hausdorff one are not adapted for the
hand tracking problem because of the expensive time
needed to achieve both the comparison and track-
ing. To reduce the computational burden, we propose
to compute a volume of voxels from the hand point
cloud, where each voxel is characterized by its dis-
tance to that cloud. By placing any model point cloud
in the computed volume, it becomes fast to compute
its distance to the hand point cloud. We experiment
our proposed function using synthetic data obtained
from depth maps generated by means of the OpenGL
library. The preliminary results obtained so far are
very encouraging because we are able to track com-
plex hand motion such as the closing of hand. Besides
tracking complex hand motion, our proposed function
is faster than other well-known functions such as the
Hausdorff one.
We plan to extend our experimental study using
real data. For this propose, different methods could
be used to collect 3D hand point cloud. Stereo vi-
sion could be a solution to the problem of acquiring
3D cloud point of the hand but it requires the use two
video cameras. Another alternative consists to use a
new generation of video cameras, called time of flight
cameras, and provides a 3D cloud point of the ob-
served scene in real-time. However, this new technol-
ogy is deemed to be not very precise. Structured light
sensor could also be used to obtain depth maps of the
hand as it is done in (Bray et al., 2004a). This method
seems to provide accurate results but it is slower than
a time of flight camera. A comparative study between
these different methods must be performed to select
the one yielding the best results in terms of accuracy
and computing time.
REFERENCES
Bray, M., Koller-meier, E., Mller, P., Gool, L. V., and
Schraudolph, N. N. (2004a). 3d hand tracking by rapid
stochastic gradient descent using a skinning model. In
In 1st European Conference on Visual Media Produc-
tion (CVMP, pages 59–68.
Bray, M., Koller-Meier, E., Schraudolph, N. N., and Gool,
L. V. (2004b). Stochastic meta-descent for tracking
articulated structures. In CVPRW ’04: Proceedings
of the 2004 Conference on Computer Vision and Pat-
tern Recognition Workshop (CVPRW’04) Volume 1,
page 7, Washington, DC, USA. IEEE Computer So-
ciety.
de La Gorce, M., Paragios, N., and Fleet, D. J. (2008).
Model-based hand tracking with texture, shading and
self-occlusions. In CVPR.
Dennis, J. E., Jr., and Torczon, V. (1991). Direct search
methods on parallel machines. SIAM Journal on Op-
timization, 1:448–474.
Heap, T. and Hogg, D. (1996). Towards 3d hand tracking
using a deformable model. In In Face and Gesture
Recognition, pages 140–145.
Henia, O. B., Hariti, M., and Bouakaz, S. (2010). A two-
step minimization algorithm for model-based hand
tracking. In WSCG.
Huttenlocher, D. P., Klanderman, G. A., Kl, G. A., and
Rucklidge, W. J. (1993). Comparing images using
the hausdorff distance. IEEE Transactions on Pattern
Analysis and Machine Intelligence, 15:850–863.
Ike, T., Kishikawa, N., and Stenger, B. (2007). A real-time
hand gesture interface implemented on a multi-core
processor. In MVA, pages 9–12.
Kato, M. and Xu, G. (2006). Occlusion-free hand mo-
tion tracking by multiple cameras and particle filter-
ing with prediction. IJCSNS International Journal of
Computer Science and Network Security, 6(10):58–
65.
Kerdvibulvech, C. and Saito, H. (2009). Model-based hand
tracking by chamfer distance and adaptive color learn-
ing using particle filter. J. Image Video Process.,
2009:2–2.
Meijster, A., Roerdink, J., and Hesselink, W. H. (2000). A
general algorithm for computing distance transforms
in linear time. In Mathematical Morphology and its
Applications to Image and Signal Processing, pages
331–340. Kluwer.
Nelder, J. A. and Mead, R. (1965). A simplex method
for function minimization. The Computer Journal,
7(4):308–313.
Ouhaddi, H. and Horain, P. (1999). 3d hand gesture tracking
by model registration. In Proc.IWSNHC3DI99, pages
70–73.
Rosales, R., Athitsos, V., Sigal, L., and Sclaroff, S. (2001).
3d hand pose reconstruction using specialized map-
pings. In ICCV, pages 378–385.
A NEW DEPTH-BASED FUNCTION FOR 3D HAND MOTION TRACKING
%p