Ouissem Ben-Henia, Saida Bouakaz


Model-based methods to the tracking of an articulated hand in a video sequence generally use a cost function to compare the hand pose with a parametric three-dimensional (3D) hand model. This comparison allows adapting the hand model parameters and it is thus possible to reproduce the hand gestures. Many proposed cost functions exploit either silhouette or edge features. Unfortunately, these functions cannot deal with the tracking of complex hand motion. This paper presents a new depth-based function to track complex hand motion such as opening and closing hand. Our proposed function compares 3D point clouds stemming from depth maps. Each hand point cloud is compared with several clouds of points which correspond to different model poses in order to obtain the model pose that is close to the hand one. To reduce the computational burden, we propose to compute a volume of voxels from a hand point cloud, where each voxel is characterized by its distance to that cloud. When we place a model point cloud inside this volume of voxels, it becomes fast to compute its distance to the hand point cloud. Compared with other well-known functions such as the directed Hausdorff distance (Huttenlocher et al., 1993), our proposed function is more adapted to the hand tracking problem and it is faster than the Hausdorff function.


  1. Bray, M., Koller-meier, E., Mller, P., Gool, L. V., and Schraudolph, N. N. (2004a). 3d hand tracking by rapid stochastic gradient descent using a skinning model. In In 1st European Conference on Visual Media Production (CVMP, pages 59-68.
  2. Bray, M., Koller-Meier, E., Schraudolph, N. N., and Gool, L. V. (2004b). Stochastic meta-descent for tracking articulated structures. In CVPRW 7804: Proceedings of the 2004 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'04) Volume 1, page 7, Washington, DC, USA. IEEE Computer Society.
  3. de La Gorce, M., Paragios, N., and Fleet, D. J. (2008). Model-based hand tracking with texture, shading and self-occlusions. In CVPR.
  4. Dennis, J. E., Jr., and Torczon, V. (1991). Direct search methods on parallel machines. SIAM Journal on Optimization, 1:448-474.
  5. Heap, T. and Hogg, D. (1996). Towards 3d hand tracking using a deformable model. In In Face and Gesture Recognition, pages 140-145.
  6. Henia, O. B., Hariti, M., and Bouakaz, S. (2010). A twostep minimization algorithm for model-based hand tracking. In WSCG.
  7. Huttenlocher, D. P., Klanderman, G. A., Kl, G. A., and Rucklidge, W. J. (1993). Comparing images using the hausdorff distance. IEEE Transactions on Pattern Analysis and Machine Intelligence, 15:850-863.
  8. Ike, T., Kishikawa, N., and Stenger, B. (2007). A real-time hand gesture interface implemented on a multi-core processor. In MVA, pages 9-12.
  9. Kato, M. and Xu, G. (2006). Occlusion-free hand motion tracking by multiple cameras and particle filtering with prediction. IJCSNS International Journal of Computer Science and Network Security, 6(10):58- 65.
  10. Kerdvibulvech, C. and Saito, H. (2009). Model-based hand tracking by chamfer distance and adaptive color learning using particle filter. J. Image Video Process., 2009:2-2.
  11. Meijster, A., Roerdink, J., and Hesselink, W. H. (2000). A general algorithm for computing distance transforms in linear time. In Mathematical Morphology and its Applications to Image and Signal Processing, pages 331-340. Kluwer.
  12. Nelder, J. A. and Mead, R. (1965). A simplex method for function minimization. The Computer Journal, 7(4):308-313.
  13. Ouhaddi, H. and Horain, P. (1999). 3d hand gesture tracking by model registration. In Proc.IWSNHC3DI99, pages 70-73.
  14. Rosales, R., Athitsos, V., Sigal, L., and Sclaroff, S. (2001). 3d hand pose reconstruction using specialized mappings. In ICCV, pages 378-385.
  15. Shimada, N., Kimura, K., and Shirai, Y. (2001). Realtime 3-d hand posture estimation based on 2-d appearance retrieval using monocular camera. In RATFGRTS 7801: Proceedings of the IEEE ICCV Workshop on Recognition, Analysis, and Tracking of Faces and Gestures in Real-Time Systems (RATFG-RTS'01), page 23, Washington, DC, USA. IEEE Computer Society.
  16. Stenger, B., Mendonca, P. R. S., and Cipolla, R. (2001). Model-based 3d tracking of an articulated hand. In Computer Vision and Pattern Recognition, 2001. CVPR 2001. Proceedings of the 2001 IEEE Computer Society Conference on, volume 2, pages II-310-II315 vol.2.
  17. Stenger, B., Thayananthan, A., Torr, P. H. S., and Cipolla, R. (2006). Model-based hand tracking using a hierarchical bayesian filter. IEEE Trans. Pattern Analysis and Machine Intelligence(PAMI), 28(9):1372-1384.
  18. Wang, R. Y. and Popovic, J. (2009). Real-time handtracking with a color glove. In SIGGRAPH 7809: ACM SIGGRAPH 2009 papers, pages 1-8, New York, NY, USA. ACM.
  19. Wu, Y., Lin, J., and Huang, T. S. (2005). Analyzing and capturing articulated hand motion in image sequences. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 27(12):1910-1922.

Paper Citation

in Harvard Style

Ben-Henia O. and Bouakaz S. (2011). A NEW DEPTH-BASED FUNCTION FOR 3D HAND MOTION TRACKING . In Proceedings of the International Conference on Computer Vision Theory and Applications - Volume 1: VISAPP, (VISIGRAPP 2011) ISBN 978-989-8425-47-8, pages 653-658. DOI: 10.5220/0003378106530658

in Bibtex Style

author={Ouissem Ben-Henia and Saida Bouakaz},
booktitle={Proceedings of the International Conference on Computer Vision Theory and Applications - Volume 1: VISAPP, (VISIGRAPP 2011)},

in EndNote Style

JO - Proceedings of the International Conference on Computer Vision Theory and Applications - Volume 1: VISAPP, (VISIGRAPP 2011)
SN - 978-989-8425-47-8
AU - Ben-Henia O.
AU - Bouakaz S.
PY - 2011
SP - 653
EP - 658
DO - 10.5220/0003378106530658