ON THE HUMAN POSE RECOVERY BASED ON A SINGLE VIEW

Sébastien Piérard, Marc Van Droogenbroeck

2012

Abstract

Estimating the pose of the observed person is crucial for a large variety of applications including home entertainment, man-machine interaction, video surveillance, etc. Often, only a single side view is available, but authors claim that it is possible to derive the pose despite that humans evolve in a 3D environment. In addition, to decrease the sensitivity to color and texture, it is preferable to rely only on the silhouette to recover the pose. Under these conditions, we show that there is an intrinsic limitation: at least two poses correspond to the observed silhouette. We discuss this intrinsic limitation in details in this short paper. To our knowledge, this issue has been overlooked by authors in the past. We observe that this limitation has an impact on the way previous reported results should be interpreted, and it has clearly to be taken into account for designing new methods.

References

  1. Agarwal, A. and Triggs, B. (2005). Monocular human motion capture with a mixture of regressors. In IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, volume 3, San Diego, USA.
  2. Agarwal, A. and Triggs, B. (2006). Recovering 3d human pose from monocular images. IEEE Transactions on Pattern Analysis and Machine Intelligence, 28(1):44- 58.
  3. Barnich, O. and Van Droogenbroeck, M. (2011). ViBe: A universal background subtraction algorithm for video sequences. IEEE Transactions on Image Processing, 20(6):1709-1724.
  4. Elgammal, A. and Lee, C. (2009). Tracking people on a torus. IEEE Transactions on Pattern Analysis and Machine Intelligence, 31(3):520-538.
  5. Geurts, P., Ernst, D., and Wehenkel, L. (2006). Extremely randomized trees. Machine Learning, 63(1):3-42.
  6. Girshick, R., Shotton, J., Kohli, P., Criminisi, A., and Fitzgibbon, A. (2011). Efficient regression of generalactivity human poses from depth images. In International Conference on Computer Vision (ICCV), Barcelona, Spain.
  7. Gond, L., Sayd, P., Chateau, T., and Dhome, M. (2008). A 3D shape descriptor for human pose recovery. In Perales, F. and Fisher, R., editors, Articulated Motion and Deformable Objects, volume 5098 of Lecture Notes in Computer Science, pages 370-379. Springer.
  8. Howe, N. (2004). Silhouette lookup for automatic pose tracking. In IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, volume 1, pages 15-22, Washington, USA.
  9. Moeslund, T., Hilton, A., and Krüger, V. (2006). A survey of advances in vision-based human motion capture and analysis. Computer Vision and Image Understanding, 104:90-126.
  10. Peng, B. and Qian, G. (2008). Binocular dance pose recognition and body orientation estimation via multilinear analysis. In IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, Anchorage, USA.
  11. Piérard, S., Leroy, D., Hansen, J.-F., and Van Droogenbroeck, M. (2011). Estimation of human orientation in images captured with a range camera. In Advances Concepts for Intelligent Vision Systems (ACIVS), volume 6915 of Lecture Notes in Computer Science, pages 519-530. Springer.
  12. Poppe, R. and Poel, M. (2006). Comparison of silhouette shape descriptors for example-based human pose recovery. In International Conference on Automatic Face and Gesture Recognition, pages 541-546, Southampton, UK.
  13. Rosales, R. and Sclaroff, S. (2001). Learning body pose via specialized maps. In Proceedings of Neural Information Processing Systems, pages 1263-1270, Vancouver, Canada.
  14. Shakhnarovich, G., Viola, P., and Darrell, T. (2003). Fast pose estimation with parameter-sensitive hashing. In International Conference on Computer Vision (ICCV), volume 2, pages 750-757, Nice, France.
  15. Shotton, J., Fitzgibbon, A., Cook, M., Sharp, T., Finocchio, M., Moore, R., Kipman, A., and Blake, A. (2011). Real-time human pose recognition in parts from single depth images. In IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), Colorado Springs.
  16. Sminchisescu, C., Kanaujia, A., Li, Z., and Metaxas, D. (2005). Discriminative density propagation for 3d human motion estimation. In IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), volume 1, pages 390-397, San Diego, USA.
  17. Sminchisescu, C. and Telea, A. (2002). Human pose estimation from silhouettes - a consistent approach using distance level sets. In International Conference for Computer Graphics, Visualization and Computer Vision, volume 10, pages 413-420, Plzen?, Czech Republic.
  18. Taylor, C. (2000). Reconstruction of articulated objects from point correspondences in a single uncalibrated image. Computer Vision and Image Understanding, 80(3):349-363.
Download


Paper Citation


in Harvard Style

Piérard S. and Van Droogenbroeck M. (2012). ON THE HUMAN POSE RECOVERY BASED ON A SINGLE VIEW . In Proceedings of the 1st International Conference on Pattern Recognition Applications and Methods - Volume 2: ICPRAM, ISBN 978-989-8425-99-7, pages 310-315. DOI: 10.5220/0003731203100315


in Bibtex Style

@conference{icpram12,
author={Sébastien Piérard and Marc Van Droogenbroeck},
title={ON THE HUMAN POSE RECOVERY BASED ON A SINGLE VIEW},
booktitle={Proceedings of the 1st International Conference on Pattern Recognition Applications and Methods - Volume 2: ICPRAM,},
year={2012},
pages={310-315},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003731203100315},
isbn={978-989-8425-99-7},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 1st International Conference on Pattern Recognition Applications and Methods - Volume 2: ICPRAM,
TI - ON THE HUMAN POSE RECOVERY BASED ON A SINGLE VIEW
SN - 978-989-8425-99-7
AU - Piérard S.
AU - Van Droogenbroeck M.
PY - 2012
SP - 310
EP - 315
DO - 10.5220/0003731203100315