An Improved Approach for Depth Data based Face Pose Estimation using Particle Swarm Optimization

Xiaozheng Mou, Han Wang

2014

Abstract

This paper presents an improved approach for face pose estimation based on depth data using particle swarm optimization (PSO). In this approach, the frontal face of the system-user is first initialized and its depth image is taken as a person-specific template. Each query face of that user is rotated and translated with respect to its centroid using PSO to match with the template. Since the centroid of each query face always changes with the face pose changing, a common reference point has to be defined to measure the exact transformation of the query face. Thus, the nose tips of the optimal transformed face and the query face are localized to recompute the transformation from the query face to the optimal transformed face that matched with the template. Using the recomputed rotation and translation information, finally, the pose of the query face can be approximated by the relative pose between the query face and the template face. Experiments on public database show that the accuracy of this new method is more than 99%, which is much higher than the best performance (< 91%) of existing work.

References

  1. Back, T. (1996). Evolutionary algorithms in theory and practice: evolution strategies, evolutionary programming, genetic algorithms. Oxford University Press, USA.
  2. Besl, P. and McKay, N. (1992). A method for registration of 3d shapes. IEEE Transactions on pattern analysis and machine intelligence, 14(2):239-256.
  3. Bleiweiss, A. and Werman, M. (2010). Robust head pose estimation by fusing time-of-flight depth and color. In Proceedings of IEEE International Workshop on Multimedia Signal Processing, pages 116-121.
  4. Breitenstein, M., Kuettel, D., Weise, T., Gool, L. V., and Pfister, H. (2008). Real-time face pose estimation from single range images. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pages 1-8.
  5. Cai, Q., Gallup, D., Zhang, C., and Zhang, Z. (2010). 3d deformable face tracking with a commodity depth camera. In Proceedings of European Conference on Computer Vision, pages 229-242.
  6. Fanelli, G., Gall, J., and Gool, L. V. (2011a). Real time head pose estimation with random regression forests. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pages 617-624.
  7. Fanelli, G., Weise, T., Gall, J., and Gool, L. V. (2011b). Real time head pose estimation from consumer depth cameras. Pattern Recognition, pages 101-110.
  8. Ghorbel, M. B., Baklouti, M., and Couvet, S. (2010). 3d head pose estimation and tracking using particle filtering and icp algorithm. Articulated Motion and Deformable Objects, pages 224-237.
  9. Horn, B. and Harris, J. (1991). Rigid body motion from range image sequences. CVGIP: Image Understanding, 53(1):1-13.
  10. Kondori, F., Yousefi, S., Li, H., and Sonning, S. (2011). 3d head pose estimation using the kinect. In Proceedings of International Conference on Wireless Communications and Signal Processing, pages 1-4.
  11. Mora, K. F. and Odobez, J. (2012). Gaze estimation from multimodal kinect data. In Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, pages 25-30.
  12. Murphy-Chutorian, E. and Trivedi, M. (2009). Head pose estimation in computer vision: A survey. IEEE Transactions on Pattern Analysis and Machine Intelligence, 31(4):607-626.
  13. Padeleris, P., Zabulis, X., and Argyros, A. (2012). Head pose estimation on depth data based on particle swarm optimization. In Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, pages 42-49.
  14. Rajwade, A. and Levine, M. (2006). Facial pose from 3d data. Image and Vision Computing, 24(8):849-856.
  15. Seemann, E., Nickel, K., and Stiefelhagen, R. (2004). Head pose estimation using stereo vision for human-robot interaction. In Proceedings of IEEE International Conference on Automatic Face and Gesture Recognition, pages 626-631.
  16. Tang, Y., Sun, Z., and Tan, T. (2011a). Face pose estimation based on integral slice features of single depth images. In Proceedings of Asian Conference on Pattern Recognition, pages 530-534.
  17. Tang, Y., Sun, Z., and Tan, T. (2011b). Real-time head pose estimation using random regression forests. Biometric Recognition, pages 66-73.
  18. Tu, Y., Zeng, C., Yeh, C., Huang, S., Cheng, T., and Ouhyoung, M. (2011). Real-time head pose estimation using depth map for avatar control. In Proceedings of IPPR Conference on Computer Vision, Graphics, and Image Processing.
  19. Wang, H. and Ying, Y. (2012). A novel torchlight data association strategy for surface registration. In Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems, pages 1708-1713.
  20. Weise, T., Leibe, B., and Gool, L. V. (2007). Fast 3d scanning with automatic motion compensation. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pages 1-8.
  21. Zhang, Z., Liu, Z., Adler, D., Cohen, M., Hanson, E., and Shan, Y. (2004). Robust and rapid generation of animated faces from video images: A model-based modeling approach. International Journal of Computer Vision, 58(2):93-119.
Download


Paper Citation


in Harvard Style

Mou X. and Wang H. (2014). An Improved Approach for Depth Data based Face Pose Estimation using Particle Swarm Optimization . In Proceedings of the 9th International Conference on Computer Vision Theory and Applications - Volume 2: VISAPP, (VISIGRAPP 2014) ISBN 978-989-758-004-8, pages 534-541. DOI: 10.5220/0004732305340541


in Bibtex Style

@conference{visapp14,
author={Xiaozheng Mou and Han Wang},
title={An Improved Approach for Depth Data based Face Pose Estimation using Particle Swarm Optimization},
booktitle={Proceedings of the 9th International Conference on Computer Vision Theory and Applications - Volume 2: VISAPP, (VISIGRAPP 2014)},
year={2014},
pages={534-541},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004732305340541},
isbn={978-989-758-004-8},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 9th International Conference on Computer Vision Theory and Applications - Volume 2: VISAPP, (VISIGRAPP 2014)
TI - An Improved Approach for Depth Data based Face Pose Estimation using Particle Swarm Optimization
SN - 978-989-758-004-8
AU - Mou X.
AU - Wang H.
PY - 2014
SP - 534
EP - 541
DO - 10.5220/0004732305340541