MULTIMODAL USER IDENTIFICATION FOR NETWORK-BASED INTELLIGENT ROBOTS

Keun-Chang Kwak

2011

Abstract

This paper is concerned with multimodal user identification based face and speaker recognition for Human-Robot Interaction (HRI) under network-based intelligent robot environments. Face and speaker recognition are frequently used in conjunction with HRI that can naturally interact between human and robot. For this purpose, we present Tensor-based Multilinear Principal Component Analysis (TMPCA) and Mel-Frequency Cepstral Coefficients-Gaussian Mixture Model (MFCC-GMM) to recognize with face images and speech signals obtained through network transmission, respectively. Furthermore, we investigate network-based multimodal user identification for the near future study. The experimental results on face and speaker database with distant-varying reveal that the presented method shows good performance in network-based intelligent robot environments.

References

  1. Reynolds, D. A., Rose, R. C., 1995. Robust textindependent speaker identification using Gaussian mixture speaker models. IEEE Trans. on Speech and Audio Processing, vol. 3, no. 1, pp. 72-83.
  2. Kwak, K. C., Kim, H. J., Bae, K. S., Yoon, H. S., 2007. Speaker identification and verification for intelligent service robots. In International Conference on Artificial Intelligence (ICAI2007), Las Vegas, May.
  3. Ha, Y. G., Sohn, J. C., Cho, Y. J., and Yoon, H., 2005. Towards ubiquitous robotic companion: Design and implementation of ubiquitous robotic service framework. ETRI Journal, vol. 27, no. 6, pp. 666-676.
  4. Kim, D. H., Lee, J., Yoon, H. S., and Cha, E. Y., 2007. A non-cooperative user authentication system in robot environments. IEEE Consumer Electronics, vol. 53, no. 2, pp. 804-811.
  5. Yun, W. H., Kim, D. H., and Yoon, H. S., 2007. Fast Group verification system for intelligent robot service. IEEE Trans. on Consumer Electronics, vol. 53, no. 4, pp. 1731-1735.
  6. Ji, M., Kim, S., and Kim, H., 2008. Text-independent speaker identification using soft channel selection in home robot environments. IEEE Trans. on Consumer Electronics, vol. 54, no. 1, pp. 140-144, 2008.
  7. Lu, H., Plataniotis, K. N., and Venetsanopoulos, A. N., 2008. MPCA: Multilinear principal component analysis of tensor objects. IEEE Trans. on Neural Networks, vol. 19, no. 1, pp. 18-39.
  8. Kwak, K. C., and Kim, S. S., 2008. Sound source localization with aid of excitation source information in home robot environments. IEEE Trans. on Consumer Electronics, vol. 54, no. 2, pp. 852-856.
  9. Kim, H. J., Lee, J. Y., Kwak, K. C., and Yoon, H. S., 2007. Network-based voice component framework for human-robot interaction. International Symposium on Communications and Information Technologies (ISCIT 2007), pp. 1546-1550.
Download


Paper Citation


in Harvard Style

Kwak K. (2011). MULTIMODAL USER IDENTIFICATION FOR NETWORK-BASED INTELLIGENT ROBOTS . In Proceedings of the 8th International Conference on Informatics in Control, Automation and Robotics - Volume 2: ICINCO, ISBN 978-989-8425-75-1, pages 337-340. DOI: 10.5220/0003574203370340


in Bibtex Style

@conference{icinco11,
author={Keun-Chang Kwak},
title={MULTIMODAL USER IDENTIFICATION FOR NETWORK-BASED INTELLIGENT ROBOTS},
booktitle={Proceedings of the 8th International Conference on Informatics in Control, Automation and Robotics - Volume 2: ICINCO,},
year={2011},
pages={337-340},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003574203370340},
isbn={978-989-8425-75-1},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 8th International Conference on Informatics in Control, Automation and Robotics - Volume 2: ICINCO,
TI - MULTIMODAL USER IDENTIFICATION FOR NETWORK-BASED INTELLIGENT ROBOTS
SN - 978-989-8425-75-1
AU - Kwak K.
PY - 2011
SP - 337
EP - 340
DO - 10.5220/0003574203370340