MULTIMODAL USER IDENTIFICATION FOR NETWORK-BASED INTELLIGENT ROBOTS
Keun-Chang Kwak
2011
Abstract
This paper is concerned with multimodal user identification based face and speaker recognition for Human-Robot Interaction (HRI) under network-based intelligent robot environments. Face and speaker recognition are frequently used in conjunction with HRI that can naturally interact between human and robot. For this purpose, we present Tensor-based Multilinear Principal Component Analysis (TMPCA) and Mel-Frequency Cepstral Coefficients-Gaussian Mixture Model (MFCC-GMM) to recognize with face images and speech signals obtained through network transmission, respectively. Furthermore, we investigate network-based multimodal user identification for the near future study. The experimental results on face and speaker database with distant-varying reveal that the presented method shows good performance in network-based intelligent robot environments.
References
- Reynolds, D. A., Rose, R. C., 1995. Robust textindependent speaker identification using Gaussian mixture speaker models. IEEE Trans. on Speech and Audio Processing, vol. 3, no. 1, pp. 72-83.
- Kwak, K. C., Kim, H. J., Bae, K. S., Yoon, H. S., 2007. Speaker identification and verification for intelligent service robots. In International Conference on Artificial Intelligence (ICAI2007), Las Vegas, May.
- Ha, Y. G., Sohn, J. C., Cho, Y. J., and Yoon, H., 2005. Towards ubiquitous robotic companion: Design and implementation of ubiquitous robotic service framework. ETRI Journal, vol. 27, no. 6, pp. 666-676.
- Kim, D. H., Lee, J., Yoon, H. S., and Cha, E. Y., 2007. A non-cooperative user authentication system in robot environments. IEEE Consumer Electronics, vol. 53, no. 2, pp. 804-811.
- Yun, W. H., Kim, D. H., and Yoon, H. S., 2007. Fast Group verification system for intelligent robot service. IEEE Trans. on Consumer Electronics, vol. 53, no. 4, pp. 1731-1735.
- Ji, M., Kim, S., and Kim, H., 2008. Text-independent speaker identification using soft channel selection in home robot environments. IEEE Trans. on Consumer Electronics, vol. 54, no. 1, pp. 140-144, 2008.
- Lu, H., Plataniotis, K. N., and Venetsanopoulos, A. N., 2008. MPCA: Multilinear principal component analysis of tensor objects. IEEE Trans. on Neural Networks, vol. 19, no. 1, pp. 18-39.
- Kwak, K. C., and Kim, S. S., 2008. Sound source localization with aid of excitation source information in home robot environments. IEEE Trans. on Consumer Electronics, vol. 54, no. 2, pp. 852-856.
- Kim, H. J., Lee, J. Y., Kwak, K. C., and Yoon, H. S., 2007. Network-based voice component framework for human-robot interaction. International Symposium on Communications and Information Technologies (ISCIT 2007), pp. 1546-1550.
Paper Citation
in Harvard Style
Kwak K. (2011). MULTIMODAL USER IDENTIFICATION FOR NETWORK-BASED INTELLIGENT ROBOTS . In Proceedings of the 8th International Conference on Informatics in Control, Automation and Robotics - Volume 2: ICINCO, ISBN 978-989-8425-75-1, pages 337-340. DOI: 10.5220/0003574203370340
in Bibtex Style
@conference{icinco11,
author={Keun-Chang Kwak},
title={MULTIMODAL USER IDENTIFICATION FOR NETWORK-BASED INTELLIGENT ROBOTS},
booktitle={Proceedings of the 8th International Conference on Informatics in Control, Automation and Robotics - Volume 2: ICINCO,},
year={2011},
pages={337-340},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003574203370340},
isbn={978-989-8425-75-1},
}
in EndNote Style
TY - CONF
JO - Proceedings of the 8th International Conference on Informatics in Control, Automation and Robotics - Volume 2: ICINCO,
TI - MULTIMODAL USER IDENTIFICATION FOR NETWORK-BASED INTELLIGENT ROBOTS
SN - 978-989-8425-75-1
AU - Kwak K.
PY - 2011
SP - 337
EP - 340
DO - 10.5220/0003574203370340