SPEAKER’S GENDER IDENTIFICATION FOR HUMAN-ROBOT
INTERACTION
Kyung-Sook Bae, Keun-Chang Kwak, Soo-Young Chi
Intelligent Robot Research Division, Electronics and Telecommunications Research Institute(ETRI),
161 Gajeong-Dong, Yuseong-Gu, Daejeon, Korea
Keywords: Gender Identification, GMM, URC.
Abstract: This paper is concerned with a text-independent Speaker’s gender Identification (GI) for Human-Robot
Interaction (HRI). For this purpose, we perform speaker’s gender recognition based on Gaussian Mixture
Model (GMM) and use robot platform called WEVER, which is a Ubiquitous Robotic Companion (URC)
intelligent service robot developed at Intelligent Robot Research Division in Electronics and
Telecommunication Research Institute (ETRI). Furthermore, we communicate with intelligent service
robots through a Korean-based spontaneous speech recognition and text-independent speaker’s gender
identification to provide a suitable service such as selection of preferable TV channel or music for the
identified speaker’s gender. The experimental results obtained for ETRI speaker database reveal that the
approach presented in this paper yields a good identification (94.9%) performance within 3 meter.
1 INTRODUCTION
Speaker Identification has many applications and is
a topic of great interest in the speech research
community. Speaker’s Gender Identification (GI)
can be thought of as a subset of speaker
identification and also can be contributed to increase
performance of Speaker Identification as a
preprocessing. In the past, GI has been investigated
for clean speech by Wu and Childers (Wu, 1991).
Parris and Carey studied GI for different languages
using telephone speech data. In their system Paris
and Carey trained an GI system using speakers of
British English and tested their system using
speakers of British English, US English, and 10
other languages. Slomka and Sridharan proposed
text-independent GI systems capable of being
optimized for multiple adverse conditions, including
various coders, and reverberation levels (Slomka,
1997).
Recently, there has been a renewal of interest in
Human-Robot Interaction (HRI) for intelligent
robots. Among HRI components, specifically the
concern with speech-based HRI such as speech
recognition, sound source separation, and speaker
recognition has been growing. In this paper, we
present text-independent GI to develop HRI
components for Ubiquitous Robotic Companion
(URC) intelligent service robots, which exploit
strong Information Technology (IT) infrastructure
such as high-speed internet.
Here the URC means
that it will provide the necessary services at any time
and place to meet the user’s requirements. Thus, it
combines the network function with the current
concept of a robot in order to enhance mobility and
human interface. For this purpose, we perform
gender recognition based on Gaussian Mixture
Model (GMM) (Reynolds, 1995) through a
microphone equipped with WEVER robot developed
by ETRI. Furthermore, we communicate with
intelligent service robots through spontaneous
speech recognition and text-independent speaker’s
gender recognition to provide a suitable service for
the identified speaker’s gender. The experimental
results obtained for ETRI speaker database reveal
that the presented approach yields a good
identification performance at a short or long-
distance (3m-5m). Proposed GI system is shown in
Figure 1.
339
Bae K., Kwak K. and Chi S. (2006).
SPEAKER’S GENDER IDENTIFICATION FOR HUMAN-ROBOT INTERACTION.
In Proceedings of the International Conference on Signal Processing and Multimedia Applications, pages 339-342
DOI: 10.5220/0001569103390342
Copyright
c
SciTePress