Then the next step was calculating the threshold.
As the reference models were generated for each
speaker they were tested on a validation for
calculating the average distance between true
speakers and impostors. This measurement is stored
to serve as the calculation of decision of the stage of
testing.
The tests database was divided into two groups.
With a genuine phrases (the 5 other phrases of the
stage of training) and another with the phrases of
impostors (5 separate impostors). The process of
testing was also performed in 20 interactions for 16,
32 and 64 mixtures. For both groups were estimated
parameters of Maximum Likelihood.
The result of these distortions was compared to
the threshold calculated in training step and the basic
measures of error used in the system were the False
Acceptance Rate (FAR) and False Rejection Rate
(FRR) as defined below.
Overall performance can be obtained by combining
these two errors into total success rate (TSR) where:
4.5 Results
Table 2 shows a summary of the results of the use of
speaker verification. Have seen the results for 16, 32
and 64 and the results were mixed with 64 blends
the best as expected. The overall success rate (TSR)
was up 91.12%. For the false acceptance rate (FAR)
and false rejection (FRR), the results were very
similar, with no great increase in performance.
Table 2: Speakers Data Base Description.
Mixtures FAR FRR TSR
16 11,23 8,92 89,92%
32 10,98 8,47 90,27%
64 9,43 8,32 91,12%
5 CONCLUSIONS
This article show a speaker verification system
based on the GMM approach to telephony
environments. This system was integrated with the
Asterisk telephony platform and the results are very
similar to systems for quiet environments.
The tests using the GMM approach results in the
range of 90%. Other techniques being worked see
better results in the literature, however, our idea was
to show the ease of integration between the platform
and the asterisk of speaker verification system.
As future work we test in more noisy acoustic
environments with cell phones, make changes in the
extraction of features to achieve more representative
parameters and finally implement the changes as the
GMM as FGMM (Tran, 1998) and Type-2-Fuzzy
GMM - (Zeng, 2007). Thus we want to achieve
better recognition rates and robustness to noisy
environments.
REFERENCES
Naik, J.M., Netsch, L.P., Doddington, G.R. 1989.
In International Conference on Acoustics, Speech, and
Signal Processing. Speaker verification over long
distance telephone lines.
Mak, M.W., Kung, S.Y. 2002. In International
Conference on Acoustics, Speech, and Signal
Processing. Combining stochastic feature
transformation and handset Identification for
telephone-based speaker verification.
Meggelen, J. V., Madsen, L., Smith, J., 2007. Asterisk –
The Future of Telephony, O’REILLY. USA, 2
nd
edition.
Spencer, M., et al., 2003. The Asterisk Handbook, Digium,
Inc. USA, 2
nd
version.
Campbell, J.P. 1997. In Proceedings of IEEE, Speaker
Recognition: A Tutorial.
Mashao, D., Skosan, M. 2006. In Pattern Recognition,
Combining Classifier Decision for Robust Speaker
Identification.
Fechine, J.M. 2002. In PhD Thesis, UFCG-Brazil.
Reconhecimento Automático de Identidade Vocal
Utilizando Modelagem Híbrida: Paramétrica e
Estatística.
Reynolds D., R. Rose. 1995. In IEEE Trans. Speech Audio
Process. Robust text-independent speaker
identification using gaussian mixture speaker models.
Reynolds, D., T. Quatieri, R. Dunn. 2000. In Digital
Signal Process., DSP. Speaker verification using
adapted Gaussian speaker mixture models.
Tran, D., VanLe, T., Wagner M., 1998. In: Proceedings
of the International Conference on Spoken Language
Processing. Fuzzy Gaussian mixture models for
speaker recognition.
Zeng, J., Xie, L., Liu, Z.Q., 2007. In Pattern Recognition,
Type-2-Fuzzy Gaussian Mixture Models.
TSR =
FAR + FRR
Total number of accesses
100% - x 100
Number of re
ected
enuine claims
Total number of
enuine accesses
FRR =
x 100
Number of acce
ted im
oster claims
Total number of im
oster accesses
FAR =
x 100
SPEAKER VERIFICATION SYSTEM THROUGH TELEPHONE CHANNEL - An Integrated System for Telephony
Plataform Asterisk
93