ACKNOWLEDGEMENTS
This research was carried out within projects funded
by the Ministry of Science and Technology of
Spain (TEC2006-12887-C02) and the Universidad
Polit´ecnica de Madrid (AL06-EX-PID-033). The
work has also received support from European COST
action 2103.
REFERENCES
(1994). Voice disorders database v.1. CD-ROM. Mas-
sachusetts Eye and Ear Infirmary.
(1998). Transmission characteristics of national networks.
Series G: Transmission Systems and Media, Digital
Systems and Networks Rec. G.120 (12/98), ITU-T.
Baken, R. J. and Orlikoff, R. F. (2000). Clinical Measure-
ment of Speech and Voice. Singular Publishers, San
Diego (USA).
Bimbot, F., Bonastre, J. F., Fredouille, C., Gravier, G.,
Magrin-Chagnolleau, I., Meignier, S., Merlin, T.,
Ortega-Garcia, J., Petrovska, D., and Reynolds, D. A.
(2004). A tutorial on text-independent speaker verifi-
cation. EURASIP Journal on Applied Signal Process-
ing, 2004(4):430–451.
Boyanov, B. and Hadjitodorov, S. (1997). Acoustic analysis
of pathological voices. A voice analysis system for the
screening of laryngeal diseases. IEEE Engineering in
Medicine and Biology, 16(4):74–82.
Davis, S. B. and Mermelstein, P. (1980). Comparison
of parametric representations for monosyllabic word
recognition in continuously spoken sentences. IEEE
Transactions on Acoustics, Speech and Signal Pro-
cessing, ASSP-28(4):357–366.
Deller, J. R., Proakis, J. G., and Hansen, J. H. L. (1993).
Discrete-time processing of speech signals. Macmil-
lan Publishing Company, New York (USA).
Dimolitsas, S. and Gunn, J. E. (1988). Modular, off
line, full duplex telephone channel simulator for high
speed data transceiver evaluation. IEE Proceedings,
135(2):155–160.
Fraile, R., Godino-Llorente, J. I., S´aenz-Lech´on, N., Osma-
Ruiz, V., and Gomez-Vilda, P. (2007). Analysis of
the impact of analogue telephone channel on MFCC
parameters for voice pathology detection. In Proceed-
ings of the 8th INTERSPEECH Conference (INTER-
SPEECH 2007), pages 1218–1221.
Fraile, R., Godino-Llorente, J. I., S´aenz-Lech´on, N., Osma-
Ruiz, V., and G´omez-Vilda, P. (2008a). Use of
cepstrum-based parameters for automatic pathology
detection on speech. Analysis of performance and the-
oretical justification. In Proceedings of Biosignals
2008, volume 1, pages 85–91.
Fraile, R., Saenz-Lechon, N., Godino-Llorente, J. I., Osma-
Ruiz, V., and Gomez-Vilda, P. (2008b). Use of mel-
frequency cepstral coeffcients for automatic pathol-
ogy detection on sustained vowel phonations: Math-
ematical and statistical justification. In Proceedings
of the International Symposium on Image/Video Com-
munications over fixed and mobile networks, volume
Accepted.
Godino-Llorente, J. I. and Gomez-Vilda, P. (2004). Au-
tomatic detection of voice impairments by means of
short-term cepstral parameters and neural network
based detectors. IEEE Transactions on Biomedical
Engineering, 51(2):380–384.
Godino-Llorente, J. I., Gomez-Vilda, P., and Blanco-
Velasco, M. (2006). Dimensionality reduction of a
pathological voice quality assessment system based
on gaussian mixture models and short-term cepstral
parameters. IEEE Transactions on Biomedical Engi-
neering, 53(10):1943–1953.
Haykin, S. (1994). Neural networks: A comprehensive
foundation. Macmillan, New York.
Jamieson, D. G., Parsa, V., Price, M. C., and Till, J. (2002).
Interaction of speech coders and atypical speech ii:
Effects on speech quality. Journal of Speech, Lan-
guage and Hearing Research, 45:689–699.
Martin, A. F., Doddington, G. R., Kamm, T., Ordowski, M.,
and Przybocki, M. A. (1997). The DET curve in as-
sessment of detection task performance. In Proceed-
ings of Eurospeech ’97, volume IV, pages 1895–1898,
Rhodes, Crete.
Moran, R. J., Reilly, R. B., de Chazal, P., and Lacy, P. D.
(2006). Telephony-based voice pathology assessment
using automated speech analysis. IEEE Transactions
on Biomedical Engineering, 53(3):468–477.
Murphy, P. J. and Akande, O. O. (2005). Quantification
of glottal and voiced speech harmonics-to-noise ratios
using cepstral-based estimation. In Proceedings of the
3
rd
International Conference on Non-Linear Speech
Processing (NOLISP’05), pages 224–232.
Parsa, V. and Jamieson, D. G. (2000). Identification
of pathological voices using glottal noise measures.
Journal of Speech, Language and Hearing Research,
43(2):469–485.
Pouchoulin, G., Fredouille, C., Bonastre, J. F., Ghio, A., and
Giovanni, A. (2007). Frequency study for the charac-
terization of the dysphonic voices. In Proceedings of
the 8th INTERSPEECH Conference (INTERSPEECH
2007), pages 1198–1201.
Reynolds, D. A., Zissman, M. A., Quatieri, T. F., O’Leary,
G. C., and Carlson, B. A. (1995). The effects of tele-
phone transmission degradations on speaker recogni-
tion performance. In Proceedings of ICASSP ’95, vol-
ume 1, pages 329–332, Detroit, MI, USA.
Sdersten, M. and Lindhe, C. (2007). Voice ergonomics -
an overview of recent research. In Berlin, C. and Bli-
gard, L. O., editors, Proceedings of the 39th Nordic
Ergonomics Society Conference.
TM Alliance Team (2004). Telemedicine 2010: Visions for
a personal medical network. Technical Report BR-29,
ESA Publications Division.
Umapathy, K., Krishnan, S., Parsa, V., and Jamieson, D. G.
(2005). Discrimination of pathological voices using
a time-frequency approach. IEEE Transactions on
Biomedical Engineering, 52(3):421–430.
BIOSIGNALS 2009 - International Conference on Bio-inspired Systems and Signal Processing
48