SHORT-TERM CEPSTRAL ANALYSIS APPLIED TO VOCAL FOLD EDEMA DETECTION

Silvana Cunha Costa, Benedito G. Aguiar Neto, Joseana Macêdo Fechine, Menaka Muppa

2008

Abstract

Digital signal processing techniques have been used to perform an acoustic analysis for vocal quality assessment due to the simplicity and the non-invasive nature of the measurement procedures. Their employment is of special interest, as they can provide an objective diagnosis of pathological voices, and may be used as complementary tool in laryngoscope exams. The acoustic modeling of pathological voices is very important to discriminate normal and pathological voices. The degree of reliability and effectiveness of the discriminating process depends on the appropriate acoustic feature extraction. This paper aims at specifying and evaluating the acoustic features for vocal fold edema through a parametric modeling approach based on the resonant structure of the human speech production mechanism, and a nonparametric approach related to human auditory perception system. For this purpose, LPC and LPC-based cepstral coefficients, and mel-frequency cepstral coefficients are used. A vector-quantizing-trained distance classifier is used in the discrimination process.

References

  1. Bou-Ghazale, S.E., Hansen, J.H.L., 2000. A Comparative Study of Traditional and Newly Proposed Features for Recognition of Speech Under Stress. IEEE Transactions on Speech & Audio Processing. Vol. 8, no. 4, pp. 429-442, July.
  2. Davis, S. B, 1979. Acoustic Characteristics of Normal and Pathological Voices. Speech and Language: Advances in Basic Research and Practice. Vol. 1, pp. 271-335.
  3. Dibazar, A. A., Berger, T.W., and Narayanan, S. S., 2006. Pathological Voice Assessment. Proceedings of the 28th IEEE EMBS Annual International Conference. New York, USA, Aug. 30-Sept. 3.
  4. Furui, S., 1981. Cepstral Analysis Technique for Automatic Speaker Verification. IEEE Transactions on Acoustics, Speech and Signal Processing. Vol. 29, No. 2, pp 254-272, April.
  5. Gavidia-Ceballos, Liliana and Hansen, John H. L., 1996. Direct Speech Feature Estimation Using an Interactive EM Algorithm for Vocal Fold Pathology Detection. IEEE Trans. on Biomedical Engineering. Vol. 43, No. 4, April.
  6. Godino-Llorente, J. I., Gomes-Vilda, P. and BlancoVelasco M., 2006. Dimensionality Reduction of a Pathological Voice Quality Assessment System Based on Gaussian Mixture Models and Short-Term Cepstral Parameters. IEEE Transactions on Biomedical Engineering. Vol. 53, No. 10, pp. 1943-1953, October,
  7. Kay Elemetrics Corp. Disordered Voice Database, 1994. Model 4337, 03 Ed.
  8. Linde, Y., Buzo, A., and Gray, R. M., 1980. An Algorithm for Vector Quantizer Design, IEEE Transaction on Communications. Vol. COM-28, N0.I, pages 84-95, January.
  9. Marinaki, M., Contropoulos, C., Pitas, I., and Maglaveras, N., 2004. Automatic Detection of Vocal Fold Paralysis and Edema, Proc. of 8th Conf. Spoken Language Processing (Interspeech 2004). Jeju, Korea, October.
  10. Murphy, Peter J. and Akande, Olatunji O., 2007. Noise Estimation in Voice Signals Using Short-term Cepstral, Journal of the Acoustical Society of America. pp. 1679-1690, Vol. 121, No. 3, March.
  11. O'Shaughnessy, Douglas, 2000. Speech Communications: Human and Machine. 2nd Edition, NY, IEEE Press.
  12. Parsa, Vijay and Jamieson, Donald G., 2001. Acoustic Discrimination of Pathological Voice: Sustained Vowels versus Continuous Speech. Journal of Speech, Language, and Hearing Research. Vol. 44, pp 327- 339, April.
  13. Quek, F., M. Harper, Haciahmetoglou, Y., Chen, L. and Raming, L. O., 2002. Speech pauses and gestural holds in ParkinsoÁs disease. Proceedings of International Conference on Spoken Language Processing. pp. 2485-2488.
  14. Rabiner L. R. and Schafer R. W., 1978. Digital Processing of Speech Signals. New Jersey: Prentice-Hall.
  15. Shama, K., Krishna, A. and Cholayya, N. U., 2007. Study of Harmonics-to-Noise Ratio and Critical-Band Energy Spectrum of Speech as Acoustic Indicators of Laryngeal and Voice Pathology. EURASIP Journal on Advances in Signal Processing. Vol. 2007.
  16. Umapathy, K., Krishnan, S., Parsa, V., and Jamieson D., 2005. G. Discrimination of Pathological Voices Using a Time-Frequency Approach. IEEE Transactions on Biomedical Engineering. Vol. 52, No. 3, March.
Download


Paper Citation


in Harvard Style

Cunha Costa S., G. Aguiar Neto B., Macêdo Fechine J. and Muppa M. (2008). SHORT-TERM CEPSTRAL ANALYSIS APPLIED TO VOCAL FOLD EDEMA DETECTION . In Proceedings of the First International Conference on Bio-inspired Systems and Signal Processing - Volume 2: BIOSIGNALS, (BIOSTEC 2008) ISBN 978-989-8111-18-0, pages 110-115. DOI: 10.5220/0001066901100115


in Bibtex Style

@conference{biosignals08,
author={Silvana Cunha Costa and Benedito G. Aguiar Neto and Joseana Macêdo Fechine and Menaka Muppa},
title={SHORT-TERM CEPSTRAL ANALYSIS APPLIED TO VOCAL FOLD EDEMA DETECTION},
booktitle={Proceedings of the First International Conference on Bio-inspired Systems and Signal Processing - Volume 2: BIOSIGNALS, (BIOSTEC 2008)},
year={2008},
pages={110-115},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001066901100115},
isbn={978-989-8111-18-0},
}


in EndNote Style

TY - CONF
JO - Proceedings of the First International Conference on Bio-inspired Systems and Signal Processing - Volume 2: BIOSIGNALS, (BIOSTEC 2008)
TI - SHORT-TERM CEPSTRAL ANALYSIS APPLIED TO VOCAL FOLD EDEMA DETECTION
SN - 978-989-8111-18-0
AU - Cunha Costa S.
AU - G. Aguiar Neto B.
AU - Macêdo Fechine J.
AU - Muppa M.
PY - 2008
SP - 110
EP - 115
DO - 10.5220/0001066901100115