PROLONGATION RECOGNITION IN DISORDERED SPEECH USING CWT AND KOHONEN NETWORK

Ireneusz Codello, Wiesława Kuniszyk-Jóźkowiak, Elżbieta Smołka, Adam Kobus

Abstract

Automatic disorder recognition in speech can be very helpful for the therapist while monitoring therapy progress of the patients with disordered speech. In this article we focus on prolongations. We analyze the signal using Continuous Wavelet Transform with 22 bark scales, we divide the result into vectors (using windowing) and then we pass such vectors into Kohonen network. We have increased the recognition ratio from 54% to 81% by adding a modification into the network learning process as well as into CWT computation algorithm. All the analysis was performed and the results were obtained using the authors’ program – “WaveBlaster”. It is very important that the recognition ratio above 80% was obtained by a fully automatic algorithm (without a teacher). The presented problem is part of our research aimed at creating an automatic prolongation recognition system.

References

  1. Akansu A.N, Haddad R. A., 2001, Multiresolution signal decomposition, Academic Press.
  2. Barro S., Marin R., 2002, Fuzzy Logic in Medicine, Physica-Verlag Heidenberg, New York
  3. Codello I., Kuniszyk-Józkowiak W., 2007, Wavelet analysis of speech signal, Annales UMCS Informatica, 2007, AI 6, Pages 103-115.
  4. Codello I., Kuniszyk-Józkowiak W., Kobus A., 2010, Kohonen network application in speech analysis algorithm, Annales UMCS Informatica, (Accepted paper).
  5. Garfield, S., M. Elshaw, and S. Wermter. 2001, Selforgazizing networks for classification learning from normal and aphasic speech. In The 23rd Conference of the Cognitive Science Society. Edinburgh, Scotland
  6. Gold, B., Morgan, N., 2000. Speech and audio signal processing, JOHN WILEY & SONS, INC.
  7. Goupillaud P., Grossmann A., Morlet J., 1984- 1985.Cycle-octave and related transforms in seismic signal analysis'', Geoexploration, 23, 85-102
  8. Huang, X., Acero, A., 2001. Spoken Language Processing: A Guide to Theory, Algorithm and System Development, Prentice-Hall Inc.
  9. Kohonen, T., 2001, Self-Organizing Maps, 34:p.2173- 2179
  10. Nayak J., Bhat P. S., Acharya R., Aithal U. V., 2005, Classification and analysis of speech abnormalities, Volume 26, Issues 5-6, Pages 319-327, Elsevier SAS
  11. Smith J., Abel J, 1999, Bark and ERB Bilinear Transforms, IEEE Transactions on Speech and Audio Processing, November, 1999.
  12. Szczurowska, I, W. Kuniszyk-Józkowiak, and E. Smolka, 2006,The application of Kohonen and Multilayer Perceptron network in the speech nonfluency analysis. Archives of Acoustics. 31 (4 (Supplement)): p. 205- 210
  13. Szczurowska, I, W. Kuniszyk-Józkowiak, and E. Smolka, 2007, Application of Artificial Neural Networks In Speech Nonfluency Recognition. Polish Jurnal of Environmental Studies, 2007 16(4A): p. 335-338.
  14. Traunmüller H., 1990 "Analytical expressions for the tonotopic sensory scale" J. Acoust. Soc. Am. 88: 97- 100.
Download


Paper Citation


in Harvard Style

Codello I., Kuniszyk-Jóźkowiak W., Smołka E. and Kobus A. (2010). PROLONGATION RECOGNITION IN DISORDERED SPEECH USING CWT AND KOHONEN NETWORK . In Proceedings of the International Conference on Fuzzy Computation and 2nd International Conference on Neural Computation - Volume 1: ICNC, (IJCCI 2010) ISBN 978-989-8425-32-4, pages 392-398. DOI: 10.5220/0003057903920398


in Bibtex Style

@conference{icnc10,
author={Ireneusz Codello and Wiesława Kuniszyk-Jóźkowiak and Elżbieta Smołka and Adam Kobus},
title={PROLONGATION RECOGNITION IN DISORDERED SPEECH USING CWT AND KOHONEN NETWORK},
booktitle={Proceedings of the International Conference on Fuzzy Computation and 2nd International Conference on Neural Computation - Volume 1: ICNC, (IJCCI 2010)},
year={2010},
pages={392-398},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003057903920398},
isbn={978-989-8425-32-4},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Fuzzy Computation and 2nd International Conference on Neural Computation - Volume 1: ICNC, (IJCCI 2010)
TI - PROLONGATION RECOGNITION IN DISORDERED SPEECH USING CWT AND KOHONEN NETWORK
SN - 978-989-8425-32-4
AU - Codello I.
AU - Kuniszyk-Jóźkowiak W.
AU - Smołka E.
AU - Kobus A.
PY - 2010
SP - 392
EP - 398
DO - 10.5220/0003057903920398