PROLONGATION RECOGNITION IN DISORDERED SPEECH USING CWT AND KOHONEN NETWORK
Ireneusz Codello, Wiesława Kuniszyk-Jóźkowiak, Elżbieta Smołka, Adam Kobus
2010
Abstract
Automatic disorder recognition in speech can be very helpful for the therapist while monitoring therapy progress of the patients with disordered speech. In this article we focus on prolongations. We analyze the signal using Continuous Wavelet Transform with 22 bark scales, we divide the result into vectors (using windowing) and then we pass such vectors into Kohonen network. We have increased the recognition ratio from 54% to 81% by adding a modification into the network learning process as well as into CWT computation algorithm. All the analysis was performed and the results were obtained using the authors’ program – “WaveBlaster”. It is very important that the recognition ratio above 80% was obtained by a fully automatic algorithm (without a teacher). The presented problem is part of our research aimed at creating an automatic prolongation recognition system.
References
- Akansu A.N, Haddad R. A., 2001, Multiresolution signal decomposition, Academic Press.
- Barro S., Marin R., 2002, Fuzzy Logic in Medicine, Physica-Verlag Heidenberg, New York
- Codello I., Kuniszyk-Józkowiak W., 2007, Wavelet analysis of speech signal, Annales UMCS Informatica, 2007, AI 6, Pages 103-115.
- Codello I., Kuniszyk-Józkowiak W., Kobus A., 2010, Kohonen network application in speech analysis algorithm, Annales UMCS Informatica, (Accepted paper).
- Garfield, S., M. Elshaw, and S. Wermter. 2001, Selforgazizing networks for classification learning from normal and aphasic speech. In The 23rd Conference of the Cognitive Science Society. Edinburgh, Scotland
- Gold, B., Morgan, N., 2000. Speech and audio signal processing, JOHN WILEY & SONS, INC.
- Goupillaud P., Grossmann A., Morlet J., 1984- 1985.Cycle-octave and related transforms in seismic signal analysis'', Geoexploration, 23, 85-102
- Huang, X., Acero, A., 2001. Spoken Language Processing: A Guide to Theory, Algorithm and System Development, Prentice-Hall Inc.
- Kohonen, T., 2001, Self-Organizing Maps, 34:p.2173- 2179
- Nayak J., Bhat P. S., Acharya R., Aithal U. V., 2005, Classification and analysis of speech abnormalities, Volume 26, Issues 5-6, Pages 319-327, Elsevier SAS
- Smith J., Abel J, 1999, Bark and ERB Bilinear Transforms, IEEE Transactions on Speech and Audio Processing, November, 1999.
- Szczurowska, I, W. Kuniszyk-Józkowiak, and E. Smolka, 2006,The application of Kohonen and Multilayer Perceptron network in the speech nonfluency analysis. Archives of Acoustics. 31 (4 (Supplement)): p. 205- 210
- Szczurowska, I, W. Kuniszyk-Józkowiak, and E. Smolka, 2007, Application of Artificial Neural Networks In Speech Nonfluency Recognition. Polish Jurnal of Environmental Studies, 2007 16(4A): p. 335-338.
- Traunmüller H., 1990 "Analytical expressions for the tonotopic sensory scale" J. Acoust. Soc. Am. 88: 97- 100.
Paper Citation
in Harvard Style
Codello I., Kuniszyk-Jóźkowiak W., Smołka E. and Kobus A. (2010). PROLONGATION RECOGNITION IN DISORDERED SPEECH USING CWT AND KOHONEN NETWORK . In Proceedings of the International Conference on Fuzzy Computation and 2nd International Conference on Neural Computation - Volume 1: ICNC, (IJCCI 2010) ISBN 978-989-8425-32-4, pages 392-398. DOI: 10.5220/0003057903920398
in Bibtex Style
@conference{icnc10,
author={Ireneusz Codello and Wiesława Kuniszyk-Jóźkowiak and Elżbieta Smołka and Adam Kobus},
title={PROLONGATION RECOGNITION IN DISORDERED SPEECH USING CWT AND KOHONEN NETWORK},
booktitle={Proceedings of the International Conference on Fuzzy Computation and 2nd International Conference on Neural Computation - Volume 1: ICNC, (IJCCI 2010)},
year={2010},
pages={392-398},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003057903920398},
isbn={978-989-8425-32-4},
}
in EndNote Style
TY - CONF
JO - Proceedings of the International Conference on Fuzzy Computation and 2nd International Conference on Neural Computation - Volume 1: ICNC, (IJCCI 2010)
TI - PROLONGATION RECOGNITION IN DISORDERED SPEECH USING CWT AND KOHONEN NETWORK
SN - 978-989-8425-32-4
AU - Codello I.
AU - Kuniszyk-Jóźkowiak W.
AU - Smołka E.
AU - Kobus A.
PY - 2010
SP - 392
EP - 398
DO - 10.5220/0003057903920398