5 Conclusions and Future Work
At the moment, a medium size database for Romanian language has been created. On
short term, the goal is to enlarge the database with common words and with specific
words to be used in remote monitoring.
The big number of triphones that has been obtained is specific to spontaneous
speech. There are cases when some triphones combinations have a bigger number of
occurrences compared to others. For instance, triphones “d+e”, “d-e” and “i_+n” have
more than 1500 occurrences. Triphones with lower number of occurrences will not be
ignored. 2903 triphones have less than 10 occurrences and they represent 57% from
the total amount of triphones. When enlarging the size of the database it is expected
that the triphones with less occurrences will have an increased number of entries. Due
to the nature of spontaneous speech, it is expected to obtain a completely different
view over the triphones occurrences, on database size enlarging. Not ignoring
triphones that have a small number of occurrences is assumed to be a characteristic of
spontaneous speech and as such they have to be taken into consideration.
The corpus and the spontaneous speech recognition results will have applicability
in medical monitoring from remote locations and will help the persons with mobility,
communication and writing difficulties. As the already existent recognition tools
which make use of continuous speech are frequently used, the spontaneous speech
recognition tool intended to be implemented will be highly appreciated. It will ease
the work of the user, taking off some of the existent constraints, like forcing to speak
grammatically correct, stressing a dyslectic who is not able to pick up correctly his
words, etc.
References
1. Rialle, V., Lamy, J.B., Noury, N., Bajolle, L.: Remote monitoring of patients at home: A
Software Agent approach. Computer Methods and Programs in Biomedicine (2003)
2. Vacher, M., Serignat, J.F., Chaillol, S., Istrate, D., Popescu, V.: Speech and Sound Use in a
Remote Monitoring System for Health Care
3. Russell, S., Norvig, P.: Artificial Intelligence – A Modern Approach, Prentice Hall, Second
Edition (2003)
4. Martin, J., Jurafsky, D.: Spoken Language Processing, Prentice Hall, Third Edition (2007)
5. Huang, X., Acero, A., Wuen-Hon, H.: Spoken Language Processing – A Guide to Theory,
Algorithm, and System Development, Prentice Hall (2001)
6. Burileanu, D.: Basic research and implementation Decisions for a text-to-speech synthesis
system in Romanian International Journal of Speech Technology (2002)
7. Young, S., Evermann, G., Gales, M., Hain, T., Kershaw, D., Liu, X. (Andrew), Moore, G.,
Odell, J., Ollason, D., Povey, D., Valtchev, V., Woodland, P.: The HTK Book (2005)
8. Munteanu, D.: Contribuţii la realizarea sistemelor de recunoaştere a vorbirii continue pentru
limba română, Teză de doctorat, Academia Tehnică Militară din Bucureşti (2006)
9. Schultz, T., Kirchhoff, K.: Multilingual Speech Processing, Academic Press (2006)
10. http://www.dyslexic.com/
11. http://www.dragontalk.com/
86