ANALYTICAL DESCRIPTION OF THE PRODUCTION OF FORMATS IN HUMAN SPEECH

Damyan Damyanov

2013

Abstract

For the purposes. of speech synthesis, biometrics , medical and psychological diagnosis, the functioning of the glottis during phonation has been studied many times. At present, a relatively trivial solution proposed by Fant has established itself, regardless of the actual purpose of the system, using the "pusle source - filter" model. The model of Fant allows the linear prediction method to perform reconstruction of the current form of the vocal tract and the excitation of glottal volume velocity. But the practice shows that the fluctuations of the speech tract due to psycho-physiological effect on the functioning of the facial muscles in most cases are negligible. Thus, they are below the accuracy, which the linear model allows, using approximation with a cascade of coaxial cylindrical sections of equal length and constant cross-section. This requires more complex algorithms, and thus additional information is extracted from the pattern of air volume velocity after glottis. In this study, it is to be shown, that the model of Fant actually allows depiction of the psychophysiological changes in the spectral features of the speech signal without the use of additional models. For this purpose it is sufficient to analyze the relationships of the main parameters of the excitation pulse of the source with the frequency response of the filter. In the current practice, these correlations are not considered and the source and the filter are examined separately.

References

  1. Chiba, T., Kajiyama, M.1995,The vowel, Its Nature and Structure. Tokyo-Kaiseikan, Tokyo 1995
  2. Damyanov, D., Galabov, V., 2012a, Characteristics of the model of Fant of second order on speech production, Proceedings of the Technical University - Sofia, Volume 62, Issue 2, pp. 181-188, ISSN 1311-0829 , Sofia, 2012,
  3. Damyanov, D., Galabov, V., 2012b, On the Impact of duration of the phase of open glottis on the spectral characteristics of the phonation process, Proceedings of the Technical University - Sofia, Volume 62, Issue 2, pp. 173-180, ISSN 1311-0829 , Sofia, 2012,
  4. Damyanov, D., Galabov, V., 2012c, Some effects of the assumption of an all-pole filter, used to describe processes of type "pulse source, 1-st International Conference on Telecommunications and Remote Sensing, August, 29-30, pp. 139-145, ISBN 978-989- 8565-28-0 , Sofia, 2012,
  5. Ekman, P., Friesen W., 1978, The Facial Action Coding System, Consulting Psychologist Press, San Francisco. CA, 1978
  6. Fant, G., 1990, Acoustic Theory of Speech Production, Mouton&Co, Hauge
  7. Flannagan, J., 1992, Speech analysis, Synthesis and Perception, Springer, Berlin, 1992
  8. Hayes, M., 1999, Schaum's Outline of Theory and Problems of Digital Signal Processing, Singapore, McGraw-Hill, 1999
  9. Kalat, James W. 2012, Biological Psychology, Wadswoth. Cengage Learning, 10-th edition, Belmont, 2009
  10. Pfister, B., Kaufmann, T., 2008, Sprachverarbeitung - Grundlagen und Methoden der Sprachsynthese und Spracherkennung, Springer Verlag, Heidelberg, 2008
  11. Pickett, J.M. 1982, The sounds of speech communication , Univercity Park Press, Baltimore, 1982
  12. Proakis, J.,2000, Discrete Time Processing of Speech Signals, New Jersey, JohnWiley&Sons, IEEE Press, 2000
  13. Rabiner, L., Schafer R. 1992, Digital processing of speech signals, Prentice-Hall Inc, Engelwood Cliffs, New Jersey, 1992
  14. Reuter-Lorenz, Patricia A., et.al., 2010, The Cognitive neuroscience of mind: A tribute to Michael S. Gazzaniga, MIT Press, April, London, 2010
  15. Trashlieva, V., Puleva, T., 2011, Model building for optimal administrative process management, International Conference Automatics and Informatics'11, Bulgaria, 3-7.10.2011, pp B-263-B266, ISSN 1313-1850, Sofia, 2011
Download


Paper Citation


in Harvard Style

Damyanov D. (2013). ANALYTICAL DESCRIPTION OF THE PRODUCTION OF FORMATS IN HUMAN SPEECH . In Proceedings of the Second International Conference on Telecommunications and Remote Sensing - Volume 1: ICTRS, ISBN 978-989-8565-57-0, pages 79-84. DOI: 10.5220/0004785600790084


in Bibtex Style

@conference{ictrs13,
author={Damyan Damyanov},
title={ANALYTICAL DESCRIPTION OF THE PRODUCTION OF FORMATS IN HUMAN SPEECH},
booktitle={Proceedings of the Second International Conference on Telecommunications and Remote Sensing - Volume 1: ICTRS,},
year={2013},
pages={79-84},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004785600790084},
isbn={978-989-8565-57-0},
}


in EndNote Style

TY - CONF
JO - Proceedings of the Second International Conference on Telecommunications and Remote Sensing - Volume 1: ICTRS,
TI - ANALYTICAL DESCRIPTION OF THE PRODUCTION OF FORMATS IN HUMAN SPEECH
SN - 978-989-8565-57-0
AU - Damyanov D.
PY - 2013
SP - 79
EP - 84
DO - 10.5220/0004785600790084