Didactic Speech Synthesizer: Acoustic Module - Formants Model

João Paulo Teixeira, Anildo P. Fernandes

Abstract

Text-to-speech synthesis is the main subject treated in this work. It will be presented the constitution of a generic text-to-speech system conversion, explained the functions of the various modules and described the development techniques using the formants model. The development of a didactic formant synthesiser under Matlab environment will also be described. This didactic synthesiser is intended for a didactic understanding of the formant model of speech production.

References

  1. Barbosa P., Bailly G. (1994). Characterisation of rhythmic patterns for text-to-speech synthesis, in Speech Communication, 15: 127-137.
  2. Barros, M. J., (2002). "Estudo Comparativo e Técnicas de Geração de sinal para Síntese de Fala ". Master dissertation, Faculdade de Engenharia da Universidade do Porto.
  3. Fujisaki, H.. (1983). Dynamic characteristics of voice fundamental frequency in speech and singing. In MacNeilage. In P. F., Editor. The Production of Speech, pages 39-55. Springer-Verlag.
  4. Hirst, D. and Di Cristo, A.. (1998). Intonation Systems - A Survey of Twenty Languages. Cambridge University Press.
  5. Klatt, DH (1987). Review of text-to-speech conversion for English - Journal of the Acoustical Society of America, 82 (3) - 1987. Pages 737-793.
  6. Pierrehumbert, J. B. (1980). The Phonology and Phonetics of English Intonation. PhD thesis, Massachusetts Institute of Technology.
  7. Saraswathi, S., (2010). Design of Multilingual Speech Synthesis System. Academic journal article from Intelligent Information Management, Vol. 2, No. 1.
  8. Sproat, Richard W. (1997). Multilingual Text-to-Speech Synthesis: The Bell Labs Approach. Springer.
  9. Taylor, P. (2000). Analysis and Synthesis of Intonation using the Tilt Model. Journal of the Acoustical Society of America. vol 1073, pp. 1697-1714.
  10. Teixeira, J. P. (2012). Prosody Generation Model for TTS Systems - Segmental Durations and F0 Contours with Fujisaki Model. LAP LAMBERT Academic Publishing ISBN-13: 978-3-659-16277-0.
  11. Teixeira, J. P., (1995). "Modelização Paramétrica de Sinais para Aplicação em Sistemas de Conversão Texto-Fala." Master Dissertation, FEUP - Porto.
  12. Teixeira, J. P.,Barros, M. J. and Freitas, D., (2003). "Sistemas de Conversão Texto-Fala." Procedings of CLME, Maputo.
Download


Paper Citation


in Harvard Style

Paulo Teixeira J. and P. Fernandes A. (2013). Didactic Speech Synthesizer: Acoustic Module - Formants Model . In Proceedings of the International Conference on Bio-inspired Systems and Signal Processing - Volume 1: BIOSIGNALS, (BIOSTEC 2013) ISBN 978-989-8565-36-5, pages 356-359. DOI: 10.5220/0004249603560359


in Bibtex Style

@conference{biosignals13,
author={João Paulo Teixeira and Anildo P. Fernandes},
title={Didactic Speech Synthesizer: Acoustic Module - Formants Model},
booktitle={Proceedings of the International Conference on Bio-inspired Systems and Signal Processing - Volume 1: BIOSIGNALS, (BIOSTEC 2013)},
year={2013},
pages={356-359},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004249603560359},
isbn={978-989-8565-36-5},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Bio-inspired Systems and Signal Processing - Volume 1: BIOSIGNALS, (BIOSTEC 2013)
TI - Didactic Speech Synthesizer: Acoustic Module - Formants Model
SN - 978-989-8565-36-5
AU - Paulo Teixeira J.
AU - P. Fernandes A.
PY - 2013
SP - 356
EP - 359
DO - 10.5220/0004249603560359