Didactic Speech Synthesizer: Acoustic Module - Formants Model
João Paulo Teixeira, Anildo P. Fernandes
2013
Abstract
Text-to-speech synthesis is the main subject treated in this work. It will be presented the constitution of a generic text-to-speech system conversion, explained the functions of the various modules and described the development techniques using the formants model. The development of a didactic formant synthesiser under Matlab environment will also be described. This didactic synthesiser is intended for a didactic understanding of the formant model of speech production.
References
- Barbosa P., Bailly G. (1994). Characterisation of rhythmic patterns for text-to-speech synthesis, in Speech Communication, 15: 127-137.
- Barros, M. J., (2002). "Estudo Comparativo e Técnicas de Geração de sinal para Síntese de Fala ". Master dissertation, Faculdade de Engenharia da Universidade do Porto.
- Fujisaki, H.. (1983). Dynamic characteristics of voice fundamental frequency in speech and singing. In MacNeilage. In P. F., Editor. The Production of Speech, pages 39-55. Springer-Verlag.
- Hirst, D. and Di Cristo, A.. (1998). Intonation Systems - A Survey of Twenty Languages. Cambridge University Press.
- Klatt, DH (1987). Review of text-to-speech conversion for English - Journal of the Acoustical Society of America, 82 (3) - 1987. Pages 737-793.
- Pierrehumbert, J. B. (1980). The Phonology and Phonetics of English Intonation. PhD thesis, Massachusetts Institute of Technology.
- Saraswathi, S., (2010). Design of Multilingual Speech Synthesis System. Academic journal article from Intelligent Information Management, Vol. 2, No. 1.
- Sproat, Richard W. (1997). Multilingual Text-to-Speech Synthesis: The Bell Labs Approach. Springer.
- Taylor, P. (2000). Analysis and Synthesis of Intonation using the Tilt Model. Journal of the Acoustical Society of America. vol 1073, pp. 1697-1714.
- Teixeira, J. P. (2012). Prosody Generation Model for TTS Systems - Segmental Durations and F0 Contours with Fujisaki Model. LAP LAMBERT Academic Publishing ISBN-13: 978-3-659-16277-0.
- Teixeira, J. P., (1995). "Modelização Paramétrica de Sinais para Aplicação em Sistemas de Conversão Texto-Fala." Master Dissertation, FEUP - Porto.
- Teixeira, J. P.,Barros, M. J. and Freitas, D., (2003). "Sistemas de Conversão Texto-Fala." Procedings of CLME, Maputo.
Paper Citation
in Harvard Style
Paulo Teixeira J. and P. Fernandes A. (2013). Didactic Speech Synthesizer: Acoustic Module - Formants Model . In Proceedings of the International Conference on Bio-inspired Systems and Signal Processing - Volume 1: BIOSIGNALS, (BIOSTEC 2013) ISBN 978-989-8565-36-5, pages 356-359. DOI: 10.5220/0004249603560359
in Bibtex Style
@conference{biosignals13,
author={João Paulo Teixeira and Anildo P. Fernandes},
title={Didactic Speech Synthesizer: Acoustic Module - Formants Model},
booktitle={Proceedings of the International Conference on Bio-inspired Systems and Signal Processing - Volume 1: BIOSIGNALS, (BIOSTEC 2013)},
year={2013},
pages={356-359},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004249603560359},
isbn={978-989-8565-36-5},
}
in EndNote Style
TY - CONF
JO - Proceedings of the International Conference on Bio-inspired Systems and Signal Processing - Volume 1: BIOSIGNALS, (BIOSTEC 2013)
TI - Didactic Speech Synthesizer: Acoustic Module - Formants Model
SN - 978-989-8565-36-5
AU - Paulo Teixeira J.
AU - P. Fernandes A.
PY - 2013
SP - 356
EP - 359
DO - 10.5220/0004249603560359