ALLOPHONE GROUP SELECTION FACTORS FOR POLISH SPEECH SYNTHESIS

Bożena Kozłowska; Janusz Rafałko; Mariusz Rybnik

doi:10.5220/0003889804510456

ALLOPHONE GROUP SELECTION FACTORS FOR POLISH SPEECH SYNTHESIS

Bożena Kozłowska, Janusz Rafałko, Mariusz Rybnik

2012

Abstract

The article concerns selection of allophone groups for Polish speech synthesis. It describes factors to be taken into consideration while dividing allophones into certain groups. Thus, the presentation includes classification suggested by the authors. Although the described factors regard Polish language, they may facilitate any study on similar division concerning any other language. Each language has determined specificity pronounces, therefore should choose suitable allophonic groups for the language. However precise description on what elements we should special attention give, where later problems can appear in pronunciation e.g. certainty will make easier work to persons making similar division in different languages.

References

M. Dluska, „Fonetyka Polska”, 1981.
T. Dutoit, “An Introduction to text-to-speech synthesis”, Kluwer Academic Publishers 1997, pp. 286.
J. Van Santen, R. Sproat, J. Olive, J. Hirshberg, “Progress in speech synthesis”, Springer Verlag, New York 1997, Chapter 4, “Concatenative Synthesis and Automated Segmentation”, pp. 259-220.
X. Huang, A. Acero, H. Hon, “Spoken Language Processing”, Prentice Hall PTR, New Jersey 2001, Chapter 2 “Spoken Language Structure”, pp. 19-69.
E. Szpilewski, B. Piórkowska, J. Rafalko, B. Lobanov, V. Kiselov, L. Tsirulnik, “Polish TTS in Multi-Voice Slavonic Languages Speech Synthesis System”, SPECOM'2004 Proceedings, 9th International Conference Speech and Computer, Saint-Petersburg, Russia 2004, pp. 565 - 570.
Piórkowska B., Popowski K., Rafalko J., Szpilewski E., „Synteza mowy polskiej na podstawie tekstu”, XI Symposium AES, New Trends in Audio and Video, Conference Program, Abstracts and Proceedings, 20 - 22 september 2006, Bialystok, Poland, pp. 150 - 169.
Piórkowska B., Popowski K., Rafalko J., Szpilewski E., „Polish Language Speech Synthesis Basis on Text Information”, New Trends in Audio and Video, vol. I, Bialystok Technical University, Treatise Nr 134, 2006, pp. 507 - 526.
T. Taylor, “Text-to-Speech Synthesis”, Cambridge University Press 2009, Chapter 11 “Acoustic models of speech production”, pp. 309 - 340.

Download

Paper Citation

in Harvard Style

Kozłowska B., Rafałko J. and Rybnik M. (2012). ALLOPHONE GROUP SELECTION FACTORS FOR POLISH SPEECH SYNTHESIS . In Proceedings of the 4th International Conference on Agents and Artificial Intelligence - Volume 1: ICAART, ISBN 978-989-8425-95-9, pages 451-456. DOI: 10.5220/0003889804510456

in Bibtex Style

@conference{icaart12,
author={Bożena Kozłowska and Janusz Rafałko and Mariusz Rybnik},
title={ALLOPHONE GROUP SELECTION FACTORS FOR POLISH SPEECH SYNTHESIS},
booktitle={Proceedings of the 4th International Conference on Agents and Artificial Intelligence - Volume 1: ICAART,},
year={2012},
pages={451-456},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003889804510456},
isbn={978-989-8425-95-9},
}

in EndNote Style

TY - CONF
JO - Proceedings of the 4th International Conference on Agents and Artificial Intelligence - Volume 1: ICAART,
TI - ALLOPHONE GROUP SELECTION FACTORS FOR POLISH SPEECH SYNTHESIS
SN - 978-989-8425-95-9
AU - Kozłowska B.
AU - Rafałko J.
AU - Rybnik M.
PY - 2012
SP - 451
EP - 456
DO - 10.5220/0003889804510456