An Attempt to Create Speech Synthesis Model That Retains Lombard Effect Characteristics

Gražina Korvel, Olga Kurasova, Bożena Kostek

Abstract

The speech with the Lombard effect has been extensively studied in the context of speech recognition or speech enhancement. However, few studies have investigated the Lombard effect in the context of speech synthesis. The aim of this paper is to create a mathematical model that allows for retaining the Lombard effect. These models could be used as a basis of a formant speech synthesizer. The proposed models are based on dividing the speech signal into harmonics and modeling them as the output of a SISO system whose transfer function poles are multiple, and inputs vary in time. An analysis of the Lombard effect of the synthesized signal is performed on the noise residual. The synthesized signal residual is described by vectors of acoustic parameters related to the Lombard effect. For testing the performance of the created models in various noise conditions two classifiers are employed, namely kNN and Naive Bayes. For comparison of results, we created models of sinusoids based on frequency tracks. The results show that a model based on the residual sinewave sum demonstrates the possibility of retaining the Lombard effect. Finally, future work directions are outlined in conclusions.

Download


Paper Citation


in Harvard Style

Korvel G., Kurasova O. and Kostek B. (2019). An Attempt to Create Speech Synthesis Model That Retains Lombard Effect Characteristics.In Proceedings of the 16th International Joint Conference on e-Business and Telecommunications - Volume 1: SIGMAP, ISBN 978-989-758-378-0, pages 280-289. DOI: 10.5220/0007854302800289


in Bibtex Style

@conference{sigmap19,
author={Gražina Korvel and Olga Kurasova and Bożena Kostek},
title={An Attempt to Create Speech Synthesis Model That Retains Lombard Effect Characteristics},
booktitle={Proceedings of the 16th International Joint Conference on e-Business and Telecommunications - Volume 1: SIGMAP,},
year={2019},
pages={280-289},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0007854302800289},
isbn={978-989-758-378-0},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 16th International Joint Conference on e-Business and Telecommunications - Volume 1: SIGMAP,
TI - An Attempt to Create Speech Synthesis Model That Retains Lombard Effect Characteristics
SN - 978-989-758-378-0
AU - Korvel G.
AU - Kurasova O.
AU - Kostek B.
PY - 2019
SP - 280
EP - 289
DO - 10.5220/0007854302800289