loading
Papers

Research.Publish.Connect.

Paper

Authors: Gražina Korvel 1 ; Olga Kurasova 1 and Bożena Kostek 2

Affiliations: 1 Institute of Data Science and Digital Technologies, Vilnius University, Akademijos str. 4, LT-04812, Vilnius and Lithuania ; 2 Audio Acoustics Laboratory, Faculty of Electronics, Telecommunications and Informatics, Gdańsk University of Technology, G. Narutowicza 11/12, 80-233 Gdansk and Poland

ISBN: 978-989-758-378-0

Keyword(s): Speech Analysis and Synthesis, Lombard Effect, SISO (Single-Input and Single-Output) System, Sinusoidal Model.

Related Ontology Subjects/Areas/Topics: Multidimensional Signal Processing ; Multimedia ; Multimedia Signal Processing ; Multimodal Signal Processing ; Perceptual/Human Audiovisual System Modeling ; Telecommunications

Abstract: The speech with the Lombard effect has been extensively studied in the context of speech recognition or speech enhancement. However, few studies have investigated the Lombard effect in the context of speech synthesis. The aim of this paper is to create a mathematical model that allows for retaining the Lombard effect. These models could be used as a basis of a formant speech synthesizer. The proposed models are based on dividing the speech signal into harmonics and modeling them as the output of a SISO system whose transfer function poles are multiple, and inputs vary in time. An analysis of the Lombard effect of the synthesized signal is performed on the noise residual. The synthesized signal residual is described by vectors of acoustic parameters related to the Lombard effect. For testing the performance of the created models in various noise conditions two classifiers are employed, namely kNN and Naive Bayes. For comparison of results, we created models of sinusoids based on freque ncy tracks. The results show that a model based on the residual sinewave sum demonstrates the possibility of retaining the Lombard effect. Finally, future work directions are outlined in conclusions. (More)

PDF ImageFull Text

Download
CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 34.229.131.116

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Korvel, G.; Kurasova, O. and Kostek, B. (2019). An Attempt to Create Speech Synthesis Model That Retains Lombard Effect Characteristics.In Proceedings of the 16th International Joint Conference on e-Business and Telecommunications - Volume 1: SIGMAP, ISBN 978-989-758-378-0, pages 280-289. DOI: 10.5220/0007854302800289

@conference{sigmap19,
author={Korvel, G. and Olga Kurasova. and Bożena Kostek.},
title={An Attempt to Create Speech Synthesis Model That Retains Lombard Effect Characteristics},
booktitle={Proceedings of the 16th International Joint Conference on e-Business and Telecommunications - Volume 1: SIGMAP,},
year={2019},
pages={280-289},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0007854302800289},
isbn={978-989-758-378-0},
}

TY - CONF

JO - Proceedings of the 16th International Joint Conference on e-Business and Telecommunications - Volume 1: SIGMAP,
TI - An Attempt to Create Speech Synthesis Model That Retains Lombard Effect Characteristics
SN - 978-989-758-378-0
AU - Korvel, G.
AU - Kurasova, O.
AU - Kostek, B.
PY - 2019
SP - 280
EP - 289
DO - 10.5220/0007854302800289

Login or register to post comments.

Comments on this Paper: Be the first to review this paper.