loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Authors: Asma Bougrine 1 ; Philippe Ravier 1 ; Abdenour Hacine-Gharbi 2 and Hanane Ouachour 1

Affiliations: 1 PRISME Laboratory, University of Orleans, 12 Rue de Blois, 45067 Orleans, France ; 2 LMSE Laboratory, University of Bordj Bou Arréridj, Elanasser, 34030 Bordj Bou Arréridj, Algeria

Keyword(s): Speech Injunction Classification, Massive Wild Oral Corpus, Prosodic Features, Static and Dynamic Features, SVM, K-NN, Long Short Term Memory (LSTM).

Abstract: The classification of injunction in french oral speech is a difficult task since no standard linguistic structure is known in the french language. Thus, prosodic features of the speech could be permitted indicators for this task, especially the logarithmic energy. Our aim is to validate the predominance of the log energy prosodic feature by using conventional classifiers such as SVM or K-NN. Second, we intend to improve the classification rates by using a deep LSTM recurrent network. When applied on the RAVIOLI database, the log energy feature showed indeed the best classification rates (CR) for all classifiers with CR = 82% for SVM and CR = 71.42% for K-NN. When applying the LSTM network on our data, the CR reached a not better value of 79.49% by using the log energy feature alone. More surprisingly, the CR significantly increased to 96.15% by using the 6 prosodic features. We conclude that deep learning methods need as much data as possible for reaching high performance, even the l ess informative ones, especially when the dataset is small. The counterpart of deep learning methods remains the difficulty of optimal parameters tuning. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 18.188.205.95

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Bougrine, A.; Ravier, P.; Hacine-Gharbi, A. and Ouachour, H. (2022). LSTM Network based on Prosodic Features for the Classification of Injunction in French Oral Utterances. In Proceedings of the 11th International Conference on Pattern Recognition Applications and Methods - ICPRAM; ISBN 978-989-758-549-4; ISSN 2184-4313, SciTePress, pages 730-736. DOI: 10.5220/0010910500003122

@conference{icpram22,
author={Asma Bougrine. and Philippe Ravier. and Abdenour Hacine{-}Gharbi. and Hanane Ouachour.},
title={LSTM Network based on Prosodic Features for the Classification of Injunction in French Oral Utterances},
booktitle={Proceedings of the 11th International Conference on Pattern Recognition Applications and Methods - ICPRAM},
year={2022},
pages={730-736},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0010910500003122},
isbn={978-989-758-549-4},
issn={2184-4313},
}

TY - CONF

JO - Proceedings of the 11th International Conference on Pattern Recognition Applications and Methods - ICPRAM
TI - LSTM Network based on Prosodic Features for the Classification of Injunction in French Oral Utterances
SN - 978-989-758-549-4
IS - 2184-4313
AU - Bougrine, A.
AU - Ravier, P.
AU - Hacine-Gharbi, A.
AU - Ouachour, H.
PY - 2022
SP - 730
EP - 736
DO - 10.5220/0010910500003122
PB - SciTePress