parameters can be optimized, the total number of
epochs is set to 300 for all tests, the best network
throughout the training is adopted.
First, we show in Table 3 the volatility of the CR
results given by LSTM through different testing con-
figuration of the hyper-parameters. These results have
been obtained considering the 6 prosodic features. By
empirically optimizing the hyper-parameters set using
few combinations, it is thus possible to increase the
CR to the very high CR of 96.15%.
Second, using this optimization strategy, we show
CR results given by LSTM in Table 4, using increas-
ing input sizes (number of prosodic features). A first
test is to train the network using the logarithmic en-
ergy only, thus the size of the input layer sequences is
equal to 1. We used the adaptive moment estimation
(ADAM) as optimizer with a learning rate of 0.001
and a momentum constant equal to 0.9, to minimize
the binary-cross-entropy loss function. The CR given
by the trained network is 79.49%. As the input data
are still small, we trained the LSTM network by using
two prosodic features E, PI and the six prosodic fea-
tures to increase our data collection volume. To train
this new network, The ADAM optimizer doesn’t give
good training, we therefore used the Stochastic Gradi-
ent Descent with momentum (SGDM) optimizer, with
a learning rate of 0.001 and a momentum equal to 0.9.
The size of the mini-batch is 10. Using E and PI,
the CR is 80.77% while using the 6 features, the CR
reaches 96.15%.
5 CONCLUSIONS
In the present paper, we used two conventional classi-
fiers, namely the support vector machine (SVM) and
the k-nearest neighbors (K-NN) with a voting rule de-
cision strategy for classifying INJ vs NINJ utterances
in RAVIOLI database. By optimally tuning their pa-
rameters, these classifiers all gave the best CR val-
ues when applied with the log energy feature only
compared with six features namely, the energy and
the pitch with their first and second derivatives. The
best classification rate of 82% was given with SVM
method. When applying the Long Short-Term Mem-
ory (LSTM) network on our data, the CR reach the
not better value of 79.49% by using the log energy
feature alone. More surprisingly, the CR significantly
increased to 96.15% by using the 6 prosodic features.
Albeit more test are necessary we conclude that deep
learning methods need as much data as possible for
reaching high performance, even the less informa-
tive ones, especially when the dataset is small. The
counterpart of deep learning methods remains the dif-
ficulty of optimal parameters tuning. Future work
will investigate larger INJ and NINJ dataset extracted
from RAVIOLI database as well as a injunction cate-
gorization within the injunction of the dataset.
ACKNOWLEDGEMENTS
This work is funded by the r
´
egion Centre-Val de
Loire, France, under the contract APR IA Ravioli
#17055LLL (lotfi.abouda@univ-orleans.fr, leader).
This collaborative work implies three laboratories of
University of Orl
´
eans (LLL, PRISME/IRAuS, LIFO)
and one laboratory from University of Tours (LIFAT).
All the persons involved in this project are acknowl-
edged for their participation.
REFERENCES
Bengacemi, H., Hacine-Gharbi, A., Ravier, P., and abed
meraim, K. (2021). Surface emg signal classification
for parkinson’s disease using wcc descriptor and ann
classifier. pages 287–294.
Boersma, P. and Weenink, D. (2013). Praat: doing phonet-
ics by computer [computer program]. version 5.3. 51.
Online: http://www. praat. org.
Cortes, C. and Vapnik, V. (1995). Support-vector networks.
Machine learning, 20(3):273–297.
Fix, E. and Hodges, J. L. (1989). Discriminatory analy-
sis. nonparametric discrimination: Consistency prop-
erties. International Statistical Review/Revue Interna-
tionale de Statistique, 57(3):238–247.
Ghazali, F., Hacine-Gharbi, A., and Ravier, P. (2020).
Statistical features extraction based on the discrete
wavelet transform for electrical appliances identifica-
tion. In Proceedings of the 1st International Confer-
ence on Intelligent Systems and Pattern Recognition,
pages 22–26, Virtual Event Tunisia. ACM.
Ghazali, F., Hacine-Gharbi, A., and Ravier, P. (2021). Se-
lection of statistical wavelet features using wrapper
approach for electrical appliances identification based
on knn classifier combined with voting rules method.
International Journal of Computational Systems En-
gineering, page In Press.
Ghazali, F., Hacine-Gharbi, A., Ravier, P., and Mohamadi,
T. (2019). Extraction and selection of statistical har-
monics features for electrical appliances identifica-
tion using k-nn classifier combined with voting rules
method. Turkish Journal of Electrical Engineering &
Computer Sciences, 27:2980–2997.
Hacine-Gharbi, A., Petit, M., Ravier, P., and N
´
emo, F.
(2015). Prosody based Automatic Classification of the
Uses of French ‘Oui’ as Convinced or Unconvinced
Uses:. In Proceedings of the International Confer-
ence on Pattern Recognition Applications and Meth-
LSTM Network based on Prosodic Features for the Classification of Injunction in French Oral Utterances
735