Speech Emotion Classification Accuracy.
INTERSPEECH 2018: 257-261.
Naim Terbeh, Mounir Zrigui. 2017. A Robust Algorithm
for PathologicalSpeech Correction. PACLING 2017:
341-351.
Promod Yenigalla, Abhay Kumar, Suraj Tripathi, Chirag
Singh, Sibsambhu Kar, Jithendra Vepa. 2018. Speech
Emotion Recognition Using Spectrogram & Phoneme
Embedding. INTERSPEECH 2018: 3688-3692
Mustaqeem, Muhammad Sajjad, Soonil Kwon. 2020.
Clustering-Based Speech Emotion Recognition by
Incorporating Learned Features and Deep BiLSTM.
IEEE Access 8: 79861-79875
Nithya Roopa S., Prabhakaran M, Betty.P. 2018. Speech
Emotion Recognition using Deep Learning.
International Journal of Recent Technology and
Engineering (IJRTE) ISSN: 2277-3878, Volume-7
Issue-4S, November 2018.
Mohamed Labidi, Mohsen Maraoui, Mounir Zrigui. 2017.
Unsupervised Method for Im-proving Arabic Speech
Recognition Systems. PACLIC 2017: 161-168
Hugo Touvron, Matthieu Cord, Matthijs Douze, Francisco
Massa, Alexandre Sablayrolles, Hervé Jégou.2021.
Training data-efficient image transformers &
distillation through atten-tion. ICML 2021: 10347-
10357
Anwer Slimi, Henri Nicolas and Mounir Zrigui. 2022.
Detection of Emotion Categories’ Change in Speeches.
In Proceedings of the 14th International Conference on
Agents and Artificial Intelligence - Volume 3, ISBN
978-989-758-547-0, ISSN 2184-433X, pages 597-604.
DOI: 10.5220/0010868100003116 .
Alexey Dosovitskiy, Lucas Beyer, Alexander Kolesnikov,
Dirk Weissenborn, Xiaohua Zhai, Thomas Unterthiner,
Mostafa Dehghani, Matthias Minderer, Georg Heigold,
Sylvain Gelly, Jakob Uszkoreit, Neil Houlsby. 2021.
An Image is Worth 16x16 Words: Transformers for
Image Recognition at Scale. ICLR 2021.
Jindong Gu, Volker Tresp, Yao Qin. 2021. Are Vision
Transformers Robust to Patch Per-turbations? CoRR
abs/2111.10659 (2021)
Pavel Karpov, Guillaume Godin, Igor V. Tetko:. 2020.
Transformer-CNN: Swiss knife for QSAR modeling
and interpretation. J. Cheminformatics 12(1): 17 (2020)
Jiang Zhang, Chen Li, Ganwanming Liu, Min Min, Chong
Wang, Jiyi Li, Yuting Wang, Hongmei Yan, Zhentao
Zuo, Wei Huang, Huafu Chen. 2022. A
CNNtransformer hybrid approach for decoding visual
neural activity into text, Computer Methods and
Programs in Biomedicine, Volume 214, 2022, 106586,
ISSN 0169- 2607, https://doi.org/10.1016/j.cmpb.20
21.106586.
Yun Liu, Guolei Sun, Yu Qiu, Le Zhang, Ajad Chhatkuli,
Luc Van Gool. 2021. Transform-er in Convolutional
Neural Networks. CoRR abs/2106.03180 (2021).
Meddeb Ons., Maraoui M ohsen and Zrigui, Mounir. 2021.
Personalized Smart Learning Recommendation System
for Arabic Users in Smart Campus. International
Journal of Web-Based Learning and Teaching
Technologies (IJWLTT), 16(6), 1-21.
http://doi.org/10.4018/IJWLTT.20211101.oa9
Mustaqeem, Soonil Kwon. 2020a. A CNN-Assisted
Enhanced Audio Signal Processing for Speech Emotion
Recognition. Sensors 20(1): 183
Mustaqeem, Soonil Kwon. 2020b. CLSTM: Deep Feature-
Based Speech Emotion Recognition Using the
Hierarchical ConvLSTM Network. Mathematics 2020,
8, 2133. https://doi.org/10.3390/math8122133
Noushin Hajarolasvadi and Hasan Demirel. 2019. 3D
CNN-Based Speech Emotion Recognition Using K-
Means Clustering and Spectrograms. Entropy 2019, 21,
497.
Bellagha Med Lazhar and Zrigui Mounir. 2020. Speaker
Naming in TV programs Based on Speaker Role
Recognition. 2020 IEEE/ACS 17th International
Conference on Computer Systems and Applications
(AICCSA), 1-8.
Seo Minji, Kim Myungho. 2020. Fusing Visual Attention
CNN and Bag of Visual Words for Cross-Corpus
Speech Emotion Recognition. Sensors 20, no. 19: 5559.
https://doi.org/10.3390/s20195559.
Leonardo Pepino, Pablo Riera, Luciana Ferrer. 2021.
Emotion Recognition from Speech Using Wav2vec 2.0
Embeddings. CoRR abs/2104.03502
Aneesh Muppidi, Martin Radfar. 2021. Speech Emotion
Recognition Using Quaternion Convolutional Neural
Networks. ICASSP 2021: 6309-6313
Mr. N. Ratna Kanth and Dr. S. Saraswathi: A Survey on
Speech Emotion Recognition. Advances in Computer
Science and Information Technology (ACSIT) Print
ISSN: 2393-9907; Online ISSN: 2393-9915; Volume 1,
Number 3; November, 2014 pp. 135-139.
Daniel S. Park, William Chan, Yu Zhang, Chung-Cheng
Chiu, Barret Zoph, Ekin D. Cubuk, Quoc V. Le:
SpecAugment: A Simple Data Augmentation Method
for Automatic Speech Recognition. CoRR
abs/1904.08779 (2019)
S.S. Stevens and J. Volkman: A scale for the Measurement
of the Psychological Magnitude Pitch. J.A.S.A January
1937, Volume 8.
Jianfeng Zhao, Xia Mao, Lijiang Chena: Speech emotion
recognition using deep 1D & 2D CNN LSTM networks.
Biomedical Signal Processing and Control 47 (2019)
312–323.
Livingstone SR, Russo FA. 2018. The Ryerson Audio-
Visual Database of Emotional Speech and Song
(RAVDESS): A dynamic, multimodal set of facial and
vocal expressions in North American English. PLoS
ONE 13(5): e0196391.
Adnen Mahmoud and Mounir Zrigui. 2021. Hybrid
Attention-based Approach for Arabic Paraphrase
Detection, Applied Artificial Intelligence, 35:15, 1271-
1286, DOI: 10.1080/08839514.2021.1975880