Emotion recognition in human-computer interaction.
IEEE Signal processing magazine, 18(1):32–80.
Ekman, P., Friesen, W. V., and Scherer, K. R. (1976). Body
movement and voice pitch in deceptive interaction.
Semiotica, 16(1):23–28.
El Ayadi, M., Kamel, M. S., and Karray, F. (2011). Sur-
vey on speech emotion recognition: Features, classi-
fication schemes, and databases. Pattern Recognition,
44(3):572–587.
Fairbanks, G. and Hoaglin, L. W. (1941). An experimen-
tal study of the durational characteristics of the voice
during the expression of emotion. Communications
Monographs, 8(1):85–90.
France, D. J., Shiavi, R. G., Silverman, S., Silverman,
M., and Wilkes, M. (2000). Acoustical properties
of speech as indicators of depression and suicidal
risk. IEEE transactions on Biomedical Engineering,
47(7):829–837.
Graciarena, M., Shriberg, E., Stolcke, A., Enos, F.,
Hirschberg, J., and Kajarekar, S. (2006). Combin-
ing prosodic lexical and cepstral systems for deceptive
speech detection. In 2006 IEEE International Confer-
ence on Acoustics Speech and Signal Processing Pro-
ceedings, volume 1, pages I–I. IEEE.
Haque, S., Togneri, R., and Zaknich, A. (2005). A zero-
crossing perceptual model for robust speech recogni-
tion. In Inter-University Postgraduate Electrical En-
gineering Symposium, Curtin University.
Jackson, P. and Haq, S. (2014). Surrey audio-visual ex-
pressed emotion (savee) database. University of Sur-
rey: Guildford, UK.
Kirchh
¨
ubel, C. and Howard, D. M. (2013). Detecting sus-
picious behaviour using speech: Acoustic correlates
of deceptive speech–an exploratory investigation. Ap-
plied ergonomics, 44(5):694–702.
Koolagudi, S. G. and Rao, K. S. (2012). Emotion recogni-
tion from speech: a review. International journal of
speech technology, 15(2):99–117.
Lee, C. M., Narayanan, S. S., et al. (2005). Toward detect-
ing emotions in spoken dialogs. IEEE transactions on
speech and audio processing, 13(2):293–303.
Livingstone, S. R. and Russo, F. A. (2018). The ryerson
audio-visual database of emotional speech and song
(ravdess): A dynamic, multimodal set of facial and
vocal expressions in north american english. PloS one,
13(5):e0196391.
Ma, J., Jin, H., Yang, L. T., and Tsai, J. J.-P. (2006). Ubiqui-
tous Intelligence and Computing: Third International
Conference, UIC 2006, Wuhan, China, September 3-
6, 2006, Proceedings (Lecture Notes in Computer Sci-
ence). Springer-Verlag.
Pantic, M. and Rothkrantz, L. J. (2003). Toward an
affect-sensitive multimodal human-computer interac-
tion. Proceedings of the IEEE, 91(9):1370–1390.
P
´
erez-Rosas, V., Abouelenien, M., Mihalcea, R., and
Burzo, M. (2015). Deception detection using real-
life trial data. In Proceedings of the 2015 ACM on
International Conference on Multimodal Interaction,
pages 59–66. ACM.
Rong, J., Li, G., and Chen, Y.-P. P. (2009). Acoustic
feature selection for automatic emotion recognition
from speech. Information processing & management,
45(3):315–328.
Scherer, K. R. (1986). Vocal affect expression: A review
and a model for future research. Psychological bul-
letin, 99(2):143.
Schuller, B., Rigoll, G., and Lang, M. (2004). Speech
emotion recognition combining acoustic features and
linguistic information in a hybrid support vector
machine-belief network architecture. In 2004 IEEE
International Conference on Acoustics, Speech, and
Signal Processing, volume 1, pages I–577. IEEE.
Shafer, G. (1976). A mathematical theory of evidence, vol-
ume 42. Princeton university press.
Sokolova, M. and Lapalme, G. (2009). A systematic analy-
sis of performance measures for classification tasks.
Information processing & management, 45(4):427–
437.
Theodoridis, S. and Koutroumbas, K. (2009). Pattern recog-
nition. 2003. Elsevier Inc.
Ververidis, D., Kotropoulos, C., and Pitas, I. (2004). Auto-
matic emotional speech classification. In 2004 IEEE
International Conference on Acoustics, Speech, and
Signal Processing, volume 1, pages I–593. IEEE.
Wang, G., Chen, H., and Atabakhsh, H. (2004). Crim-
inal identity deception and deception detection in
law enforcement. Group Decision and Negotiation,
13(2):111–127.
VISAPP 2020 - 15th International Conference on Computer Vision Theory and Applications
720