Curtis, K., Jones, G. J., and Campbell, N. (2015). Effects of
good speaking techniques on audience engagement. In
Proceedings of the 2015 ACM on International Con-
ference on Multimodal Interaction, pages 35–42.
Damian, I., Tan, C. S., Baur, T., Sch
¨
oning, J., Luyten, K.,
and Andr
´
e, E. (2015). Augmenting social interac-
tions: Realtime behavioural feedback using social sig-
nal processing techniques. In Proceedings of the 33rd
annual ACM conference on Human factors in comput-
ing systems, pages 565–574.
Ebbinghaus, H. (1913). Memory: a contribution to exper-
imental psychology. 1885. New York: Teachers Col-
lege, Columbia University.
Eyben, F., Scherer, K. R., Schuller, B. W., Sundberg,
J., Andr
´
e, E., Busso, C., Devillers, L. Y., Epps, J.,
Laukka, P., Narayanan, S. S., and Truong, K. P.
(2016). The geneva minimalistic acoustic parameter
set (gemaps) for voice research and affective com-
puting. IEEE Transactions on Affective Computing,
7(2):190–202.
Eyben, F., W
¨
ollmer, M., and Schuller, B. (2010). Opens-
mile: the munich versatile and fast open-source au-
dio feature extractor. In Proceedings of the 18th ACM
international conference on Multimedia, pages 1459–
1462.
Haider, F., Koutsombogera, M., Conlan, O., Vogel, C.,
Campbell, N., and Luz, S. (2020). An active data
representation of videos for automatic scoring of oral
presentation delivery skills and feedback generation.
Frontiers in Computer Science, 2:1.
Hemamou, L., Felhi, G., Vandenbussche, V., Martin, J.-C.,
and Clavel, C. (2019). Hirenet: A hierarchical atten-
tion model for the automatic analysis of asynchronous
video job interviews. In Proceedings of the AAAI Con-
ference on Artificial Intelligence, volume 33, pages
573–581.
Hemamou, L., Guillon, A., Martin, J.-C., and Clavel, C.
(2021). Multimodal hierarchical attention neural net-
work: Looking for candidates behaviour which impact
recruiter’s decision. IEEE Transactions on Affective
Computing.
Hirschberg, J. B. and Rosenberg, A. (2005). Acous-
tic/prosodic and lexical correlates of charismatic
speech.
Hongwei, Z. et al. (2020). Analysis of the persuasive meth-
ods in barack obama’s speeches from the social psy-
chology’s perspectives. The Frontiers of Society, Sci-
ence and Technology, 2(10).
Lundberg, S. M. and Lee, S.-I. (2017). A unified approach
to interpreting model predictions. Advances in neural
information processing systems, 30.
Nguyen, A.-T., Chen, W., and Rauterberg, M. (2012). On-
line feedback system for public speakers. In 2012
IEEE Symposium on E-Learning, E-Management and
E-Services, pages 1–5. IEEE.
Nguyen, L. S. and Gatica-Perez, D. (2015). I would hire you
in a minute: Thin slices of nonverbal behavior in job
interviews. In Proceedings of the 2015 ACM on inter-
national conference on multimodal interaction, pages
51–58.
Nojavanasghari, B., Gopinath, D., Koushik, J., Baltru
ˇ
saitis,
T., and Morency, L.-P. (2016). Deep multimodal fu-
sion for persuasiveness prediction. In Proceedings
of the 18th ACM International Conference on Multi-
modal Interaction, pages 284–288.
Park, S., Shim, H. S., Chatterjee, M., Sagae, K., and
Morency, L.-P. (2014). Computational analysis of per-
suasiveness in social multimedia: A novel dataset and
multimodal prediction approach. In Proceedings of
the 16th International Conference on Multimodal In-
teraction, pages 50–57.
Pennebaker, J. W., Boyd, R. L., Jordan, K., and Blackburn,
K. (2015). The development and psychometric prop-
erties of liwc2015. Technical report.
Ramanarayanan, V., Leong, C. W., Chen, L., Feng, G., and
Suendermann-Oeft, D. (2015). Evaluating speech,
face, emotion and body movement time-series fea-
tures for automated multimodal presentation scoring.
In Proceedings of the 2015 ACM on International
Conference on Multimodal Interaction, pages 23–30.
Scherer, S., Layher, G., Kane, J., Neumann, H., and Camp-
bell, N. (2012). An audiovisual political speech analy-
sis incorporating eye-tracking and perception data. In
LREC, pages 1114–1120.
Sharma, G. and Sharma, P. (2010). Importance of soft skills
development in 21st century curriculum. International
Journal of Education & Allied Sciences, 2(2).
Strangert, E. and Gustafson, J. (2008). What makes a good
speaker? subject ratings, acoustic measurements and
perceptual evaluations. In Ninth Annual Conference of
the International Speech Communication Association.
Tanveer, M. I., Lin, E., and Hoque, M. (2015). Rhema:
A real-time in-situ intelligent interface to help people
with public speaking. In Proceedings of the 20th in-
ternational conference on intelligent user interfaces,
pages 286–295.
Tillfors, M. and Furmark, T. (2007). Social phobia in
swedish university students: prevalence, subgroups
and avoidant behavior. Social psychiatry and psychi-
atric epidemiology, 42(1):79–86.
W
¨
ortwein, T., Chollet, M., Schauerte, B., Morency, L.-P.,
Stiefelhagen, R., and Scherer, S. (2015). Multimodal
public speaking performance assessment. In Proceed-
ings of the 2015 ACM on International Conference on
Multimodal Interaction, pages 43–50.
Zhao, R., Li, V., Barbosa, H., Ghoshal, G., and Hoque,
M. E. (2017). Semi-automated and collaborative on-
line training module for improving communication
skills. Proceedings of the ACM on Interactive, Mobile,
Wearable and Ubiquitous Technologies, 1(2):1–20.
HUCAPP 2023 - 7th International Conference on Human Computer Interaction Theory and Applications
200