loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Tanay Agrawal 1 ; Dhruv Agarwal 2 ; 1 ; Michal Balazia 1 ; 3 ; Neelabh Sinha 4 ; 1 and François Bremond 1 ; 3

Affiliations: 1 INRIA Sophia Antipolis - Méditerranée, France ; 2 Indian Institute of Information Technology, Allahabad, India ; 3 Université Côte d’Azur, France ; 4 Birla Institute of Technology and Science, Pilani, India

Keyword(s): Multimodal Transformer, Multimodal Data, Feature Engineering, Personality Recognition.

Abstract: Personality computing and affective computing have gained recent interest in many research areas. The datasets for the task generally have multiple modalities like video, audio, language and bio-signals. In this paper, we propose a flexible model for the task which exploits all available data. The task involves complex relations and to avoid using a large model for video processing specifically, we propose the use of behaviour encoding which boosts performance with minimal change to the model. Cross-attention using transformers has become popular in recent times and is utilised for fusion of different modalities. Since long term relations may exist, breaking the input into chunks is not desirable, thus the proposed model processes the entire input together. Our experiments show the importance of each of the above contributions.

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 13.59.196.41

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Agrawal, T., Agarwal, D., Balazia, M., Sinha, N. and Bremond, F. (2022). Multimodal Personality Recognition using Cross-attention Transformer and Behaviour Encoding. In Proceedings of the 17th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2022) - Volume 5: VISAPP; ISBN 978-989-758-555-5; ISSN 2184-4321, SciTePress, pages 501-508. DOI: 10.5220/0010841400003124

@conference{visapp22,
author={Tanay Agrawal and Dhruv Agarwal and Michal Balazia and Neelabh Sinha and Fran\c{c}ois Bremond},
title={Multimodal Personality Recognition using Cross-attention Transformer and Behaviour Encoding},
booktitle={Proceedings of the 17th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2022) - Volume 5: VISAPP},
year={2022},
pages={501-508},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0010841400003124},
isbn={978-989-758-555-5},
issn={2184-4321},
}

TY - CONF

JO - Proceedings of the 17th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2022) - Volume 5: VISAPP
TI - Multimodal Personality Recognition using Cross-attention Transformer and Behaviour Encoding
SN - 978-989-758-555-5
IS - 2184-4321
AU - Agrawal, T.
AU - Agarwal, D.
AU - Balazia, M.
AU - Sinha, N.
AU - Bremond, F.
PY - 2022
SP - 501
EP - 508
DO - 10.5220/0010841400003124
PB - SciTePress