M&M: Multimodal-Multitask Model Integrating Audiovisual Cues in Cognitive Load Assessment

Long Nguyen-Phuoc; Long Nguyen-Phuoc; Renald Gaboriau; Dimitri Delacroix; Laurent Navarro

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

M&M: Multimodal-Multitask Model Integrating Audiovisual Cues in Cognitive Load Assessment

Topics: Deep Learning for Visual Understanding ; Face and Expression Recognition; Features Extraction; Machine Learning Technologies for Vision; Medical Image Applications; Multi-task learning

In Proceedings of the 19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 2 VISAPP: VISAPP, 869-876, 2024 , Rome, Italy

Authors: Long Nguyen-Phuoc ^{1

;

2} ; Renald Gaboriau ² ; Dimitri Delacroix ² and Laurent Navarro ¹

Affiliations: ¹ Mines Saint- Étienne, University of Lyon, University Jean Monnet, Inserm, U 1059 Sainbiose, Centre CIS, 42023 Saint- Étienne, France ; ² MJ Lab, MJ INNOV, 42000 Saint-Etienne, France

Keyword(s): Cognitive Load Assessment, Multimodal-Multitask Learning, Multihead Attention.

Abstract: This paper introduces the M&M model, a novel multimodal-multitask learning framework, applied to the AVCAffe dataset for cognitive load assessment (CLA). M&M uniquely integrates audiovisual cues through a dual-pathway architecture, featuring specialized streams for audio and video inputs. A key innovation lies in its cross-modality multihead attention mechanism, fusing the different modalities for synchronized multitasking. Another notable feature is the model’s three specialized branches, each tailored to a specific cognitive load label, enabling nuanced, task-specific analysis. While it shows modest performance compared to the AVCAffe’s single-task baseline, M&M demonstrates a promising framework for integrated multimodal processing. This work paves the way for future enhancements in multimodal-multitask learning systems, emphasizing the fusion of diverse data types for complex task handling.

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 3.145.201.73

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

Nguyen-Phuoc, L.; Gaboriau, R.; Delacroix, D. and Navarro, L. (2024). M&M: Multimodal-Multitask Model Integrating Audiovisual Cues in Cognitive Load Assessment. In Proceedings of the 19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 2: VISAPP; ISBN 978-989-758-679-8; ISSN 2184-4321, SciTePress, pages 869-876. DOI: 10.5220/0012575100003660

@conference{visapp24,
author={Long Nguyen{-}Phuoc. and Renald Gaboriau. and Dimitri Delacroix. and Laurent Navarro.},
title={M&M: Multimodal-Multitask Model Integrating Audiovisual Cues in Cognitive Load Assessment},
booktitle={Proceedings of the 19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 2: VISAPP},
year={2024},
pages={869-876},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0012575100003660},
isbn={978-989-758-679-8},
issn={2184-4321},
}

TY - CONF

JO - Proceedings of the 19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 2: VISAPP
TI - M&M: Multimodal-Multitask Model Integrating Audiovisual Cues in Cognitive Load Assessment
SN - 978-989-758-679-8
IS - 2184-4321
AU - Nguyen-Phuoc, L.
AU - Gaboriau, R.
AU - Delacroix, D.
AU - Navarro, L.
PY - 2024
SP - 869
EP - 876
DO - 10.5220/0012575100003660
PB - SciTePress