A Data Augmentation Approach for Improving the Performance of Speech Emotion Recognition

Georgia Paraskevopoulou; Evaggelos Spyrou; Evaggelos Spyrou; Stavros Perantonis

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

A Data Augmentation Approach for Improving the Performance of Speech Emotion Recognition

Topics: Emotional and Social Signals in Multimedia ; Music, Speech and Audio Processing

In Proceedings of the 19th International Conference on Signal Processing and Multimedia Applications SIGMAP - Volume 1, 61-69, 2022 , Lisbon, Portugal

Authors: Georgia Paraskevopoulou ¹ ; Evaggelos Spyrou ^{2

;

3} and Stavros Perantonis ³

Affiliations: ¹ Department of History & Philosophy of Science, National and Kapodistrian University of Athens, Athens, Greece ; ² Department of Computer Science and Telecommunications, University of Thessaly, Lamia, Greece ; ³ Institute of Informatics and Telecommunications, National Center for Scientific Research - “Demokritos,” Athens, Greece

Keyword(s): Emotion Recognition, Convolutional Neural Network, Spectrograms, Data Augmentation.

Abstract: The recognition of the emotions of humans is crucial for various applications related to human-computer interaction or for understanding the users’ mood in several tasks. Typical machine learning approaches used towards this goal first extract a set of linguistic features from raw data, which are then used to train supervised learning models. Recently, Convolutional Neural Networks (CNNs), which unlike traditional approaches, learn to extract the appropriate features of their inputs, have also been applied as emotion recognition classifiers. In this work, we adopt a CNN architecture that uses spectrograms, extracted from audio signals as inputs and we propose data augmentation techniques to boost the classification performance. The proposed data augmentation approach includes noise addition, shifting of the audio signal, and changing its pitch or its speed. Experimental results indicate that the herein presented approach outperforms previous work which not use augmented data.

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 216.73.216.119

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

Paraskevopoulou, G., Spyrou, E., Perantonis and S. (2022). A Data Augmentation Approach for Improving the Performance of Speech Emotion Recognition. In Proceedings of the 19th International Conference on Signal Processing and Multimedia Applications - SIGMAP; ISBN 978-989-758-591-3; ISSN 2184-9471, SciTePress, pages 61-69. DOI: 10.5220/0011148000003289

@conference{sigmap22,
author={Georgia Paraskevopoulou and Evaggelos Spyrou and Stavros Perantonis},
title={A Data Augmentation Approach for Improving the Performance of Speech Emotion Recognition},
booktitle={Proceedings of the 19th International Conference on Signal Processing and Multimedia Applications - SIGMAP},
year={2022},
pages={61-69},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0011148000003289},
isbn={978-989-758-591-3},
issn={2184-9471},
}

TY - CONF

JO - Proceedings of the 19th International Conference on Signal Processing and Multimedia Applications - SIGMAP
TI - A Data Augmentation Approach for Improving the Performance of Speech Emotion Recognition
SN - 978-989-758-591-3
IS - 2184-9471
AU - Paraskevopoulou, G.
AU - Spyrou, E.
AU - Perantonis, S.
PY - 2022
SP - 61
EP - 69
DO - 10.5220/0011148000003289
PB - SciTePress