loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Osman Büyük 1 and Levent M. Arslan 2 ; 3

Affiliations: 1 Department of Electronics and Communications Engineering, Kocaeli University, Kocaeli, Turkey ; 2 Department of Electrical and Electronics Engineering, Bogazici University, Istanbul, Turkey ; 3 Sestek Speech Enabled Software Technologies Incorporation, Istanbul, Turkey

Keyword(s): Age Classification from Voice, Multi-language, Feed-forward Deep Neural Networks, Support Vector Machines, Gaussian Mixture Models.

Abstract: In this paper, we investigate the use of deep neural networks (DNN) for a multi-language age classification task using speaker’s voice. For this purpose, speech databases in two different languages are combined together to construct a multi-language database. Mel-frequency cepstral coefficients (MFCC) are extracted for each utterance. A Gaussian mixture model (GMM), a support vector machine (SVM) and a feed-forward deep neural network (DNN) systems are trained using the features. In the SVM and DNN methods, the GMM means are concatenated to obtain a GMM supervector. The supervectors are fed into the SVM and DNN for age classification. In the experiments, we observe that the multi-language training does not degrade the performance in the SVM and DNN methods when compared to the matched training where train and test languages are the same. On the other hand, the performance is degraded for the traditional GMM method. Additionally, the SVM and DNN significantly outperform the GMM in the multi-language train-test scenario. The absolute performance improvement with the SVM and DNN is approximately 12% and 7% for female and male speakers, respectively. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 18.219.25.226

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Büyük, O. and Arslan, L. (2019). An Investigation of Multi-Language Age Classification from Voice. In Proceedings of the 12th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2019) - BIOSIGNALS; ISBN 978-989-758-353-7; ISSN 2184-4305, SciTePress, pages 85-92. DOI: 10.5220/0007237600850092

@conference{biosignals19,
author={Osman Büyük. and Levent M. Arslan.},
title={An Investigation of Multi-Language Age Classification from Voice},
booktitle={Proceedings of the 12th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2019) - BIOSIGNALS},
year={2019},
pages={85-92},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0007237600850092},
isbn={978-989-758-353-7},
issn={2184-4305},
}

TY - CONF

JO - Proceedings of the 12th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2019) - BIOSIGNALS
TI - An Investigation of Multi-Language Age Classification from Voice
SN - 978-989-758-353-7
IS - 2184-4305
AU - Büyük, O.
AU - Arslan, L.
PY - 2019
SP - 85
EP - 92
DO - 10.5220/0007237600850092
PB - SciTePress