DNN-based Models for Speaker Age and Gender Classification

Zakariya Qawaqneh, Arafat Abu Mallouh, Buket D. Barkana

2017

Abstract

Automatic speaker age and gender classification is an active research field due to the continuous and rapid development of applications related to humans’ life and health. In this paper, we propose a new method for speaker age and gender classification, which utilizes deep neural networks (DNNs) as feature extractor and classifier. The proposed method creates a model for each speaker. For each test speech utterance, the similarity between the test model and the speaker class models are compared. Two feature sets have been used: Mel-frequency cepstral coefficients (MFCCs) and shifted delta cepstral (SDC) coefficients. The proposed model by using the SDC feature set achieved better classification results than that of MFCCs. The experimental results showed that the proposed SDC speaker model + SDC class model outperformed all the other systems by achieving 57.21% overall classification accuracy.

Download


Paper Citation


in Harvard Style

Qawaqneh Z., Abu Mallouh A. and Barkana B. (2017). DNN-based Models for Speaker Age and Gender Classification. In - BIOSIGNALS, (BIOSTEC 2017) ISBN , pages 0-0. DOI: 10.5220/0006096400001488


in Bibtex Style

@conference{biosignals17,
author={Zakariya Qawaqneh and Arafat Abu Mallouh and Buket D. Barkana},
title={DNN-based Models for Speaker Age and Gender Classification},
booktitle={ - BIOSIGNALS, (BIOSTEC 2017)},
year={2017},
pages={},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006096400001488},
isbn={},
}


in EndNote Style

TY - CONF

JO - - BIOSIGNALS, (BIOSTEC 2017)
TI - DNN-based Models for Speaker Age and Gender Classification
SN -
AU - Qawaqneh Z.
AU - Abu Mallouh A.
AU - Barkana B.
PY - 2017
SP - 0
EP - 0
DO - 10.5220/0006096400001488