COMPARATIVE STUDY OF SEVERAL NOVEL ACOUSTIC FEATURES FOR SPEAKER RECOGNITION

Vladimir Pervouchine, Graham Leedham, Haishan Zhong, David Cho, Haizhou Li

Abstract

Finding good features that represent speaker identity is an important problem in speaker recognition area. Recently a number of novel acoustic features have been proposed for speaker recognition. The researchers use different data sets and sometimes different classifiers to evaluate the features and compare them to the baselines such as MFCC or LPCC. However, due to different experimental conditions direct comparison of those features to each other is difficult or impossible. This paper presents a study of five new recently proposed acoustic features using the same data (NIST 2001 SRE), and the same UBM-GMM classifier. The results are presented as DET curves with equal error ratios indicated. Also, an SVM-based combination of GMM scores produced on different features has been made to determine if the new features carry any complimentary information. The results for different features as well as for their combinations are directly comparable to each other and to those obtained with the baseline MFCC features.

Download


Paper Citation


in Harvard Style

Pervouchine V., Leedham G., Zhong H., Cho D. and Li H. (2008). COMPARATIVE STUDY OF SEVERAL NOVEL ACOUSTIC FEATURES FOR SPEAKER RECOGNITION . In Proceedings of the First International Conference on Bio-inspired Systems and Signal Processing - Volume 1: BIOSIGNALS, (BIOSTEC 2008) ISBN 978-989-8111-18-0, pages 220-223. DOI: 10.5220/0001060302200223


in Bibtex Style

@conference{biosignals08,
author={Vladimir Pervouchine and Graham Leedham and Haishan Zhong and David Cho and Haizhou Li},
title={COMPARATIVE STUDY OF SEVERAL NOVEL ACOUSTIC FEATURES FOR SPEAKER RECOGNITION},
booktitle={Proceedings of the First International Conference on Bio-inspired Systems and Signal Processing - Volume 1: BIOSIGNALS, (BIOSTEC 2008)},
year={2008},
pages={220-223},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001060302200223},
isbn={978-989-8111-18-0},
}


in EndNote Style

TY - CONF
JO - Proceedings of the First International Conference on Bio-inspired Systems and Signal Processing - Volume 1: BIOSIGNALS, (BIOSTEC 2008)
TI - COMPARATIVE STUDY OF SEVERAL NOVEL ACOUSTIC FEATURES FOR SPEAKER RECOGNITION
SN - 978-989-8111-18-0
AU - Pervouchine V.
AU - Leedham G.
AU - Zhong H.
AU - Cho D.
AU - Li H.
PY - 2008
SP - 220
EP - 223
DO - 10.5220/0001060302200223