Authors:
Amirhossein Tavanaei
1
;
Alireza Ghasemi
2
;
Mohammad Tavanaei
3
;
Hossein Sameti
1
and
Mohammad T. Manzuri
1
Affiliations:
1
Sharif University of Technology, Iran, Islamic Republic of
;
2
École Polytechnique Fédérale de Lausanne, Switzerland
;
3
SAIPA Company, Iran, Islamic Republic of
Keyword(s):
Speech recognition, Machine learning, Pattern recognition, Mel frequency discrete wavelet transform, One-class learning, Support vector data description.
Related
Ontology
Subjects/Areas/Topics:
Artificial Intelligence
;
Biomedical Engineering
;
Biomedical Signal Processing
;
Data Manipulation
;
Health Engineering and Technology Applications
;
Human-Computer Interaction
;
Methodologies and Methods
;
Neurocomputing
;
Neurotechnology, Electronics and Informatics
;
Pattern Recognition
;
Physiological Computing Systems
;
Sensor Networks
;
Soft Computing
;
Speech Recognition
;
Wavelet Transform
Abstract:
A classifier based on Support Vector Data Description (SVDD) is proposed for spoken digit recognition. We use the Mel Frequency Discrete Wavelet Coefficients (MFDWC) and the Mel Frequency cepstral Coefficients (MFCC) as the feature vectors. The proposed classifier is compared to the HMM and results are promising and we show the HMM and SVDD classifiers have equal accuracy rates. The performance of the proposed features and SVDD classifier with several kernel functions are evaluated and compared in clean and noisy speech. Because of multi resolution and localization of the Wavelet Transform (WT) and using SVDD, experiments on the spoken digit recognition systems based on MFDWC features and SVDD with weighted polynomial kernel function give better results than the other methods.