SUPPORT VECTOR DATA DESCRIPTION FOR SPOKEN DIGIT RECOGNITION

Amirhossein Tavanaei, Alireza Ghasemi, Mohammad Tavanaei, Hossein Sameti, Mohammad T. Manzuri

2012

Abstract

A classifier based on Support Vector Data Description (SVDD) is proposed for spoken digit recognition. We use the Mel Frequency Discrete Wavelet Coefficients (MFDWC) and the Mel Frequency cepstral Coefficients (MFCC) as the feature vectors. The proposed classifier is compared to the HMM and results are promising and we show the HMM and SVDD classifiers have equal accuracy rates. The performance of the proposed features and SVDD classifier with several kernel functions are evaluated and compared in clean and noisy speech. Because of multi resolution and localization of the Wavelet Transform (WT) and using SVDD, experiments on the spoken digit recognition systems based on MFDWC features and SVDD with weighted polynomial kernel function give better results than the other methods.

References

  1. Bresolin A. et. al., "Digit recognition using wavelet and SVM in Brazilian Portuguese". ICASSP 2008, pp.
  2. J. N. Gowdy and Z. Tufekci, "Mel-scaled discrete wavelet coefficients for speech recognition", ICASSP 2000, pp. 1351:1354, Istanbul 2000.
  3. Rabiner L. and Juang B., "An introduction to hidden Markov models", IEEE ASSP, 3(1):4-16, 1986.
  4. Rabiner L., "A tutorial on Hidden Markov Models and selected applications in speech recognition". Proceedings of the IEEE 77 (2): 257286, 1989.
  5. Ganapathiraju A. et. al., "Applications of Support Vector Machines to Speech Recognition", IEEE Transactions on Signal Processing, 52(8):2348-2355, 2004.
  6. Sonkamble B. A et. al., "An Overview of Speech Recognition System based on the Support Vector Machines", ICCCE 2008, pp. 768-771 Kuala Lumpur 2008.
  7. Bai J. et. al., "Speech Recognition Based on A Compound Kernel Support Vector Machine", ICCT 2008, pp. 696-699 China 2008.
  8. Khan S. S.,A, "Survey of Recent Trends in One Class Classification". AICS 2009, pp. 188-197, Ireland 2009.
  9. DeMenthon Y. X. and Doermann D. S., "Support Vector Data Description for image categorization from Internet images". ICPR 2008, pp. 1-4, Florida U.S.A 2008.
  10. Tax D. and Duin. R., "Support Vector Data Description, Machine Learning", 54(1):45-66, 2004.
  11. Taylor J. S. and Cristianini. N., “Kernel Methods for Pattern Analysis” Cambridge University Press, 2004.
Download


Paper Citation


in Harvard Style

Tavanaei A., Ghasemi A., Tavanaei M., Sameti H. and T. Manzuri M. (2012). SUPPORT VECTOR DATA DESCRIPTION FOR SPOKEN DIGIT RECOGNITION . In Proceedings of the International Conference on Bio-inspired Systems and Signal Processing - Volume 1: BIOSIGNALS, (BIOSTEC 2012) ISBN 978-989-8425-89-8, pages 32-37. DOI: 10.5220/0003764400320037


in Bibtex Style

@conference{biosignals12,
author={Amirhossein Tavanaei and Alireza Ghasemi and Mohammad Tavanaei and Hossein Sameti and Mohammad T. Manzuri},
title={SUPPORT VECTOR DATA DESCRIPTION FOR SPOKEN DIGIT RECOGNITION},
booktitle={Proceedings of the International Conference on Bio-inspired Systems and Signal Processing - Volume 1: BIOSIGNALS, (BIOSTEC 2012)},
year={2012},
pages={32-37},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003764400320037},
isbn={978-989-8425-89-8},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Bio-inspired Systems and Signal Processing - Volume 1: BIOSIGNALS, (BIOSTEC 2012)
TI - SUPPORT VECTOR DATA DESCRIPTION FOR SPOKEN DIGIT RECOGNITION
SN - 978-989-8425-89-8
AU - Tavanaei A.
AU - Ghasemi A.
AU - Tavanaei M.
AU - Sameti H.
AU - T. Manzuri M.
PY - 2012
SP - 32
EP - 37
DO - 10.5220/0003764400320037