DISCRETE SPEECH RECOGNITION USING A HAUSDORFF BASED METRIC - An automatic word-based speech recognition approach

Tudor Barbu

Abstract

In this work we provide an automatic speaker-independent word-based discrete speech recognition approach. Our proposed method consist of several processing levels. First, an word-based audio segmentation is performed, then a feature extraction is applied on the obtained segments. The speech feature vectors are computed using a delta delta mel cepstral vocal sound analysis. Then, a minimum distance supervised classifier is proposed. Because of the different dimensions of the speech feature vectors, we create a Hausdorff-based nonlinear metric to measure the distance between them.

References

  1. Rabiner, L., Juang, B. H., 1993. Fundamentals of Speech Recognition. Prentice Hall Signal Processing Series. Prentice Hall, Englewood Cliffs, New Jersey 07632, A. V. Oppenheim, Series Editor.
  2. Rabiner, L., Schafer, R., 1978. Digital Processing of Speech Signals. Prentice Hall Signal Processing Series. Prentice Hall, Englewood Cliffs, NJ.
  3. Minh N. Do, 2000. An Automatic Speaker Recognition System. Digital Signal Processing Mini-Project. Audio Visual Communications Laboratory, Swiss Federal Institute of Technology, Lausanne.
  4. Furui, S, 1986. Speaker-independent isolated word recognition using dynamic features of the speech spectrum. IEEE Transactions on Acoustic Speech and Signal Processing. Vol ASSP-34, No.1, 52-59.
  5. Gregoire, N., Bouillot, M., 1998. Hausdorff distance between convex polygons. Web project for the course CS 507 Computational Geometry, McGill University.
  6. Duda, R., Hart, P., Stork, D., G., 2000. Pattern Classification. John Wiley & Sons.
Download


Paper Citation


in Harvard Style

Barbu T. (2004). DISCRETE SPEECH RECOGNITION USING A HAUSDORFF BASED METRIC - An automatic word-based speech recognition approach . In Proceedings of the First International Conference on E-Business and Telecommunication Networks - Volume 3: ICETE, ISBN 972-8865-15-5, pages 363-368. DOI: 10.5220/0001381803630368


in Bibtex Style

@conference{icete04,
author={Tudor Barbu},
title={DISCRETE SPEECH RECOGNITION USING A HAUSDORFF BASED METRIC - An automatic word-based speech recognition approach},
booktitle={Proceedings of the First International Conference on E-Business and Telecommunication Networks - Volume 3: ICETE,},
year={2004},
pages={363-368},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001381803630368},
isbn={972-8865-15-5},
}


in EndNote Style

TY - CONF
JO - Proceedings of the First International Conference on E-Business and Telecommunication Networks - Volume 3: ICETE,
TI - DISCRETE SPEECH RECOGNITION USING A HAUSDORFF BASED METRIC - An automatic word-based speech recognition approach
SN - 972-8865-15-5
AU - Barbu T.
PY - 2004
SP - 363
EP - 368
DO - 10.5220/0001381803630368