A ROBUST SPEECH COMMAND RECOGNIZER FOR EMBEDDED APPLICATIONS
Alexandre Maciel, Arlindo Veiga, Cláudio Neves, José Lopes, Carla Lopes, Fernando Perdigão, Luís Sá
2008
Abstract
This paper describes a command-based robust speech recognition system for the Portuguese language. Due to an efficient noise reduction algorithm the system can be operated in adverse noise environments such as in cars or factories. The recognizer was trained and tested with a speech database with 250 commands spoken by 345 speakers in clean and noisy conditions. The system incorporates a user friendly application programming interface and was optimized for embedded platforms with limited computational resources. Performance tests for the recognizer are presented.
References
- ETSI, 2003. ETSI ES 202 050 v1.1.3. Speech Processing, Transmission and Quality Aspects (STQ); Distributed Speech Recognition; Advanced Front-end Feature Extraction Algorithm; Compression Algorithms. Technical Report ETSI ES 202 050, ETSI.
- HTK3, 2006. The HTK book (for HTK version 3.4). Technical report, Cambridge University. England. http://htk.eng.cam.ac.uk/.
- Li, J.-Y., Liu, B., Wang, R.-H., and Dai L.-R., 2004. A Complexity Reduction of ETSI Advanced Front-end for DSR. In proc. of ICASSP'2004, vol. I, pp. 61-64. Montreal, Canada.
- Neves, C., Veiga, A., Sá, L., and Perdigão, F., 2008. Efficient Noise-Robust Speech Recognition Front-end Based on the ETSI Standard. Submitted to INTERSPEECH'2008. Brisbane, Australia.
- Peinado, A., and Segura, J., 2006. Speech Recognition over Digital Channels: Robustness and Standards, John Wiley & Sons, Ltd. England.
- Tecnovoz, 2008. http://www.tecnovoz.pt/web/home.asp.
- Yu, D., Ju, Y., Wang, Y.-Y., and Alex, W., 2006. N-Gram Based Filler Model for Robust Grammar Authoring. In proc. of ICASSP'2006, vol. I, pp. 565-568. Toulouse, France.
Paper Citation
in Harvard Style
Maciel A., Veiga A., Neves C., Lopes J., Lopes C., Perdigão F. and Sá L. (2008). A ROBUST SPEECH COMMAND RECOGNIZER FOR EMBEDDED APPLICATIONS . In Proceedings of the International Conference on Signal Processing and Multimedia Applications - Volume 1: SIGMAP, (ICETE 2008) ISBN 978-989-8111-60-9, pages 92-95. DOI: 10.5220/0001938700920095
in Bibtex Style
@conference{sigmap08,
author={Alexandre Maciel and Arlindo Veiga and Cláudio Neves and José Lopes and Carla Lopes and Fernando Perdigão and Luís Sá},
title={A ROBUST SPEECH COMMAND RECOGNIZER FOR EMBEDDED APPLICATIONS},
booktitle={Proceedings of the International Conference on Signal Processing and Multimedia Applications - Volume 1: SIGMAP, (ICETE 2008)},
year={2008},
pages={92-95},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001938700920095},
isbn={978-989-8111-60-9},
}
in EndNote Style
TY - CONF
JO - Proceedings of the International Conference on Signal Processing and Multimedia Applications - Volume 1: SIGMAP, (ICETE 2008)
TI - A ROBUST SPEECH COMMAND RECOGNIZER FOR EMBEDDED APPLICATIONS
SN - 978-989-8111-60-9
AU - Maciel A.
AU - Veiga A.
AU - Neves C.
AU - Lopes J.
AU - Lopes C.
AU - Perdigão F.
AU - Sá L.
PY - 2008
SP - 92
EP - 95
DO - 10.5220/0001938700920095