EARS: ELECTROMYOGRAPHICAL AUTOMATIC RECOGNITION OF SPEECH

Szu-Chen Stan Jou, Tanja Schultz

2008

Abstract

In this paper, we present our research on automatic speech recognition of surface electromyographic signals that are generated by the human articulatory muscles. With parallel recorded audible speech and electromyographic signals, experiments are conducted to show the anticipatory behavior of electromyographic signals with respect to speech signals. Additionally, we demonstrate how to develop phone-based speech recognizers with carefully designed electromyographic feature extraction methods. We show that articulatory feature (AF) classifiers can also benefit from the novel feature, which improve the F-score of the AF classifiers from 0.467 to 0.686. With a stream architecture, the AF classifiers are then integrated into the decoding framework. Overall, the word error rate improves from 86.8% to 29.9% on a 100 word vocabulary recognition task.

References

  1. Jorgensen, C. and Binsted, K. (2005). Web browser control using EMG based sub vocal speech recognition. In Proc. HICSS, Hawaii.
  2. Jorgensen, C., Lee, D., and Agabon, S. (2003). Sub auditory speech recognition based on EMG signals. In Proc. IJCNN, Portland, Oregon.
  3. Maier-Hein, L., Metze, F., Schultz, T., and Waibel, A. (2005). Session independent non-audible speech recognition using surface electromyography. In Proc. ASRU, San Juan, Puerto Rico.
  4. Manabe, H., Hiraiwa, A., and Sugimura, T. (2003). Unvoiced speech recognition using EMG-Mime speech recognition. In Proc. CHI, Ft. Lauderdale, Florida.
  5. Manabe, H. and Zhang, Z. (2004). Multi-stream HMM for EMG-based speech recognition. In Proc. IEEE EMBS, San Francisco, California.
  6. Metze, F. and Waibel, A. (2002). A flexible stream architecture for ASR using articulatory features. In Proc. ICSLP, Denver, CO.
  7. Yu, H. and Waibel, A. (2000). Streaming the front-end of a speech recognizer. In Proc. ICSLP, Beijing, China.
Download


Paper Citation


in Harvard Style

Stan Jou S. and Schultz T. (2008). EARS: ELECTROMYOGRAPHICAL AUTOMATIC RECOGNITION OF SPEECH . In Proceedings of the First International Conference on Bio-inspired Systems and Signal Processing - Volume 1: BIOSIGNALS, (BIOSTEC 2008) ISBN 978-989-8111-18-0, pages 3-12. DOI: 10.5220/0001067100030012


in Bibtex Style

@conference{biosignals08,
author={Szu-Chen Stan Jou and Tanja Schultz},
title={EARS: ELECTROMYOGRAPHICAL AUTOMATIC RECOGNITION OF SPEECH},
booktitle={Proceedings of the First International Conference on Bio-inspired Systems and Signal Processing - Volume 1: BIOSIGNALS, (BIOSTEC 2008)},
year={2008},
pages={3-12},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001067100030012},
isbn={978-989-8111-18-0},
}


in EndNote Style

TY - CONF
JO - Proceedings of the First International Conference on Bio-inspired Systems and Signal Processing - Volume 1: BIOSIGNALS, (BIOSTEC 2008)
TI - EARS: ELECTROMYOGRAPHICAL AUTOMATIC RECOGNITION OF SPEECH
SN - 978-989-8111-18-0
AU - Stan Jou S.
AU - Schultz T.
PY - 2008
SP - 3
EP - 12
DO - 10.5220/0001067100030012