Preliminary Evaluation of a Silent Speech Interface based on Intra-Oral Magnetic Sensing
Lam A. Cheah, Jie Bai, Jose A. Gonzalez, James M. Gilbert, Stephen R. Ell, Phil D. Green, Roger K. Moore
2016
Abstract
This paper addresses the hardware challenges faced in developing a practical silent speech interface (SSI) for post-laryngectomy speech rehabilitation. Although a number of SSIs have been developed, many are still deemed as impractical due to a high degree of intrusiveness and discomfort, hence limiting their transition to outside of the laboratory environment. The aim of this paper is to build upon our previous work, in developing a user-centric prototype and enhancing its desirable features. A new Permanent Magnet Articulography (PMA) system is presented which fits within the palatal cavity of the user’s mouth, giving unobtrusive appearance and high portability. The prototype is comprised of a miniaturised circuit constructed using commercial off-the-shelf (COTS) components and is implemented in the form of a dental retainer, which is mounted under roof of the user’s mouth and firmly clasps onto the upper teeth. Preliminary evaluation via speech recognition experiments demonstrates that the intraoral prototype achieves word recognition accuracy of 75.7%, slightly lower than its predecessor. Nonetheless, the intraoral design is expected to improve the stability and robustness of the PMA system with a much improved appearance since it can be completely hidden inside the user’s mouth.
References
- Bai, J., Cheah, L. A., Ell, S. R., and Gilbert, J. M. (2015). Design of an intraoral device based on permanent magnetic articulography. In Proceedings of Macau Conference on Engineering, Technology and Applied Science, pages 1-12, Macau, China.
- Braz, D. S. A., Ribas, M. M., Dedivitis, R. A., Nishimoto, I. N., and Barros, A. P. B. (2005). Quality of life and depression in patients undergoing total and partial laryngectomy. Clinics, 60(2):135-142.
- Bright, A. K., and Conventry, L. (2013). Assistive technology for older adults: psychological and socioemotional design requirements. In Proceedings of 6th International Conference on PErvaesive Technologies Related to Assistive Environments, pages 1-4, Rhodes, Greece.
- Cheah, L. A., Bai, J., Gonzalez, J. A., Ell, S. R., Gilbert, J. M., Moore, R. K., and Green, P. D. (2015). A usercentric design of permanent magnetic articulography based assistive speech technology. In Proceedings of 8th BIOSIGNALS, pages 109-116, Lisbon, Portugal.
- Denby, B., Schultz, T., Honda, K., Hueber, T., Gilbert, J. M., and Brumberg, J. S. (2010). Silent speech interfaces. Speech Communication, 52(4):270-287.
- Doi, H., Nakamura, K., Toda, T., Saruwatari, H., and Shikano, K. (2010). Esophageal speech enhancement based on statistical voice conversion with Gaussian mixture model. IEICE Transactions on Information and Systems, 93(9):2472-2482.
- Fagan, M. J., Ell, S. R., Gilbert, J. M., Sarrazin, E., and Chapman, P. M. (2008). Development of a (silent) speech recognition system for patients following laryngectomy. Medical engineering & physics, 30(4):419-425.
- Gilbert, J. M., Rybchenko, S. I., Hofe, R., Ell, S. R., Fagan, M. J., Moore, R. K. and Green, P. D. (2010). Isolated word recognition of silent speech using magnetic implants and sensors. Medical Engineering & Physics, 32(10):1189-1197.
- Gonzalez, J. A., Cheah, L. A., Bai, J., Ell, S. R., Gilbert, J. M., Moore, R. K., and Green, P. D. (2014). Analysis of phonetic similarity in a silent speech interface based on permanent magnetic articulography. In Proceedings of 15th INTERSPEECH, pages 1018-1022, Singapore.
- Hirsch, T., Forlizzi, J., Goetz, J., Stoback, J., and Kurtx, C. (2000). The ELDer project: Social and emotional factors in the design of eldercare technologies. In Proceedings on the 2000 conference of Universal Usability, pages 72-79, Arlington, USA.
- Hofe, R., Bai, J., Cheah, L. A., Ell, S. R., Gilbert, J. M., Moore, R. K., and Green, P. D. (2013a). Performance of the MVOCA silent speech interface across multiple speakers. In Proceedings of 14th INTERSPEECH, pages 1140-1143, Lyon, France.
- Hofe, R., Ell, S. R., Fagan, M. J., Gilbert, J. M., Green, P. D., Moore, R. K., and Rybchenko, S. I. (2013b). Small-vocabulary speech recognition using silent speech interface based on magnetic sensing. Speech Communication, 55(1):22-32.
- Leonard, R. G. (1984). A database for speakerindependent digit recognition. In Proceedings of 9th ICASSP, pages 328-331, San Diego, USA.
- Lontis, E. R., Lund, M. E., Christensen, H. V., Gaihede, M., Caltenco, H. A., and Andreasen-Strujik, L. N. (2010). Clinical evaluation of wireless inductive tongue computer interface for control of computers and assistive devices. In Proceedings of International Conference on Engineering in Medicine and Biology Society, pages 3365-3368, Beunos Aires, Argentina.
- Maier-Hein, L., Metze, F., Schultz, T., and Waibel, A. (2005). Session independent non-audible speech recognition using surface electromyography. In Automatic Speech Recognition and Understanding Workshop, pages 331-336, Cancun, Mexico.
- Martin, J. L., Murphy, E., Crowe, J. A., and Norris, B. J. (2006). Capturing user requirements in medical devices development: the role of ergonomics. Physiological Measurement, 27(8):49-62.
- Park, H., Kiani, M., Lee, H. M., Kim, J., Block, J., Gosselin, B., and Ghovanloo, M. (2012). A wireless magnetoresistive sensing system for an intraoral tongue-computer interface. IEEE Transactions on Biomedical Circuits and Systems, 6(6):571:585.
- Rabiner, L. R. (1989). A tutorial on Hidden Markov Models and selected applications in speech recognition. Proceedings of the IEEE, 77:257-286.
- Tang, H., and Beebe, D. J. (2006). An oral interface for blind navigation. IEEE Transactions on Neural Systems and Rehabilitation Engineering, 14(1):116- 123.
- Toda, T., Black, A. W., and Tokuda, K. (2008). Statistical mapping between articulatory movements and acoustic spectrum using a Gaussian mixture model. Speech Communication, 50(3): 215-227.
- Toda, T., Nakagiri, M., and Shikano, K. (2012). Statistical voice conversion techniques for body-conducted unvoiced speech enhancement. IEEE Transactions on Audio, Speech and Language Processing, 20(9):2505- 2517.
- Toutios, A., and Margaritis, K. G. (2005). A support vector approach to the acoustic-to-articulatory mapping. In Proceedings of 6th INTERSPEECH, pages 3221-3224, Lisbon, Portugal.
- Wand, M., and Schultz, T. (2011). Session-independent EMG-based speech recognition. In Proceedings of 4th BIOSIGNALS, pages 295-300, Rome, Italy.
- Wang, J., Samal, A., Green, J. R., and Rudzicz, F. (2012). Sentence recognition from articulatory movements for silent speech interfaces. In Proceedings of 37th ICASSP, pages 4985-4988, Kyoto, Japan.
- Young, S., Everman, G., Gales, M., Hain, T., Kershaw, D., Liu, X., Moore, G., Odell, J., Ollason, D., Povery, D., Valtchev, V., and Woodland, P. (2009). The HTK Book (for HTK Version 3.4.1). Cambridge: Cambridge University Press.
Paper Citation
in Harvard Style
Cheah L., Bai J., Gonzalez J., Gilbert J., Ell S., Green P. and Moore R. (2016). Preliminary Evaluation of a Silent Speech Interface based on Intra-Oral Magnetic Sensing . In Proceedings of the 9th International Joint Conference on Biomedical Engineering Systems and Technologies - Volume 1: BIODEVICES, (BIOSTEC 2016) ISBN 978-989-758-170-0, pages 108-116. DOI: 10.5220/0005824501080116
in Bibtex Style
@conference{biodevices16,
author={Lam A. Cheah and Jie Bai and Jose A. Gonzalez and James M. Gilbert and Stephen R. Ell and Phil D. Green and Roger K. Moore},
title={Preliminary Evaluation of a Silent Speech Interface based on Intra-Oral Magnetic Sensing},
booktitle={Proceedings of the 9th International Joint Conference on Biomedical Engineering Systems and Technologies - Volume 1: BIODEVICES, (BIOSTEC 2016)},
year={2016},
pages={108-116},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005824501080116},
isbn={978-989-758-170-0},
}
in EndNote Style
TY - CONF
JO - Proceedings of the 9th International Joint Conference on Biomedical Engineering Systems and Technologies - Volume 1: BIODEVICES, (BIOSTEC 2016)
TI - Preliminary Evaluation of a Silent Speech Interface based on Intra-Oral Magnetic Sensing
SN - 978-989-758-170-0
AU - Cheah L.
AU - Bai J.
AU - Gonzalez J.
AU - Gilbert J.
AU - Ell S.
AU - Green P.
AU - Moore R.
PY - 2016
SP - 108
EP - 116
DO - 10.5220/0005824501080116