Time-segmentation- and Position-free Recognition from Video of Air-drawn Gestures and Characters

Yuki Nitsuma, Syunpei Torii, Yuichi Yaguchi, Ryuichi Oka

Abstract

We report on the recognition from video streams of isolated alphabetic characters and connected cursive textual characters, such as alphabetic, hiragana a kanji characters, drawn in the air. This topic involves a number of difficult problems in computer vision, such as the segmentation and recognition of complex motion from video. We utilize an algorithm called time-space continuous dynamic programming (TSCDP) that can realize both time- and location-free (spotting) recognition. Spotting means that prior segmentation of input video is not required. Each of the reference (model) characters used is represented by a single stroke composed of pixels. We conducted two experiments involving the recognition of 26 isolated alphabetic characters and 23 Japanese hiragana and kanji air-drawn characters. Moreover we conducted gesture recognition experiments based on TSCDP and showed that TSCDP was free from many restrictions imposed upon conventional methods.

References

  1. Alon, J. (2006). Spatiotemporal Gesture Segmentation. Dissertation for Doctor of Philosophy, Boston University.
  2. Chen, F., Fu, C., and Huang., C. (2003). Hand gesture recognition using a real-time tracking method and hidden markov models. Image and Video Computing, 21(8):745-758.
  3. Ezaki, N., Sugimoto, M., Kiyota, K., and Yamamoto, S. (2010). Character recognition by using acceleration sensor: Proposing a character input method using wiimote [in Japanese]. Meeting on Image Recognition and Understanding, IS2-48:1094-1098.
  4. Gao, W., Ma, J., J.Wu, and Wang, C. (2000). Sign language recognition based on hmm/ann/dp. International Journal of Pattern Recognition and Artificial Intelligence, 14(5):587-602.
  5. Horo, T. and Inaba, M. (2006). A handwriting recognition system using multiple cameras [in Japanese]. Workshop on Interactive Systems and Software (WISS2006).
  6. Kolsch, M. and Turk, M. (2004). Robust hand detection. In Proc. Sixth IEEE International Conference on Automatic Face and Gesture Recognition, pages 614-619.
  7. Nakai, M. and Yonezawa, H. (2009). Aerial handwritten character recognition using an acceleration sensor [in Japanese]. Forum on Information Technology, H19:133-134.
  8. Oka, R. (1998). Spotting method for classification of real world data. The Computer Journal, 41(8):559-565.
  9. Oka, R. and Matsuzaki, T. (2012). Robustness for time-spatial deformation of an occlusion realized in time-space continuous dynamic programming [in Japanese]. Joint Technical Meeting on Information Processing and Innovative Industrial Systems, 27(6):873-891.
  10. Okada, T. and Muraoka, Y. (2003). Letter input system for handwriting gestures [in Japanese]. Transactions of the Institute of Electronics, Information and Communication Engineers, D-II J86-D-II(7):1015-1025.
  11. Ong, S. C. W. and Ranganath, S. (2005). Automatic sign language analysis: A survey and the future beyond lexical meaning. Pattern Analysis and Machine Intelligence, 27(6):873-891.
  12. Sato, A., Shinoda, K., and Furui, S. (2010). Sign language recognition using time-of-flight camera [in Japanese]. Meeting on Image Recognition and Understanding, IS3-44:1861-1868.
  13. Sclaroff, S., Betke, M., Kollios, G., Alon, J., Athitsos, V., Li, R., Magee, J., and Tian, T.-P. (2005). Tracking analysis and recognition of human gestures in video. ICDAR: Int. Conf. on Document Analysis and Recognition.
  14. Yang, M., Ahuja, N., and Tabb, M. (2002). Extraction of 2d motion trajectories and its application to hand gesture recognition. Pattern Analysis and Machine Intelligence, 24(8):1061-1074.
Download


Paper Citation


in Harvard Style

Nitsuma Y., Torii S., Yaguchi Y. and Oka R. (2014). Time-segmentation- and Position-free Recognition from Video of Air-drawn Gestures and Characters . In Proceedings of the 3rd International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM, ISBN 978-989-758-018-5, pages 588-599. DOI: 10.5220/0004816805880599


in Bibtex Style

@conference{icpram14,
author={Yuki Nitsuma and Syunpei Torii and Yuichi Yaguchi and Ryuichi Oka},
title={Time-segmentation- and Position-free Recognition from Video of Air-drawn Gestures and Characters},
booktitle={Proceedings of the 3rd International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM,},
year={2014},
pages={588-599},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004816805880599},
isbn={978-989-758-018-5},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 3rd International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM,
TI - Time-segmentation- and Position-free Recognition from Video of Air-drawn Gestures and Characters
SN - 978-989-758-018-5
AU - Nitsuma Y.
AU - Torii S.
AU - Yaguchi Y.
AU - Oka R.
PY - 2014
SP - 588
EP - 599
DO - 10.5220/0004816805880599