seventh is SVM on combined feature from MHI and
its histogram. The last one is the results SVM based
on Haar wavelet transform of MHI and histogram of
MHI. This last result achieves the best overall perfor-
mance of approximately 71% correct classification.
6 CONCLUSION
In this paper, we proposed a system for fast human ac-
tion recognition. Potential applications include secu-
rity systems, man-machine communication, and ubiq-
uitous vision systems. The proposed method does
not rely on accurate tracking as many other works
do, since many tracking algorithms incur a prohibitive
computational cost for the system. Our system is
based on simple features in order to achieve high-
speed recognition, particularly in real-time embedded
vision applications.
In comparison with local SVM methods by
Schuldt (Schuldt et al., 2004), our feature vector is
much easier to obtain because we don’t need to find
interest points in each frame. We also don’t need a
validation dataset for parameter tuning.
In comparison with Meng’s (Meng et al., 2006b)
(Meng et al., 2006a) methods, we use a Haar wavelet
transform and histogram methods to build a new fea-
ture vector from the MHI representation. This new
feature vector contains the important information of
the MHI and also has a lower dimension. Experimen-
tal results demonstrate that these techniques made a
significant improvement on the human action recog-
nition performance compared to other methods.
If the learning part of the system is conducted off-
line, this system has great potential for implementa-
tion in small, embedded computing devices, typically
FPGA or DSP based systems, which can be embedded
in the application and give real-time performance.
REFERENCES
Aggarwal, J. K. and Cai, Q. (1999). Human motion
analysis: a review. Comput. Vis. Image Underst.,
73(3):428–440.
Bobick, A. F. and Davis, J. W. (2001). The recognition
of human movement using temporal templates. IEEE
Trans. Pattern Anal. Mach. Intell., 23(3):257–267.
Dalal, N., Triggs, B., and Schmid, C. (2006). Human de-
tection using oriented histograms of flow and appear-
ance. In LNCS, volume 3952. ECCV 2006.
Farnell, B. (1999). Moving bodies, acting selves. Annual
Review of Anthropology, 28:341–373.
Farquhar, J. D. R., Hardoon, D. R., Meng, H., Shawe-
Taylor, J., and Szedmak, S. (2005). Two view learn-
ing: Svm-2k, theory and practice. In NIPS.
Joachims, T. (1999). Making large-scale svm learning prac-
tical. In Advances in Kernel Methods - Support Vector
Learning, USA. MIT-Press. Oikonomopoulos, Anto-
nios and Patras, Ioannis and Pantic, Maja eds.
Ke, Y., Sukthankar, R., and Hebert., M. (2005). Efficient vi-
sual event detection using volumetric features. In Pro-
ceedings of International Conference on Computer Vi-
sion, pages 166–173. Beijing, China, Oct. 15-21,
2005.
Mallat, S. (1989). A theory for multiresolution signal de-
composition: the wavelet representation. IEEE Trans-
actions on Pattern Analysis and Machine Intelligence,
11:674–693.
Meng, H., Pears, N., and Bailey, C. (2006a). Human action
classification using svm
2k classifier on motion fea-
tures. In Lecture Notes in Computer Science (LNCS),
volume 4105, pages 458–465, Istanbul, Turkey. Inter-
national workshop on Multimedia Content Represen-
tation,Classification and Security (MRCS 2006).
Meng, H., Pears, N., and Bailey, C. (2006b). Recogniz-
ing human actions based on motion information and
svm. In 2nd IET International Conference on Intel-
ligent Environments, pages 239–245, Athens, Greece.
IET.
Meng, H., Shawe-Taylor, J., Szedmak, S., and Farquhar, J.
D. R. (2004). Support vector machine to synthesise
kernels. In Deterministic and Statistical Methods in
Machine Learning, pages 242–255.
Ogata, T., Tan, J. K., and Ishikawa, S. (2006). High-speed
human motion recognition based on a motion history
image and an eigenspace. IEICE Transactions on In-
formation and Systems, E89(1):281–289.
Oikonomopoulos, A., Patras, I., and Pantic, M. (2006).
Kernel-based recognition of human actions using spa-
tiotemporal salient points. In Proceedings of IEEE
Int’l Conf. on Computer Vision and Pattern Recogni-
tion 2006, volume 3.
Schuldt, C., Laptev, I., and Caputo, B. (2004). Recogniz-
ing human actions: a local SVM approach. In Proc.
Int. Conf. Pattern Recognition (ICPR’04), Cambridge,
U.K.
Strang, G. and Nguyen, T. (1996). Wavelets and Filter
Banks. Wellesley Cambridge Press.
Weinland, D., Ronfard, R., and Boyer, E. (2005). Motion
history volumes for free viewpoint action recognition.
In IEEE International Workshop on modeling People
and Human Interaction (PHI’05).
Wong, S.-F. and Cipolla, R. (2005). Real-time adaptive
hand motion recognition using a sparse bayesian clas-
sifier. In ICCV-HCI, pages 170–179.