REFERENCES
Barnachon, M., Bouakaz, S., Boufama, B., and Guillou, E.
(2013). A real-time system for motion retrieval and
interpretation. Pattern Recognition Letters.
Bengio, Y., Lamblin, P., Popovici, D., and Larochelle, H.
(2007). Greedy layer-wise training of deep networks.
In Sch¨olkopf, B., Platt, J., and Hoffman, T., editors,
Advances in Neural Information Processing Systems
19, pages 153–160. MIT Press, Cambridge, MA.
Bengio, Y. and LeCun, Y. (2007). Scaling learning algo-
rithms towards AI. In Bottou, L., Chapelle, O., De-
Coste, D., and Weston, J., editors, Large Scale Kernel
Machines. MIT Press.
Chen, X. and Koskela, M. (2013). Classification of rgb-d
and motion capture sequences using extreme learning
machine. In Proceedings of 18th Scandinavian Con-
ference on Image Analysis.
Chung, H. and Yang, H.-D. (2013). Conditional random
field-based gesture recognition with depth informa-
tion. Optical Engineering, 52(1):017201–017201.
Haykin, S. (2009). Neural Networks and Learning Ma-
chines. Pearson Education, 3rd edition.
Hinton, G. and Salakhutdinov, R. (2006). Reducing the di-
mensionality of data with neural networks. Science,
313(5786):504–507.
Huang, G.-B., Zhu, Q.-Y., and Siew, C.-K. (2006). Extreme
learning machine: Theory and applications. Neuro-
computing, 70(1-3):489–501.
Larochelle, H. and Bengio, Y. (2008). Classification us-
ing discriminative restricted Boltzmann machines. In
Proceedings of the 25th international conference on
Machine learning (ICML 2008), pages 536–543, New
York, NY, USA. ACM.
M¨uller, M. and Roder, T. (2006). Motion templates for
automatic classification and retrieval of motion cap-
ture data. In Proceedings of the Eurographics/ACM
SIGGRAPH symposium on Computer animation, vol-
ume 2, pages 137–146, Vienna, Austria.
M¨uller, M., R¨oder, T., Clausen, M., Eberhardt, B., Kr¨uger,
B., and Weber, A. (2007). Documentation mocap
database HDM05. Technical Report CG-2007-2,
U. Bonn.
Ofli, F., Chaudhry, R., Kurillo, G., Vidal, R., and Bajcsy, R.
(2013). Berkeley mhad: A comprehensive multimodal
human action database. In Applications of Computer
Vision (WACV), 2013 IEEE Workshop on, pages 53–
60.
Raptis, M., Kirovski, D., and Hoppe, H. (2011). Real-
time classification of dance gestures from skeleton
animation. In Proceedings of the 2011 ACM SIG-
GRAPH/Eurographics Symposium on Computer An-
imation, pages 147–156. ACM.
Raptis, M., Wnuk, K., Soatto, S., et al. (2008).
Flexible dictionaries for action classification. In
Proc. MLVMA’08.
Rumelhart, D. E., Hinton, G., and Williams, R. J. (1986).
Learning representations by back-propagating errors.
Nature, 323(Oct):533–536.
Shotton, J., Fitzgibbon, A., Cook, M., Sharp, T., Finocchio,
M., Moore, R., Kipman, A., and Blake, A. (2011).
Real-time human pose recognition in parts from single
depth images. In Proc. Computer Vision and Pattern
Recognition.
Vieira, A., Lewiner, T., Schwartz, W., and Campos, M.
(2012). Distance matrices as invariant features for
classifying MoCap data. In 21st International Confer-
ence on Pattern Recognition (ICPR), Tsukuba, Japan.
Wang, J., Liu, Z., Wu, Y., and Yuan, J. (2012). Mining
actionlet ensemble for action recognition with depth
cameras. In Computer Vision and Pattern Recogni-
tion (CVPR), 2012 IEEE Conference on, pages 1290–
1297. IEEE.
Zeiler, M. D. (2012). ADADELTA: An adaptive learning
rate method. arXiv:
1212.5701 [cs.LG]
.
Zhao, X., Li, X., Pang, C., and Wang, S. (2013). Human
action recognition based on semi-supervised discrimi-
nant analysis with global constraint. Neurocomputing,
105(0):45 – 50.
APPENDIX: 65 Actions in HDM05
Dataset
1. cartwheelLHandStart1Reps
cartwheelLHandStart2Reps
cartwheelRHandStart1Reps
2. clap1Reps
clap5Reps
3. clapAboveHead1Reps
clapAboveHead5Reps
4. depositFloorR
5. depositHighR
6. depositLowR
7. depositMiddleR
8. elbowToKnee1RepsLelbowStart
elbowToKnee1RepsRelbowStart
elbowToKnee3RepsLelbowStart
elbowToKnee3RepsRelbowStart
9. grabFloorR
10. grabHighR
11. grabLowR
12. grabMiddleR
13. hitRHandHead
14. hopBothLegs1hops
hopBothLegs2hops
hopBothLegs3hops
15. hopLLeg1hops
hopLLeg2hops
hopLLeg3hops
16. hopRLeg1hops
hopRLeg2hops
hopRLeg3hops
17. jogLeftCircle4StepsRstart
jogLeftCircle6StepsRstart
ClassifyingandVisualizingMotionCaptureSequencesusingDeepNeuralNetworks
129