Collaborative Activities Understanding from 3D Data

Fabrizio Natola, Valsamis Ntouskos, Fiora Pirri

Abstract

Our work consists in finding a way to recognize activities performed by two people that collaborate in a working environment. Starting from results obtained in the past years by Gong, Medioni and other authors, we go a step forward, trying to construct a learning function that is able to generalize the model provided by the authors cited before. Moreover, we search for a space in which we can map the points corresponding to the poses, over time, of the skeletons of the two subjects, so that no information is lost.

References

  1. Ali, S. and Shah, M. (2010). Human action recognition in videos using kinematic features and multiple instance learning. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 32(2):288-303.
  2. Cuturi, M., Vert, J., Birkenes, O., and Matsui, T. (2007). A kernel for time series based on global alignments. In Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on, volume 2, pages II-413-II-416.
  3. Gong, D. and Medioni, G. (2011). Dynamic manifold warping for view invariant action recognition. In Computer Vision (ICCV), 2011 IEEE International Conference on, pages 571-578.
  4. Gong, D., Medioni, G., and Zhao, X. (2014). Structured time series analysis for human action segmentation and recognition. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 36(7):1414-1427.
  5. Gong, D., Medioni, G., Zhu, S., and Zhao, X. (2012). Kernelized temporal cut for online temporal segmentation and recognition. In Computer Vision-ECCV 2012, pages 229-243. Springer.
  6. Junejo, I., Dexter, E., Laptev, I., and Perez, P. (2011). View-independent action recognition from temporal self-similarities. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 33(1):172-185.
  7. Lawrence, N. D. (2004). Gaussian process latent variable models for visualisation of high dimensional data. Advances in neural information processing systems, 16:329-336.
  8. Lawrence, N. D. and Quinonero-Candela, J. (2006). Local distance preservation in the gp-lvm through back constraints. In Proceedings of the 23rd international conference on Machine learning, pages 513-520. ACM.
  9. Li, R., Chellappa, R., and Zhou, S. K. (2013). Recognizing interactive group activities using temporal interaction matrices and their riemannian statistics. International journal of computer vision, 101(2):305-328.
  10. Li, W., Zhang, Z., and Liu, Z. (2010). Action recognition based on a bag of 3d points. In Computer Vision and Pattern Recognition Workshops (CVPRW), 2010 IEEE Computer Society Conference on, pages 9-14.
  11. Liao, W.-K. and Medioni, G. (2008). 3d face tracking and expression inference from a 2d sequence using manifold learning. In Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on, pages 1-8.
  12. Lv, F. and Nevatia, R. (2006). Recognition and segmentation of 3-d human action using hmm and multi-class adaboost. In Computer Vision-ECCV 2006, pages 359-372. Springer.
  13. Lv, F. and Nevatia, R. (2007). Single view human action recognition using key pose matching and viterbi path searching. In Computer Vision and Pattern Recognition, 2007. CVPR 7807. IEEE Conference on, pages 1- 8.
  14. Mordohai, P. and Medioni, G. (2010). Dimensionality estimation, manifold learning and function approximation using tensor voting. The Journal of Machine Learning Research, 11:411-450.
  15. Ning, H., Xu, W., Gong, Y., and Huang, T. (2008). Latent pose estimator for continuous action recognition. In Computer Vision-ECCV 2008, pages 419- 433. Springer.
  16. Noma, H. S. K.-i. (2002). Dynamic time-alignment kernel in support vector machine. Advances in neural information processing systems, 14:921.
  17. Ntouskos, V., Papadakis, P., Pirri, F., et al. (2013). Discriminative sequence back-constrained gp-lvm for mocap based action recognition. In International Conference on Pattern Recognition Applications and Methods.
  18. Rakotomamonjy, A., Bach, F., Canu, S., Grandvalet, Y., et al. (2008). Simplemkl. Journal of Machine Learning Research, 9:2491-2521.
  19. Roweis, S. T. and Saul, L. K. (2000). Nonlinear dimensionality reduction by locally linear embedding. Science, 290(5500):2323-2326.
  20. Ryoo, M. and Aggarwal, J. (2011). Stochastic representation and recognition of high-level group activities. International journal of computer Vision, 93(2):183- 200.
  21. Tenenbaum, J. B., De Silva, V., and Langford, J. C. (2000). A global geometric framework for nonlinear dimensionality reduction. Science, 290(5500):2319-2323.
  22. Vemulapalli, R., Pillai, J., and Chellappa, R. (2013). Kernel learning for extrinsic classification of manifold features. In Computer Vision and Pattern Recognition (CVPR), 2013 IEEE Conference on, pages 1782- 1789.
  23. Wang, J., Fleet, D., and Hertzmann, A. (2008). Gaussian process dynamical models for human motion. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 30(2):283-298.
  24. Weinland, D., O zuysal, M., and Fua, P. (2010). Making action recognition robust to occlusions and viewpoint changes. In Computer Vision-ECCV 2010, pages 635-648. Springer.
  25. Zhang, X. and Fan, G. (2011). Joint gait-pose manifold for video-based human motion estimation. In Computer Vision and Pattern Recognition Workshops (CVPRW), 2011 IEEE Computer Society Conference on, pages 47-54.
Download


Paper Citation


in Harvard Style

Natola F., Ntouskos V. and Pirri F. (2015). Collaborative Activities Understanding from 3D Data . In Doctoral Consortium - DCPRAM, (ICPRAM 2015) ISBN , pages 19-23


in Bibtex Style

@conference{dcpram15,
author={Fabrizio Natola and Valsamis Ntouskos and Fiora Pirri},
title={Collaborative Activities Understanding from 3D Data},
booktitle={Doctoral Consortium - DCPRAM, (ICPRAM 2015)},
year={2015},
pages={19-23},
publisher={SciTePress},
organization={INSTICC},
doi={},
isbn={},
}


in EndNote Style

TY - CONF
JO - Doctoral Consortium - DCPRAM, (ICPRAM 2015)
TI - Collaborative Activities Understanding from 3D Data
SN -
AU - Natola F.
AU - Ntouskos V.
AU - Pirri F.
PY - 2015
SP - 19
EP - 23
DO -