Towards a 3D Pipeline for Monitoring and Tracking People in an Indoor Scenario using Multiple RGBD Sensors

Konstantinos Amplianitis, Michele Adduci, Ralf Reulke

2015

Abstract

Human monitoring and tracking has been a prominent research area for many scientists around the globe. Several algorithms have been introduced and improved over the years, eliminating false positives and enhancing monitoring quality. While the majority of approaches are restricted to the 2D and 2.5D domain, 3D still remains an unexplored field. Microsoft Kinect is a low cost commodity sensor extensively used by the industry and research community for several indoor applications. Within this framework, an accurate and fast-to-implement pipeline is introduced working in two main directions: pure 3D foreground extraction of moving people in the scene and interpretation of the human movement using an ellipsoid as a mathematical reference model. The proposed work is part of an industrial transportation research project whose aim is to monitor the behavior of people and make a distinction between normal and abnormal behaviors in public train wagons. Ground truth was generated by the OpenNI human skeleton tracker and used for evaluating the performance of the proposed method.

References

  1. Baum, M. and Hanebeck, U. D. (2013). Extended object tracking with random hypersurface models. CoRR, abs/1304.5084.
  2. Brown, D. C. (1971). Close-range camera calibration. Photogrammetric Engineering, 37(8):855-866.
  3. Buys, K., Cagniart, C., Baksheev, A., De Laet, T., De Schutter, J., and Pantofaru, C. (2014). An adaptable system for rgb-d based human body detection and pose estimation. J. Vis. Comun. Image Represent., 25(1):39- 52.
  4. Faion, F., Baum, M., and Hanebeck, U. D. (2012). Tracking 3D Shapes in Noisy Point Clouds with Random Hypersurface Models. In Proceedings of the 15th International Conference on Information Fusion (Fusion 2012), Singapore.
  5. Hegger, F., Hochgeschwender, N., Kraetzschmar, G., and Ploeger, P. (2013). People Detection in 3d Point Clouds Using Local Surface Normals, volume 7500 of Lecture Notes in Computer Science, book section 15, pages 154-165. Springer Berlin Heidelberg.
  6. Kammerl, J., Blodow, N., Rusu, R. B., Gedikli, S., Beetz, M., and Steinbach, E. (2012). Real-time compression of point cloud streams. In IEEE International Conference on Robotics and Automation (ICRA), Minnesota, USA.
  7. Lepetit, V., F.Moreno-Noguer, and P.Fua (2009). Epnp: An accurate o(n) solution to the pnp problem. International Journal Computer Vision, 81(2).
  8. Moshtagh, N. (2005). Minimum volume enclosing ellipsoid.
  9. Munaro, M., Basso, F., and Menegatti, E. (2012). Tracking people within groups with rgb-d data. In IROS, pages 2101-2107. IEEE.
  10. Shotton, J., Girshick, R., Fitzgibbon, A., Sharp, T., Cook, M., Finocchio, M., Moore, R., Kohli, P., Criminisi, A., Kipman, A., and Blake, A. (2013). Efficient human pose estimation from single depth images. IEEE Transactions on Pattern Analysis and Machine Intelligence, 35(12):2821-2840.
  11. Sigalas, M., Pateraki, M., Oikonomidis, I., and Trahanias, P. (2013). Robust model-based 3d torso pose estimation in rgb-d sequences. In The IEEE International Conference on Computer Vision (ICCV) Workshops.
  12. Todd, M. J. and Yildirim, E. A. (2007). On khachiyan's algorithm for the computation of minimumvolume enclosing ellipsoids. Discrete Appl. Math., 155(13):1731-1744.
  13. Ziegler, J., Nickel, K., and Stiefelhagen, R. (2006). Tracking of the articulated upper body on multi-view stereo image sequences. In CVPR (1), pages 774-781. IEEE Computer Society.
Download


Paper Citation


in Harvard Style

Amplianitis K., Adduci M. and Reulke R. (2015). Towards a 3D Pipeline for Monitoring and Tracking People in an Indoor Scenario using Multiple RGBD Sensors . In Proceedings of the 10th International Conference on Computer Vision Theory and Applications - Volume 3: VISAPP, (VISIGRAPP 2015) ISBN 978-989-758-091-8, pages 192-200. DOI: 10.5220/0005356601920200


in Bibtex Style

@conference{visapp15,
author={Konstantinos Amplianitis and Michele Adduci and Ralf Reulke},
title={Towards a 3D Pipeline for Monitoring and Tracking People in an Indoor Scenario using Multiple RGBD Sensors},
booktitle={Proceedings of the 10th International Conference on Computer Vision Theory and Applications - Volume 3: VISAPP, (VISIGRAPP 2015)},
year={2015},
pages={192-200},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005356601920200},
isbn={978-989-758-091-8},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 10th International Conference on Computer Vision Theory and Applications - Volume 3: VISAPP, (VISIGRAPP 2015)
TI - Towards a 3D Pipeline for Monitoring and Tracking People in an Indoor Scenario using Multiple RGBD Sensors
SN - 978-989-758-091-8
AU - Amplianitis K.
AU - Adduci M.
AU - Reulke R.
PY - 2015
SP - 192
EP - 200
DO - 10.5220/0005356601920200