4 CONCLUSIONS
In this paper, we propose a computational system for
a passive detection of a 5-star human skeleton based
on raw video. The system includes two main mod-
ules, segmentation and star skeleton detection, and
it was adjusted and evaluated using a semi-restricted
realistic environment. The realistic videos are re-
lated with interior and exterior human walking ar-
eas with varying illumination, clutter, and other un-
controlled conditions (e.g., weather). Several com-
puter vision methods were explored for the segmenta-
tion and star skeleton modules. The best results were
achieved using simpler approaches: background sub-
traction and shadow and highlight removal using HSV
color space; smoothed Euclidean distance to centroid
and zero-crossing of distance differences to detect the
human extremes. In future work, we intend to use
motion and memory to estimate the position of hu-
man parts (e.g., hands) that might be temporarily hid-
den. Also, we plan to create motion skeleton features
(e.g., velocity, acceleration) in order to train a ma-
chine learning classifier such that it can learn to detect
human actions (e.g., walking, making a cellular call).
ACKNOWLEDGEMENTS
This work has been supported by COMPETE:
POCI-01-0145-FEDER-007043 and FCT - Fundac¸
˜
ao
para a Ci
ˆ
encia e Tecnologia within the Project
Scope: UID/CEC/00319/2013 and research grant
FCT SFRH/BD/84939/2012.
REFERENCES
Afsar, P., Cortez, P., and Santos, H. (2015a). Automatic
human action recognition from video using hidden
markov model. In Computational Science and Engi-
neering (CSE), 2015 IEEE 18th International Confer-
ence on, pages 105–109. IEEE.
Afsar, P., Cortez, P., and Santos, H. (2015b). Automatic
visual detection of human behavior: A review from
2000 to 2014. Expert Systems with Applications,
42(20):6935–6956.
Chen, H.-S., Chen, H.-T., Chen, Y.-W., and Lee, S.-Y.
(2006). Human action recognition using star skele-
ton. In Proceedings of the 4th ACM international
workshop on Video surveillance and sensor networks,
pages 171–178. ACM.
Cortez, P., Matos, L. M., Pereira, P. J., Santos, N., and
Duque, D. (2016). Forecasting store foot traffic us-
ing facial recognition, time series and support vector
machines. In Soft Computing Models in Industrial and
Environmental Applications (SOCO), Advances in In-
telligent and Soft Computing (AISC) Vol. 527, pages
267–276, San Sebastian, Spain. Springer.
Duque, D., Santos, H., and Cortez, P. (2005). Moving object
detection unaffected by cast shadows, highlights and
ghosts. In IEEE International Conference on Image
Processing 2005, volume 3, pages III–413. IEEE.
Enzweiler, M. and Gavrila, D. M. (2009). Monocular pedes-
trian detection: Survey and experiments. IEEE trans-
actions on pattern analysis and machine intelligence,
31(12):2179–2195.
Fujiyoshi, H., Lipton, A. J., and Kanade, T. (2004). Real-
time human motion analysis by image skeletonization.
IEICE TRANSACTIONS on Information and Systems,
87(1):113–120.
Geronimo, D., Lopez, A. M., Sappa, A. D., and Graf, T.
(2010). Survey of pedestrian detection for advanced
driver assistance systems. IEEE transactions on pat-
tern analysis and machine intelligence, 32(7):1239–
1258.
Kaewtrakulpong, P. and Bowden, R. (2002). An im-
proved adaptive background mixture model for real-
time tracking with shadow detection. In Video-based
surveillance systems, pages 135–144. Springer.
Orrite-Urunuela, C., del Rincon, J. M., Herrero-Jaraba,
J. E., and Rogez, G. (2004). 2d silhouette and 3d
skeletal models for human detection and tracking. In
Pattern Recognition, 2004. ICPR 2004. Proceedings
of the 17th International Conference on, volume 4,
pages 244–247. IEEE.
Otsu, N. (1975). A threshold selection method from gray-
level histograms. Automatica, 11(285-296):23–27.
Poppe, R. (2007). Vision-based human motion analysis: An
overview. Computer vision and image understanding,
108(1):4–18.
Turaga, P., Chellappa, R., Subrahmanian, V. S., and Udrea,
O. (2008). Machine recognition of human activities:
A survey. IEEE Transactions on Circuits and Systems
for Video Technology, 18(11):1473–1488.
Vemulapalli, R., Arrate, F., and Chellappa, R. (2016). R3dg
features: Relative 3d geometry-based skeletal repre-
sentations for human action recognition. Computer
Vision and Image Understanding.
Yang, X. and Tian, Y. (2014). Effective 3d action recogni-
tion using eigenjoints. Journal of Visual Communica-
tion and Image Representation, 25(1):2–11.
Human Skeleton Detection from Semi-constrained Environment Video
389