Spatio-temporal Video Autoencoder for Human Action Recognition

Anderson Santos, Helio Pedrini

Abstract

The demand for automatic systems for action recognition has increased significantly due to the development of surveillance cameras with high sampling rates, low cost, small size and high resolution. These systems can effectively support human operators to detect events of interest in video sequences, reducing failures and improving recognition results. In this work, we develop and analyze a method to learn two-dimensional (2D) representations from videos through an autoencoder framework. A multi-stream network is used to incorporate spatial and temporal information for action recognition purposes. Experiments conducted on the challenging UCF101 and HMDB51 data sets indicate that our representation is capable of achieving competitive accuracy rates compared to the literature approaches.

Download


Paper Citation


in Harvard Style

Santos A. and Pedrini H. (2019). Spatio-temporal Video Autoencoder for Human Action Recognition.In Proceedings of the 14th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 5: VISAPP, ISBN 978-989-758-354-4, pages 114-123. DOI: 10.5220/0007409401140123


in Bibtex Style

@conference{visapp19,
author={Anderson Santos and Helio Pedrini},
title={Spatio-temporal Video Autoencoder for Human Action Recognition},
booktitle={Proceedings of the 14th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 5: VISAPP,},
year={2019},
pages={114-123},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0007409401140123},
isbn={978-989-758-354-4},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 14th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 5: VISAPP,
TI - Spatio-temporal Video Autoencoder for Human Action Recognition
SN - 978-989-758-354-4
AU - Santos A.
AU - Pedrini H.
PY - 2019
SP - 114
EP - 123
DO - 10.5220/0007409401140123