Figure 5: Comparison of LDS and H-LDS descriptor
performance on our dataset.
Figure 6: Comparison of LDS and H-LDS descriptor
performance on MSRC-12 dataset.
As seen in Figures 5 and 6 the histogram of
LDSs offer an improvement in classification results
compared to using a single descriptor for the whole
motion. Additionally, the h-LDS descriptor clearly
outperforms the simple LDS descriptor in each case.
This extends to the case of histogram of LDSs,
where the same behavior can be observed.
4 CONCLUSIONS
In this paper, we introduced a higher order linear
dynamical systems (h-LDS) descriptor for extracting
dynamics from multidimensional time evolving data.
By applying higher order decomposition in the
observation data, we showed that we can achieve
higher detection rates than standard linear dynamical
systems both in the case of dynamic texture analysis
and human action recognition. In the future, we are
planning to use data from different sources, e.g.,
multispectral imaging in the case of flame detection
or skeletal data and depth data in the case of human
action recognition.
ACKNOWLEDGEMENTS
The research leading to these results has received
funding from the European Community's Seventh
Framework Programme (FP7-ICT-2011-9) under
grant agreement no FP7-ICT-600676 ''i-Treasures:
Intangible Treasures - Capturing the Intangible
Cultural Heritage and Learning the Rare Know-How
of Living Human Treasures''.
REFERENCES
Avgerinakis, K., Briassouli, A., Kompatsiaris, I., 2012.
"Smoke Detection Using Temporal HOGHOF
Descriptors and Energy Colour Statistics from Video,"
in Int'l Workshop on Multi-Sensor Systems and
Networks for Fire Detection and Management.
Barmpoutis, P., Dimitropoulos, K., Grammalidis, N.,
2014. "Smoke Detection Using Spatio-Temporal
Analysis, Motion Modeling and Dynamic Texture
Recognition", 22nd European Signal Processing
Conference (EUSIPCO 2014), Lisbon, Portugal, 1-5
September.
Boots, B., 2009. Learning stable linear dynamical systems.
M.S. Thesis in Machine Learning, Carnegie Mellon
University.
Chan, A., Vasconcelos, N., 2005. "Probabilistic Kernels
for the Classification of Auto-Regressive Visual
Processes," in IEEE Conf. Computer Vision and
Pattern Recognition.
Chan, A., Vasconcelos, N., 2007. "Classifying Video with
Kernel Dynamic Textures," in IEEE Conf. Computer
Vision and Pattern Recognition.
Cock, K. D., Moor, B. D., 2002. "Subspace angles and
distances between ARMA models," System and
Control Letters, vol. 4, pp. 265-270.
Dimitropoulos, K., Tsalakanidou, F., Grammalidis, N.,
2012. "Flame detection for video-based early fire
warning systems and 3D visualization of fire
propagation," in 13th IASTED Int'l Conf. on Computer
Graphics and Imaging.
Dimitropoulos, K., Barboutis, P., Grammalidis, N., 2015.
"Spatio-Temporal Flame Modeling and Dynamic
Texture Analysis for Automatic Video-Based Fire
Detection", IEEE Transactions on Circuits and
Systems for Video Technology, vol. 25, no. 2, pp.
339-351.
Doretto, G., Chiuso, A., Wu, Y. N., Soatto, S., 2003.
"Dynamic Textures," Int'l J. of Computer Vision, vol.
51, no. 2, pp. 91-109.
Fothergill, S., Mentis, H. M., Kohli, P., Nowozin, S.,
2012. Instructing people for training gestural
interactive systems. In J. A. Konstan, E. H. Chi, and
K. Hook, editors, CHI, pages 1737–1746. ACM.
Kaufman, L., Rousseeuw, P.J., 1987. Clustering by means
of Medoids. In Statistical Data Analysis Based on the
L1–Norm and Related Methods, edited by Y. Dodge,