View-invariant 3D Skeleton-based Human Activity Recognition based on Transformer and Spatio-temporal Features

Ahmed Snoun, Tahani Bouchrika, Olfa Jemai

2022

Abstract

With the emergence of depth sensors, real-time 3D human skeleton estimation have become easier to accomplish. Thus, methods for human activity recognition (HAR) based on 3D skeleton have become increasingly accessible. In this paper, we introduce a new approach for human activity recognition using 3D skeletal data. Our approach generates a set of spatio-temporal and view-invariant features from the skeleton joints. Then, the extracted features are analyzed using a typical Transformer encoder in order to recognize the activity. In fact, Transformers, which are based on self-attention mechanism, have been successful in many domains in the last few years, which makes them suitable for HAR. The proposed approach shows promising performance on different well-known datasets that provide 3D skeleton data, namely, KARD, Florence 3D, UTKinect Action 3D and MSR Action 3D.

Download


Paper Citation


in Harvard Style

Snoun A., Bouchrika T. and Jemai O. (2022). View-invariant 3D Skeleton-based Human Activity Recognition based on Transformer and Spatio-temporal Features. In Proceedings of the 11th International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM, ISBN 978-989-758-549-4, pages 706-715. DOI: 10.5220/0010895300003122


in Bibtex Style

@conference{icpram22,
author={Ahmed Snoun and Tahani Bouchrika and Olfa Jemai},
title={View-invariant 3D Skeleton-based Human Activity Recognition based on Transformer and Spatio-temporal Features},
booktitle={Proceedings of the 11th International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM,},
year={2022},
pages={706-715},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0010895300003122},
isbn={978-989-758-549-4},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 11th International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM,
TI - View-invariant 3D Skeleton-based Human Activity Recognition based on Transformer and Spatio-temporal Features
SN - 978-989-758-549-4
AU - Snoun A.
AU - Bouchrika T.
AU - Jemai O.
PY - 2022
SP - 706
EP - 715
DO - 10.5220/0010895300003122