tion demonstrates that combining datasets with this
method is possible, which creates a foundation to find
commonalities and suited features for activities across
different settings.
7 CONCLUSIONS
This paper introduces a novel way of classifying hu-
man activities based on unique combinations of in-
terpretable high-level features for each activity. The
extracted features are interpretable and easily exten-
sible, and allow comparison and combination of dif-
ferent datasets in the high-level feature space. The
three approaches using HMMs, Random Forests, and
custom high-level feature functions for feature ex-
traction were proposed and evaluated, of which the
Hidden Markov Model performed best across the two
datasets CSL-SHARE and UniMiB SHAR. Classifi-
cation experiments investigated how well the extrac-
tion combined with a Random Forest for final classi-
fication could perform. In a person-independent 10-
fold cross-validation, they performed well, at 85% on
the CSL-SHARE compared to 93.6% state-of-the-art
and 64% compared to 77.0% state-of-the-art on the
UniMiB SHAR dataset (Hartmann et al., 2021). Fur-
thermore, few-shot learning experiments were con-
ducted, where one-shot learning did not succeed, but
low extraction error rates and increasing f1 scores in
few-shot learning are encouraging. Additionally, an
experiment combining the two datasets showed the
potential and promise of developing human activity
recognition systems across data sources. Remarkable
is that the errors in both classification experiments and
few-shot learning experiments are attributable, and
the next steps for increased performance are clear:
deepened development and choice of features along
with their extraction methods.
Furthermore, the next two major steps are clear:
further investigate and develop high-level features and
extract these as sequences rather than as vectors to
enable online recognition. High-level features will
be developed borrowing from previous HAR work,
sports knowledge, and even utilizing findings and
criteria from dance from decades of previous work.
The main challenges for sequence extraction are cre-
ating the ground truth and addressing varying sam-
pling rates across datasets. The ground truth creation
likely requires manual data annotation and might only
be possible for certain high-level features and if the
dataset provides video examples. A sliding window
approach could address the sequence problem when
extracting features for a single dataset, but it does not
scale easily across datasets due to different sampling
rates. The slow nature of high-level features might en-
able re-sampling and should be investigated in future
work. These and other topics, including estimating
a performance ceiling with Neural Networks and ex-
tending to further datasets and modalities, are future
work for these high-level features.
REFERENCES
Amma, C., Gehrig, D., and Schultz, T. (2010). Airwrit-
ing recognition using wearable motion sensors. In
First Augmented Human International Conference,
page 10. ACM.
Arifoglu, D. and Bouchachia, A. (2017). Activity recog-
nition and abnormal behaviour detection with recur-
rent neural networks. Procedia Computer Science,
110:86–93.
Bakis, R. (1976). Continuous speech recognition via cen-
tisecond acoustic states. The Journal of the Acoustical
Society of America, 59(S1):S97–S97.
Chen, C., Liaw, A., and Breiman, L. (2004). Using Random
Forest to Learn Imbalanced Data. Technical report.
Dickinson, S. J., Leonardis, A., Schiele, B., and Tarr, M. J.
(2009). Object categorization: computer and human
vision perspectives. Cambridge University Press.
Guest, A. H. (1977). Labanotation: Or, Kinetography La-
ban : the System of Analyzing and Recording Move-
ment. Number 27. Taylor & Francis.
Hartmann, Y., Liu, H., and Schultz, T. (2021). Feature
space reduction for human activity recognition based
on multi-channel biosignals. In Proceedings of the
14th International Joint Conference on Biomedical
Engineering Systems and Technologies, pages 215–
222. INSTICC, SciTePress.
Liu, H., Hartmann, Y., and Schultz, T. (2021a). CSL-
SHARE: A multimodal wearable sensor-based human
activity dataset. Frontiers in Computer Science.
Liu, H., Hartmann, Y., and Schultz, T. (2021b). Motion
Units: Generalized sequence modeling of human ac-
tivities for sensor-based activity recognition. In EU-
SIPCO 2021 - 29th European Signal Processing Con-
ference. IEEE.
Meinel, K. and Schnabel, G. (1987). Bewegungslehre -
Sportmotorik: Abriß einer Theorie der sportlichen
Motorik unter p
¨
adagogischem Aspekt. Meyer &
Meyer Verlag, Aachen, 12., erg
¨
anzte auflage edition.
Micucci, D., Mobilio, M., and Napoletano, P. (2017).
UniMiB SHAR: A dataset for human activity recog-
nition using acceleration data from smartphones. Ap-
plied Sciences, 7(10):1101.
Oniga, S. and S
¨
ut
˝
o, J. (2014). Human activity recognition
using neural networks. In Proceedings of the 15th
International Carpathian Control Conference, pages
403–406. IEEE.
Ord
´
o
˜
nez, F. J. and Roggen, D. (2016). Deep convolutional
and LSTM recurrent neural networks for multimodal
wearable activity recognition. Sensors, 16(1):115.
BIOSIGNALS 2022 - 15th International Conference on Bio-inspired Systems and Signal Processing
48