tivity count, contains a wider variety of activities,
while the Daily Sports dataset has much less vari-
ety in its constituent activities. As a result, it can
also be observed from the tables that the Gomaa-1
dataset comes after the HAD-AW and REALDISP
datasets in transfer learning performance. The HAPT
dataset contains the fewest activities with a low va-
riety/heterogeneity. As such, it shows the highest
degradation in the resulting transfer learning perfor-
mance of all the considered datasets, as can be ob-
served from Tables 6 and 7. This is in consonance
with the behavior observed and discussed in Section
6 previously.
We believe that the results obtained from these ex-
periments further highlight the benefit of training such
transfer learning models on activity-diverse datasets.
7.2 Comparison to Similar Transfer
Learning Schemes
In order to clearly illustrate the benefit of VersaTL,
we also compare it to (Chikhaoui et al., 2018), which
is the most similar method to it. Due to constraints
on space and challenges with one of the considered
datasets, we consider six evaluations from their work
which are concerned with transfer-learning perfor-
mance between devices placed at three different lo-
cations on the body. We segment the two datasets
concerned (MobiAct (Chatzaki et al., 2016) and Re-
alWorld HAR (Sztyler and Stuckenschmidt, 2016)) as
described in (Chikhaoui et al., 2018) and derive the
average F-metric obtained from the scenarios they in-
vestigated. The results of these experiments are re-
ported on a normalized scale and shown in Table 8.
The baseline results (i.e from (Chikhaoui et al.,
2018)) are shown in the ”Baseline F-Metric” col-
umn. It can be seen that the results obtained from
our method (displayed in the ”Obtained F-Metric”
column) shows an improvement of up to 40% in the
F-metric. This illustrates the superiority of VersaTL
relative to the scheme proposed in (Chikhaoui et al.,
2018).
8 CONCLUSION
In this work we propose VersaTL, a transfer learning
scheme for activity recognition data using convolu-
tional neural networks (CNNs). We design a small
CNN and include a spatial pyramid pooling layer to
allow for inputs of any type (i.e fixed or varying-
length). We then train the network (initially for clas-
sification) using standard methods. Subsequently, we
use the convolutional filters and the pooling layer as a
feature extractor. The feature extractor is then reused
for other datasets (different than the one used to train
the network initially) and dataset-specific feedfor-
ward neural networks are then trained to classify the
generated feature vectors.
The results obtained from our evaluations indicate
that VersaTL yields comparable performance (within
5%) to CNN-based classifiers trained from scratch,
while requiring a fraction of the training time. This
highlights the utility of our transfer learning tech-
nique in this domain. We believe that this could pave
the way for the widespread use of pretrained models
in activity recognition, similar to the way pretrained
models are used in computer vision, e.g., AlexNet,
VGGNet, etc.
In the future we intend to investigate the general-
izability of VersaTL to other types of timeseries data
which share the same number of axes/channels. The
restriction on the number of axes/channels is required
due to the convolutional filter-based frontend of our
method. Going further we may also investigate spe-
cific training of the feature extraction portion of the
network with a view to optimizing the learnt features
for discriminability.
REFERENCES
Altun, K., Barshan, B., and Tunc¸el, O. (2010). Comparative
study on classifying human activities with miniature
inertial and magnetic sensors. Pattern Recognition,
43(10):3605–3620.
Bai, S., Kolter, J. Z., and Koltun, V. (2018). An em-
pirical evaluation of generic convolutional and recur-
rent networks for sequence modeling. arXiv preprint
arXiv:1803.01271.
Ba
˜
nos, O., Damas, M., Pomares, H., Rojas, I., T
´
oth, M. A.,
and Amft, O. (2012). A benchmark dataset to eval-
uate sensor displacement in activity recognition. In
Proceedings of the 2012 ACM Conference on Ubiqui-
tous Computing, pages 1026–1035. ACM.
Banos, O., Toth, M. A., Damas, M., Pomares, H., and Ro-
jas, I. (2014). Dealing with the effects of sensor dis-
placement in wearable activity recognition. Sensors,
14(6):9995–10023.
Bengio, Y. (2012). Deep learning of representations for
unsupervised and transfer learning. In Proceedings
of ICML Workshop on Unsupervised and Transfer
Learning, pages 17–36.
Chatzaki, C., Pediaditis, M., Vavoulas, G., and Tsiknakis,
M. (2016). Human daily activity and fall recognition
using a smartphone’s acceleration sensor. In Inter-
national Conference on Information and Communica-
tion Technologies for Ageing Well and e-Health, pages
100–118. Springer.
Chikhaoui, B., Gouineau, F., and Sotir, M. (2018). A
cnn based transfer learning model for automatic ac-
tivity recognition from accelerometer sensors. In
VersaTL: Versatile Transfer Learning for IMU-based Activity Recognition using Convolutional Neural Networks
515