Table 3: POI detection quality for three CTSs (TP, FP, FN
correspond to the number of true positives, false positives,
and false negatives, respectively).
No. TP FP FN Precision Recall F1
1 165 43 52 0.79 0.76 0.78
2 110 17 35 0.87 0.76 0.80
3 106 8 24 0.93 0.82 0.87
5 DISCUSSION
The distributions of the movement pattern types of
D
M
and D
E
are skewed. Hence, F1 is our main met-
ric of interest as it combines precision and recall. We
also use F1 for assessing the POI detection, which
performs reasonably well with a mean F1 score of
approx. 0.82 (D
E
). Our classification model performs
even better with an F1 score of approx. 0.86 (D
E
).
However, there exist several threats to validity.
The number of available CTSs and, as a consequence,
the dataset size are rather low. This also leads to a lim-
ited size of the evaluation dataset D
E
, which might
explain that F1 for D
M
is lower than for D
E
. For
the POI detection, the clustering technique DBSCAN
provided the best results for the given dataset. Other
clustering techniques might be better suited for larger
datasets. Furthermore, in order to minimize the label-
ing effort, the CTS group only had to label the activ-
ities they considered as regular. Not labeling presum-
ably non-regular activities can lead to more errors as
each CTS might not be aware of all her actual regu-
larities.
Moreover, the location data is recorded by a rather
homogeneous group of CTSs that are very simi-
lar in terms of worksite affiliation, working hours,
and age. In contrast, the travel behavior of distinct
user groups differs, e.g. between home-based per-
sons (like homemakers) and persons who travel to
their workplace (Kutter, 1973). The proposed features
may therefore be not as effective for different group
compositions. Furthermore, our work in progress ap-
proach was only validated for timeslots with 1-week
periodicities. We will further investigate the robust-
ness of the features and our approach, especially for
additional timeslot types, in our future work.
6 CONCLUSIONS
Digitization in the automotive industry causes the
change from car manufacturers to mobility service
providers. For example, to propose meaningful MaaS
offerings to interested and consenting individuals,
movement regularities have to be identified. Our pro-
posed approach can model daily travel routines and
predict regularities using the machine learning algo-
rithm XGBoost. We demonstrate that already small
datasets enable acceptable performance for POI de-
tection and future movement prediction (F1 > 0.8).
In our future work, we will investigate the remain-
ing movement pattern types, further temporal period-
icities, and sub-weekday time unit granularity. More-
over, we will explore how the evolution of movement
behavior over time can be incorporated.
REFERENCES
Ashbrook, D. et al. (2002). Learning significant locations
and predicting user movement with GPS. In Proc. 6th
Int’l Symp. on Wearable Computers, pages 101–108.
Bachi, R. (2005). Standard Distance Measures and Related
Models for Spatial Analysis. Papers in Regional Sci-
ence, 10:83–132.
Chen, T. et al. (2016). XGBoost: A Scalable Tree Boosting
System. Proc. of the 22nd ACM SIGKDD Int’l Conf.
on Knowledge Discovery and Data Mining - KDD ’16.
Ester, M. et al. (1996). A Density-Based Algorithm for
Discovering Clusters in Large Spatial Databases with
Noise. In Proc. of the Second Int’l Conf. on Knowl-
edge Discovery and Data Mining, KDD’96, page
226–231. AAAI Press.
Gr
¨
uner, T. (2019). Detecting Complex Spatio-temporal
Movement Patterns Using Historical Location Data.
DHBW Stuttgart, Germany. Bachelor’s Thesis.
Hall, R. (2012). Handbook of Transportation Science. Int’l
Series in Operations Research & Management Sci-
ence. Springer US.
Jittrapirom, P. et al. (2017). Mobility as a Service: A Crit-
ical Review of Definitions, Assessments of Schemes,
and Key Challenges. Urban Planning, 2(2):13–25.
Kitamura, R. et al. (1997). Generation of Synthetic Daily
Activity-Travel Patterns. Transportation Research
Record, 1607(1):154–162.
Kjærgaard, M. B. et al. (2010). Indoor Positioning Using
GPS Revisited. In Pervasive Computing, pages 38–
56.
Kutter, E. (1973). A Model for Individual Travel Behaviour.
Urban Studies, 10(2):235–258.
Liu, F. T. et al. (2008). Isolation Forest. In 2008 Eighth
IEEE Int’l Conf. on Data Mining, pages 413–422.
Merah, A. F. et al. (2013). A Sequential Patterns Data
Mining Approach Towards Vehicular Route Predic-
tion in VANETs. Mobile Networks and Applications,
18:788–802.
Vintan, L. et al. (2004). Person Movement Prediction Using
Neural Networks. In First Workshop on Modeling and
Retrieval of Context, pages 1–12.
Vukovic, M. et al. (2007). Predicting user movement for
advanced location-aware services. In 2007 15th Int’l
Conf. on Software, Telecommunications and Com-
puter Networks, pages 1–5.
VEHITS 2020 - 6th International Conference on Vehicle Technology and Intelligent Transport Systems
128