
2 RELATED WORK
Over recent years, various strategies have been em-
ployed for pattern learning on human movement, uti-
lizing activity and spatial routines on daily or weekly
cycles. In the context of pattern learning, topic mod-
eling emerges as a relevant technique. Huynh et al.
(2008) contributed to the early application of topic
modeling for recognizing daily routines using time-
series activity data. Their approach begins by cluster-
ing features relevant to routines, followed by applying
Latent Dirichlet Allocation (LDA) to uncover hidden
topics within these routines. Similarly, a related study
from Tang (2021) explored the use of LDA to identify
location patterns in user trip behaviors using Global
Positioning System (GPS) coordinates. However, this
study did not consider the temporal factor of patterns.
Steinhauer et al. (2019) adopted a similar approach
in their study of telecommunication networks, inter-
preting run-time variable readings as words in LDA to
obtain representative topics. Furthermore, Seiter et al.
(2014) employed LDA to discover activity routines of
stroke rehabilitation patients, subsequently applying
KNN to map the discovered topics to specific rou-
tines. Sun et al. (2021) used a two-dimensional prob-
ability distribution, a variation of LDA, to discover
routine patterns in individual travel behavior. This
work does not consider the order of occurrences, es-
sential for routine comprehension.
In the context of anomaly detection in human rou-
tines, Sun et al. (2021) also explored the detection
of anomalies by using perplexity as a scoring func-
tion to assess whether new behaviors were predictable
(thus normal) or anomalous. Steinhauer et al. (2019)
also aimed to detect anomalies following topic mod-
eling by establishing a ”normal model” and relying
on an expert’s visual comparison to confirm the pres-
ence and nature of anomalies. In their study on de-
tecting patterns in anomalous clusters, Abraham and
Nair (2018) applied LDA to define the normal topics
within the data, subsequently using a statistical test
to detect anomalies. Similarly, Thornton et al. (2020)
used LDA to identify normal patterns in avionics net-
works and employed a combination of variational in-
ference and statistical analysis to detect anomalies.
Based on studies on routine pattern learning, topic
modeling has shown promising results for in-depth
analysis of human movement routines. This work will
therefore explore its potential in complex time series
analysis, focusing on a critical use case: lone worker
safety. Human routines are complex, and a few days
or weeks may be insufficient to capture them accu-
rately. To the best of our knowledge, no study has
addressed long-term routine learning in human move-
ment using both activity patterns and GPS data and
only a few have focused on anomaly detection within
individual routines, which is the aim of this study.
3 METHODS
The primary aim of this work is to develop a gener-
alized approach to predictive risk assessment based
on deviations from a user’s routine within the time-
series domain. To achieve this, a lone worker use
case was selected, and a general approach for rou-
tine learning, anomaly detection, and model updating
was developed. Figure 1 provides an overview of the
proposed approach. For the routine learning stage, a
topic modeling technique was applied, consisting of a
clustering step, document creation, LDA model train-
ing, and topic activation. For the anomaly detection
stage, a statistical approach was developed. Since the
solution is designed for real-life implementation, rou-
tine evolution was also considered. Therefore, model
updates were explored, using both retraining and In-
cremental Learning (IL) approaches.
3.1 Datasets and Preprocessing
For this study’s dataset, three lone workers volun-
teered to collect data on their daily work routines dur-
ing the first semester of 2024. The typical daily rou-
tine of LW1 includes a commuting period via sub-
way or car and a work period in an office. LW2 com-
mutes by car and has work periods that include being
in an office and walking outdoors within a specific re-
gion. Lastly, LW3 is a manufacturing worker who has
a long commuting period by car and spends part of
the workday walking through facilities.
The recording protocol involved installing a log-
ger Android application on the workers’ smartphones,
which was activated by the user at the beginning and
end of each workday. During the collection period,
the lone workers did not record data every weekday
due to holidays, application issues, battery drain, and
forgetfulness in activating the application. As a re-
sult, this led to a total of 45, 46, and 81 days of usable
data for routine learning for LW1, LW2, and LW3,
respectively. Additionally, the volunteers were asked
to annotate any days on which activities and/or loca-
tions occurred that were not part of their routine. This
request was made to prevent the contamination of rou-
tine learning with anomalous patterns. Therefore, the
test days were selected based on these annotations,
corresponding to a percentage between 23% and 34%
of all available days. Table 1 summarizes the dataset
details for the data collected from each lone worker.
BIOSIGNALS 2025 - 18th International Conference on Bio-inspired Systems and Signal Processing
936