strategy, in which the K-Means algorithm has shown
to be the best. The most challenging part of the clus-
tering phase was to assign meaning (trip style) to the
clusters, which implied further human analysis and
inspection on the clustering results. Thus, the pro-
posed solution was found to be able to identify the
trip styles in a satisfactory way.
As future work, we may use different clustering
algorithms and perform a deeper analysis of the ev-
idence accumulation algorithm. We can use feature
selection techniques instead of feature reduction, to
identify the most relevant original features for the
trip style identification task and the smallest subset
of these features that yield robust classification.
ACKNOWLEDGEMENTS
This work was co-funded by the EU H2020
i-DREAMS project (Project Number: 814761)
funded by European Commission under the MG-
2-1-2018 Research and Innovation Action (RIA),
www.idreamsproject.eu, and by the European Re-
gional Development Fund (FEDER), through Por-
tugal 2020, under the Operational Competitiveness
and Internationalization (COMPETE 2020) and Lis-
boa 2020 programmes (grants no. 069918 “Cardi-
oLeather”).
REFERENCES
Almeida, A., Br
´
as, S., Sargento, S., and Oliveira, I. (2022).
Using bus tracking data to detect potential hazard driv-
ing zones. In Pattern Recognition and Image Analysis,
pages 667–679, Cham. Springer International Pub-
lishing.
Bishop, C. and Nasrabadi, N. (2006). Pattern recognition
and machine learning, volume 4. Springer.
Cali
´
nski, T. and Harabasz, J. (1974). A dendrite method
for cluster analysis. Communications in Statistics,
3(1):1–27.
Davies, D. and Bouldin, D. (1979). A cluster separation
measure. IEEE Transactions on Pattern Analysis and
Machine Intelligence, PAMI-1(2):224–227.
Ester, M., Kriegel, H., Sander, J., and Xu, X. (1996). A
density-based algorithm for discovering clusters in
large spatial databases with noise. In Knowlede Dis-
covery in Databases (KDD), volume 96, pages 226–
231.
et al, I. S. (2011). Towards a large scale European naturalis-
tic driving study: final report of PROLOGUE: deliver-
able D4.2. Technical report, SWOV Institute for Road
Safety Research.
European Comission (2021). Road safety,
https://ec.europa.eu/transport.
Fred, A. and Jain, A. (2002). Data clustering using evidence
accumulation. In International Conference on Pattern
Recognition, volume 4, pages 276–280 vol.4.
Hartigan, J. and Wong, M. (1979). Algorithm as 136:
A k-means clustering algorithm. Journal of the
Royal Statistical Society. series c (applied statistics),
28(1):100–108.
Hutson, D. (2021). How car insurance premiums are
calculated https://www.comparethemarket.com/car-
insurance/content/what-impacts-upon-your-car-
insurance/.
i-DREAMS Team (2021). A smart driver and road environ-
ment assessment and monitoring system.
Jolliffe, I. (2002). Principal component analysis. Springer
Series in Statistics.
Joshi, A., Kale, S., Chandel, S., and Pal, D. (2015). Lik-
ert scale: Explored and explained. British Journal of
Applied Science & Technology, 7(4):396–403.
Mobileye (2021). Autonomous driving & ADAS (Ad-
vanced Driver Assistance Systems).
NationMaster (2014). Motor vehicle ownership per 1000
inhabitants, https://ourworldindata.org/.
Okun, O. and Valentini, G. (2008). Supervised and Un-
supervised ensemble methods and their applications,
volume SCI, 126. Springer.
Peck, R. and Kuan, J. (1983). A statistical model of individ-
ual accident risk prediction using driver record, terri-
tory and other biographical factors. Accident Analysis
& Prevention, 15(5):371–393.
Reynolds, D. (2009). Gaussian mixture models. Encyclo-
pedia of biometrics, 741(659-663).
Rousseeuw, P. (1987). Silhouettes: A graphical aid to
the interpretation and validation of cluster analysis.
Journal of Computational and Applied Mathematics,
20(1):53–65.
Sagberg, F., Selpi, Piccinini, G., and Engstr
¨
om, J. (2015).
A review of research on driving styles and road safety.
Human factors, 57(7):1248–1275.
Sander, J., Ester, M., Kriegel, H., and Xu, X. (1998).
Density-based clustering in spatial databases: The al-
gorithm GDBSCAN and its applications. Data Mining
and Knowledge Discovery, 2(2):169–194.
Shahid, A., Wilkinson, K., Marcu, S., and Shapiro, C.
(2011). Karolinska sleepiness scale (KSS) - stop, that
and one hundred other sleep scales. A. Shahid and K.
Wilkinson and S. Marcu and C. Shapiro (eds), pages
209–210.
Singh, H. and Kathuria, A. (2021). Profiling drivers to as-
sess safe and eco-driving behavior–a systematic re-
view of naturalistic driving studies. Accident Analysis
& Prevention, 161:106349.
Taubman-Ben-Ari, O., Mikulincer, M., and Gillath,
O. (2004). The multidimensional driving style
inventory–scale construct and validation. Accident
Analysis & Prevention, 36(3):323–332.
Witten, I., Frank, E., Hall, M., and Pal, C. (2016). Data min-
ing: practical machine learning tools and techniques.
Morgan Kauffmann, 4th edition.
DATA 2023 - 12th International Conference on Data Science, Technology and Applications
294