4 CONCLUSIONS
Crash prediction offers a proactive solution to
increase road safety and save lives. For this reason, it
has been a long-standing interest of researchers,
industry, and policymakers. However, crash
prediction remains complex and requires high
resolution and large data sets to develop powerful
models that effectively predict accidents. According
to the scientific literature, different approaches have
been used to address this topic. Although, as far as
the authors know, few works have been focused on
investigating crash prediction based on driver inputs
and ensemble methods.
In the present work, the performance of ensemble
methods machine learning algorithms has been
assessed in a classification case. in fact, the developed
models in this research consist of predicting the
occurrence of a crash from the simulator-based
dataset of some driver inputs as features.
Results show that the Random Forest outperforms
the gradient boosting machine and extreme gradient
boosting for the default parameters. But after the
optimization of hyperparameters, it was noticed that
even if the algorithms are not the same, their
performances are almost equal. Therefore, tuning
hyperparameters impacts the machine learning
developed model and offers the highest improvement
and benefit in accident prediction accuracy.
One of the limitations of this paper is that the
comparison is made only between ensemble
methods-based algorithms. Thus, future work should
include more algorithms, as well as integrating other
variables such as the vehicle kinematics and the
surrounding environment. In addition, the necessity
for having a large, and comprehensive training data
sets presents a clear challenge that should be
mitigated to ensure that machine learning
applications remain accurate.
ACKNOWLEDGEMENTS
The Moroccan Ministry of Equipment, Transport,
and Logistics; and the Moroccan National Centre for
Scientific and Technical Research (CNRST) are
gratefully acknowledged for their valuable support of
this research.
REFERENCES
Abdelwahab HT, Abdel-Aty MA. Development of
artificial neural network models to predict driver injury
severity in traffic accidents at signalized intersections.
Transp Res Rec 2001:6–13.
https://doi.org/10.3141/1746-02.
Aljanahi AAM, Rhodes AH, Metcalfe A V. Speed, speed
limits and road traffic accidents under free flow
conditions 1999;31:161–8.
Ameksa M, Mousannif H, Al Moatassime H, Elamrani
Abou Elassad Z. Toward Flexible Data Collection of
Driving Behaviour. ISPRS - Int Arch Photogramm
Remote Sens Spat Inf Sci 2020; XLIV-4/W3-2020:33–
43. https://doi.org/10.5194/isprs-archives-xliv-4-w3-
2020-33-2020.
Bari D, Ameksa M, Ouagabi A. A comparison of
datamining tools for geo-spatial estimation of visibility
from AROME-Morocco model outputs in regression
framework. Proc - 2020 IEEE Int Conf Moroccan
Geomatics, MORGEO 2020 2020.
https://doi.org/10.1109/Morgeo49228.2020.9121909.
BREIMAN L. Random forests. Random For 2001;45:5–
32. https://doi.org/10.1201/9780429469275-8.
Chen T, Guestrin C. XGBoost: A scalable tree boosting
system. Proc. ACM SIGKDD Int. Conf. Knowl.
Discov. Data Min., vol. 13-17- Augu, 2016, p. 785–94.
https://doi.org/10.1145/2939672.2939785.
Friedman JH. Greedy function approximation: A gradient
boosting machine. Ann Stat 2001;29:1189–232.
https://doi.org/10.1214/aos/1013203451.
Lagarde E. Road traffic injuries. Encycl Environ Heal
2020:572–80. https://doi.org/10.1016/B978-0-444-
63951-6.00623-9.
Osman OA, Hajij M, Bakhit PR, Ishak S. Prediction of
Near-Crashes from Observed Vehicle Kinematics
using Machine Learning. Transp Res Rec 2019.
https://doi.org/10.1177/0361198119862629.
scikit-learn. classification report n.d. https://scikit-
learn.org/stable/modules/generated/sklearn.metrics.cla
ssification_report.html.
Silva PB, Andrade M, Ferreira S. Machine learning applied
to road safety modeling: A systematic literature review.
J Traffic Transp Eng (English Ed 2020;7:775–90.
https://doi.org/10.1016/j.jtte.2020.07.004.
Treat JR, Tumba NS, McDonald ST, Shinar D, Hume RD,
Mayer RE, et al. Tri-Level Study of the Causes of
Traffic Accidents: An overview of final results. Proc
Am Assoc Automot Med Annu Conf 1979;21:391–
403.
Zouhair EAE, Mousannif H, Al Moatassime H. Towards
analyzing crash events for novice drivers under
reduced-visibility settings: A simulator study. ACM
Int. Conf. Proceeding Ser., 2020.
https://doi.org/10.1145/3386723.3387849.
Abdelwahab HT, Abdel-Aty MA. Development of
artificial neural network models to predict driver injury
severity in traffic accidents at signalized intersections.
Transp Res Rec 2001:6–13.
https://doi.org/10.3141/1746-02.
Aljanahi AAM, Rhodes AH, Metcalfe A V. Speed, speed
limits and road traffic accidents under free flow
conditions 1999;31:161–8.
Ameksa M, Mousannif H, Al Moatassime H, Elamrani
Abou Elassad Z. Toward Flexible Data Collection of