Second, although several common health factors
were considered in this study, other factors, such as
genetic factors, may still affect stroke. Further studies
could try to incorporate more potential influencing
factors into the model to improve the
comprehensiveness of the prediction.
In summary, our study provides some insights into
optimizing stroke probability prediction models. Still,
further in-depth studies are needed to overcome the
limitations to achieve more accurate and reliable
predictions.
5 CONCLUSION
This study delves into the two main algorithms, RF
and MLP, in machine learning for predicting stroke
probability. The main finding of our study is that RF
slightly outperforms MLP in several aspects, such as
accuracy, precision, recall, and F1 score, showing a
more comprehensive, stable, and superior
performance. By comparing the ROC-AUC scores, it
was demonstrated that RF has a higher ability than
MLP in recognizing stroke, which further solidifies
its superiority in stroke probability prediction.
Therefore, the random forest algorithm is a more
suitable optimization model for predicting stroke
probability. When constructing a stroke prediction
model, choosing an appropriate algorithm is crucial
to obtaining accurate and reliable prediction results.
This study bridges the gap of machine learning in
predicting stroke probability and provides a basis for
constructing more accurate models for assisted
medical decision-making. Second, this study
provides new ideas for actively preventing stroke by
explaining the key factors more strongly associated
with the probability of stroke. This study helps
medical professionals and data analysts study stroke
through machine learning algorithms, enabling them
to make accurate stroke probability predictions and
explore stroke risk factors.
This study still has shortcomings, such as the
limitation of the dataset. In the future, researchers can
explore deeply in the following aspects. First,
researchers can consider introducing more advanced
machine learning algorithms into the study to
improve the prediction performance further.
Meanwhile, researchers can conduct interdisciplinary
collaborative research, which can better understand
the mechanism and prediction methods of stroke by
combining the knowledge of clinical medicine and
data science.
In conclusion, despite the limitations of this study,
the stroke probability prediction model can be
improved through continuous efforts and in-depth
research to reduce the threat of stroke to human health.
REFERENCES
VL. Feigin, M. Brainin, B. Norrving, et al, World Stroke
Organization (WSO): Global Stroke Fact Sheet 2022.
International Journal of Stroke. 2022, vol.17, no.1,
pp.18-29.
J. Sidey-Gibbons, C. Sidey-Gibbons, Machine learning in
medicine: a practical introduction. BMC Med Res
Methodol, 2019, vol.19, pp.64.
Y. Wu, Y. Fang, Stroke Prediction with Machine Learning
Methods among Older Chinese. International Journal of
Environmental Research and Public Health, 2020,
vol.17, no.6, pp.1828.
Z. R. Wang, Application of Machine Learning Methods in
Stroke Risk Prediction (Master's Thesis, Guangzhou
University), 2023.
N. V. Chawla, K. W. Bowyer, L. O. Hall, and W. P.
Kegelmeyer, SMOTE: synthetic minority over-
sampling technique. Journal of artificial intelligence
research, 2002, vol.16, pp.321-357.
T. C. Wen, B. Y. Liu, Y. N. Zhang, A study of risk factors
for unplanned readmission within 31 days in ischaemic
stroke patients: a random forest model. Chinese Journal
of Evidence-Based Medicine, 2019, vol.19, no.5.
Z. Zhang, D. Zhou, J. Zhang, et al, Multilayer perceptron-
based prediction of stroke mimics in prehospital triage.
Scientific Reports, 2022, vol.12, no.1, pp.17994.
J. F. Torres, D. Hadjout, A. Sebaa, et al, Deep learning for
time series forecasting: a survey. Big Data, 2021, vol.9,
no.1, pp.3-21.
D. M. Belete, M. D. Huchaiah, Grid search in
hyperparameter optimization of machine learning
models for prediction of HIV/AIDS test results.
International Journal of Computers and Applications,
2022, vol.44, no.9, pp.875-886.
G. Assmann, P. Cullen, H. Schulte, Simple scoring scheme
for calculating the risk of acute coronary events based
on the 10-year follow-up of the prospective
cardiovascular Munster (PROCAM) study. Circulation,
2002, vol.105, no.3, pp.310-315.