Second, although several common health factors 
were considered in this study, other factors, such as 
genetic factors, may still affect stroke. Further studies 
could try to incorporate more potential influencing 
factors into the model to improve the 
comprehensiveness of the prediction. 
In summary, our study provides some insights into 
optimizing stroke probability prediction models. Still, 
further in-depth studies are needed to overcome the 
limitations to achieve more accurate and reliable 
predictions. 
5  CONCLUSION 
This study delves into the two main algorithms, RF 
and MLP, in machine learning for predicting stroke 
probability. The main finding of our study is that RF 
slightly outperforms MLP in several aspects, such as 
accuracy, precision, recall, and F1 score, showing a 
more comprehensive, stable, and superior 
performance. By comparing the ROC-AUC scores, it 
was demonstrated that RF has a higher ability than 
MLP in recognizing stroke, which further solidifies 
its superiority in stroke probability prediction. 
Therefore, the random forest algorithm is a more 
suitable optimization model for predicting stroke 
probability. When constructing a stroke prediction 
model, choosing an appropriate algorithm is crucial 
to obtaining accurate and reliable prediction results. 
This study bridges the gap of machine learning in 
predicting stroke probability and provides a basis for 
constructing more accurate models for assisted 
medical decision-making. Second, this study 
provides new ideas for actively preventing stroke by 
explaining the key factors more strongly associated 
with the probability of stroke. This study helps 
medical professionals and data analysts study stroke 
through machine learning algorithms, enabling them 
to make accurate stroke probability predictions and 
explore stroke risk factors. 
This study still has shortcomings, such as the 
limitation of the dataset. In the future, researchers can 
explore deeply in the following aspects. First, 
researchers can consider introducing more advanced 
machine learning algorithms into the study to 
improve the prediction performance further. 
Meanwhile, researchers can conduct interdisciplinary 
collaborative research, which can better understand 
the mechanism and prediction methods of stroke by 
combining the knowledge of clinical medicine and 
data science. 
In conclusion, despite the limitations of this study, 
the stroke probability prediction model can be 
improved through continuous efforts and in-depth 
research to reduce the threat of stroke to human health. 
REFERENCES 
VL. Feigin, M. Brainin, B. Norrving, et al, World Stroke 
Organization (WSO): Global Stroke Fact Sheet 2022. 
International Journal of Stroke. 2022, vol.17, no.1, 
pp.18-29.  
J. Sidey-Gibbons, C. Sidey-Gibbons, Machine learning in 
medicine: a practical introduction. BMC Med Res 
Methodol, 2019, vol.19, pp.64.  
Y. Wu, Y. Fang, Stroke Prediction with Machine Learning 
Methods among Older Chinese. International Journal of 
Environmental Research and Public Health, 2020, 
vol.17, no.6, pp.1828.  
Z. R. Wang, Application of Machine Learning Methods in 
Stroke Risk Prediction (Master's Thesis, Guangzhou 
University), 2023. 
N. V. Chawla, K. W. Bowyer, L. O. Hall, and W. P. 
Kegelmeyer, SMOTE: synthetic minority over-
sampling technique. Journal of artificial intelligence 
research, 2002, vol.16, pp.321-357. 
T. C. Wen, B. Y. Liu, Y. N. Zhang, A study of risk factors 
for unplanned readmission within 31 days in ischaemic 
stroke patients: a random forest model. Chinese Journal 
of Evidence-Based Medicine, 2019, vol.19, no.5. 
Z. Zhang, D. Zhou, J. Zhang, et al, Multilayer perceptron-
based prediction of stroke mimics in prehospital triage. 
Scientific Reports, 2022, vol.12, no.1, pp.17994. 
J. F. Torres, D. Hadjout, A. Sebaa, et al, Deep learning for 
time series forecasting: a survey. Big Data, 2021, vol.9, 
no.1, pp.3-21. 
D. M. Belete, M. D. Huchaiah, Grid search in 
hyperparameter optimization of machine learning 
models for prediction of HIV/AIDS test results. 
International Journal of Computers and Applications, 
2022, vol.44, no.9, pp.875-886. 
G. Assmann, P. Cullen, H. Schulte, Simple scoring scheme 
for calculating the risk of acute coronary events based 
on the 10-year follow-up of the prospective 
cardiovascular Munster (PROCAM) study. Circulation, 
2002, vol.105, no.3, pp.310-315.