decision-making mechanisms and placing significant
design pressure on decision-makers. Then it is also
very difficult to convince users and patients of the
reliability of the results. Therefore, it is necessary to
design for greater interpretability for decision-makers
and users. Second, privacy issues arise when training
models use personal sensitive data, raising concerns
about the potential exposure of users' private
information. In addition, the practicality of
implementing machine learning models in real-world
situations may be hindered by various factors such as
data quality issues, sparse labelling, or environmental
changes. Moreover, as models become more
complex, interpreting their predictions becomes more
challenging. Data quality and bias can significantly
affect the performance and robustness of machine
learning models, especially when faced with
unbalanced datasets or missing data. Sometimes, data
cannot be consistently achieved in machine learning
models. After changing scenarios or missing some
labels, obtaining the same data results and accurately
predicting outcomes becomes challenging.
Summarizing the necessary information and
achieving uniform results for diverse datasets
presents a significant challenge.
Looking ahead, potential solutions and avenues
for progress are emerging, for example, the Shapley
Addition Method of Interpretation (SHAP) approach,
designed as a novel and cutting-edge method, aims to
facilitate clinical interpretation and intuitive
comprehension of feature significance. It
accomplishes this by visualizing the relationship
between each feature and its associated predictive
power (Lundberg, 2020). The Federated Learning
(FL) approach provides a way to train models on
distributed data sources, improving model
performance while protecting user privacy. A wide
range of architectures based on Federated Learning,
as mentioned in (Yaqoob, 2023), have been
categorised as horizontal FL and vertical FL, and
many people have used diverse approaches to outline
the characteristics and results of some of the
optimisation strategies implemented by FL and to
discuss some of the expected business consequences
of federated learning. In addition, using the principles
of transfer learning, pre-trained models can be
migrated from one domain to another of interest,
thereby reducing data requirements and enhancing
model generalisation. comprehensively look to
compare and contrast several of the most widely
applicable machine learning methods, using a
combination of SHAP and FL to improve
interpretability and privacy and achieve optimal
solutions.
4 CONCLUSIONS
This work comprehensively discusses and compares
the advantages and disadvantages between various
traditional machine learning and deep learning on the
prediction of stroke in patients, obtaining the method
with the highest accuracy, and summarising the
relatively well-developed dataset available for
experiments. This paper mainly uses methods such as
RF, GB, and U-Net for screening to generate targeted
stroke prediction results synthetically. Some new
techniques have not been considered in this article,
such as large language models, time series models,
multimodal data fusion, and causal inference
methods, which will be added in the future to form a
more complete system for more thorough
consideration.
REFERENCES
Azam, M. S., Habibullah, M., & Rana, H. K. 2020.
Performance analysis of various machine learning
approaches in stroke prediction. International Journal
of Computer Applications, 175(21), 11-15.
Benzakoun, J., Charron, S., Turc, G., Hassen, W. B.,
Legrand, L., Boulouis, G., ... & Oppenheim, C. 2021.
Tissue outcome prediction in hyperacute ischemic
stroke: Comparison of machine learning models.
Journal of Cerebral Blood Flow & Metabolism, 41(11),
3085-3096.
Cheon, S., Kim, J., & Lim, J. 2019. The use of deep learning
to predict stroke patient mortality. International
journal of environmental research and public
health, 16(11), 1876.
Cox, A. P., Raluy-Callado, M., Wang, M., Bakheit, A. M.,
Moore, A. P., & Dinet, J. 2016. Predictive analysis for
identifying potentially undiagnosed post-stroke
spasticity patients in United Kingdom. Journal of
biomedical informatics, 60, 328-333.
Dev, S., Wang, H., Nwosu, C. S., Jain, N., Veeravalli, B.,
& John, D. 2022. A predictive analytics approach for
stroke prediction using machine learning and neural
networks. Healthcare Analytics, 2, 100032.
Fernandez-Lozano, C., Hervella, P., Mato-Abad, V.,
Rodríguez-Yáñez, M., Suárez-Garaboa, S., López-
Dequidt, I., ... & Iglesias-Rey, R. 2021. Random forest-
based prediction of stroke outcome. Scientific
reports, 11(1), 10071.
Kappelhof, N., Ramos, L. A., Kappelhof, M., van Os, H. J.,
Chalos, V., van Kranendonk, K. R., ... & Marquering,
H. A. 2021. Evolutionary algorithms and decision trees
for predicting poor outcome after endovascular
treatment for acute ischemic stroke. Computers in
Biology and Medicine, 133, 104414.
Li, S., Zheng, J., & Li, D. 2021. Precise segmentation of
non-enhanced computed tomography in patients with
ischemic stroke based on multi-scale U-Net deep