changes in the match environment. Therefore, it is an
ongoing challenge to continuously optimize and
update the models to improve their robustness and
adaptability. The continuous development of
technology will provide new data sources and
analysis methods, offering more possibilities for
further improvement of football match prediction
models.
4 CONCLUSIONS
In this article, a review of the field of Machine
Learning to analyse football data and predict
outcomes is provided. The review covers a range of
methods including Hybrid Machine Learning Models,
Binary Classification, Regression, TOPSIS, Expert
Systems, Ensemble Learning, and others. It was
found that while these models can be useful, there
may be some issues with their interpretability and
usefulness, as well as limitations in terms of fast
feedback. In order to establish trust with users, the
model's interpretability needs to be further enhanced
to clearly communicate the rationale and reasoning
behind each prediction. Additionally, the quality and
diversity of data can impact the models' effectiveness
when dealing with complex football match situations.
To improve the models, future researches are
recommended to focus on expanding the dataset to
include more information on leagues, teams, and
players. Researchers shall also explore emerging
technologies and methods to cope with the ever-
changing field of football match data analysis.
REFERENCES
Baboota, R. & Kaur, H. 2019. Predictive analysis and
modelling football results using machine learning
approach for English Premier League. International
Journal of Forecasting, 35(2), 741-755.
Bentéjac, C., Csörgő, A. & Martínez-Muñoz, G. 2021. A
comparative analysis of gradient boosting algorithms.
Artificial Intelligence Review, 54, 1937-1967.
Bunker, R. P. & Thabtah, F. 2019. A machine learning
framework for sport result prediction. Applied
computing and informatics, 15(1), 27-33.
Çelikbilek, Y. & Tüysüz, F. 2020. An in-depth review of
theory of the TOPSIS method: An experimental analysis.
Journal of Management Analytics, 7(2), 281-300.
Gu, W., Foster, K., Shang, J. & Wei, L. 2019. A game-
predicting expert system using big data and machine
learning. Expert Systems with Applications, 130, 293-
305.
Groll, A., Hvattum, L. M., Ley, C., Popp, F., Schauberger,
G., Van Eetvelde, H. & Zeileis, A. 2021. Hybrid
machine learning forecasts for the UEFA EURO 2020.
arXiv preprint arXiv:2106.05799.
Groll, A., Ley, C., Schauberger, G. & Van Eetvelde, H.
2019. A hybrid random forest to predict soccer matches
in international tournaments. Journal of Quantitative
Analysis in Sports, 15(4), 271-287.
Haruna, U., Maitama, J. Z., Mohammed, M. & Raj, R. G.
2021. Predicting the outcomes of football matches
using machine learning approach. In International
Conference on Informatics and Intelligent Applica-
tions. Cham: Springer International Publishing, 92-104.
Hvattum, L. M. 2019. A comprehensive review of plus-
minus ratings for evaluating individual players in team
sports. International Journal of Computer Science in
Sport, 18(1), 1-23.
Hwang, C. L., Yoon, K., Hwang, C. L. & Yoon, K. 1981.
Methods for multiple attribute decision making.
Multiple attribute decision making: methods and
applications a state-of-the-art survey, 58-191.
Li, Y., Wang, L. & Li, F. 2021. A data-driven prediction
approach for sports team performance and its
application to National Basketball Association. Omega,
98, 102123.
Liu, Y. & Wang, J. 2020. Research on post-match score
mechanism of players based on artificial intelligence
and clustering regression model. Journal of Intelligent
& Fuzzy Systems, 39(4), 4869-4879.
Lu, S., Liu, M., Yin, L., Yin, Z., Liu, X., & Zheng, W. 2023.
The multi-modal fusion in visual question answering: a
review of attention mechanisms. PeerJ Computer
Science, 9, e1400.
Mehmood, E. & Anees, T. 2020. Challenges and solutions
for processing real-time big data stream: a systematic
literature review. IEEE Access, 8, 119123-119143.
Mienye, I. D. & Sun, Y. 2022. A survey of ensemble
learning: Concepts, algorithms, applications, and
prospects. IEEE Access, 10, 99129-99149.
Özgüven, Y. M., Gönener, U. & Eken, S. 2021. A
Dockerized big data architecture for sports analytics.
Qader, M. A., Zaidan, B. B., Zaidan, A. A., Ali, S. K.,
Kamaluddin, M. A. & Radzi, W. B. 2017. A
methodology for football players selection problem
based on multi-measurements criteria analysis.
Measurement, 111, 38-50.
Qiu, Y., Chang, C. S., Yan, J. L., Ko, L., & Chang, T. S.
2019. Semantic segmentation of intracranial
hemorrhages in head CT scans. In 2019 IEEE 10th
International Conference on Software Engineering and
Service Science (ICSESS) (pp. 112-115). IEEE.
Salloum, S., Dautov, R., Chen, X., Peng, P. X. & Huang, J.
Z. 2016. Big data analytics on Apache Spark.
International Journal of Data Science and Analytics, 1,
145-164.
Sudhaman, K., Akuthota, M. & Chaurasiya, S. K. 2022. A
Review on the Different Regression Analysis in
Supervised Learning. Bayesian Reasoning and Gaussian
Processes for Machine Learning Applications, 15-32