
6 CONCLUSIONS
In the current paper, the use of machine learning tech-
niques for predicting students’ academic performance
in the Romanian pre-university education system was
investigated. This research has offered helpful in-
sights into the effectiveness and promise of predictive
analytics in enhancing educational results by looking
at several ML algorithms and taking a variety of as-
pects into account.
This paper underlines the significance of incor-
porating technology breakthroughs into educational
practices by demonstrating the potential of machine
learning approaches. Predictive analytics may consid-
erably assist the Romanian pre-university education
system by optimizing resource allocation, enhancing
teaching methods, and eventually improving educa-
tional achievements for all students.
It is essential to acknowledge this study’s limita-
tions. The quality and representativeness of the given
datasets determine how accurate and generalizable the
prediction models are. Additionally, constant updates
and improvements to the models are required due to
the changing nature of the educational system in order
to maintain their usefulness and efficacy.
Within this paper it was obtained satisfactory re-
sults, making a comparison with related work it can
be seen that the results obtained are good. The find-
ings of this research contribute to the growing body of
knowledge on ML applications in education and pro-
vide a foundation for future studies aimed at enhanc-
ing educational practices and improving student out-
comes. Within the paper, it was managed to demon-
strate the efficiency of the Random Forest method
in comparison with other machine learning methods
when it comes to modelling academic problems.
Considering the importance of the educational
system, an application that manages to predict stu-
dents’ grades would be of real help, its use could help
in the early identification of students with problems,
so that they could be supported and helped to develop.
Future work would consist of creating a bigger
data set and testing and validating the models cre-
ated in this paper on this new data set, respectively,
trying to check what performance could be obtained
with other ML approaches. Considering the studies
presented in related work, the performance of models
such as Extreme Gradient Boosting, Bayesian Net-
work or Support Vector Machine could be checked.
Also, as future research, there is the aim to create
a recommendation system for students, which would
suggest which high school to attend based on their
academic performance in middle school. Such a rec-
ommendation system would be extremely beneficial
to the academic environment, being intended for both
teachers and students or parents. Considering the
openness shown by society towards software appli-
cations based on machine learning, it is believed that
such a system would catch on well and be used in the
academic environment.
REFERENCES
Almarabeh, H. (2017). Analysis of students’ performance
by using different data mining classifiers. Interna-
tional Journal of Modern Education and Computer
Science, 9(8):9.
Bakhshinategh, B., Zaiane, O. R., ElAtia, S., and Ipperciel,
D. (2018). Educational data mining applications and
tasks: A survey of the last 10 years. Education and
Information Technologies, 23:537–553.
Francis, B. K. and Babu, S. S. (2019). Predicting academic
performance of students using a hybrid data mining
approach. Journal of medical systems, 43:1–15.
Grandini, M., Bagli, E., and Visani, G. (2020). Metrics for
multi-class classification: an overview. arXiv preprint
arXiv:2008.05756.
Hutter, F., Kotthoff, L., and Vanschoren, J. (2019). Au-
tomated machine learning: methods, systems, chal-
lenges. Springer Nature.
Mohamad, S. K. and Tasir, Z. (2013). Educational data
mining: A review. Procedia-Social and Behavioral
Sciences, 97:320–324.
Naser, M. and Alavi, A. (2020). Insights into performance
fitness and error metrics for machine learning. arXiv
preprint arXiv:2006.00887.
Oppong, S. O. (2023). Predicting students’ performance
using machine learning algorithms: A review. Asian
Journal of Research in Computer Science, 16(3):128–
148.
Pe
˜
na-Ayala, A. (2014). Educational data mining: A sur-
vey and a data mining-based analysis of recent works.
Expert systems with applications, 41(4):1432–1462.
Probst, P., Wright, M. N., and Boulesteix, A.-L. (2019). Hy-
perparameters and tuning strategies for random for-
est. Wiley Interdisciplinary Reviews: data mining and
knowledge discovery, 9(3):e1301.
Salloum, S. A., Alshurideh, M., Elnagar, A., and Shaalan,
K. (2020). Mining in educational data: review and fu-
ture directions. In Proceedings of the International
Conference on Artificial Intelligence and Computer
Vision (AICV2020), pages 92–102. Springer.
Siddiqui, I. F., Arain, Q. A., et al. (2019). Analyzing stu-
dents’academic performance through educational data
mining. 3C Tecnologia.
Sindhu Meena, K. and Suriya, S. (2020). A survey on su-
pervised and unsupervised learning techniques. In
Proceedings of international conference on artificial
intelligence, smart grid and smart city applications:
AISGSC 2019, pages 627–644. Springer.
Prediction in Pre-University Education System Using Machine Learning Methods
437