Bennedsen, J. and Caspersen, M. E. (2019). Failure rates
in introductory programming: 12 years later. ACM
Inroads, 10(2):30–36.
C
´
ardenas-Cobo, J., Puris, A., Novoa-Hern
´
andez, P., Parra-
Jim
´
enez,
´
A., Moreno-Le
´
on, J., and Benavides, D.
(2021). Using scratch to improve learning program-
ming in college students: A positive experience from
a non-weird country. Electronics, 10(10):1180.
Carrillo, J. M. and Parraga-Alava, J. (2018). How predicting
the academic success of students of the espam mfl?: A
preliminary decision trees based study. In 2018 IEEE
Third Ecuador Technical Chapters Meeting (ETCM),
pages 1–6.
Chawla, N. V., Bowyer, K. W., Hall, L. O., and Kegelmeyer,
W. P. (2002). Smote: synthetic minority over-
sampling technique. Journal of artificial intelligence
research, 16:321–357.
Chen, R.-C., Dewi, C., Huang, S.-W., and Caraka, R. E.
(2020). Selecting critical features for data classifica-
tion based on machine learning methods. Journal of
Big Data, 7(1):52.
Cilia, N. D., De Stefano, C., Fontanella, F., Raimondo, S.,
and Scotto di Freca, A. (2019). An experimental com-
parison of feature-selection and classification methods
for microarray datasets. Information, 10(3):109.
Farissi, A., Dahlan, H. M., et al. (2020). Genetic algo-
rithm based feature selection with ensemble methods
for student academic performance prediction. In Jour-
nal of Physics: Conference Series, volume 1500, page
012110. IOP Publishing.
Ghoneim, S. (2019). Accuracy, Recall, Precision, F-Score
and Specificity, which to optimize on.
Guyon, I., Weston, J., Barnhill, S., and Vapnik, V. (2002).
Gene selection for cancer classification using support
vector machines. Machine learning, 46:389–422.
Huynh-Cam, T.-T., Chen, L.-S., and Huynh, K.-V. (2022).
Learning performance of international students and
students with disabilities: Early prediction and feature
selection through educational data mining. Big Data
and Cognitive Computing, 6(3).
Huynh-Cam, T.-T., Chen, L.-S., and Le, H. (2021). Using
decision trees and random forest algorithms to predict
and determine factors contributing to first-year uni-
versity students’ learning performance. Algorithms,
14(11).
Kohonen, T. (2001). Learning Vector Quantization, pages
245–261. Springer Berlin Heidelberg, Berlin, Heidel-
berg.
Kuhn, M. and Wickham, H. (2020). Tidymodels: a collec-
tion of packages for modeling and machine learning
using tidyverse principles.
Kursa, M. B., Jankowski, A., and Rudnicki, W. R. (2010).
Boruta–a system for feature selection. Fundamenta
Informaticae, 101(4):271–285.
Liu, J., Peng, P., and Luo, L. (2020). The relation between
family socioeconomic status and academic achieve-
ment in china: A meta-analysis. Educational Psychol-
ogy Review, 32:49–76.
Niyogisubizo, J., Liao, L., Nziyumva, E., Murwanashyaka,
E., and Nshimyumukiza, P. C. (2022). Predicting
student’s dropout in university classes using two-
layer ensemble machine learning approach: A novel
stacked generalization. Computers and Education:
Artificial Intelligence, 3:100066.
Phauk, S. and Okazaki, T. (2020). Study on dominant fac-
tor for academic performance prediction using feature
selection methods. International Journal of Advanced
Computer Science and Applications, 11:492–502.
Rahimi, S. and Shute, V. J. (2021). First inspire, then in-
struct to improve students’ creativity. Computers &
Education, 174:104312.
Ramaswami, G. S., Susnjak, T., Mathrani, A., and Umer,
R. (2020). Predicting students final academic perfor-
mance using feature selection approaches. In 2020
IEEE Asia-Pacific Conference on Computer Science
and Data Engineering (CSDE), pages 1–5.
RStudio Team (2018). RStudio: Integrated Development
Environment for R. RStudio, Inc.
Simeunovi
´
c, V. and Preradovi
´
c, L. (2014). Using data min-
ing to predict success in studying. Croatian Journal
of Education, 16(2):491–523.
Stoian, C. E., F
˘
arcas
,
iu, M. A., Dragomir, G.-M., and
Gherhes
,
, V. (2022). Transition from online to face-to-
face education after covid-19: The benefits of online
education from students’ perspective. Sustainability,
14(19).
Su, Y.-S., Lin, Y.-D., and Liu, T.-Q. (2022). Applying ma-
chine learning technologies to explore students’ learn-
ing features and performance prediction. Frontiers in
Neuroscience, 16.
Team, R. C. (2017). R: A Language and Environment for
Statistical Computing. R Foundation for Statistical
Computing.
Tharwat, A. (2018). Classification assessment methods. Ap-
plied Computing and Informatics.
Tomasevic, N., Gvozdenovic, N., and Vranes, S. (2020). An
overview and comparison of supervised data mining
techniques for student exam performance prediction.
Computers & Education, 143:103676.
Wickham, H., Averick, M., Bryan, J., Chang, W., Mc-
Gowan, L. D., Franc¸ois, R., Grolemund, G., Hayes,
A., Henry, L., Hester, J., Kuhn, M., Pedersen, T. L.,
Miller, E., Bache, S. M., M
¨
uller, K., Ooms, J., Robin-
son, D., Seidel, D. P., Spinu, V., Takahashi, K.,
Vaughan, D., Wilke, C., Woo, K., and Yutani, H.
(2019). Welcome to the tidyverse. Journal of Open
Source Software, 4(43):1686.
Xavier, M. and Meneses, J. (2020). Dropout in Online
Higher Education: A scoping review from 2014 to
2018.
Xiao, W., Ji, P., and Hu, J. (2021). Rnkheu: A hybrid fea-
ture selection method for predicting students’ perfor-
mance. Scientific Programming, 2021:1–16.
Ya
˘
gcı, M. (2022). Educational data mining: prediction
of students’ academic performance using machine
learning algorithms. Smart Learning Environments,
9(1):11.
Predicting Academic Performance of Low-Income Students in Public Ecuadorian Online Universities: An Educational Data Mining
Approach
63