
philosophical paradigm that aligns with this study is
positivism. Positivism emphasizes the importance of
empirical observation, experimentation, and apply-
ing scientific methods for understanding and solving
problems (da Silva et al., 2010).
Although there are studies in the literature that
assess machine learning algorithms for software ef-
fort prediction, they frequently employ distinct exper-
imental methodologies, datasets, and evaluation met-
rics. This lack of standardization makes it challenging
to compare results across different research efforts.
To contribute in this direction, this article con-
ducted a controlled experimental study on the appli-
cation of machine learning techniques to predict the
effort estimation of software development teams us-
ing seven machine learning algorithms: J48 (Decision
Tree), KNN (K-Nearest Neighbors), SVM (Support
Vector Machine), ANN (Artificial Neural Network),
Bagging (Bootstrap Aggregating), Stacking (Stacked
Generalization), and XGBoost (Extreme Gradient
Boosting). To optimize the hyperparameters, the PSO
(Particle Swarm Optimization) technique was used.
The analyzed data comes from eight datasets from the
PROMISE repository, covering various specifications
and characteristics. The evaluation metrics used in-
clude MMR (Modified McCabe’s Complexity), MAE
(Mean Absolute Error), MdMRE (Magnitude Rela-
tive Error), and R2 score.
The article is organized as follows: Section 2 cov-
ers the related works. Section 3 discusses the method-
ology, and Section 4 provides the results and discus-
sions. Finally, section 5 presents the final considera-
tions.
2 RELATED WORK
The section presents some related work conducted by
professionals, academics, and researchers in the field
of software effort estimation and its impact on soft-
ware companies.
The work developed by (Tiwari and Sharma,
2022) applied machine learning techniques such as
SVM (linear, polynomial, RBF, sigmoid), Random
Forest, Stochastic Gradient Boosting, Decision Tree,
KNN (K-Nearest Neighbors), Logistic Regression,
Naive Bayes, and MLP to estimate effort in soft-
ware projects across various datasets, including IS-
BSG Release 12, Albrecht, China, COCOMO81, De-
sharnais, Finnish, Kemmerer, Kitchenham, Maxwell,
Miyazaki, NASA18, NASA93, and Telecom. The
study utilized evaluation metrics such as Pred (25),
Pred (50), MAE, MMRE, MMER, MdMRE, R-
Squared, MSE, and RMSE. Support Vector Regres-
sion (SVR) with an RBF kernel stood out, providing
the best results among the employed techniques. The
study did not utilize any hyperparameter optimization
techniques. Crucial factors, such as the scalability of
the algorithms, the high computational cost, and the
feasibility of implementation in development teams
with limited resources, were not thoroughly discussed
in the study.
Subsequently, the research developed by (Alham-
dany and Ibrahim, 2022) proposed the LASSO ma-
chine learning technique as a promising approach
for software development effort estimation, stand-
ing out from other algorithms in performance. Var-
ious techniques were utilized, such as Random For-
est (RF), Neural Networks (Neuralnet), Ridge Re-
gression (Ridge), Elastic Net (ElasticNet), Deep Neu-
ral Networks (Deepnet), Support Vector Machines
(SVM), Decision Trees (DT), and LASSO, on the
following datasets: China, Kemerer, Cocomo81,
Albrecht, Maxwell, Desharnais, and Kitchenham.
LASSO is notable for its ability to simplify models,
improve interpretability, address issues like overfit-
ting and multicollinearity, and perform automatic fea-
ture selection. redThe study offers a quantitative as-
sessment using metrics such as MAE, RMSE, and R-
squared, but could benefit from a deeper exploration
of the models’ interpretability in different project con-
texts.
The work by (Shukla and Kumar, 2023) investi-
gated different machine learning techniques and en-
semble models to predict Use Case Points (UCP),
aiming to improve software effort estimation. While
traditional methods like linear regression and deci-
sion trees have been widely used in previous stud-
ies, this work proposes the application of ensem-
bles, such as Boosting and Bagging, with different
base models, including SVR, MLP, and KNN. Among
the evaluated models, Boost-SVR demonstrated the
best performance, surpassing previous approaches in
metrics such as MAE, MSE, MBRE, and Pred(25),
demonstrating the effectiveness of ensemble tech-
niques in predicting UCP. The study utilized two pub-
lic datasets for estimating Use Case Points (UCP), re-
ferred to as DS1 and DS2, to conduct the experimen-
tal analysis. The study used Grid Search for hyper-
parameter optimization of the machine learning al-
gorithms. A drawback of Grid Search is that it can
be computationally costly and inefficient, especially
with large datasets or when there is a high number of
hyperparameters and possible combinations.
In (Saqlain et al., 2023), the authors utilized
five public datasets: ISBSG, NASA93, COCOMO81,
Maxwell, and Desharnais. The work included data
cleaning and selecting relevant features using Pear-
ICEIS 2025 - 27th International Conference on Enterprise Information Systems
220