
Clark, B. and Zubrow, D. (2001). How good is a software:
a review on defect prediction techniques. In Software
Engineering Symposium, pages 1–35, Carneige Mel-
lon University.
Dam, H. K., Pham, T., Ng, S. W., Tran, T., Grundy, J.,
Ghose, A., Kim, T., and Kim, C. (2018). A deep tree-
based model for software defect prediction. CoRR,
abs/1802.00921:1–10.
Fawcett, T. (2006). An introduction to ROC analysis. Pat-
tern Recognition Letters, 27(8):861–874.
Feng, Z., Guo, D., Tang, D., Duan, N., Feng, X., Gong,
M., Shou, L., Qin, B., Liu, T., Jiang, D., and Zhou,
M. (2020). CodeBERT: A pre-trained model for pro-
gramming and natural languages. In EMNLP 2020,
pages 1536–1547, Online.
Graves, A., Fern
´
andez, S., and Schmidhuber, J. (2005).
Bidirectional LSTM networks for improved phoneme
classification and recognition. In Proceedings of Ar-
tificial Neural Networks: Formal Models and Their
Applications - ICANN 2005, pages 799–804.
Herbold, S., Trautsch, A., Trautsch, F., and Ledel, B.
(2022). Problems with szz and features: An empirical
study of the state of practice of defect prediction data
collection. Empirical Software Engineering, 27(2).
Hosseini, S., Turhan, B., and Gunarathna, D. (2019). A sys-
tematic literature review and meta-analysis on cross
project defect prediction. IEEE Transactions on Soft-
ware Engineering, 45(2):111–147.
Hryszko, J. and Madeyski, L. (2018). Cost effectiveness
of software defect prediction in an industrial project.
Foundations of Computing and Decision Sciences,
43(1):7 – 35.
hua Chang, R., Mu, X., and Zhang, L. (2011). Software
defect prediction using non-negative matrix factoriza-
tion. Journal of Software, 6(11):2114–2120.
Huo, X., Yang, Y., Li, M., and Zhan, D.-C. (2018). Learn-
ing semantic features for software defect prediction
by code comments embedding. In ICDM 2018, pages
1049–1054.
Jin, C. (2021). Cross-project software defect prediction
based on domain adaptation learning and optimiza-
tion. Expert Systems with Applications, 171:114637.
Jureczko, M. and Madeyski, L. (2010). Towards identifying
software project clusters with regard to defect predic-
tion. In The 6th International Conference on Predic-
tive Models in Software Engineering, page 1–10.
Le, Q. V. and Mikolov, T. (2014). Distributed represen-
tations of sentences and documents. Computing Re-
search Repository (CoRR), abs/1405.4:1–9.
Li, J., He, P., Zhu, J., and Lyu, M. R. (2017). Software
defect prediction via convolutional neural network.
In 2017 IEEE International Conference on Software
Quality, Reliability and Security, pages 318–328.
Liu, J., Ai, J., Lu, M., Wang, J., and Shi, H. (2023). Se-
mantic feature learning for software defect prediction
from source code and external knowledge. Journal of
Systems and Software, 204:111753.
Liu, Y., Ott, M., Goyal, N., Du, J., Joshi, M., Chen, D.,
Levy, O., Lewis, M., Zettlemoyer, L., and Stoyanov,
V. (2019). RoBERTa: A Robustly Optimized BERT
Pretraining Approach.
Malhotra, R. (2015). A systematic review of machine learn-
ing techniques for software fault prediction. Applied
Soft Computing, 27:504 – 518.
Marcus, A., Poshyvanyk, D., and Ferenc, R. (2008). Us-
ing the conceptual cohesion of classes for fault predic-
tion in object-oriented systems. IEEE Transactions on
Software Engineering, 34(2):287–300.
Mazzarella, V., Maiello, I., Capozzi, V., Budillon, G., and
Ferretti, R. (2017). Comparison between 3d-var and
4d-var data assimilation methods for the simulation of
a heavy rainfall case in central italy. Advances in Sci-
ence and Research, 14:271–278.
Miholca, D.-L., Tomescu, V.-I., and Czibula, G. (2022). An
in-depth analysis of the software features’ impact on
the performance of deep learning-based software de-
fect predictors. IEEE Access, 10:64801–64818.
Mohammed, A. H. and Ali, A. H. (2021). Survey of
BERT (Bidirectional Encoder Representation Trans-
former) types. Journal of Physics: Conference Series,
1963(1):012173.
Runeson, P. and H
¨
ost, M. (2009). Guidelines for conduct-
ing and reporting case study research in software en-
gineering. Empirical Softw. Engg., 14(2):131–164.
Uddin, M., Li, B., Ali, Z., Kefalas, P., Khan, I., and Zada,
I. (2022). Software defect prediction employing BiL-
STM and BERT-based semantic feature. Soft Com-
puting, 26:7877–7891.
Wang, S., Liu, T., and Tan, L. (2016). Automatically learn-
ing semantic features for defect prediction. In Pro-
ceedings of the 38th International Conference on Soft-
ware Engineering, pages 297–308. ACM.
Wilcoxon Signed-Rank Test (2023). Social science statis-
tics. http://www.socscistatistics.com/tests/.
Yang, X., Lo, D., Xia, X., Zhang, Y., and Sun, J. (2015).
Deep learning for just-in-time defect prediction. In
2015 IEEE International Conference on Software
Quality, Reliability and Security, pages 17–26.
Yao, W., Shafiq, M., Lin, X., and Yu, X. (2023). A Software
Defect Prediction Method Based on Program Seman-
tic Feature Mining. Electronics, 12(7).
Zhao, L., Shang, Z., Zhao, L., Zhang, T., and Tang,
Y. (2019). Software defect prediction via cost-
sensitive siamese parallel fully-connected neural net-
works. Neurocomputing, 352:64–74.
Zheng, J. (2009). Predicting software reliability with neu-
ral network ensembles. Expert Systems with Applica-
tions, 36(2):2116–2122.
Zhou, C., He, P., Zeng, C., and Ma, J. (2022). Software
defect prediction with semantic and structural infor-
mation of codes based on graph neural networks. In-
formation and Software Technology, 152:107057.
Zhou, S., Alon, U., Agarwal, S., and Neubig, G. (2023).
CodeBERTScore: Evaluating code generation with
pretrained models of code. In Proceedings of the 2023
Conf. on Empirical Methods in Natural Language
Processing, pages 13921–13937, Singapore. ACM.
Zhu, K., Zhang, N., Ying, S., and Wang, X. (2020). Within-
project and cross-project software defect prediction
based on improved transfer naive bayes algorithm.
Computers, Materials & Continua, 63(2):891–910.
Enriching the Semantic Representation of the Source Code with Natural Language-Based Features from Comments for Improving the
Performance of Software Defect Prediction
143