
In 2016 IEEE International Conference on Software
Quality, Reliability and Security, pages 214–221.
Chen, M. and Ma, Y. (2015). An empirical study on predict-
ing defect numbers. In International Conference on
Software Engineering and Knowledge Engineering.
Chen, X., Zhang, D., Zhao, Y., Cui, Z., and Ni, C. (2019).
Software defect number prediction: Unsupervised vs
supervised methods. Information and Software Tech-
nology, 106:161–181.
Cohen, M., Rokach, L., and Puzis, R. (2022). Cross version
defect prediction with class dependency embeddings.
ArXiv, abs/2212.14404.
Fan, G., Diao, X., Yu, H., Yang, K., Chen, L., and Vitiello,
A. (2019). Software defect prediction via attention-
based recurrent neural network. Sci. Program., 2019.
Harman, M., Islam, S., Jia, Y., Minku, L. L., Sarro, F., and
Srivisut, K. (2014). Less is more: Temporal fault pre-
dictive performance over multiple hadoop releases. In
Le Goues, C. and Yoo, S., editors, Search-Based Soft-
ware Engineering, pages 240–246. Springer.
Huo, X. and Li, M. (2019). On cost-effective software de-
fect prediction: Classification or ranking? Neurocom-
puting, 363:339–350.
Jureczko, M. and Madeyski, L. (2010). Towards identify-
ing software project clusters with regard to defect pre-
diction. In Proceedings of the 6th international con-
ference on predictive models in software engineering,
pages 1–10.
Kabir, M. A., Keung, J., Turhan, B., and Bennin, K. E.
(2021). Inter-release defect prediction with feature se-
lection using temporal chunk-based learning: An em-
pirical study. Appl. Soft Comput., 113(PA).
Kabir, M. A., Keung, J. W., Bennin, K. E., and Zhang, M.
(2020). A drift propensity detection technique to im-
prove the performance for cross-version software de-
fect prediction. In 2020 IEEE 44th Annual Computers,
Software, and Applications Conference (COMPSAC),
pages 882–891.
Kabir, M. A., Rehman, A. U., Islam, M. M. M., Ali, N., and
Baptista, M. L. (2023). Cross-version software defect
prediction considering concept drift and chronological
splitting. Symmetry, 15(10).
Li, Y., Su, J., and Yang, X. (2018). Multi-objective vs.
single-objective approaches for software defect pre-
diction. In Proceedings of the 2nd International Con-
ference on Management Engineering, Software Engi-
neering and Service Sciences, page 122–127. ACM.
Lu, H., Kocaguneli, E., and Cukic, B. (2014). Defect pre-
diction between software versions with active learn-
ing and dimensionality reduction. 2014 IEEE 25th
International Symposium on Software Reliability En-
gineering, pages 312–322.
Ouellet, A. and Badri, M. (2023). Combining object-
oriented metrics and centrality measures to predict
faults in object-oriented software: An empirical val-
idation. Journal of Software: Evolution and Process.
Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V.,
Thirion, B., Grisel, O., Blondel, M., Prettenhofer,
P., Weiss, R., Dubourg, V., Vanderplas, J., Passos,
A., Cournapeau, D., Brucher, M., Perrot, M., and
Duchesnay, E. (2011). Scikit-learn: Machine learning
in Python. Journal of Machine Learning Research,
12:2825–2830.
Shukla, S., Radhakrishnan, T., Kasinathan, M., and Neti,
L. B. M. (2016). Multi-objective cross-version defect
prediction. Soft Computing, 22:1959 – 1980.
Sun, B. (2024). Bert-based cross-project and cross-version
software defect prediction. Applied and Computa-
tional Engineering, 73:33–41.
Wang, S., Liu, T., and Tan, L. (2016). Automatically learn-
ing semantic features for defect prediction. In Pro-
ceedings of the 38th International Conference on Soft-
ware Engineering, page 297–308. ACM.
Wang, Z., Tong, W., Li, P., Ye, G., Chen, H., Gong, X.,
and Tang, Z. (2022). BugPre: an intelligent software
version-to-version bug prediction system using graph
convolutional neural networks. Complex & Intelligent
Systems, 9.
Xu, Z., Li, S., Luo, X., Liu, J., Zhang, T., Tang, Y., Xu, J.,
Yuan, P., and Keung, J. (2019). TSTSS: A two-stage
training subset selection framework for cross version
defect prediction. J. Syst. Softw., 154(C):59–78.
Xu, Z., Li, S., Tang, Y., Luo, X., Zhang, T., Liu, J., and
Xu, J. (2018a). Cross version defect prediction with
representative data via sparse subset selection. In Pro-
ceedings of the 26th Conference on Program Compre-
hension, page 132–143. ACM.
Xu, Z., Liu, J., Luo, X., and Zhang, T. (2018b). Cross-
version defect prediction via hybrid active learning
with kernel principal component analysis. In IEEE
25th International Conference on Software Analysis,
Evolution and Reengineering, pages 209–220.
Yang, X. and Wen, W. (2018). Ridge and lasso regres-
sion models for cross-version defect prediction. IEEE
Transactions on Reliability, 67:885–896.
Yu, X., Rao, J., Liu, L., Lin, G., Hu, W., Keung, J. W.,
Zhou, J., and Xiang, J. (2024). Improving effort-
aware defect prediction by directly learning to rank
software modules. Information and Software Technol-
ogy, 165:107250.
Zhang, J., Wu, J., Chen, C., Zheng, Z., and Lyu, M. R.
(2020). CDS: A cross–version software defect pre-
diction model with data selection. IEEE Access,
8:110059–110072.
Zhang, N., Ying, S., Ding, W., Zhu, K., and Zhu, D. (2021).
WGNCS: A robust hybrid cross-version defect model
via multi-objective optimization and deep enhanced
feature representation. Inf. Sci., 570(C):545–576.
Zhao, Y., Wang, Y., Zhang, D., and Gong, Y. (2022a). Elim-
inating the high false-positive rate in defect prediction
through bayesnet with adjustable weight. Expert Sys-
tems, 39.
Zhao, Y., Wang, Y., Zhang, Y., Zhang, D., Gong, Y., and Jin,
D. (2022b). ST-TLF: Cross-version defect prediction
framework based transfer learning. Information and
Software Technology, 149:106939.
Cross-Version Defect Prediction: Does Excessive Train-Test Similarity Affect the Reliability of Evaluation?
315