defect-prediction models in practice. IEEE software,
22(6):23–29.
Kumar, L., Misra, S., and Rath, S. K. (2017). An empirical
analysis of the effectiveness of software metrics and
fault prediction model for identifying faulty classes.
Computer Standards & Interfaces, 53:1–32.
Menzies, T., Krishna, R., and Pryor, D. (2015). The
promise repository of empirical software engineering
data (2015).
Mordal-Manet, K., Laval, J., Ducasse, S., Anquetil, N., Bal-
mas, F., Bellingard, F., Bouhier, L., Vaillergues, P.,
and McCabe, T. J. (2011). An empirical model for
continuous and weighted metric aggregation. In 2011
15th European Conference on Software Maintenance
and Reengineering (CSMR), pages 141–150. IEEE.
Posnett, D., Filkov, V., and Devanbu, P. (2011). Eco-
logical inference in empirical software engineering.
In Proceedings of the 2011 26th IEEE/ACM Interna-
tional Conference on Automated Software Engineer-
ing, pages 362–371. IEEE Computer Society.
Rathore, S. S. and Kumar, S. (2017). Linear and non-linear
heterogeneous ensemble methods to predict the num-
ber of faults in software systems. Knowledge-Based
Systems, 119:232–256.
Sanz-Rodriguez, J., Dodero, J. M., and Sanchez-Alonso, S.
(2011). Metrics-based evaluation of learning object
reusability. Software Quality Journal, 19(1):121–140.
Serebrenik, A. and van den Brand, M. (2010). Theil index
for aggregation of software metrics values. In Soft-
ware Maintenance (ICSM), 2010 IEEE International
Conference on, pages 1–9. IEEE.
Turhan, B., Mısırlı, A. T., and Bener, A. (2013). Empiri-
cal evaluation of the effects of mixed project data on
learning defect predictors. Information and Software
Technology, 55(6):1101–1118.
Vasa, R., Lumpe, M., Branch, P., and Nierstrasz, O. (2009).
Comparative analysis of evolving software systems
using the gini coefficient. In 2009 IEEE International
Conference on Software Maintenance (ICSM), pages
179–188. IEEE.
Vasilescu, B., Serebrenik, A., and van den Brand, M.
(2011). You can’t control the unfamiliar: A study
on the relations between aggregation techniques for
software metrics. In 2011 27th IEEE International
Conference on Software Maintenance (ICSM), pages
313–322. IEEE.
Walter, B., Wolski, M., Prominski, P., and Kupi
´
nski, S.
(2016). One metric to combine them all: experimen-
tal comparison of metric aggregation approaches in
software quality models. In Software Measurement
and the International Conference on Software Pro-
cess and Product Measurement (IWSM-MENSURA),
2016 Joint Conference of the International Workshop
on, pages 159–163. IEEE.
Yang, X., Lo, D., Xia, X., and Sun, J. (2017). Tlel: A two-
layer ensemble learning approach for just-in-time de-
fect prediction. Information and Software Technology,
87:206–220.
Zhang, F., Hassan, A. E., McIntosh, S., and Zou, Y. (2017).
The use of summation to aggregate software metrics
hinders the performance of defect prediction mod-
els. IEEE Transactions on Software Engineering,
43(5):476–491.
Zhao, Y., Yang, Y., Lu, H., Liu, J., Leung, H., Wu, Y., Zhou,
Y., and Xu, B. (2017). Understanding the value of
considering client usage context in package cohesion
for fault-proneness prediction. Automated Software
Engineering, 24(2):393–453.
Zhou, Y. and Leung, H. (2006). Empirical analysis of
object-oriented design metrics for predicting high and
low severity faults. IEEE Transactions on software
engineering, 32(10):771–789.
Zimmermann, T., Nagappan, N., Gall, H., Giger, E., and
Murphy, B. (2009). Cross-project defect prediction: a
large scale experiment on data vs. domain vs. process.
In Proceedings of the the 7th joint meeting of the Eu-
ropean software engineering conference and the ACM
SIGSOFT symposium on The foundations of software
engineering, pages 91–100. ACM.
Zimmermann, T., Premraj, R., and Zeller, A. (2007). Pre-
dicting defects for eclipse. In Proceedings of the third
international workshop on predictor models in soft-
ware engineering, page 9. IEEE Computer Society.
Investigating the Effect of Software Metrics Aggregation on Software Fault Prediction
311