observed that ensemble using differentiable function
based ELM as well as non- differentiable function ba-
sed ELM outperform as compared to other methods
across all datasets for both prediction scenarios within
project defect prediction as well as inter release pre-
diction. Overall, the proposed ensemble models have
consistent prediction accuracy across all datasets for
both prediction scenarios.
ACKNOWLEDGMENTS
The authors are thankful to Ministry of Electronics
and Information Technology, Government of India.
This publication is an outcome of the R&D work un-
dertaken in the project under the Visvesvaraya PhD
Scheme of Ministry of Electronics and Information
Technology, Government of India, being implemen-
ted by Digital India Corporation (formerly Media Lab
Asia).
REFERENCES
Ben-Hur, A., Horn, D., Siegelmann, H. T., and Vapnik, V.
(2001). Support vector clustering. Journal of machine
learning research, 2(Dec):125–137.
Bowes, D., Hall, T., and Petri
´
c, J. (2017). Software defect
prediction: do different classifiers find the same de-
fects? Software Quality Journal, pages 1–28.
Briand, L. C. and W
¨
ust, J. (2002). Empirical studies of qua-
lity models in object-oriented systems. In Advances in
computers, volume 56, pages 97–166. Elsevier.
D’Ambros, M., Lanza, M., and Robbes, R. (2010). An ex-
tensive comparison of bug prediction approaches. In
Mining Software Repositories (MSR), 2010 7th IEEE
Working Conference on, pages 31–41. IEEE.
Gao, K. and Khoshgoftaar, T. M. (2007). A comprehensive
empirical study of count models for software fault pre-
diction. IEEE Transactions on Reliability, 56(2):223–
236.
Golub, G. H. and Reinsch, C. (1970). Singular value de-
composition and least squares solutions. Numerische
mathematik, 14(5):403–420.
Graves, T. L., Karr, A. F., Marron, J. S., and Siy, H. (2000).
Predicting fault incidence using software change his-
tory. IEEE Transactions on software engineering,
26(7):653–661.
Greene, W. H. (2003). Econometric analysis. Pearson Edu-
cation India.
He, Z., Shu, F., Yang, Y., Li, M., and Wang, Q. (2012).
An investigation on the feasibility of cross-project
defect prediction. Automated Software Engineering,
19(2):167–199.
Higgins, J. J. (2003). Introduction to modern nonparametric
statistics.
Huang, G., Huang, G.-B., Song, S., and You, K. (2015).
Trends in extreme learning machines: A review. Neu-
ral Networks, 61:32–48.
Huang, G.-B., Zhou, H., Ding, X., and Zhang, R. (2012).
Extreme learning machine for regression and mul-
ticlass classification. IEEE Transactions on Sys-
tems, Man, and Cybernetics, Part B (Cybernetics),
42(2):513–529.
Huang, G.-B., Zhu, Q.-Y., Mao, K., Siew, C.-K., Sarat-
chandran, P., and Sundararajan, N. (2006a). Can thres-
hold networks be trained directly? IEEE Transactions
on Circuits and Systems II: Express Briefs, 53(3):187–
191.
Huang, G.-B., Zhu, Q.-Y., and Siew, C.-K. (2006b). Ex-
treme learning machine: theory and applications.
Neurocomputing, 70(1-3):489–501.
James, G., Witten, D., Hastie, T., and Tibshirani, R. (2013).
An introduction to statistical learning, volume 112.
Springer.
Kanmani, S., Uthariaraj, V. R., Sankaranarayanan, V., and
Thambidurai, P. (2007). Object-oriented software
fault prediction using neural networks. Information
and software technology, 49(5):483–492.
Lambert, D. (1992). Zero-inflated poisson regression, with
an application to defects in manufacturing. Techno-
metrics, 34(1):1–14.
Laradji, I. H., Alshayeb, M., and Ghouti, L. (2015). Soft-
ware defect prediction using ensemble learning on se-
lected features. Information and Software Technology,
58:388–402.
Li, W., Huang, Z., and Li, Q. (2016). Three-way decisions
based software defect prediction. Knowledge-Based
Systems, 91:263–274.
MacDonell, S. G. (1997). Establishing relationships bet-
ween specification size and software process effort in
case environments. Information and Software Techno-
logy, 39(1):35–45.
Menzies, T., Greenwald, J., and Frank, A. (2007). Data
mining static code attributes to learn defect predictors.
IEEE transactions on software engineering, 33(1):2–
13.
Menzies, T., Krishna, R., and Pryor, D. (2015). The
promise repository of empirical software engineering
data. http://openscience.us/repo. North Carolina State
University, Department of Computer Science.
Nam, J., Fu, W., Kim, S., Menzies, T., and Tan, L. (2017).
Heterogeneous defect prediction. IEEE Transactions
on Software Engineering.
Ostrand, T. J., Weyuker, E. J., and Bell, R. M. (2005). Pre-
dicting the location and number of faults in large soft-
ware systems. IEEE Transactions on Software Engi-
neering, 31(4):340–355.
Patro, S. and Sahu, K. K. (2015). Normalization: A prepro-
cessing stage. arXiv preprint arXiv:1503.06462.
Quinlan, J. R. (1987). Simplifying decision trees. Internati-
onal journal of man-machine studies, 27(3):221–234.
Quinlan, J. R. et al. (1992). Learning with continuous clas-
ses. In 5th Australian joint conference on artificial
intelligence, volume 92, pages 343–348. Singapore.
Extreme Learning Machine based Linear Homogeneous Ensemble for Software Fault Prediction
77