A Framework for Adoption of Machine Learning in Industry for Software Defect Prediction

Rakesh Rana, Miroslaw Staron, Jörgen Hansson, Martin Nilsson, Wilhelm Meding

Abstract

Machine learning algorithms are increasingly being used in a variety of application domains including software engineering. While their practical value have been outlined, demonstrated and highlighted in number of existing studies, their adoption in industry is still not widespread. The evaluations of machine learning algorithms in literature seem to focus on few attributes and mainly on predictive accuracy. On the other hand the decision space for adoption or acceptance of machine learning algorithms in industry encompasses much more factors. Companies looking to adopt such techniques want to know where such algorithms are most useful, if the new methods are reliable and cost effective. Further questions such as how much would it cost to setup, run and maintain systems based on such techniques are currently not fully investigated in the industry or in academia leading to difficulties in assessing the business case for adoption of these techniques in industry. In this paper we argue for the need of framework for adoption of machine learning in industry. We develop a framework for factors and attributes that contribute towards the decision of adoption of machine learning techniques in industry for the purpose of software defect predictions. The framework is developed in close collaboration within industry and thus provides useful insight for industry itself, academia and suppliers of tools and services.

References

  1. Ajzen, I. & Fishbein, M., 1980. Understanding attitudes and predicting social behaviour. Available at: http://www.citeulike.org/group/38/article/235626.
  2. Attewell, P., 1992. Technology diffusion and organizational learning: The case of business computing. Organization Science, 3(1), pp.1-19.
  3. Bertolino, A., 2007. Software testing research: Achievements, challenges, dreams. In 2007 Future of Software Engineering. IEEE Computer Society, pp. 85-103.
  4. Boehm, B., 1987. Industrial software metrics top 10 list, IEEE Computer Soc 10662 Los Vaqueros Circle, Po Box 3014, Los Alamitos, Ca 90720-1264.
  5. Chau, P.Y. & Tam, K.Y., 1997. Factors Affecting the Adoption of Open Systems: An Exploratory Study. Mis Quarterly, 21(1).
  6. Davis Jr, F.D., 1986. A technology acceptance model for empirically testing new end-user information systems: Theory and results. Massachusetts Institute of Technology.
  7. Edmondson, A.C. et al., 2003. Learning how and learning what: Effects of tacit and codified knowledge on performance improvement following technology adoption. Decision Sciences, 34(2), pp.197-224.
  8. Gondra, I., 2008. Applying machine learning to software fault-proneness prediction. Journal of Systems and Software, 81(2), pp.186-195.
  9. Günes Koru, A. & Tian, J., 2003. An empirical comparison and characterization of high defect and high complexity modules. Journal of Systems and Software, 67(3), pp.153-163.
  10. Harrold, M.J., 2000. Testing: a roadmap. In Proceedings of the Conference on the Future of Software Engineering. ACM, pp. 61-72.
  11. Van der Heijden, H., 2003. Factors influencing the usage of websites: the case of a generic portal in The Netherlands. Information & Management, 40(6), pp.541-549.
  12. Khoshgoftaar, T.M. et al., 1992. Predictive modeling techniques of software quality from software measures. Software Engineering, IEEE Transactions on, 18(11), pp.979-987.
  13. Khoshgoftaar, T.M. & Allen, E.B., 1999. Logistic regression modeling of software quality. International Journal of Reliability, Quality and Safety Engineering, 6(04), pp.303-317.
  14. Khoshgoftaar, T.M., Munson, J.C. & Lanning, D.L., 1993. A comparative study of predictive models for program changes during system testing and maintenance. In Software Maintenance, 1993. CSM-93, Proceedings., Conference on. IEEE, pp. 72-79.
  15. Legris, P., Ingham, J. & Collerette, P., 2003. Why do people use information technology? A critical review of the technology acceptance model. Information & management, 40(3), pp.191-204.
  16. Lessmann, S. et al., 2008. Benchmarking Classification Models for Software Defect Prediction: A Proposed Framework and Novel Findings. IEEE Transactions on Software Engineering, 34(4), pp.485-496.
  17. Menzies, T. et al., 2003. How simple is software defect detection. Submitted to the Emprical Software Engineering Journal.
  18. Mitchell, T.M., 1999. Machine learning and data mining. Communications of the ACM, 42(11), pp.30-36.
  19. Pijpers, G.G. et al., 2001. Senior executives' use of information technology. Information and Software Technology, 43(15), pp.959-971.
  20. Sonnenburg, S. et al., 2007. The need for open source software in machine learning. Journal of Machine Learning Research, 8(10).
  21. Tornatzky, L.G., Fleischer, M. & Chakrabarti, A.K., 1990. Processes of technological innovation.
  22. Yang, K.C., 2005. Exploring factors affecting the adoption of mobile commerce in Singapore. Telematics and informatics, 22(3), pp.257-277.
  23. Zhang, D. & Tsai, J.J., 2003. Machine learning and software engineering. Software Quality Journal, 11(2), pp.87-119.
  24. Zhang, S. & Zaki, M.J., 2006. Mining multiple data sources: local pattern analysis. Data Mining and Knowledge Discovery, 12(2-3), pp.121-125.
  25. Zimmermann, T., Premraj, R. & Zeller, A., 2007. Predicting defects for eclipse. In Predictor Models in Software Engineering, 2007. PROMISE'07: ICSE Workshops 2007. International Workshop on. IEEE, pp. 9-9.
Download


Paper Citation


in Harvard Style

Rana R., Staron M., Hansson J., Nilsson M. and Meding W. (2014). A Framework for Adoption of Machine Learning in Industry for Software Defect Prediction . In Proceedings of the 9th International Conference on Software Engineering and Applications - Volume 1: ICSOFT-EA, (ICSOFT 2014) ISBN 978-989-758-036-9, pages 383-392. DOI: 10.5220/0005099303830392


in Bibtex Style

@conference{icsoft-ea14,
author={Rakesh Rana and Miroslaw Staron and Jörgen Hansson and Martin Nilsson and Wilhelm Meding},
title={A Framework for Adoption of Machine Learning in Industry for Software Defect Prediction},
booktitle={Proceedings of the 9th International Conference on Software Engineering and Applications - Volume 1: ICSOFT-EA, (ICSOFT 2014)},
year={2014},
pages={383-392},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005099303830392},
isbn={978-989-758-036-9},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 9th International Conference on Software Engineering and Applications - Volume 1: ICSOFT-EA, (ICSOFT 2014)
TI - A Framework for Adoption of Machine Learning in Industry for Software Defect Prediction
SN - 978-989-758-036-9
AU - Rana R.
AU - Staron M.
AU - Hansson J.
AU - Nilsson M.
AU - Meding W.
PY - 2014
SP - 383
EP - 392
DO - 10.5220/0005099303830392