analyzed in this article due to the scope of the appli-
cation and the intended comparison with the existing
approach. Moreover, we intend to use NLP (Natu-
ral Language Processing) to investigate the commit
text based on maintenance activities. Thus, we could
generate a classification approachable to classify the
commits automatically based on the labels discussed
in this article. Finally, we intend to use other cat-
egories (i.e., activities) proposed in the literature to
generalize the results.
ACKNOWLEDGEMENTS
The present work was carried out with the support
of the Coordenac¸
˜
ao de Aperfeic¸oamento de Pessoal
de N
´
ıvel Superior - Brazil (CAPES) - Financing
Code 001. The authors thank the partial support of
the CNPq (Brazilian National Council for Scientific
and Technological Development), FAPEMIG (Foun-
dation for Research and Scientific and Technological
Development of Minas Gerais), and PUC Minas.
REFERENCES
Amor, J., Robles, G., Gonzalez-Barahona, J., Gsyc, A., Car-
los, J., and Madrid, S. (2006). Discriminating de-
velopment activities in versioning systems: A case
study. In Proceedings of the 2nd International Con-
ference on Predictor Models in Software Engineering,
PROMISE’06.
Bach, S., Binder, A., Montavon, G., Klauschen, F., M
¨
uller,
K., and Samek, W. (2015). On pixel-wise explana-
tions for non-linear classifier decisions by layer-wise
relevance propagation. PLoS ONE, 10.
Breiman, L. (2001). Random forests. Machine Learning,
45(1):5–32.
Caruana, R. and Niculescu-Mizil, A. (2006). An empiri-
cal comparison of supervised learning algorithms. In
Proceedings of the 23rd International Conference on
Machine Learning, ICML’06, pages 161–168.
Chen, T. and Guestrin, C. (2015). XGBoost : Reliable large-
scale tree boosting system. In Proceedings of the NIPS
2015 Workshop on Machine Learning Systems, Learn-
ingSys’15.
Chen, T. and Guestrin, C. (2016). XGBoost: A scalable tree
boosting system. In Proceedings of the 22nd Interna-
tional Conference on Knowledge Discovery and Data
Mining, KDD’16, pages 785–794.
Claesen, M. and De Moor, B. (2015). Hyperparameter
search in machine learning. In Proceedings of the 11th
Metaheuristics International Conference, MIC’15.
Cohen, J. (1960). A coefficient of agreement for nominal
scales. Educational and Psychological Measurement,
20(1):37–46.
Datta, A., Sen, S., and Zick, Y. (2016). Algorithmic trans-
parency via quantitative input influence: Theory and
experiments with learning systems. In Proceedings of
the IEEE Symposium on Security and Privacy, SP’16,
pages 598–617.
dos Santos, G. E. and Figueiredo, E. (2020). Commit
classification using natural language processing: Ex-
periments over labeled datasets. In Proceedings of
the 23rd Iberoamerican Conference on Software En-
gineering, CIbSE’20.
Fischer, M., Pinzger, M., and Gall, H. (2003). Populat-
ing a release history database from version control
and bug tracking systems. In Proceedings of the
International Conference on Software Maintenance,
ICSM’03, pages 23–32.
Fluri, B. and Gall, H. C. (2006). Classifying change types
for qualifying change couplings. In Proceedings of
the 14th IEEE International Conference on Program
Comprehension, ICPC’06, pages 35–45.
Friedman, J. (2001). Greedy function approximation:
A gradient boosting machine. Annals of Statistics,
29(5):1189–1232.
Gorla, N. and Lam, Y. W. (2004). Who should work with
whom?: Building effective software project teams.
Communications of the ACM, 47(6):79–82.
Gupta, M. (2017). Improving software maintenance using
process mining and predictive analytics. In Proceed-
ings of the IEEE International Conference on Soft-
ware Maintenance and Evolution, ICSME’17, pages
681–686.
Hattori, L. P. and Lanza, M. (2008). On the nature of com-
mits. In Proceedings of the 33rd ACM/IEEE Interna-
tional Conference on Automated Software Engineer-
ing, ASE’18, pages 63–71.
Herraiz, I., Robles, G., Gonzalez-Barahona, J. M.,
Capiluppi, A., and Ramil, J. F. (2006). Compari-
son between slocs and number of files as size met-
rics for software evolution analysis. In Proceedings of
the Conference on Software Maintenance and Reengi-
neering, CSMR’06, pages 8 pp.–213.
Hindle, A., German, D., Godfrey, M., and Holt, R. (2009).
Automatic classification of large changes into mainte-
nance categories. In Proceedings of the 17th IEEE In-
ternational Conference on Program Comprehension,
ICPC’09, pages 30–39.
Kohavi, R. (1995). A study of cross-validation and boot-
strap for accuracy estimation and model selection.
In Proceedings of the 14th International Joint Con-
ference on Artificial Intelligence, IJCAI’95, page
1137–1143.
Levin, S. and Yehudai, A. (2016). Using temporal and se-
mantic developer-level information to predict mainte-
nance activity profiles. In Proceedings of the IEEE In-
ternational Conference on Software Maintenance and
Evolution, ICSME’16, pages 463–467.
Levin, S. and Yehudai, A. (2017a). 1151 com-
mits with software maintenance activity labels
(corrective, perfective, adaptive). Available:
https://doi.org/10.5281/zenodo.835534.
Levin, S. and Yehudai, A. (2017b). Boosting automatic
commit classification into maintenance activities by
ICEIS 2021 - 23rd International Conference on Enterprise Information Systems
28