is one of the popular and most used open source plat-
form (Noten et al., 2017), hence the contribution of
this research is useful for fellow researchers in this
field.
Internal Threats. Internal risks of this study is that
considering UML files as the only UML model based
repository. We used UML files since they are defacto
standard (Ozkaya and Erata, 2020). There are many
other file formats are available that also present UML
modeling.
Risks exist in profiling information for the soft-
ware engineers who participate in software modeling.
It could be the case that the targeted software engi-
neers do not perform modeling frequently and this is
the reason, we find fewer updates on models.
Another risk is that the identified UML models are
created only for teaching or academic purpose for a
short period time. And these models are not repre-
sentative for the models in open source projects. To
mitigate this risk we manually checked the reposito-
ries and confirmed that the selected repositories are
non trivial.
7 CONCLUSION
In this paper, we investigate a sub set of projects that
has UML modeling and the usages of model files in
the open source. Model file commits provide use-
ful information on the updates and maintenance of
the UML models. Additionally, this study investigate
how often contributors update model files. Study re-
sults show that the models are created and maintained
by experienced software engineers.
Further, this study reveals that often UML model
files are created at the very first phase of software
development. Modification or updates are made on
the files mostly at the beginning and in the middle
of software development phase. Study results show
that the model life span is shorter compared to project
life span. Often, repositories with fewer model files
get updated frequently than repositories with higher
number of model files.
This study investigate 17 repositories which are
a small data set representing open source projects. In
the future, we have a plan to include more repositories
in our data set which will be a broad representation of
models usages in the open source.
REFERENCES
Badreddin, O., Lethbridge, T. C., and Elassar, M. (2013).
Modeling practices in open source software. In IFIP
International Conference on Open Source Systems,
pages 127–139. Springer.
France, R., Bieman, J., and Cheng, B. H. (2006). Repos-
itory for model driven development (remodd). In In-
ternational Conference on Model Driven Engineering
Languages and Systems, pages 311–317. Springer.
Gorschek, T., Tempero, E., and Angelis, L. (2014). On the
use of software design models in software develop-
ment practice: An empirical investigation. Journal of
Systems and Software, 95:176–193.
Hebig, R., Quang, T. H., Chaudron, M. R., Robles, G., and
Fernandez, M. A. (2016). The quest for open source
projects that use uml: mining github. In Proceedings
of the ACM/IEEE 19th International Conference on
Model Driven Engineering Languages and Systems,
pages 173–183.
Ho-Quang, T., Hebig, R., Robles, G., Chaudron, M. R., and
Fernandez, M. A. (2017). Practices and perceptions of
uml use in open source projects. In 2017 IEEE/ACM
39th International Conference on Software Engineer-
ing: Software Engineering in Practice Track (ICSE-
SEIP), pages 203–212. IEEE.
Khandoker Rahad (2020). Raw data: Characterization of
software design and collaborative modeling in open
source projects,.
Kobryn, C. (2002). Will uml 2.0 be agile or awkward?
Communications of the ACM, 45(1):107–110.
Noten, J., Mengerink, J. G., and Serebrenik, A. (2017).
A data set of ocl expressions on github. In 2017
IEEE/ACM 14th International Conference on Mining
Software Repositories (MSR), pages 531–534. IEEE.
Osman, M. H. and Chaudron, M. R. (2013). Uml usage in
open source software development: A field study. In
EESSMOD@ MoDELS, pages 23–32.
Ozkaya, M. and Erata, F. (2020). A survey on the prac-
tical use of uml for different software architecture
viewpoints. Information and Software Technology,
121:106275.
Podgorelec, V. and Heri
ˇ
cko, M. (2007). Estimating soft-
ware complexity from uml models. ACM SIGSOFT
Software Engineering Notes, 32(2):1–5.
Reza, S. M., Badreddin, O., and Rahad, K. (2020). Mod-
elmine: A tool to facilitate mining models from open
source repositories. In 2020 ACM/IEEE 23rd Interna-
tional Conference on Model Driven Engineering Lan-
guages and Systems (MODELS). ACM.
Torchiano, M., Tomassetti, F., Ricca, F., Tiso, A., and Reg-
gio, G. (2013). Relevance, benefits, and problems of
software modelling and model driven techniques—a
survey in the italian industry. Journal of Systems and
Software, 86(8):2110–2126.
Yatani, K., Chung, E., Jensen, C., and Truong, K. N. (2009).
Understanding how and why open source contributors
use diagrams in the development of ubuntu. In Pro-
ceedings of the SIGCHI Conference on Human Fac-
tors in Computing Systems, pages 995–1004.
Characterization of Software Design and Collaborative Modeling in Open Source Projects
261