to derive implicit knowledge from provenance data
related to software process. Its main goal is to assist
software managers to improve future executions and
reuse processes.
As a first step to OntoComplex specification, in
order to identify the deficiencies in the domain, a
systematic review was conducted to find relevant
works in the area, specially works which propose
techniques, frameworks, tools or procedures that
could support project managers to make decisions
based on data.
(Gencel et al, 2013) and (Naedele et al, 2015)
propose models to deals with metrics and
measurement programs. (Fonseca et al, 2017)
proposed an ontology-based approach for
measurement systems integration.
A systematic literature review was performed by
(Tahir et al, 2016) to answer questions about software
measurement programs. Challenges faced by
companies which adopt data-driven decision making
are presented by (Bosch, 2017).
This paper is organized as follows: Section 1
presents the introduction. Section 2 discusses the
research context. In Section 3 OntoComplex
architecture is presented and in Section 4 some
related works are detailed. Finally, in Section 5
conclusions are discussed.
2 RESEARCH CONTEXT
Software development is a complex and
unpredictable activity (Acuna et al, 2000). Through
software process monitoring and analysis, it is
possible to bring knowledge from previous process to
new processes, finding and understanding
vulnerabilities and deficiencies. One approach that
can help in that tasks is data provenance, a
“description of the origins of a piece of data and the
process by which it arrived in a
database” (Buneman et al., 2001).
Data provenance is a metadata that helps a better
understanding about data origin, which process have
generated or influenced it, its context and how it was
produced used until the moment it was stored.
Provenance describes which processes and steps are
related to a data lifecycle. Considering software
process context, a process uses a data and can
generate another data, which can be input in another
process or be relevant for a software manager
decision making, which, based on this information
can find the origin of errors, data accuracy, related
users, execution time, execution date, used
configuration parameters, among others. Benefits
from using data provenance can be even more
relevant when associated to technologies that help
finding implicit knowledge, improving decision
making strategies. Among these technologies, we can
cite ontologies (Prabhune et al., 2016) and complex
networks (Harary, 1969).
Ontologies are models that represent knowledge
about a domain, which are able to express entities,
relationships and rules related to that domain
(Prabhune et al., 2016). The Prov-O ontology, part of
PROV model from W3C, is specified using OWL2
language (Web Ontology Language). The Prov-O has
classes, properties and restrictions that can be used to
represent and acquire provenance information,
generated by different systems and in different
contexts. This model can also be specialized, adding
new classes and properties to express knowledge
from different applications and domains (Belhajjame
et al., 2012). In this context, ProvONE (Cuevas-
Vicenttín et al., 2015) is a Prov-O ontology
specialization, to be used in scientific workflows
domain.
ProvONE ontology can also be interpreted as a
graph, with nodes and edges. In this paper is proposed
a representation of this model as a complex network,
where nodes and edges are related to the model
entities and properties. With this new representation
it is possible to better understand data influence and
behaviour, which may generate strategic information
to help decision making. In this vein, OntoComplex
architecture was proposed considering different
technologies, as part of the process management
framework proposed in (Costa et al., 2016b) and
(Dalpra et al., 2015) with the main objective to derive
strategic knowledge to improve future software
process executions.
3 OntOComplex
Considering the ProvONE model, and the use of
ontologies, complex networks and visualization
techniques, this section details OntoComplex
architecture (Figure 1), which aims to provide
software process improvement. The main goal of this
architecture is to use software process and its
execution data analysis, to help managers to make
decisions based on acquired knowledge to improve
future executions.
A partnership with a medium size development
company was proposed with the goal of evaluating
OntoComplex functionalities and its architecture use.
A real dataset from error solution and new
functionalities requirements from an ERP (Enterprise
ICEIS 2018 - 20th International Conference on Enterprise Information Systems
62