Authors:
Antonio Bevacqua
1
;
Marco Carnuccio
1
;
Francesco Folino
2
;
Massimo Guarascio
2
and
Luigi Pontieri
2
Affiliations:
1
University of Calabria, Italy
;
2
National Research Council of Italy, Italy
Keyword(s):
Data Mining, Regression, Clustering, Business Process Analysis.
Related
Ontology
Subjects/Areas/Topics:
Artificial Intelligence
;
Data Mining
;
Databases and Information Systems Integration
;
Enterprise Information Systems
;
Sensor Networks
;
Signal Processing
;
Soft Computing
Abstract:
This paper presents a novel approach to the discovery of predictive process models, which are meant to support the run-time prediction of some performance indicator (e.g., the remaining processing time) on new ongoing process instances. To this purpose, we combine a series of data mining techniques (ranging from pattern mining, to non-parametric regression and to predictive clustering) with ad-hoc data transformation and abstraction mechanisms. As a result, a modular representation of the process is obtained, where different performance-relevant variants of it are provided with separate regression models, and discriminated on the basis of context information. Notably, the approach is capable to look at the given log traces at a proper level of abstraction, in a pretty automatic and transparent fashion, which reduces the need for heavy intervention by the analyst (which is, indeed, a major drawback of previous solutions in the literature). The approach has been validated on a real app
lication scenario, with satisfactory results, in terms of both prediction accuracy and robustness.
(More)