5 CONCLUSION AND FUTURE
WORK
Process-aware information systems record detailed
process execution information concerning the
processes they support. Typically, the assignment of
case identifiers to the process instances is missing,
due to data centric architecture of the underlying
information systems (van der Aalst, 2006). In this
paper, we introduce an approach to simulate
synthetic event log from scratch rather than repairing
incomplete unlabeled event log. We use as input the
process profile that defines the activity vocabulary,
i.e. a list of valid activity labels, types and occurrence
priority, and Petri net in tabular form. Proposed
approach synLogGen also takes both noise and
surprise effects into account at applying Petri net
firing rule. According to completeness and soundness
outcomes obtained at the experimental runs with
varying noise factor values, the combination of
proposed approach synLogGen with the process
discovery approach introduced in (Esgin, Senkul &
Cimenbicer, 2010) is robust to the corresponding
noise effect at distilling the process behaviors in a
relatively noisy process execution environment.
In the evaluation step, four process models that
are referenced in (Bayomie et al., 2016) are
determined as benchmark processes. Two prior
approaches, i.e. DCI given in (Bayomie et al., 2016)
and E-Max given in (Ferreira & Gillblad, 2009),
which priorly handled the corresponding benchmark
processes, are selected as candidate. According to the
accuracy aspect, DCI and the combination of
proposed approach synLogGen with the process
discovery approach introduced in (Esgin, Senkul &
Cimenbicer, 2010) are respectively more accurate
than E-Max in terms of dissimilarity metric (Esgin &
Senkul, 2011), which measures the discrepancies
between reference process model and mined process
behaviors on a graph-based structural similarity
measurement. Additionally, proposed approach
synLogGen generates more structured and lasagna-
like process models with respect to moderate
connectivity values and less variance with reference
process model.
As the future work, we intend to implement an
integer-programming (IP) based synthetic log
generation, in which the process behaviors at
reference process model and the activity frequencies
at the unlabeled event log act as the main constraints.
The corresponding IP-based approach will simulate
maximal process traces without any relaxations at
these constraints.
REFERENCES
Bayomie, D., Helal, I. M. A., Awad, A., Ezat, E., &
Elbastawissi, A. (2016). Deducing Case IDs for
Unlabeled Event Logs. International Conference on
Business Process Management, vol. 256, 242-254.
Becker, M., & Laue, R. (2012). A Comparative Survey of
Business Process Similarity Measures. Computers in
Industry, vol. 63(2), 148-167.
Buijs, J.C., & Reijers, H.A. (2014). Comparing Business
Process Variants Using Models and Event Logs.
Enterprise, Business-Process and Information Systems
Modeling, 154-168.
Burrett, S., & Geng, L. (2008). Bayesian Classification of
Events for Task Labeling Using Work-Flow Models.
Proceedings of the 4th Workshop on Business Process
Intelligence.
Desel, J. & Esparza, J. (1995). Free Choice Petri Nets.
Cambridge Tracts in Theoretical Computer Science,
vol. 40, Cambridge University Press, Cambridge, UK.
Doganata, Y. N. (2011). Designing Internal Control Points
in Partially Managed Processes by Using Business
Vocabulary. ICDE Workshops. 267–272.
Doganata, Y. N., & Curbera, F. (2009). Effect of Using
Automated Auditing Tools on Detecting Compliance
Failures in Unmanaged Processes. BPM. LNCS, vol.
5701, 310-326.
Dustdar, S., & Gombotz, R. (2006). Discovering Web
Service Workflows Using Web Services Interaction
Mining. International Journal of Business Process
Integration and Management, vol. 1(4), 256.
Esgin, E., & Senkul, P. (2009). Hybrid Approach to
Process Mining: Finding Immediate Successors of a
Process by Using From-to Chart. International
Conference on Machine Learning and Applications,
664-668.
Esgin, E., & Senkul, P. (2011). Delta Analysis: A Hybrid
Quantitative Approach for Measuring Discrepancies
between Business Process Models. 6th International
Conference, Hybrid Artificial Intelligence Systems
(HAIS), vol. 6678, 296-304.
Esgin, E., Senkul, P., & Cimenbicer, C. (2010). A Hybrid
Approach for Process Mining: Using From-to Chart
Arranged by Genetic Algorithms. 5th International
Conference, Hybrid Artificial Intelligence Systems
(HAIS), vol. 6076, 178-186.
Ferreira, D. R., & Gillblad, D. (2009). Discovering
Process Models from Unlabelled Event Logs. BPM
LNCS, vol. 5701, 143–158.
Gerke, K., Cardoso, J., & Claus, A. (2009). Measuring the
Compliance of Processes with Reference Models. On
the Move to Meaningful Internet Systems: OTM 2009,
Confederated International Conferences: CoopIS, IS,
DOA and ODBASE, vol. 5870, 76-93.
Gunther, C. W., & van der Aalst, W. M. P. (2006).
Process Mining in Case Handling Systems.
Multikonferenz Wirtschaftsinformatik 2006.
Jagadeesh Chandra Bose, R. P., & van der Aalst, W. M. P.
(2012). When Process Mining Meets Bioinformatics.