A REFERENCE ARCHITECTURE FOR MANAGING BUSINESS
PROCESS VARIANTS
Ruopeng Lu and Shazia Sadiq
School of Information Technology and Electrical Engineering, The University of Queensland, Brisbane, Australia
Keywords: Business process management, flexible workflows, process evolution, business process analysis.
Abstract: Business process management systems (BPMS) have been prevalent in business information systems, yet
still striving to cope with emerging demands from current business environments. It is particularly
challenging in managing knowledge intensive business processes, which has partially led to the demand for
more complex BPMS functionality such as instance adaptation and streamlined process evolution. On the
other hand, various process analysis and discovery techniques have been developed as an important
component in BPMS. In this paper, we present a technology framework that supports process discovery
from preferred work practices in a flexible process management system. The framework supports instance
adaptation and a systematic approach towards process evolution/improvement.
1 INTRODUCTION
In recent years, there have been many efforts
towards providing agile business process
management (BPM) support. Business process
management systems (BPMS) have been recognised
as a substantial extension to the legacy of workflow
management systems (WFMS). The BPM life cycle
(van der Aalst et al, 2003b) identifies that apart from
process design, deployment and enactment,
supporting process diagnosis differentiates BPMS
from traditional WFMS. The process diagnosis
phase which refers to a wide range of BPM
activities, has been the prime arena for both
academia and industry, especially for business
process analysis (BPA) and process discovery (van
der Aalst et al, 2003b; Casati, 2005). Generally,
these post-execution activities are to identify and
resolve operational process problems, discover
preferred work practices, and provide business
intelligence.
Nevertheless, in the dynamic environment of
collaborative and e-business today, it is essential that
BPM technology supports the business to adapt to
changing conditions, where different process models
could be derived from existing ones to tailor
individual process instances. It is evident that work
practices at the operational level are often diverse,
incorporating the creativity and individualism of
knowledge workers and potentially contributing to
the organization’s competitive advantage. This
diversity needs to be both encouraged and
controlled. A major difficulty in this regard is the
requisite knowledge, that drives the diverse practices
at an operational level, is only tacitly available. This
knowledge constitutes the corporate skill base and is
found in the experiences and practices of individual
workers, who are domain experts in a particular
aspect of the overall operations.
There have been proposals from both academia
(Sadiq & Orlowska, 2005; van der Aalst et al, 2005;
Rinderle & Reichert, 2006) and industry (Ultimus,
2004, ILOG, 2006, Tibco, 2006) to transfer process
modelling efforts from business owners and business
analysts, to domain experts who have the knowledge
in performing activities in the process. The common
practice in these proposals has been the support for
deploying partial process models, which contain
some repetitive procedures that require less or no
flexibility for execution, but also contain loosely
coupled process activities that warrant a high level
of customization. When an instance of such process
is instantiated, a complete process model is specified
by the domain expert, where the set of loosely-
coupled activities is given an execution plan
according to instance-specific conditions, possibly
some invariant process constraints, and their
expertise. We shall refer to such approaches as
instance adaptation.
We believe that current BPM solutions only
provide limited capability for instance adaptation.
416
Lu R. and Sadiq S. (2007).
A REFERENCE ARCHITECTURE FOR MANAGING BUSINESS PROCESS VARIANTS.
In Proceedings of the Ninth International Conference on Enterprise Information Systems - ISAS, pages 416-421
DOI: 10.5220/0002400304160421
Copyright
c
SciTePress
Techniques are required where part of the modelling
effort is transferred to domain experts who make
design decisions at runtime.
At the same time, instance adaptation also
imposes new problems and challenges on existing
techniques for process diagnosis activities. During
instance adaptation, the process model of each
process instance may be uniquely designed. An
executed process instance reflects a variant of
realising the same process goals, and is called a
process variant. One of many challenging problems
is how to avail the knowledge from a diversity of
process variants as an information resource. This can
be considered as a particular area of process
discovery.
Process discovery often refers to so-called best
practices, or the best way to perform a particular
type of business process (O'Leary & Selfridge,
2000). The measures of best practices are manifold.
For example, from the resource utilization
perspective, a process variant is regarded as the best
practice if the least number of performers are
required. While in some other perspective, the
process model with the largest number of instances
can be the best since it is the most trusted. These
measures correspond to business metrics (Casati,
2005). The way to capture, manage and diffuse
knowledge associated with the best practices can be
found in many knowledge-based systems.
The contribution of this paper is to present a
reference architecture that is in accordance with
these new demands, for the retainment and retrieval
of process variants from a flexible business process
modelling and execution framework that supports
instance adaptation.
The rest of this paper is as follows. Section 2
presents a brief overview on the proposed
framework supporting instance adaptation and
process discovery. In section 3, a reference
architecture for managing process variants is
introduced, and its major components and
functionality are presented in section 4. The related
work is reviewed in section 5, followed by the
conclusion in section 6.
2 FRAMEWORK OVERVIEW
We utilize a framework for business process
modelling and execution that attempts to achieve the
balance between flexibility and control (Lu et al,
2006a). The framework consists of two major
components: (1) A constraint-based process
modelling approach, called Business Process
Constraint Network (BPCN); and (2) a repository
for case specific process models, called Process
Variant Repository (PVR).
In BPCN, business process requirements are
extracted and transformed into a minimal set of
process constraints, which are represented in a way
that is readable by human and supporting analysis
and validation for correctness. Instance adaptation
takes place when a customised process model for
process instance is constructed (or completed) at
runtime (Lu et al, 2006a). The BPCN execution
environment allows for the generation of potentially
a large number of customized process variants.
Figure 1 shows the process models of four different
process variants, all of which satisfies the same set
of process constraints, namely, task T1 must perform
before tasks T2, T5 and T6.
Figure 1: Examples for process models of process variants
satisfying the same process constraints.
Although all these process variants achieve the
same process goals, they may vary significantly in
many aspects, such as process models and resource
allocation patterns. It is important to note that the
way that domain experts reason about the situation
during process modelling cannot be truly
reconstructed using computational techniques.
In our approach, the purpose of PVR is to
capture, structure and subsequently extract the
decisions that led to a particular design as far as
possible. These design decisions are embedded in
various process properties e.g., process data values.
Over time, the repository can build into an immense
corporate resource.
In the meantime, there are various occasions
when precedents of process variants need to be
retrieved. For example, during instance adaptation,
domain experts may refer to a list of past process
variants sharing common properties similar to
A REFERENCE ARCHITECTURE FOR MANAGING BUSINESS PROCESS VARIANTS
417
current situation. Using appropriate analysis
techniques, a collection of sufficiently similar
process variants can be generalized as the
preferred/successful work practice, and consequently
lead to process improvement/evolution. The retrieval
process in PVR according to a user specified
requirement is supported.
3 PROCESS VARIANT
REPOSITORY
PVR provides a well-formed structure to store past
process designs, as well as an instrument to utilize
process variants as an information resource. The
retainment of executed process variants in the
repository and the subsequent retrieval of preferred
process variants are the two major functions of PVR.
Figure 2 presents an overview of the PVR
reference architecture. In what follows, we present
an overview of various functions to manage PVR in
stepwise order (as annotated in Figure 2). The first
three steps correspond to the retainment process,
while steps 4 – 6 correspond to the retrieval process.
Step1: An executed process variant is received
from BPCN through the interface between PVR and
BPCN. The process variant is to be retained in the
repository according to the schema in PVR, and is
referred to as a case of the business process. A set of
features are extracted from the case in order to
formulate a formal case description that can identify
the new case (cf. Section 4). It is possible for a set of
similar cases to share the same description.
Step2: The feature index is updated according to
the description of the new case. The feature index in
PVR (Lu et al, 2006b) is an organised structure of
case descriptions, and is maintained by the index
management component.
Step3: The new case is stored in the repository.
This step builds the PVR population and with time is
expected to build into a valuable corporate resource.
Step4: At a later time, a query is formulated to
specify the case retrieval requirement. A query
expresses user requirements in terms of case
features. A query may represent a partial or
complete description for a case, or multiple cases
sharing same description. The query requirement is
formulated with the help of the query processing
component.
Step5: Case descriptions are searched utilizing the
feature index to find matching cases according to
query requirements. The goal is to return a set of
sufficiently similar cases. Similarity between case
descriptions and the query requirements is
determined by a predefined similarity measure (cf.
Section 4).
Figure 2: Reference Architecture for PVR.
Step6: The initial matching cases are retrieved
from the repository, and the best matches are
selected from the set of initial matches according to
the degree of similarity compared to the query case,
called the similarity score. The further selection
process involves a more detail comparison, e.g.,
comparing the process model of a case against the
query process model (if query requirements contain
structural features).
4 PVR FEATURES AND
COMPONENTS
At any time, the repository may be queried for
precedents under specified criteria. In this section,
we discuss the major components and functionality
of the PVR reference architecture that support the
above requirements.
Case Schema
A process variant is a complex object containing
various design and executional properties. The case
schema defines the structure and data content
according to which process variants are stored. In
PVR, a feature is an attribute-value pair used to
ICEIS 2007 - International Conference on Enterprise Information Systems
418
describe the property of a case. Dimension refers to
the attribute part of the feature. We have identified
the following dimensions of features that the schema
should cover:
Structural dimension contains the customized
process model (typically graphical), which is
customized for the process variant during instance
adaptation;
Behavioural dimension contains execution
properties of the process variant, such as temporal
properties of tasks involved in the process
execution, the performers and their roles of
process tasks, and the values of process-relevant
data for the variant;
Contextual dimension contains descriptive
information (annotations) from the process
modeller about the reasoning behind the design of
a particular process variant, its goals and
intentions. There is evidence that semantic
technologies (description logic, ontologies etc.)
may assist in the formalization of the contextual
dimension.
Feature Index
In PVR, the structured collection of case
descriptions reflects the concretized knowledge that
led to the designs of different cases during instance
adaptation. The feature index is used to differentiate
different cases. Furthermore, as the number of cases
can be potentially very large, the index also
facilitates effective case retrieval in subsequence.
Figure 3: PVR feature index structure.
PVR uses a three-layer hierarchy to manage the
feature index, as illustrated in Figure 3. The set of
unique features collected from all retained cases are
on the top layer, where each feature points to one or
more case descriptions. A case description contains
a minimal list of features required to identify a
particular case, which typically covers all three
dimensions. A case description points to one or more
cases, where a case is the collection of all required
features of a process variant according to the case
schema.
The process of inserting new cases and adjusting
the index structure constitutes the repository update
approach. PVR update determines the way the case
is indexed by extracting features from the case and
computing the index values. The description of the
new case is prepared by extracting relative
information form the process variant according to a
preference list of features designed by domain
experts, which covers the three dimensions of
features. A detail presentation of the feature index
management approach can be found in (Lu et al,
2006b).
Query Formulation
Query is the interface between users and PVR,
through which a wide range of search criteria for
case retrieval is specified. Examples can be features
in behavioural and contextual dimensions such as to
find all cases with execution duration of not more
than 3 days, or performers of the role senior
management were involved. Such queries can mostly
be satisfied using well established techniques. We
are specifically interested in providing a facility to
find process variants for queries that provide
complex (structural) criteria. For example, find all
case in which activities T2 was performed
immediately after T1, and T3 was performed in
parallel with T1 and T2 (cf. Figure 4(b)).
Figure 4: Queries containing (a) a complete process model
and (b) (c) partial process models.
Furthermore, a hybrid of features in different
dimensions (or so-called multi-aspect queries) also
can be provided, e.g., find all cases where activities
T1, T2 and T3 were performed by a senior
management role in sequence, and finished
A REFERENCE ARCHITECTURE FOR MANAGING BUSINESS PROCESS VARIANTS
419
execution within 1 - 2 days. The issue here is how to
characterize the structural features of process
variants such that queries that require searching the
repository on the structural aspect can be efficiently
satisfied. The structural features of the query can
resemble a complete process model (cf. Figure 4(a)),
which specifies the exact structure required for the
cases to be retrieved; or a partial process model (cf.
Figure 4(b), (c)), which contains a fragment of the
process model characterizing the desired structural
properties to be retrieved.
Case Similarity
An essential concept for case management is the
similarity measure for the cases to be compared,
i.e., how to determine the degree of match between
two cases, or between a case in repository and a
desired case described in a query. The similarity
measure provides quantitative measurement to
calculate how similar two cases are.
Recall that each case is characterised by a
selection of features in its case description,
representing the factors that lead to the design of
particular process variant. As a result, the case
matching problem is transferred into the comparison
of case features in each dimension. However,
different types of features (in different dimensions)
generally have specific semantics, the similarity
measure for each dimension is thus to be defined
separately with respect to its semantic
appropriateness. The overall approach is:
Define the similarity measure for each feature
type in every dimension, i.e., for each type of
feature, define a function which takes as input a
pair of features to be compared, and outputs a
similarity score between 0 (dissimilar) and 1
(complete match);
Calculate the overall similarity by aggregating the
similarity score from each dimension. A weighted
approach can be used if some dimension is
prioritized by assigning a higher weight than the
others.
Consider comparing features in structural
dimension as an example. If the process model in
Figure 4(a) (referred to as 4(a)) is to be compared
with those in Figure 1. 1(a) is the most similar to
4(a) since in 1(a) all structural constraints between
the tasks in 4(a) are preserved and thus having the
highest similarity score. 1(a) is said to be
subsuming 4(a) since it contains more tasks than
4(a). If it is not required to have the same set of tasks
in both models, the structural similarity score is 1
between 1(a) and 4(a). In addition, 1(a) is called a
complete match to 4(a).
On the other hand, 1(b) is a partial match to
4(a) for partially preserving process constraints
between the tasks in 4(a). To elaborate further, in
4(a), tasks T2 and T3 are in parallel branches and
can be executed in any order. While in 1(b), T3 can
only be executed after T2. As a result, 1(b) is a more
restrictive model than 4(a), which excludes certain
execution possibilities from 4(a), and hence is less
similar to 4(a) than 1(a). For the same reason, 1(d) is
a partial match to 4(a) and the similarity rank to 4(a)
is lower than 1(b) since there is only one execution
possibility in 1(d). Notably however, 1(c) is also a
partial match to 4(a) but does not preserve the
structural constraint between tasks T1, T2, T3 and
T4, while containing extra execution possibilities.
Specifically, we have developed a methodology
(Lu & Sadiq, 2006) for comparing both exact and
partial matching cases based on complex control
flow features, where a similarity function has been
defined to produce the similarity score between 0
and 1.
Case Retrieval
In our framework, a progressive refinement
approach is used for case retrieval (Steps 4 - 6 in
Figure 2 is repeated). Given a query of case features
(typically in multi-aspect queries), the retrieval
process starts with searching for matching cases by
using a subset of required features (e.g., structural
features). Relevant features in the feature index are
searched and compared with the query features, and
a set of initial matching cases is retrieved from the
repository including complete and partial match
cases, as directed by the feature index. For partial
match cases, we use a ranking mechanism to provide
a quantitative measure for the similarity. A
predefined threshold value by domain expert is used
to filter the best matches (e.g., similarity score
0.75). A more restrictive search is then performed,
where the query case and the best matches from the
set of initial match cases are further compared with
the remaining query features. This process can be
repeated until the preferred case is selected. PVR
users can be involved in this process, which is to
decide which features to be used as comparison and
in what order.
We take the structural matching as an example to
illustrate the retrieval process. Suppose a given
query contains the structural features in Figure 4(c)
and requires to retrieve exact matching cases (i.e.,
threshold value = 1.0). A set of cases, in which the
process models contain tasks T1, T4, T5 and T6 are
first retrieved. Cases containing process model 1(d)
(cf. Figure 1) are complete matches to the query case
and are retrieved for further comparison, since
process model 1(d) subsumes 4(c). Cases contain
ICEIS 2007 - International Conference on Enterprise Information Systems
420
process models 1(a), 1(b) or 1(c) are partial matches
to the query case and hence not to be retrieved.
5 RELATED WORK
Business process analysis (BPA) involves
monitoring and analysis of process execution
patterns and performance. Process mining (van der
Aalst, 2003a) is one such approach, which is to
diagnose operational processes by extracting
information from process execution logs. The
proposed PVR reference architecture is different
from process mining approach, with emphasis on
supporting knowledge acquisition and process
discovery in BPM. In particular, it targets the reuse
of past instances of process execution to achieve
new operational goals in similar situations.
Some approaches in existing literature such as
Case-Based Reasoning (CBR) can be relevant to
address the challenges in PVR management that we
have found to link closely to our problem. There are
a number of recent proposals for process
management approaches based on CBR techniques
(Madhusudan et al, 2004; Weber et al, 2005), which
have demonstrated the possibilities to apply CBR
techniques to achieve certain BPM goals.
6 CONCLUSIONS
Variations in work practice often represent the
competitive differentiation within enterprise
operations. In this paper, we have argued for the
value of variants in BPM platforms. We have
presented a reference architecture as a foundation for
designing and implementing a process variant
repository (PVR) that addresses advanced
requirements for instance adaptation, process
redesign and post-execution analysis. The presented
components in PVR provide effective means of
searching and matching process variants against a
given query, and generate result sets that can be
conveniently ranked. The results of the proposed
reference architecture can provide deep insights into
ongoing work practices, identify areas of process
improvement, and contribute to systematic and well-
informed process evolution.
REFERENCES
van der Aalst, W. M. P., van Dongen, B.F., Herbst, J.,
Maruster, L., Schimm, G., Weijters, A.J.M.M., 2003a.
Workflow Mining: A Survey of Issues and
Approaches. Data & Knowledge Engineering, vol. 47,
pp. 237 – 267.
van der Aalst, W. M. P., t. Hofstede, A. H. M., Weske, M.,
2003b. Business Process Management: A Survey. In
BPM 2003, Conference on Business Process
Management, Eindhoven, The Netherlands.
van der Aalst, W. M. P., Weske, M., 2005. Case
handling: a new paradigm for business process
support. Data & Knowledge Engineering, vol. 53, no.
2, pp. 129-162.
Casati, F., 2005. Industry Trends in Business Process
Management: Getting Ready for Prime Time. In
DEXA 2005, 16th International Workshop on
Database and Expert Systems Applications,
Copenhagen, Denmark.
ILOG 2006, Business Rule Management (BRM) with
ILOG JRules 6, BRMS Without Compromise. White
Paper, ILOG, www.ilog.com
Lu, R., Sadiq, S., Padmanabhan, V., Governatori, G.,
2006a. Using a Temporal Constraint Network for
increased flexibility in Business Process Execution. In
ADC 2006, Seventeenth Australasian Database
Conference, Hobart, Australia.
Lu, R., Sadiq, S., Governatori, G., 2006b. Utilizing
Successful Work Practice for Business Process
Evolution. In BIS 2006, 9th International Conference
on Business Information Systems, Klagenfurt, Austria.
Lu, R., Sadiq, S., 2006. Managing Process Variants as an
Information Resource. In BPM 2006, 4th International
Conference on Business Process Management.
Vienna, Austria, September 2006.
O'Leary, D., Selfridge, P., 2000. Knowledge Management
for Best Practices. Communication of ACM, vol.43,
no. 11.
Madhusudan, T., Zhao, L., Marshall, B., 2004. A Case-
Based Reasoning Framework for Workflow Model
Management. Data Knowledge Engineering,
vol.50(1), 87-115.
Rinderle, S., Reichert, M., 2006, Data-Driven Process
Control and Exception Handling in Process
Management Systems. In CAiSE 2006, 18th
International Conference on Advanced Information
Systems Engineering, pp. 273-287.
Sadiq, S., Sadiq, W., Orlowska, M., 2005. A Framework
for Constraint Specification and Validation in Flexible
Workflows. Information Systems, vol.30, no.5, pp.
349-378.
Tibco 2006, Enhancing BPM with a Business Rule
Engine. Tibco White Paper, Tibco, www.tibco.com
Ultimus 2004, Adaptive Discovery: Accelerating the
Deployment and Adaptation of Automated Business
Processes. White Paper, Ultimus Inc.
www.ultimus.com
Weber, B., Rinderle, S., Wild. W., Reichert, M., 2005.
CCBR-Driven Business Process Evolution. In ICCBR
2005, 6th International Conference on Case-Based
Reasoning. Chicago, USA.
A REFERENCE ARCHITECTURE FOR MANAGING BUSINESS PROCESS VARIANTS
421