5.4 Using LCPMG for Creation of a Bayesian Network for IT Governance
Performance Prediction
The LCPMG is suitable for generating the CPMs of nodes that are linearly related to
one another. In the case of IT governance maturity prediction, the Bayesian network
has three hierarchical levels. The first level contains the measureable, yet not control-
lable IT governance performance node. The second level contains 34 IT process
nodes that are controllable, but not measureable. On the third level, 136 measureable
and controllable IT governance maturity indicator nodes reside, cf. Fig. 1. The CPMs
of all nodes at all levels must be defined, and the LCPMG can be applied stepwise in
order for the network to learn the CPMs.
Calculate the regressions for all IT governance maturity indicator nodes and the IT
governance performance node. Use the regressions to assign normalized weights w
i
to
each of the four node types; activities, metrics, documents and responsibilities.
The maturity for an IT process, m
p
, can be calculated as
prpdpmpap
rmlwdmlwmmlwamlwm _*_*_*_* +++=
. Calculate the m
p
for each of the
N*34 IT processes, where N represents the number of different observations made.
Use LCPMG to determine the CPMs for each of the 34 IT process nodes, based on
the m
p
:s and the ITG_Performance node. Use LCPMG to determine the CPMs for
each of the 136 maturity indicator nodes, based on maturity levels for the maturity
indicators, and the m
p
:s. Finally, the prior of the ITG_Performance node is set by
analyzing the occurrence of each one of the possible levels pl0-pl5.
CPM for PO4 and ITG_Performance (rounded)
pl0 pl1 pl2 pl3 pl4 pl5
ml5 0,01 0,01 0,01 0,02 0,06 0,17
ml4 0,01 0,01 0,04 0,13 0,26 0,39
ml3 0,03 0,10 0,22 0,35 0,40 0,33
ml2 0,19 0,32 0,40 0,35 0,23 0,10
ml1 0,42 0,39 0,27 0,14 0,05 0,01
ml0 0,35 0,18 0,07 0,02 0,01 0,01
Fig. 4. Calculations of IT process PO4’s maturity and observations of ITG_Performance (small
dots), the linear approximation of the relation between them (ITG_Performance = 0.60*PO4 +
0.69, S = 0.96), and how these fit into the CPM (colored bubbles). The resulting CPM is shown
to the right.
S denotes the standard deviation of the residuals [1],[27]. A small S indicates a good
fit to the linear model. If only a limited amount of datasets have been used in order
for the Bayesian network to learn, all levels of ITG_Performance have perhaps not
been observed. This can be corrected for by using Laplace’s estimation, i.e. add 1 to
the number of observations assigned to each state [10]. In this way, no zeros will be
present in the resulting CPM and it is thus resulting in a better and more smoothly
predicting Bayesian network. Fig.
4 shows observations for Y = PO4 (Define the IT
processes, Organization and Relationships) and X = ITG_Performance, the linear
approximation and a graphic representation of the probability mass for each cell in
177