work (on the left) and an implicational relation rule of
propositional logic (A → B) (on the right).
Figure 2: Venn diagram on the left illustrates a causal re-
lationship according to a Bayesian network, and the one on
the right illustrates an implicational relation rule of propo-
sitional logic.
In a propositional logic, whether or not a causal
relationship holds is expressed in binary. On the other
hand,in a Bayesian network, the existence of a causal
relationship is not necessarily represented in binary
but instead by a probability. Figure 2 illustrates these
situations with Venn diagrams to clarify their differ-
ences. Realistically, whether or not a causal relation-
ship is established cannot be treated in binary as done
in propositional logic because there is no guarantee
that the causal relation infallibly holds true for any
case in an actual setting. In fact, in a case where the
probability of the occurrence of an event is extremely
high, we consider that the causal relationship holds
true. Therefore, we regard a Bayesian network as an
extension of propositional logic.
If we limit ourselves to a one-to-one relation be-
tween an antecedent and a consequent, as in Fig. 1,
the combinations of values that A and B can be are as
follows:
A is T ⇒ B is T
A is T ⇒ B is F
A is F ⇒ B is T
A is F ⇒ B is F
The traditional methods for constructing a
Bayesian network optimized all the combinations of
causal relationships. However, in the implication re-
lationship of propositional logic, if an antecedent is
true, a causal relationship can either be true or false,
and if an antecedent is false, the causal relationship
is true. In other words, when an antecedent is false,
a consequent has nothing to do with the establish-
ment of a causal relationship. Therefore, we regard
a Bayesian network as an extension of propositional
logic, and causal relationships with a false antecedent
state need not be taken into account when construct-
ing a Bayesian network.
Therefore, we have proposed a methodology for
constructing a Bayesian network whose causal rela-
tionships are removed if the antecedent state is false
using association rule mining.
2 METHODS
In order to construct a Bayesian network as an ex-
tension of propositional logic, we need to determine
the existence or non-existence of causal relationships
in all random variables. In addition, we must extract
only the causal relationships whose antecedent is T .
Therefore, we can effectively extract causal relation-
ship candidates using association rule mining.
Association rule mining is a typical method
for identifying relationships between variables in
large-scale transactional data (Piatetsky and Frawley,
1991), (R. Agrawal and Swami, 1993). This method
uses three indices called support, confidence, and
lift to evaluate a causal relationship, as illustrated in
Equations (1), (2) and (3) below. Let P(A) denote a
marginal probability of a random variable A, then
support(A ⇒ B) = P(A, B), (1)
confidence(A ⇒ B) = P(B|A), (2)
lift(A ⇒ B) =
P(A, B)
P(A)P(B)
. (3)
Equation 1 expresses the co-occurrence frequency be-
tween random variables and is defined as a percentage
of the transactions that contain both an antecedent and
a consequent in all data transactions. Equation 2 is a
conditional probability and is defined as a percentage
of the transactions that contain both an antecedent and
a consequent in all data transactions that include the
antecedent. Equation 3 is a measurement of the in-
terdependence between the random variables. If its
value is higher than 1.0, the antecedent and conse-
quent are regarded as having a dependency.
Apriori is a classic algorithm for use in associ-
ation rule mining. In order to extract candidates of
random variable combinations that have a correlation,
this algorithm focuses only on combinations of ran-
dom variables that frequently appear in a data store.
The Apriori algorithm is applied when either a ran-
dom variable or combination of random variables oc-
cur infrequently.
2.1 Bayesian Network and CPT
Relationships
Since the values in CPTs are probabilities, they range
from 0 to 1. However, the value assignments in
CPTs cannot be independent from the structure of the
KDIR 2015 - 7th International Conference on Knowledge Discovery and Information Retrieval
212