In this paper, an IRL approach for fuzzy rule
selection is presented in which the degree of
cooperation of each candidate rule with other rules
of the same class is estimated. In this approach, the
final rule-set for classification is constructed by
searching for an appropriate fuzzy rule and adding it
to the rule-set in each step. Then, a simple rule-
weighting mechanism is proposed to reach some
degrees of cooperation/competition among the rules
of different classes. Four UCI ML driven data sets
are then used to evaluate the proposed fuzzy
classification method.
2 FUZZY CLASSIFICATION
RULES
In the design of fuzzy rule-based systems, we face
two conflicting objectives: error minimization and
interpretability maximization. Error minimization
has been used in many applications of fuzzy rule-
based systems in the literature while the
interpretability was not usually taken into account in
those applications. Recently, the tradeoff between
these two objectives has been discussed in some
studies. When fuzzy rule-based systems are used for
two-dimensional problems, fuzzy rules can be
represented in a tabular form (Ishibuchi and
Yamamoto, 2004). Figure 1 shows an example of a
fuzzy rule table for a two-dimensional pattern
classification problem. In this figure, we have the
following four fuzzy rules:
If
1
is small and
2
is small then Class 1,
If
1
is small and
2
is large then Class 2,
If
1
is large and
2
is small then Class 3,
If
1
is large and
2
is large then Class 4,
where small and large are linguistic values defined
by triangular membership functions.
Figure 1: Four fuzzy rules in the 2-dimensional pattern
space
[0,1] [0,1]×
.
As shown in Figure 1, fuzzy rules for 2-dimensional
problems can be written in a human understandable
manner using the tabular form representation. When
fuzzy rule-based systems are applied to high-
dimensional problems, their interpretability is
significantly degraded due to the two difficulties: the
increase in the number of fuzzy rules and the
increase in the number of antecedent conditions of
each fuzzy rule.
Assume that we have m labeled patterns
X
p
=(x
p1
,…, x
pn
)
, p=1,2,…,m from M classes in an n-
dimensional continuous pattern space is given. For
classification problems with n number of attributes
,
as in (Ishibuchi and Yamamoto, 2004), we use fuzzy
rules of the following form:
Rule
i
R : If
1
is
1i
and … and
n
is
in
then class
i
C with
i
CF
(1)
where R
i
is the i-th rule, X=(x
1
,…,x
n
) is an n-
dimensional pattern vector, A
ij
is an antecedent fuzzy
set (i.e., linguistic value such as small or large in
Figure 1), C
i
is the class label of R
i
, and CF
i
is the
weight of R
i
. It should be noted that the consequent
part of our fuzzy rule for classification problems is
totally different from standard fuzzy rules for
function approximation problems. The consequent
of our fuzzy rule is a non-fuzzy class label and the
rule weight CF
i
is a real number in the unit interval
[0, 1]. The rule weight is used as the strength of each
fuzzy rule when a new pattern is classified by a
fuzzy rule-based classification system (see
(Ishibuchi and Nakashima, 2001) for details).
The compatibility grade of a training pattern X
p
with the antecedent part A
i
=(A
i1
,…,A
i,n
) of fuzzy rule
R
i
is calculated using product operator as,
1
1
() ( ) ( )
ii in
Ap A p A pn
XX X
μμ
×⋅⋅⋅×
(2)
where
()
ij
A
μ
is the membership function of the
antecedent fuzzy set
ij
.
3 CANDIDATE RULE
GENERATION
In our approach, fuzzy if-then rules are generated
from numerical data. Then, the generated rules are
used as candidate rules from which a small number
of fuzzy if-then rules are selected in an iterative
manner. The domain interval of each attribute x
i
is
discretized into K
i
fuzzy sets. Figure 2 shows some
examples of fuzzy discretization.
INDUCING COOPERATION IN FUZZY CLASSIFICATION RULES USING ITERATIVE RULE LEARNING AND
RULE-WEIGHTING
63