Knowledge Discovery and Modeling based on Conditional Fuzzy
Clustering with Interval Type-2 Fuzzy
Yeong-Hyeon Byeon and Keun-Chang Kwak
Department of Control and Instrumentation Engineering, Chosun University, Gwangju, Korea
Keywords: Knowledge Discovery, Linguistic Modelling, Conditional Fuzzy Clustering, Interval Type-2 Fuzzy.
Abstract: This paper is concerned with a method for designing improved Linguistic Model (LM) using Conditional
Fuzzy Clustering (CFC) with two different Interval Type-2 (IT2) fuzzy approaches. The fuzzification factor
and contexts with IT2 fuzzy approach are used to deal with uncertainty of clustering,. This proposed
clustering technique has characteristics that estimate the prototypes by preserving the homogeneity between
the clustered patterns from the IT2-based contexts, and controls the amount of fuzziness of fuzzy c-partition.
Thus, the proposed method can represent a nonlinear and complex characteristic more effectively than
conventional LM. The experimental partial results on coagulant dosing process in a water purification plant
revealed that the proposed method showed a better performance in comparison to the previous works.
1 INTRODUCTION
A considerable number of researches have been
performed on fuzzy models during the past few
decades. Such fuzzy models are simply divided into
two types depending on the particular structure of
the consequent part: linguistic fuzzy model and TSK
(Takagi-Sugeno-Kang) fuzzy model. In the
linguistic fuzzy model, Mamdani model was
proposed as the first attempt to control a steam
engine and boiler combination by a set of linguistic
control rules obtained from experienced human
operators. TSK fuzzy model is designed by a
systematic approach to generating fuzzy rules from a
given input-output data set.
On the other hand, we enhance a Linguistic
Model (LM) (Pedrycz, 1999) constructed by the use
of fuzzy granulation performed by Conditional
Fuzzy Clustering (CFC) (Pedrycz, 1996). For this
purpose, we develop the improved clustering
approach based on conventional LM. Although the
superiority of this model has demonstrated in the
previous literatures, this model has a poor
approximation and generalization capability. In
order to enhance this performance, we use Interval
Type-2 (IT2) fuzzy concept (Karnik and Mendel,
1998) to estimate efficient cluster centers.
Furthermore, we deal with knowledge discovery and
linguistic modeling based on three different
uncertainties; fuzzification factor, linguistic contexts,
and both. The proposed method is constructed by
conditional fuzzy clustering with three different
uncertainty types. Finally, we apply to coagulant
dosing process in a water purification plant. The
partial results produced by the proposed method
show a better performance in comparison with
conventional LM.
This paper is organized as follows. Section 2
describes the architecture and context-based fuzzy
clustering for LM. In Section 3, we present the three
different uncertainty types for IT2 fuzzy concept.
Experimental results and comments are covered in
Section 4. Finally, conclusion is given in Section 5.
2 CONVENTIONAL LINGUISTIC
MODEL
The conditional fuzzy clustering is realized by
individual contexts. Each context (fuzzy set) has
defined semantics that can be interpreted as a small,
medium, large in the design of LM. Let us consider
a certain fixed context W
j
described by some
membership function. The data point in the output
space is associated with the corresponding
membership value. Let us introduce a family of the
partition matrices induced by the context and denote
it by U as follows