EVOLVING TAKAGI-SUGENO-KANG FUZZY SYSTEMS USING

MULTI POPULATION GRAMMAR-GUIDED GENETIC

PROGRAMMING

Athanasios Tsakonas and Bogdan Gabrys

Smart Technology Research Center, Dept. of Design, Engineering and Computing, Bournemouth University, Poole, U.K.

Keywords:

Genetic programming, Fuzzy rule based systems, Evolutionary computation.

Abstract:

This work proposes a novel approach for the automatic generation and tuning of complete Takagi-Sugeno-

Kang fuzzy rule based systems. The examined system aims to explore the effects of a reduced search space

for a genetic programming framework by means of grammar guidance that describes candidate structures of

fuzzy rule based systems. The presented approach applies context-free grammars to generate individuals and

evolve solutions through the search process of the algorithm. A multi-population approach is adopted for the

genetic programming system, in order to increase the depth of the search process. Two candidate grammars

are examined in one regression problem and one system identiﬁcation task. Preliminary results are included

and discussion proposes further research directions.

1 INTRODUCTION

The application of fuzzy rule-based systems (FRBS)

has been proven effective in a wide area of domains

and problem tasks (Jang, 1997). Among their im-

plementations, Mamdani-FRBS have been used for

classiﬁcation tasks, while regression and system iden-

tiﬁcation application areas have been dominated by

the Takagi-Sugeno-Kang (TSK) -FRBS. In order to

exploit the power of an FRBS, commonly a train-

ing phase takes place with the neuro-fuzzy techniques

most often used. In their simplest approach, these sys-

tems require the pre-determination of the rule-base

size. Advanced approaches may adopt an ad-hoc or

heuristic incremental process to generate the fuzzy

rule base. Others make use of evolutionary techniques

aiming to provide efﬁcient rule-base generation.

Genetic programming (GP) is a successful branch

of evolutionary computation effectively applied in

a wide range of tasks such as symbolic regression

and network design (Koza, 1992). In the past, GP

has been successfully tested for the production of

Mamdani-FRBS (Alba et al., 1996). Consequently,

the perspective for generating TSK-FRBS within GP

framework has always been regarded as a promising

research ﬁeld, since such a system could combine the

attractive properties of both methods.

In response to the need for such a hybrid, GP

was effectively combined with TSK-FRBS in the past.

These approaches commonly involved partial contri-

bution of GP techniques, such as the assistance by

GP in locating proper membership functions for a

TSK-FRBS (Hoffman and Nelles, 2001) or the co-

evolution of several segments of a TSK-FRBS (Del-

gado et al., 2004). In this paper, we propose an

integrated approach, for the automatic design and

tuning of complete TSK-FRBS for regression and

system identiﬁcation. To accomplish this, we use

grammars to describe the complete structure of TSK-

FRBS within a GP individual, thus mapping one GP-

individual to exactly one TSK-FRBS. Using this ap-

proach we are able to generate arbitrarily sized TSK-

FRBS which can be tuned using easy to formulate and

implement integrated approach. We furthermore en-

hance our search procedure by incorporating multi-

population architectures to the GP solution pool. We

test two variants of this system in one prediction and

one system identiﬁcation task.

The content of this paper is organised as follows.

Next section describes the scientiﬁc background of

the methods to be used. In Section 3, the design of

the system and its implementation are presented. The

results and discussion related to the system effective-

ness are included in Section 4. The conclusions and

suggestions for further research are drawn in the Sec-

tion 5.

278

Tsakonas A. and Gabrys B..

EVOLVING TAKAGI-SUGENO-KANG FUZZY SYSTEMS USING MULTI POPULATION GRAMMAR-GUIDED GENETIC PROGRAMMING.

DOI: 10.5220/0003637702780281

In Proceedings of the International Conference on Evolutionary Computation Theory and Applications (ECTA-2011), pages 278-281

ISBN: 978-989-8425-83-6

 2011 SCITEPRESS (Science and Technology Publications, Lda.)

2 BACKGROUND

Fuzzy sets are an extension to the classic (crisp) sets,

where the transition for a value from belonging to a

set and not belonging to the set is gradual and quanti-

ﬁed by a suitably chosen membership function. When

performing fuzzy reasoning, we apply, in a general-

ized form, the traditional two-valued logic, the modus

ponens. More speciﬁcally, a fact can be more or less

true, based on the truth of another fact. The general

expression of a rule in a Takagi-Sugeno-Kang (TSK)

fuzzy rule based system has the following form (Jang,

1998):

: if x

is d

and ... then y = f(x

, ..., x

) withC (1)

(i = 1, 2, ..., m), (x

, ..., x

∈ X), C ∈ [0, 1]

In this formula, C is the certainty factor and X is the

input set.

Genetic programming - GP (Koza, 1992) is a

search methodology belonging to the evolutionary

computation family, which in its canonical form per-

mits the automatic generation of programs. Among

successful evolutionary computation approaches, GP

keeps a signiﬁcant position due to its beneﬁcial qual-

ities, such as the production of arbitrary-sized so-

lutions and the upholding of population diversity

throughout runs.

Multi population evolutionary models, or island

models, divide the population into sub-populations,

often called demes, and periodically exchange a num-

ber of individuals, simulating migration. In this pa-

per, we adopt a ﬁve island model, and we employ a

canonical setup, according to literature (Fern´andez et

al., 2003).

In canonical GP, the type of nodes is implicitly de-

ﬁned by the number of each node’s arguments (Koza,

1992). Hence, usually there are two types of nodes

in the standard GP: a) terminals - non-function nodes;

and b) functions - non-terminal nodes. In order to

produce in an efﬁcient way more complex structures

within GP, such as fuzzy rule based systems, we must

deﬁne a stricter hierarchy of the potential architec-

tures. To accomplish this, one of the most efﬁcient

ways is the application of a context-free grammar

(Montana, 1995). The standard notation to express

context-free grammars is in the Backus Naur Form

(BNF).

The idea to incorporate fuzzy inference into GP is

not new. In (Alba et al., 1996) a model that uses a

context-free grammar for the production of Mamdani

fuzzy rule bases has been examined. The system has

proven competitive in various classiﬁcation tasks. In

(Delgado et al., 2004), a genetic algorithm is proposed

for a co-evolutionary system that produces TSK fuzzy

systems. In (Chen, 2007), the generation of a speciﬁc

class of TSK fuzzy systems is accomplished using

evolutionary programming. In (Hoffman and Nelles,

2001), a genetic programming system has been used

to improve a greedy algorithm (LOLIMOT) in search

of data clusters by partitioning the search space.

Our approach aims at the production of TSK fuzzy

rule based systems in the context of multi popula-

tion grammar guided genetic programming. It has

several advantages in comparison to the previous at-

tempts. Firstly, it makes use of the genetic program-

ming framework which has a number of desirable

properties in terms of maintaining population diver-

sity and providing arbitrary solution size. Secondly,

our approach is comparatively simple to implement,

since only one population is maintained, and all fuzzy

rule base parameters are tuned within one system run.

Finally, it is an integrated approach, that generalizes

the applicability of GP for the production of fuzzy

systems by generating complete TSK rule bases for

regression and system identiﬁcation. The structure of

the TSK fuzzy system is fully described within the GP

functions, and every tuning parameter is computed by

evolutionary means.

3 DESIGN AND

IMPLEMENTATION

We have evaluated the proposed system within two

data domains. The ﬁrst domain is the Concrete Slump

Test (Yeh, 2008). We have retained 20% of the data

as test set with the remaining 80% used for learning.

The second problem addressed in this work is a sys-

tem identiﬁcation task. We considered the Mackey-

Glass non-linear chaotic time series, as this is com-

monly used as a test for TSK models (Chen, 2007).

We generated 400 data points out of which 80 were

were used as the test set with the remaining data used

for training.

We examined two system conﬁgurations. The ﬁrst

system evolves a rule base with the rules in the follow-

ing form:

: if x

is µ

[and x

is µ

...]

then y = f(x

) with C (2)

f(x

) = w

+ w

n−1

+ ... + w

n−1

+ w

(3)

(i = 1, ..., m), (x

, ..., x

∈ X), C ∈ [0, 1]

The second system evolves a rule base with the rules

in the following form:

: if x

is µ

[and x

is µ

...]

EVOLVING TAKAGI-SUGENO-KANG FUZZY SYSTEMS USING MULTI POPULATION GRAMMAR-GUIDED

GENETIC PROGRAMMING

279

then y =

f(x

) f(x

)

f(x

) f(x

)

+ f(x

) with C (4)

f(x

) = w

+ w

n−1

+ ... + w

n−1

+ w

(5)

(i = 1, ..., m), (x

, ..., x

∈ X), C ∈ [0, 1]

where C is the certainty factor, X is the input set, µ

are membership functions, and w

, ..., w

are arith-

metic expressions calculated during evolution. For

expressing the fuzzy relations, Gaussian membership

functions (MF) were selected. In this paper, we se-

lected to use 3 MFs per attribute. Tuning of these

MFs was also applied by evolutionary means. The se-

lection of the variables in each rule is also guided by

evolution. As ﬁtness function, the mean square er-

ror (MSE) was selected. Table 1 summarizes the GP

training parameters.

Table 1: Genetic programming training parameters.

Parameter Value

GP System Grammar-Guided GP

Islands 5

Island topology Ring

Isolation time 50 generations

Migrants number 10 individuals

Migrants type Elite individuals

Total population 1,000 individuals

Selection Tournament

Tournament size 7

Elitism Yes

Crossover rate 0.7

Mutation rate 0.3

Max.individual size 4,000 nodes

Max.generations 500

4 RESULTS AND DISCUSSION

Our test results for the P1 and P2 systems in the Con-

crete Slump Test problem are shown in Table 2. Exact

reconstruction of the test data set was not possible, as

there is not related information in the literature (Yeh,

2008), however we selected to apply the same num-

ber of randomly chosen test records. Our results, in

this example, are given in terms of Pearson correla-

tion coefﬁcient, to allow direct comparison with exist-

ing literature results. The highest reported accuracy in

previous research, for this data set is R

=0.922 (Yeh,

2008) where a neural network model was applied.

An example part of the derived rule base of the P1

system has the following form:

if Cement is Low then y = 1.4x

+ 1.5x − 9.3,

(x: Water)

if Cement is Medium then y = −2.3x

− 0.2x − 8.1,

Table 2: Results for the Concrete Slump Test.

Parameter P1 P2

Size (nodes) 2437 410

Size (rules) 202 9

0.9127 0.80794

(x: Coarse Aggr.)

if Water is Medium then y = −1.4x

− 1.7x − 3.3,

(x: Fly Ash)

As it can be seen, in this problem, the P1 system

compares well with the results reported in the liter-

ature, and in addition the output format allows for

easier interpretation of the results. Although the P2

system achieved similar to P1 scores in our training

and validation set, its performance for the test set was

lower.

The test results for P1 and P2 systems for the

Mackey-Glass data are summarised in Table 3.

Table 3: Results for the Mackey-Glass time series.

Parameter P1 P2

Size (nodes) 1867 457

Size (rules) 98 8

MSE 0.0033591 0.001336

As expected, the higher order P2 model was able

to provide a better approximation to this time-series.

As an example output, one of the rules of the acquired

solution by P2 system has the following form:

if x

is Low then y =

) f

)

) f

)

+ f

) where:

) = 1.14x

+ 0.15x

+ 0.44x

+ 0.24x

− 0.23,

) = −1.24x

+0.15x

+0.719x

+1.23x

+0.73,

) = 0.42x

− 0.36x

− 1.22x

− 0.74x

+ 1.11,

) = 4.61x

− 0.8x

+ 1.28x

− 0.63x

− 0.25,

) = −1.17x

+ 1.05x

+ 0.33x

− 0.63x

+ 0.37.

: Slag, x

: Fly Ash, x

: Water).

Although direct comparison with previous literature

results is not possible due to inability to reconstruct

the exact training data set, past research is shown, for

reference reasons, in Table 4. As it can be seen in

the table, our approach compares well to other models

that do not employ local search methods.

5 CONCLUSIONS AND FURTHER

RESEARCH

This paper presented a system for the generation of

Takagi-Sugeno-Kang fuzzy rule based systems for re-

gression and system identiﬁcation, by means of ge-

netic programming. The proposed approach carries

ECTA 2011 - International Conference on Evolutionary Computation Theory and Applications

280

Table 4: Results for the MacKey-Glass time series. Re-

gression scores from (Jang, 1997), (Kim and Kim, 1997),

(Wang, 1992) and (Lee and Kim, 1994).

System RMSE

Linear regression model 0.55

Auto regressive model 0.19

Sixth order polynomial 0.04

Back propagation NN 0.02

GA and fuzzy system (5 MFs) 0.049206

GA and fuzzy system (7 MFs) 0.042275

GA and fuzzy system (9 MFs) 0.037873

Wang Product T-norm 0.907

Wang Min T-norm 0.904

ANFIS 0.007

P2-TSK-GP (3 MFs) - this paper 0.036548

several advantages over past related research. Firstly,

the system is capable to automatically produce arbi-

trarily large and complex TSK fuzzy systems, accord-

ing to the needs of a speciﬁc problem. Secondly, it

provides ﬂexibility in the selection of non-linear func-

tions ﬁred per rule. Finally, the output model is inter-

pretable by humans in contrast to some other models

like MLPs, since it is in the form of fuzzy rules. In this

paper we have presented preliminary results of the ex-

periments which while focusing on certain character-

istics and capabilities of the GP algorithms produced

encouraging results warranting further investigations.

Further research will be primarily focused on an

advanced grammar design for efﬁcient combinations

of polynomials. Increasing the number of the mem-

bership functions is also expected to improve the ac-

curacy of the system. Integration with ensemble sys-

tems will also be considered.

ACKNOWLEDGEMENTS

The research leading to these results has received

funding from the European Commission within the

Marie Curie Industry and Academia Partnerships and

Pathways (IAPP) programme under grant agreement

n. 251617.

REFERENCES

Alba E., Cotta C., Troya J. M., Evolutionary Design of

Fuzzy Logic Controllers Using Strongly-Typed GP, In

Proc. 1996 IEEE Int’l Symposium on Intelligent Con-

trol New York, NY., 127-132, 1996.

Chen Y., Yang B., Abraham A., Peng L., Automatic Design

of Hierarchical Takagi-Sugeno Type Fuzzy Systems

Using Evolutionary Algorithms IEEE Trans. on Fuzzy

Systems , 15:3, 2007.

Delgado M. R., Zuben F. V., Gomide F. , Coevolutionary

genetic fuzzy systems: a hierarchical collaborative ap-

proach Fuzzy Sets and Systems , Elsevier, 141, 89-106,

2004.

Fern´andez F., Tommassini M.,Vanneschi L., An Empiri-

cal Study of Multipopulation Genetic Programming

Genetic Programming and Evolvable Machines 4:1,

Kluwer, 2003.

Hoffman F., Nelles O., Genetic Programming for Model Se-

lection of TSK-Fuzzy Systems Information Sciences,

Elsevier, 136, 7-28, 2001.

Jang J-S. R., Sun C. T., Mizutani E.,Neuro-Fuzzy and Soft

Computing , Prentice Hall, NJ, 1997.

Jang J-S. R., Neuro-fuzzy modeling for nonlinear dynamic

system identiﬁcation in: Ruspini E. H. , Bonissone P.

P. , Pedrycz W., eds., Handbook of fuzzy computation ,

Institute of Physics Publishing, Dirak House, Temple

Back, Bristol, UK 1998.

Kim D., Kim C., Forecasting time series with genetic fuzzy

predictor ensemble,IEEE Trans. Fuzzy Syst., 5:4, 523-

535, 1997.

Koza J. R. Genetic programming - On the programming of

computers by means of natural selection The MIT

Press, Cambridge, Massachussets, USA, 1992.

Lee S.-H., Kim I., Time series analysis using fuzzy learn-

ing, inProc. Int’l Conf. Neural Inform. Process., Vol.

6, Seoul, Korea, 1577-1582, 1994.

Montana D. J., Strongly Typed Genetic Programming, Evo-

lutionary Computation, 3:2, 1995.

Wang L. X., Mendel J. M.,Generating fuzzy rules by learn-

ing from examples,IEEE Trans. Systems, Man and Cy-

bernetics, 22:6, 1414-1427, 1992.

Yeh, I-C., Modeling slump of concrete with ﬂy ash and su-

perplasticizer, Computers and Concrete, 5:6, 559-572,

2008.

EVOLVING TAKAGI-SUGENO-KANG FUZZY SYSTEMS USING MULTI POPULATION GRAMMAR-GUIDED

GENETIC PROGRAMMING

281