A NOVEL STRUCTURE FOR REALIZING
GOAL-DIRECTED BEHAVIOR
Cem Yucelgen, Yusuf Kuyumcu and N. Serap Sengor
Electronics and Communication Engineering Department, Istanbul Technical University, Maslak, Istanbul, Turkey
Keywords: Adaptive resonance theory, Reinforcement learning, Goal-directed behaviour.
Abstract: Intelligent organisms complete goal-directed behaviour by accomplishing a series of cognitive process.
Inspired from these cognitive processes, in this work, a novel structure composed of Adaptive Resonance
Theory and an Action Selection module is introduced. This novel structure is capable of recognizing task
relevant patterns and choosing task relevant actions to complete goal-directed behavior. In order to construct
these task relevant choices the parameters of the system are modified by Reinforcement Learning. Thus the
proposed structure is capable of modifying its choices and evaluates the outcome of these choices. In order
to show the efficiency of the proposed structure word hunting task is solved.
1 INTRODUCTION
To suppress the irrelevant stimuli amongst similar
ones, to focus on the task relevant ones and to
perceive these and process them to reach a goal
requires accomplishment of a series of cognitive
processes. A system capable of realizing these
processes would be efficient in many intelligent
system applications. In this work, an integrated
structure composed of Adaptive Resonance Theory
(ART) and Action Selection module (AS) is
introduced. This novel structure named ART-AS is
capable of recognizing the changes in the
environment and is able to adapt itself to these
changes according to the rewards it obtains for its
choices. There are two different adaptation
procedures: (i) one corresponding to selective
attention where parameters of ART are modified to
recognize goal related patterns and (ii) a second
adaptation where parameters of AS module are
modified to choose task relevant actions. Both of
these adaptation procedures are accomplished by
Reinforcement Learning (RL).
In most of the applications, the differential
equations defining ART (Carpenter, Grossberg,
1987) are not considered. Instead an algorithm using
steady state behavior of these equations is utilized
(Tan, 2004). Here the overall ART-AS structure is
composed of nonlinear dynamical systems and the
behavior of each dynamical system is adapted by the
parameters governing their steady-state behavior.
So, to determine the parameters that are effective in
guiding ART’s behavior, the solution of the
differential equations are considered. Once, the
effective parameters are determined and their
interpretation are discussed they are used to guide
ART. These parameters of ART are modified by
reward expectation error and the task related patterns
are obtained in the Long Term Memory (LTM). To
order the patterns in LTM according to the task is
the last step in concluding goal-directed behavior.
This ordering of patterns in LTM is accomplished
with a set of difference equations realizing action
selection. Another dynamical system defines the AS
module which is developed considering the neural
substrates that are effective in action selection
(Sengor, Karabacak, Steinmetz, 2008). In (Sengor
et.al., 2008) it has been shown that this dynamical
system is capable of selecting task relevant actions
in a goal-directed behavior.Thus, in the proposed
ART-AS structure while ART part realizes
recognition of task relevant patterns, AS part
determines the task relevant actions.
A similar work is (Brohan, Gurney, Dudek,
2010), where a hybrid structure is proposed. In their
work Self-Organizing Maps (SOM) and RL are
incorporated. Their aim is to solve an action
selection problem while organizing SOM with
selected actions. Here, ART is considered instead of
SOM and the aim is not only to order features
according to actions but to show how pattern
249
Yucelgen C., Kuyumcu Y. and Serap Sengor N..
A NOVEL STRUCTURE FOR REALIZING GOAL-DIRECTED BEHAVIOR.
DOI: 10.5220/0003719902490255
In Proceedings of the 4th International Conference on Agents and Artificial Intelligence (ICAART-2012), pages 249-255
ISBN: 978-989-8425-95-9
Copyright
c
2012 SCITEPRESS (Science and Technology Publications, Lda.)