8 DISCUSSION
The work that we have presented here can be consid-
ered under three points of view.
Firstly, this work is original by its deep anchor-
ing in biological inspiration. The behavior of our au-
tonomous agent is elaborated thanks to a model built
on four loops described as playing an important role
in the brain of most animals (Alexander et al., 1986).
This inspiration is anatomical, considering the nature
of information flows brought by several sensory and
motor regions. This inspiration is also functional,
particularly considering mechanisms to select actions
and to sustain goals until they are achieved. A ma-
jor characteristic of this work is to consider similar
architectural and functional properties to build four
loops and to build all the considered behaviors only
by emergence, on the basis of the loops and their in-
teractions. This biological inspiration is also very pre-
cious because most of the questions and orientations
for future works we have evoked in the paper will be
addressed by going deeper into biological details.
Secondly, we have argued that, even if these loops
are mostly studied for the understanding of higher
cognitive functions like reward-based decision mak-
ing, considering them to implement survival scenar-
ios is very important to design autonomous systems.
Particularly, considering such basic scenarios is very
convenient to study all the loops together, which is
hardly addressed in the modeling literature. It is
also interesting to understand how the two basic pro-
cesses of signaling and regulation have evolved to al-
low for more abstract behaviors, which can still be
described as interactions between limbic and sensori-
motor loops, originally built for survival.
Thirdly, another major innovation of this paper is
to propose that Malmo, a platform originally designed
for experimentation in Artificial Intelligence, is a very
powerful tool to build basic autonomous systems per-
forming survival tasks. Not only it offers many inter-
esting characteristics for the simulation and the visu-
alization of a survival task, but it also eases the design
of the most critical part, corresponding to the interface
between the computational parts of the model and the
internal and bodily aspects of the agent. In addition,
this platform has another usefulness. To address the
critical questions we have evoked in this work, we are
currently seeking insights from biology, to improve
and augment our model. Some of these questions are
clearly unanswered by the current state of the art and
must be investigated jointly with neuroscientists. In
this perspective, Malmo offers a striking advantage to
describe our model and its behavior to them, who are
more accustomed to biological observations than al-
gorithms and equations.
REFERENCES
Alexander, G., DeLong, M., and Strick, P. (1986). Par-
allel organization of functionally segregated circuits
linking basal ganglia and cortex. Ann. Rev. Neurosci.,
9:357–381.
Craig, A. D. (2003). Interoception: the sense of the phys-
iological condition of the body. Current Opinion in
Neurobiology, 13(4):500–505.
Daw, N. D., Niv, Y., and Dayan, P. (2005). Uncertainty-
based competition between prefrontal and dorsolateral
striatal systems for behavioral control. Nature neuro-
science, 8(12):1704.
Floresco, S. B. (2015). The nucleus accumbens: an inter-
face between cognition, emotion, and action. Annual
review of psychology, 66:25–52.
Gurney, K., Prescott, T. J., and Redgrave, P. (2001). A com-
putational model of action selection in the basal gan-
glia. i. a new functional anatomy. Biological Cyber-
netics, 84(6):401–410.
Guthrie, M., Leblois, A., Garenne, A., and Boraud, T.
(2013). Interaction between cognitive and motor
cortico-basal ganglia loops during decision making:
A computational study. Journal of Neurophysiology.
Haber, S., Fudge, J., and McFarland, N. (2000). Striaton-
igrostriatal pathways in primates form an ascending
spiral from the shell to the dorsolateral striatum. J
Neurosci, 20(6):2369–2382.
Hazy, T. E., Frank, M. J., and O’Reilly, R. C. (2006).
Banishing the homunculus: making working memory
work. Neuroscience, 139(1):105–118.
Humphries, M., Khamassi, M., and Gurney, K. (2012).
Dopaminergic control of the exploration-exploitation
trade-off via the basal ganglia. Frontiers in Neuro-
science, 6:9.
Johnson, M., Hofmann, K., Hutton, T., and Bignell, D.
(2016). The malmo platform for artificial intelligence
experimentation. In IJCAI, pages 4246–4247.
Maturana, H. R. and Varela, F. J. (1991). Autopoiesis
and Cognition: The Realization of the Living (Boston
Studies in the Philosophy of Science, Vol. 42). D. Rei-
del Publishing Company, 1st edition.
Parent, A. and Hazrati, L. N. (1995). Functional anatomy
of the basal ganglia. I. The cortico-basal ganglia-
thalamo-cortical loop. Brain Res Brain Res Rev,
20(1):91–127.
Redgrave, P., Prescott, T. J., and Gurney, K. (1999). The
basal ganglia: A vertebrate solution to the selection
problem? Neuroscience, 89(4):1009–1023.
Stoeter, S. A. and Papanikolopoulos, N. (2005). Au-
tonomous stair-climbing with miniature jumping
robots. IEEE Transactions on Systems, Man, and Cy-
bernetics, Part B (Cybernetics), 35(2):313–325.
Strausfeld, N. J. and Hirth, F. (2013). Deep Homology
of Arthropod Central Complex and Vertebrate Basal
Ganglia. Science, 340(6129):157–161.
Varela, F. J., Thompson, E. T., and Rosch, E. (1992). The
Embodied Mind: Cognitive Science and Human Ex-
perience. The MIT Press.
Cognitive Architecture and Software Environment for the Design and Experimentation of Survival Behaviors in Artificial Agents
159