4 CONCLUSIONS
We presented a cognitive robot architecture for
SLAM which integrates active vision, dual memory
and hierarchical task management. The main contri-
butions are: (a) visual saliency is used for FoA and
object recognition; (b) SLAM is directly integrated
with short- and long-term memory and affected by
time, allowing the robot to filter important informa-
tion and adapt to changes in the environment; and (c)
the task management system can build complex tasks
from simpler ones.
Regarding vision, we verified that the use of
saliency and object recognition yields a more robust
exploration, navigation also being more robust. How-
ever, monocular vision with simple solutions for dis-
tance estimation is not very precise. Therefore, a bi-
ological model for stereo disparity (Farrajota et al.,
2011) is being integrated. In addition, since object
recognition using OpenSURF is not very robust in
case of untextured objects, OpenSURF is being sub-
stituted by a biological model for multi-scale keypoint
extraction (Rodrigues and du Buf, 2006), and supple-
mented by a biological model for multi-scale line and
edge extraction (Rodrigues and du Buf, 2009). The
biological keypoint model can also supplement Fast
Saliency (Butko et al., 2008), because it adds local
image complexity to color contrast, for obtaining a
better model for FoA (Rodrigues and du Buf, 2006).
The addition of such biological models leads to a vi-
sion model which resembles the human visual system.
The system was successfully tested by using a
rather small environment, i.e., a sandbox of 3× 3.5 m,
with objects on the floor, mainly because of the
small robot platform with limited battery capacity and
speed, and a footprint of 11 × 13 cm. For testing the
system in real environments like corridors and lab-
oratory spaces, it is being mounted on a faster plat-
form with a larger battery capacity and a bigger foot-
print, but still only using a stereo camera without any
other sensors nor odometry, the camera head being
mounted on a rod with a height of about 80 cm. These
modifications allow us to test the system with also ob-
jects attached to walls and on tables, but this also re-
quires implementing and dealing with 3D egocentric
and environment maps.
ACKNOWLEDGEMENTS
This work was partially supported by the Portuguese
Foundation for Science and Technology (FCT)
project PEst-OE/EEI/LA0009/2011, EC project Neu-
ralDynamics (NeFP7-ICT-2009-6 PN: 270247), FCT
project Blavigator (RIPD/ADA/109690/2009) and by
PhD FCT grant SFRH/BD/71831/2010.
REFERENCES
Alami, R., Clodic, A., Montreuil, V., Sisbot, E. A., and
Chatila, R. (2006). Toward human-aware robot task
planning. Association for the Advancement of Artifi-
cial Intelligence Spring Symposia, AAAI, page 8pp.
Brady, T., Konkle, T., Alvarez, G., and Oliva, A. (2008).
Visual long-term memory has a massive storage ca-
pacity for object details. Proc. Nat. Acad. Scie.,
105(38):14325–14329.
Butko, N., Zhang, L., Cottrell, G., and Movellan, J. (2008).
Visual salience model for robot cameras. Proc. 2008
IEEE. Int. Conf. on Rob. and Automation, pages 2398–
2403.
Evans, C. (2009). Notes on the OpenSURF Library.
Tech. Rep. CSTR-09-001. University of Bristol. URL
http://www.chrisevansdev.com.
Farrajota, M., Martins, J., Rodrigues, J., and du Buf, J.
(2011). Disparity energy model with keypoint dispar-
ity validation. Accepted for 17th Portuguese Conf. on
Pattern Recognition, Porto, Portugal.
Itti, L., Koch, C., and Niebur, E. (1998). A model of
saliency-based visual attention for rapid scene anal-
ysis. IEEE Trans. on Patt. Recog. and Mach. Intell.,
20(11):1254–1259.
Jos´e, J., Farrajota, M., Rodrigues, J., and du Buf, J. (2010).
A vision system for detecting paths and moving osta-
cles for the blind. Proc. Int. Conf. on Software De-
velopment for Enhancing Accessibility and Fighting
Info-exclusion, pages 175–182.
Kawamura, K., Koku, A., Wilkes, D., Peters II, R., and Sek-
men, A. (2002). Toward egocentric navigation. Int. J.
Robotics and Automation, 17(4):135–145.
Kleinmann, L. and Mertsching, B. (2011). Learning to
adapt: Cognitive architecture based on biologically
inspired memory. In Industrial Electronics and Appli-
cations (ICIEA), 2011 6th IEEE Conference on, pages
936 –941.
Meger, D., Forss´en, P., Lai, K., Helmer, S., McCann, S.,
Southey, T., Baumann, M., Little, J. J., and Lowe,
D. G. (2008). Curious George: An attentive seman-
tic robot. Rob. Aut. Sys., 56(6):503–511.
Meinert, P. (2008). The impact of previous life experience
on cognitive structure changes and knowledge acqui-
sition of nursing theory and clinical skills in nontradi-
tional nursing students. PhD Thesis, Kent State Univ.,
College of Education, Health and Human Services,
USA, page 173.
Milford, M. and Wyeth, G. (2010). Persistent navigation
and mapping using a biologically inspired slam sys-
tem. Int. J. Robotics Res., 29(9):1131–1153.
Montemerlo, M., Thrun, S., Koller, D., and Wegbreit, B.
(2002). Fastslam: A factored solution to the simulta-
neous localization and mapping problem. Proc. AAAI.
Nat. Conf. Art. Int., pages 593–598.
ICAART 2012 - International Conference on Agents and Artificial Intelligence
622