Table 1: Domain-independent Capabilities [column - ME
agent, DS: dialog systems, GDP: goal-directed planners,
RLA: RL agents, CVR: cognitive (virtual) robots, GA:
game agents].
Mental
modeling
Self
motivation
Dialogue
Exogenous
events
Logical
inference
Logical
planning
Incomplete
knowledge
Probabilities
Plan
hierarchies
Learning
ME DS GDP RLA CVR GA
general knowledge about itself and its environment,
including both introspective (autoepistemic) knowl-
edge and knowledge about mental states of other
agents.
Table 1 summarizes the distinctive features of our
framework vis-`a-vis five previously studied frame-
works.
2
A cell is green to indicate that the surveyed
systems of that kind all exhibit that capability, red to
indicate that none of them have that capability, or yel-
low to indicate that some of them have (some of) that
capability.
In our future work, we expect to allow for de-
grees of uncertainty in ME’s knowledge. As well,
we plan to implement learning by modification of
the agent’s anticipated utility for certain actions and
states, so as to favor rewarding action sequences and
avoid adverse sequences based on the agent’s past ex-
perience. Lastly, we expect to generalize the frame-
work to do hierarchical planning, which is essential
for dealing with the complexities of dialogue about
the real world.
REFERENCES
Allen, J. F., Chambers, N., Ferguson, G., Galescu, L., Jung,
H., Swift, M., and Taysom, W. (2007). PLOW:a col-
2
Sample references are (Ferguson and Allen, 1998;
Allen et al., 2007) for DS, (Fikeset al., 1972; Kautz and Sel-
man, 1992; Sacerdoti, 1975) for GDP, (Singh et al., 2002;
Walker, 2000; Singh et al., 1994) for RLA, (Vere and Bick-
more, 1990; Nilsson, 1984; Tacke et al., 2004; Ferrein et al.,
2004) for CVR, and (Dinerstein et al., 2008; Davies and
Mehdi, 2006) for GA.
laborative task learning agent. In Proceedings of the
22nd National Conference on Artificial Intelligence
(AAAI 2007).
Davies, N. and Mehdi, Q. (2006). BDI for intelligent agents
in computer games. In Proceedings of the 8th Interna-
tional Conference on Computer Games: AI and Mo-
bile Systems (CGAIMS 2006).
Dinerstein, J., Egbert, P., Ventura, D., and Goodrich, M.
(2008). Demonstration-based behavior programming
for embodied virtual agents. Computational Intelli-
gence, 24(4):235–256.
Ferguson, G. and Allen, J. F. (1998). TRIPS: An integrated
intelligent problem-solving assistant. In Proceedings
of the 15th National Conference on Artificial Intelli-
gence (AAAI 1998).
Ferrein, E., Fritz, C., and Lakemeyer, G. (2004). On-line
decision-theoretic Golog for unpredictable domains.
In Proceedings of the 27th German Conference on Ar-
tificial Intelligence (KI 2004).
Fikes, R., Hart, P., and Nilsson, N. (1972). Learning and
executing generalized robot plans. Artificial Intelli-
gence, 3(4):251–288.
Kaplan, A. N. and Schubert, L. K. (2000). A computational
model of belief. Artificial Intelligence, 120(1):119–
160.
Kautz, H. A. and Selman, B. (1992). Planning as satisfiabil-
ity. In Proceedings of the Tenth European Conference
on Artificial Intelligence (ECAI 1992).
Liu, D. H. and Schubert, L. K. (2009). Incorporating plan-
ning and reasoning into a self-motivated, communica-
tive agent. In Proceedings of the 2nd Conference on
Artificial General Intelligence (AGI 2009).
Morbini, F. and Schubert, L. (2008). Metareasoning as an
integral part of commonsense and autocognitive rea-
soning. In AAAI-08 Workshop on Metareasoning.
Nilsson, N. J. (1984). Shakey the robot. Technical Report
323, AI Center, SRI International.
Sacerdoti, E. D. (1975). A structure for plans and behavior.
Technical Report 109, AI Center, SRI International.
Singh, S., Litman, D., Kearns, M., and Walker, M. (2002).
Optimizing dialogue management with reinforcement
learning: Experiments with the njfun system. Journal
of Artificial Intelligence Research, 16:105–133.
Singh, S. P., Barto, A. G., Grupen, R., and Connolly, C.
(1994). Robust reinforcement learning in motion plan-
ning. In Advances in Neural Information Processing
Systems 6, pages 655–662. Morgan Kaufmann.
Tacke, M., Weigel, T., and Nebel, B. (2004). Decision-
theoretic planning for playing table soccer. In Pro-
ceedings of the 27th German Conference on Artificial
Intelligence (KI 2004).
Vere, S. and Bickmore, T. (1990). A basic agent. Computa-
tional Intelligence, 6(1):41–60.
Walker, M. A. (2000). An application of reinforcement
learning to dialogue strategy selection in a spoken di-
alogue system for email. Journal of Artificial Intelli-
gence Research, 12:387–416.
COMBINING SELF-MOTIVATION WITH LOGICAL PLANNING AND INFERENCE IN A REWARD-SEEKING
AGENT
263