the average and maximum out degree (avg(δ(s)) and
max(δ(s)), respectively) of the states. Then, we sum-
marise the correspondingcost-optimal strong plan SP,
as devised by the algorithm, giving its size (i.e., the
number of plans that can be extracted from SP), the
maximum cost (max(C (s)) of a cost-optimal strong
plan that can be extracted from SP and the total plan
synthesis time (in minutes). The complete data set is
available online at (Della Penna et al., 2010).
Note that, as discussed in Section 1, the size of the
explicit graph representation of even a simple prob-
lem is huge: |S| = 5· 10
7
nodes and |S
τ
| = 6.25 · 10
9
edges in our best case. By visiting only the reachable
states, our algorithm succeeds in effectively counter-
acting state explosion.
We may note that, as expected, the greater the size
of disturbances, the bigger the number of transitions,
the smaller the number of states for which a strong
plan is found. In particular, for the fourth instance,
no strong plan exists. Moreover, the size of the re-
sulting strong plan could be effectively compressed
making use of Ordered Binary Decision Diagrams, as
described in (Della Penna et al., 2009a).
5 CONCLUSIONS AND FUTURE
WORK
In this paper we described an algorithm to solve
the cost-optimal strong planning problem in non-
deterministic finite state systems.
The presented approach extends the strong plan-
ning methodology given in (Cimatti et al., 1998) by
introducing the concept of cost, and thus generating
cost-optimal strong plans, and by exploiting explicit
algorithms to extend the class of solvable problems.
The devised algorithm has been formally proved
as correct and complete, and its complexity, if the
number of transitions in the system or the range of
possible transition costs are reasonable (to say, a bil-
lion of transitions or different costs), is dominated
by the number of (visited) transitions in the system
graph, which is a good bound for such kind of prob-
lem.
Finally, the proposed methodology has been illus-
trated through a case study based on the well known
inverted pendulum on a cart problem. Future work
will include an extensive experimentation on differ-
ent case studies. However, the first results are very
promising and show how the algorithm is effective
and scalable.
REFERENCES
Albore, A., Palacios, H., and Geffner, H. (2010). Compil-
ing uncertainty away in non-deterministic conformant
planning. In ECAI, pages 465–470.
Behrends, E. (2000). Introduction to Markov Chains.
Vieweg.
Bertoli, P., Cimatti, A., Roveri, M., and Traverso, P. (2001).
Planning in nondeterministic domains under partial
observability via symbolic model checking. In Proc.
17th IJCAI, pages 473–478. Morgan Kaufmann.
Bertoli, P., Cimatti, A., Roveri, M., and Traverso, P. (2006).
Strong planning under partial observability. Artif. In-
tell., 170:337–384.
Bonet, B. and Geffner, H. (2000). Planning with incom-
plete information as heuristic search in belief space.
In Chien, S., Kambhampati, S., and Knoblock, C., ed-
itors, Proc. 6th ICAPS, pages 52–61. AAAI Press.
Bonet, B. and Geffner, H. (2005). mGPT: A probabilistic
planner based on heuristic search. JAIR, 24:933–944.
Boutilier, C., Dean, T., and Hanks, S. (1999). Decision-
theoretic planning: Structural assumptions and com-
putational leverage. JAIR, 11:1–94.
Burch, J. R., Clarke, E. M., McMillan, K. L., Dill, D. L.,
and Hwang, L. J. (1992). Symbolic model checking:
10
20
states and beyond. Inf. Comput., 98(2):142–170.
Chesi, G. and Hung, Y. (2007). Global path-planning for
constrained and optimal visual servoing. IEEE Trans.
on Robotics, 23(5):1050–1060.
Cimatti, A., Roveri, M., and Traverso, P. (1998). Strong
planning in non-deterministic domains via model
checking. In AIPS, pages 36–43.
Della Penna, G., Intrigila, B., Lauri, N., and Magazzeni,
D. (2009a). Fast and compact encoding of numerical
controllers using obdds. In Informatics in Control, Au-
tomation and Robotics: Selected Papers from ICINCO
2008, pages 75–87. Springer.
Della Penna, G., Intrigila, B., Magazzeni, D., and Mercorio,
F. (2009b). UPMurphi: a tool for universal planning
on PDDL+ problems. In Proc. ICAPS 2009, pages
106–113. AAAI Press.
Della Penna, G., Intrigila, B., Magazzeni, D., and Mercorio,
F. (2010). Non-deterministic inverted pendulum on a
cart data set.
Della Penna, G., Intrigila, B., Melatti, I., Tronci, E., and
Venturini Zilli, M. (2004). Exploiting transition local-
ity in automatic verification of finite state concurrent
systems. STTT, 6(4):320–341.
Della Penna, G., Intrigila, B., Melatti, I., Tronci, E., and
Zilli, M. V. (2006). Finite horizon analysis of markov
chains with the murphi verifier. STTT, 8(4-5):397–
409.
Fudenberg, D. and Tirole, J. (1991). Game theory. MIT
Press.
Huang, W., Wen, Z., Jiang, Y., and Wu, L. (2007). Observa-
tion reduction for strong plans. In Proc. 20th IJCAI,
pages 1930–1935. Morgan Kaufmann.
COST-OPTIMAL STRONG PLANNING IN NON-DETERMINISTIC DOMAINS
65