In the future, we would like to focus on modeling
more problem domains and extending the results to
a more domain-independent setting. We would also
like to create a system that would be able to create
the semantically layered representation solely from
the PDDL as its structure copies the structure of the
planning problem.
ACKNOWLEDGEMENTS
The work of Michaela Urbanovsk
´
a was
supported by the OP VVV funded project
CZ.02.1.01/0.0/0.0/16019/0000765 “Research
Center for Informatics” and by the Grant Agency
of the Czech Technical University in Prague, grant
No. SGS22/168/OHK3/3T/13. The work of Anton
´
ın
Komenda was supported by the Czech Science
Foundation (grant no. 22-30043S).
REFERENCES
Chrestien, L., Pevn
´
y, T., Komenda, A., and Edelkamp, S.
(2021). Heuristic search planning with deep neu-
ral networks using imitation, attention and curriculum
learning. CoRR, abs/2112.01918.
Culberson, J. (1997). Sokoban is pspace-complete.
Fikes, R. E. and Nilsson, N. J. (1971). Strips: A new ap-
proach to the application of theorem proving to prob-
lem solving. Artificial intelligence, 2(3-4):189–208.
Ghallab, M., Knoblock, C., Wilkins, D., Barrett, A., Chris-
tianson, D., Friedman, M., Kwok, C., Golden, K.,
Penberthy, S., Smith, D., Sun, Y., and Weld, D.
(1998). Pddl - the planning domain definition lan-
guage.
Groshev, E., Tamar, A., Goldstein, M., Srivastava, S., and
Abbeel, P. (2018). Learning generalized reactive poli-
cies using deep neural networks. In 2018 AAAI Spring
Symposium Series.
Guez, A., Mirza, M., Gregor, K., Kabra, R., Racaniere,
S., Weber, T., Raposo, D., Santoro, A., Orseau, L.,
Eccles, T., Wayne, G., Silver, D., Lillicrap, T., and
Valdes, V. (2018). An investigation of model-free
planning: boxoban levels.
https://github.com/deepmind/boxoban-levels/.
Hoffmann, J. (2001). Ff: The fast-forward planning system.
AI magazine, 22(3):57–57.
Ilin, R., Kozma, R., and Werbos, P. J. (2008). Beyond feed-
forward models trained by backpropagation: A prac-
tical training tool for a more efficient universal ap-
proximator. IEEE Transactions on Neural Networks,
19(6):929–937.
Kingma, D. P. and Ba, J. (2015). Adam: A method for
stochastic optimization. In Bengio, Y. and LeCun,
Y., editors, 3rd International Conference on Learn-
ing Representations, ICLR 2015, San Diego, CA, USA,
May 7-9, 2015, Conference Track Proceedings.
Pommerening, F. and Helmert, M. (2013). Incremental lm-
cut. In Twenty-Third International Conference on Au-
tomated Planning and Scheduling.
Pommerening, F., Helmert, M., R
¨
oger, G., and Seipp, J.
(2015). From non-negative to general operator cost
partitioning. In Bonet, B. and Koenig, S., editors,
Proceedings of the Twenty-Ninth AAAI Conference on
Artificial Intelligence, January 25-30, 2015, Austin,
Texas, USA, pages 3335–3341. AAAI Press.
Shen, W., Trevizan, F. W., and Thi
´
ebaux, S. (2020). Learn-
ing domain-independent planning heuristics with hy-
pergraph networks. In Beck, J. C., Buffet, O., Hoff-
mann, J., Karpas, E., and Sohrabi, S., editors, Pro-
ceedings of the Thirtieth International Conference on
Automated Planning and Scheduling, Nancy, France,
October 26-30, 2020, pages 574–584. AAAI Press.
St
˚
ahlberg, S., Bonet, B., and Geffner, H. (2022). Learning
general optimal policies with graph neural networks:
Expressive power, transparency, and limits. In Kumar,
A., Thi
´
ebaux, S., Varakantham, P., and Yeoh, W., ed-
itors, Proceedings of the Thirty-Second International
Conference on Automated Planning and Scheduling,
ICAPS 2022, Singapore (virtual), June 13-24, 2022,
pages 629–637. AAAI Press.
Toyer, S., Thi
´
ebaux, S., Trevizan, F. W., and Xie, L. (2020).
Asnets: Deep learning for generalised planning. J.
Artif. Intell. Res., 68:1–68.
Urbanovsk
´
a, M. and Komenda, A. (2021). Neural net-
works for model-free and scale-free automated plan-
ning. Knowledge and Information Systems, pages 1–
36.
Urbanovsk
´
a, M. and Komenda, A. (2022). Learning heuris-
tic estimates for planning in grid domains by cellu-
lar simultaneous recurrent networks. In Rocha, A. P.,
Steels, L., and van den Herik, H. J., editors, Proceed-
ings of the 14th International Conference on Agents
and Artificial Intelligence, ICAART 2022, Volume 2,
Online Streaming, February 3-5, 2022, pages 203–
213. SCITEPRESS.
Urbanovsk
´
a, M. and Komenda, A. (2022a). Grid represen-
tation in neural networks for automated planning. In
Rocha, A. P., Steels, L., and van den Herik, H. J., edi-
tors, Proceedings of the 14th International Conference
on Agents and Artificial Intelligence, ICAART 2022,
Volume 3, Online Streaming, February 3-5, 2022,
pages 871–880. SCITEPRESS.
ICAART 2023 - 15th International Conference on Agents and Artificial Intelligence
500