criteria for the convergence of NMCS/NRPA. While
as in most MCTS algorithms based on rollouts, we
have probabilistic completeness in the sense that an
optimal solution can always be found by chance.
However, through nesting and adapting policies the
success likelihood can become arbitrarily small, so
that for now we cannot say by certain, that the op-
timum will be reached.
REFERENCES
Bhat, U. (1986). Finite capacity assembly-like queues.
Queueing Systems, 1:85–101.
Bouzy, B. (2016). An experimental investigation on the
pancake problem. In Computer Games: Fourth
Workshop on Computer Games, pages 30–43, Cham.
Springer International Publishing.
Browne, C. B., Powley, E., Whitehouse, D., Lucas, S. M.,
Cowling, P., Rohlfshagen, P., Tavener, S., Perez, D.,
Samothrakis, S., and Colton, S. (2004). A survey of
Monte Carlo tree search methods. 4(1):1–43.
Bürckert, H.-J., Fischer, K., and Vierke, G. (2000). Holonic
transport scheduling with teletruck. Applied Artificial
Intelligence, 14(7):697–725.
Burman, M. (1995). New results in flow line analysis. PhD
thesis, MIT.
Cazenave, T. (2009). Nested monte–carlo search. In IJCAI,
pages 456–461.
Cazenave, T. and Teytaud, F. (2012a). Application of
the Nested Rollout Policy Adaptation Algorithm to
the Traveling Salesman Problem with Time Windows,
pages 42–54. Springer.
Cazenave, T. and Teytaud, F. (2012b). Beam nested rollout
policy adaptation. In ECAI-Workshop on Computer
Games, pages 1–12.
Dorer, K. and Calisti, M. (2005). An adaptive solution to
dynamic transport optimization. In Proceedings of the
fourth international joint conference on Autonomous
agents and multiagent systems, pages 45–51. ACM.
Edelkamp, S. and Cazenave, T. (2016). Improved diversity
in nested rollout policy adaptation. In German Con-
ference on AI (KI 2016).
Edelkamp, S. and Greulich, C. (2016). Using SPIN for
the optimized scheduling of discrete event systems in
manufacturing. In SPIN 2016, pages 57–77. Springer.
Fischer, K., Müller, J. R. P., and Pischel, M. (1996). Coop-
erative transportation scheduling: an application do-
main for dai. Applied Artificial Intelligence, 10(1):1–
34.
Ganji, F., Morales Kluge, E., and Scholz-Reiter, B. (2010).
Bringing Agents into Application: Intelligent Prod-
ucts in Autonomous Logistics. In Artificial intel-
ligence and Logistics (AiLog) - Workshop at ECAI
2010, pages 37–42.
Gomes, C. P., Selman, B., Crato, N., and Kautz, H.
(2000). Heavy-tailed phenomena in satisfiability and
constraint satisfaction problems. J. Autom. Reason.,
24(1-2):67–100.
Greulich, C. and Edelkamp, S. (2016). Branch-and-bound
optimization of a multiagent system for flow produc-
tion using model checking. In ICAART 2016.
Greulich, C., Edelkamp, S., and Eicke, N. (2015). Cyber-
physical multiagent simulation in production logistics.
In MATES 2015.
Harrison, J. (1973). Assembly-like queues. Journal of Ap-
plied Probability, 10:354–367.
Himoff, J., Rzevski, G., and Skobelev, P. (2006). Ma-
genta technology multi-agent logistics i-scheduler for
road transportation. In AAMAS 06, pages 1514–1521.
ACM.
Hopp, W. and Simon, J. (1989). Bounds and heuristics for
assembly-like queues. Queueing Systems, 4:137–156.
Huang, S.-C., Arneson, B., Hayward, R. B., Mueller, M.,
and Pawlewicz, J. (2013). Mohex 2.0: A pattern-based
MCTS Hex player. In Computers and Games, pages
60–71.
Kocsis, L. and Szepesvári, C. (2006). Bandit based Monte-
Carlo planning. In ECML, pages 282–293.
Lipper, E. and Sengupta, E. (1986). Assembly-like queues
with finite capacity: bounds, asymptotics and approx-
imations. Queueing Systems, pages 67–83.
Manitz, M. (2008). Queueing-model based analysis of as-
sembly lines with finite buffers and general service
times. Computers & Operations Research, 35(8):2520
– 2536.
Morales Kluge, E., Ganji, F., and Scholz-Reiter, B. (2010).
Intelligent products - towards autonomous logistic
processes - a work in progress paper. In Intern. PLM
Conf.
Parragh, S. N., Doerner, K. F., and Hartl, R. F. (2008).
A Survey on Pickup and Delivery Problems Part II:
Transportation between Pickup and Delivery Loca-
tions. Journal für Betriebswirtschaft, 58(2):81–117.
Rosin, C. D. (2011). Nested rollout policy adaptation for
monte carlo tree search. In IJCAI, pages 649–654.
Silver, D., Huang, A., Maddison, C. J., Guez, A., Sifre, L.,
van den Driessche, G., Schrittwieser, J., Antonoglou,
I., Panneershelvam, V., Lanctot, M., Dieleman, S.,
Grewe, D., Nham, J., Kalchbrenner, N., Sutskever, I.,
Lillicrap, T., Leach, M., Kavukcuoglu, K., Graepel,
T., and Hassabis, D. (2016). Mastering the game of
go with deep neural networks and tree search. Nature,
529:484–503.
ICAART 2017 - 9th International Conference on Agents and Artificial Intelligence
290