Figure 11: 3D Mapping of the rooms and corridor.
ing new tools based on this method, we will be able
to solve currently unmanageable complex multiagent
problems under uncertainty, and obtain good results.
REFERENCES
Becker, R., Zilberstein, S., Lesser, V., and Goldman, C. V.
(2004). Solving transition independent decentralized
Markov decision processes. Journal of Artificial Intel-
ligence Research, pages 423–455.
Bellman, R. (1954). The theory of dynamic programming.
Bull. Amer. Math. Soc. 60, no. 6, pages 503–515.
Bernstein, D. S., Zilberstein, S., and Immerman, N. (2000).
The complexity of decentralized control of markov
decision processes. In Proceedings of the Sixteenth
conference on Uncertainty in artificial intelligence,
pages 32–37. Morgan Kaufmann Publishers Inc.
Boutilier, C. (1996). Planning, learning and coordination in
multiagent decision processes. In Proceedings of the
6th conference on Theoretical aspects of rationality
and knowledge, pages 195–210. Morgan Kaufmann
Publishers Inc.
Boutilier, C. (1999). Sequential optimality and coordination
in multiagent systems. In IJCAI, volume 99, pages
478–485.
Claes, D., Robbel, P., Oliehoek, F. A., Tuyls, K., Hennes,
D., and van der Hoek, W. (2015). Effective approxi-
mations for multi-robot coordination in spatially dis-
tributed tasks. In Proceedings of the 2015 Interna-
tional Conference on Autonomous Agents and Multia-
gent Systems, AAMAS ’15, pages 881–890, Richland,
SC. International Foundation for Autonomous Agents
and Multiagent Systems.
Goldman, C. V. and Zilberstein, S. (2004). Decentral-
ized control of cooperative systems: Categorization
and complexity analysis. J. Artif. Intell. Res.(JAIR),
22:143–174.
Guestrin, C., Venkataraman, S., and Koller, D. (2002).
Context-specific multiagent coordination and plan-
ning with factored MDPs. In AAAI/IAAI, pages 253–
259.
Labbe, M. and Michaud, F. (2014). Online Global Loop
Closure Detection for Large-Scale Multi-Session
Graph-Based SLAM. In Proceedings of the IEEE/RSJ
International Conference on Intelligent Robots and
Systems, pages 2661–2666.
Littman, M., Dean, T., and Kaelbling, L. P. (1995). On the
complexity of solving markov decision problems. In
Uncertainty in Artificial Intelligence. Proceedings of
the 11th Conference, pages 394–402.
Matignon, L., Jeanpierre, L., and Mouaddib, A.-I. (2012).
Coordinated multi-robot exploration under communi-
cation constraints using decentralized markov deci-
sion processes. In AAAI, pages 2017–2023.
Melo, F. S. and Veloso, M. (2009). Learning of coordina-
tion: Exploiting sparse interactions in multiagent sys-
tems. In Proceedings of The 8th International Confer-
ence on Autonomous Agents and Multiagent Systems-
Volume 2, pages 773–780. International Foundation
for Autonomous Agents and Multiagent Systems.
Messias, J. V., Spaan, M. T., and Lima, P. U. (2013).
GSMDPs for Multi-Robot Sequential Decision-
Making. In AAAI.
Papadimitriou, C. H. and Tsitsiklis, J. N. (1987). The com-
plexity of markov decision processes. In Mathematics
of Operations Research 12(3), pages 441–450.
Parr, R. (1998). Flexible decomposition algorithms for
weakly coupled Markov decision problems. In Pro-
ceedings of the Fourteenth conference on Uncer-
tainty in artificial intelligence, pages 422–430. Mor-
gan Kaufmann Publishers Inc.
Pineau, J., Roy, N., and Thrun, S. (2001). A hierarchical ap-
proach to POMDP planning and execution. Workshop
on hierarchy and memory in reinforcement learning,
65(66):51–55.
Quigley, M., Conley, K., Gerkey, B. P., Faust, J., Foote, T.,
Leibs, J., Wheeler, R., and Ng, A. Y. (2009). Ros: an
open-source robot operating system. In ICRA Work-
shop on Open Source Software.
Witwicki, S. J. and Durfee, E. H. (2010). Influence-based
policy abstraction for weakly-coupled dec-POMDPs.
In ICAPS, pages 185–192.
Xuan, P., Lesser, V., and Zilberstein, S. (2001). Commu-
nication decisions in multi-agent cooperation: Model
and experiments. In Proceedings of the International
Joint Conference on Artificial Intelligence, pages
616–623.
ECTA 2016 - 8th International Conference on Evolutionary Computation Theory and Applications
58