Figure 9: The running time of the multiagent tiger problem
given different GPU’s thread block sizes.
8 CONCLUSION
We presented a method for optimal planning in multi-
agent settings under uncertainty that utilizes the paral-
lelism provided by a heterogeneous CPU-GPU com-
puting architecture. We focused on the interactive
dynamic influence diagrams, which are probabilis-
tic graphical models whose solution involves trans-
forming the I-DID into a flat DID and computing the
policy with the maximum expected utility. Opera-
tions involving probability and utility factors during
variable elimination are parallelized on GPUs. We
demonstrate speed ups close to an order of magnitude
on multiple problem domains and run times that are
less than 17 minutes for large numbers of models and
long horizons. To the best of our knowledge, these
are the fastest run times reported so far for exactly
solving I-DIDs and other related frameworks such
as I-POMDPs for multiagent planning, and represent
a significant step forward in making these complex
frameworks practical.
As aforementioned, lower level models can be
DIDs or I-DIDs with different initial beliefs. These
candidate models are differinghypotheses of the other
agent’s behavior, and therefore may be solved inde-
pendently in parallel. However, as solving I-DIDs
requires large amount of memory, we may not solve
these in parallel on a single GPU. Nevertheless, mod-
ern computing platforms may contain two or more
GPU units linked together and programmable using
CUDA.
2
Furthermore, multiple networked machines
with GPUs may be utilized using CUDA-MPI. How-
ever, as the factor operation is not computational in-
tensive, whether the saving from the parallel compu-
tation on the GPU side can compensate the cost of
transporting data between CPU and GPU is still an
2
NVIDIA promotes having multiple GPU units man-
aged by its scalable link interface.
open question. Comparisons based on different types
of GPUs will be our immediate future work as well.
ACKNOWLEDGEMENTS
This research is supported in part by an ONR Grant,
#N000141310870, and in part by an NSF CAREER
Grant, #IIS-0845036. We thank Alex Koslov for mak-
ing his implementation of a parallel Bayesian network
inference algorithm available to us for reference.
REFERENCES
Bernstein, D. S., Givan, R., Immerman, N., and Zilberstein,
S. (2002). The complexity of decentralized control of
Markov decision processes. Mathematics of Opera-
tions Research, 27(4):819–840.
Berstein, D. S., Hansen, E. A., and Zilberstein, S. (2005).
Bounded policy iteration for decentralized POMDPs.
In IJCAI, pages 1287–1292.
Chandrasekaran, M., Doshi, P., Zeng, Y., and Chen, Y.
(2014). Team behavior in interactive dynamic influ-
ence diagrams with applications to ad hoc teams. In
AAMAS, pages 1559–1560.
Chen, Y., Hong, J., Liu, W., Godo, L., Sierra, C., and
Loughlin, M. (2013). Incorporating PGMs into a BDI
architecture. In PRIMA, pages 54–69.
Doshi, P., Zeng, Y., and Chen, Q. (2009). Graphical models
for interactive POMDPs: Representations and solu-
tions. JAAMAS, 18(3):376–416.
Gal, K. and Pfeffer, A. (2008). Networks of influence di-
agrams: A formalism for representing agents’ beliefs
and decision-making processes. JAIR, 33:109–147.
Gmytrasiewicz, P. J. and Doshi, P. (2005). A framework
for sequential planning in multiagent settings. JAIR,
24:49–79.
Howard, R. A. and Matheson, J. E. (1984). Influence dia-
grams. In Howard, R. A. and Matheson, J. E., editors,
The Principles and Applications of Decision Analysis.
Strategic Decisions Group, Menlo Park, CA 94025.
Jeon, H., Xia, Y., and Prasanna, K. V. (2010). Parallel ex-
act inference on a cpu-gpgpu heterogenous system. In
ICPP, pages 61–70.
Koller, D. and Friedman, N. (2009). Probabilistic Graphi-
cal Models: Principles and Techniques. MIT Press.
Koller, D. and Milch, B. (2001). Multi-agent influence dia-
grams for representing and solving games. In IJCAI,
pages 1027–1034.
Luo, J., Yin, H., Li, B., and Wu, C. (2011). Path planning
for automated guided vehicles system via I-DIDs with
communication. In ICCA, pages 755 –759.
Pearl, J. (1998). Probabilistic Reasoning in Intelligent Sys-
tems: Networks of Plausible Inference. Morgan Kauf-
mann, Berlin, Germany.
Sano, Y., Kadono, Y., and Fukuta, N. (2014). A perfor-
mance optimization support framework for gpu-based
traffic simulations with negotiating agents. In ACAN.
ICAART2015-InternationalConferenceonAgentsandArtificialIntelligence
194