Multi-model Adaptive Learning for Robots Under Uncertainty

Michalis Smyrnakis, Hongyang Qu, Dario Bauso, Sandor Veres

2020

Abstract

This paper casts coordination of a team of robots within the framework of game theoretic learning algorithms. A novel variant of fictitious play is proposed, by considering multi-model adaptive filters as a method to estimate other players’ strategies. The proposed algorithm can be used as a coordination mechanism between players when they should take decisions under uncertainty. Each player chooses an action after taking into account the actions of the other players and also the uncertainty. In contrast, to other game-theoretic and heuristic algorithms for distributed optimisation, it is not necessary to find the optimal parameters of the algorithm for a specific problem a priori. Simulations are used to test the performance of the proposed methodology against other game-theoretic learning algorithms.

Download


Paper Citation


in Harvard Style

Smyrnakis M., Qu H., Bauso D. and Veres S. (2020). Multi-model Adaptive Learning for Robots Under Uncertainty. In Proceedings of the 12th International Conference on Agents and Artificial Intelligence - Volume 1: ICAART, ISBN 978-989-758-395-7, pages 50-61. DOI: 10.5220/0008927700500061


in Bibtex Style

@conference{icaart20,
author={Michalis Smyrnakis and Hongyang Qu and Dario Bauso and Sandor Veres},
title={Multi-model Adaptive Learning for Robots Under Uncertainty},
booktitle={Proceedings of the 12th International Conference on Agents and Artificial Intelligence - Volume 1: ICAART,},
year={2020},
pages={50-61},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0008927700500061},
isbn={978-989-758-395-7},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 12th International Conference on Agents and Artificial Intelligence - Volume 1: ICAART,
TI - Multi-model Adaptive Learning for Robots Under Uncertainty
SN - 978-989-758-395-7
AU - Smyrnakis M.
AU - Qu H.
AU - Bauso D.
AU - Veres S.
PY - 2020
SP - 50
EP - 61
DO - 10.5220/0008927700500061