loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Adrian Redder 1 ; Arunselvan Ramaswamy 1 and Holger Karl 2

Affiliations: 1 Department of Computer Science, Paderborn University, Germany ; 2 Hasso-Plattner-Institute, Potsdam University, Germany

Keyword(s): Policy Gradient Algorithms, Multi-agent Learning, Communication Networks, Distributed Optimisation, Age of Information, Continuous Control.

Abstract: Distributed online learning over delaying communication networks is a fundamental problem in multi-agent learning, since the convergence behaviour of interacting agents is distorted by their delayed communication. It is a priori unclear, how much communication delay can be allowed, such that the joint policies of multiple agents can still converge to a solution of a multi-agent learning problem. In this work, we present the decentralization of the well known deep deterministic policy gradient algorithm using a communication network. We illustrate the convergence of the algorithm and the effect of lossy communication on the rate of convergence for a two-agent flow control problem, where the agents exchange their local information over a delaying wireless network. Finally, we discuss theoretical implications for this algorithm using recent advances in the theory of age of information and deep reinforcement learning.

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 18.226.187.60

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Redder, A., Ramaswamy, A. and Karl, H. (2022). Multi-agent Policy Gradient Algorithms for Cyber-physical Systems with Lossy Communication. In Proceedings of the 14th International Conference on Agents and Artificial Intelligence - Volume 1: ICAART; ISBN 978-989-758-547-0; ISSN 2184-433X, SciTePress, pages 282-289. DOI: 10.5220/0010845400003116

@conference{icaart22,
author={Adrian Redder and Arunselvan Ramaswamy and Holger Karl},
title={Multi-agent Policy Gradient Algorithms for Cyber-physical Systems with Lossy Communication},
booktitle={Proceedings of the 14th International Conference on Agents and Artificial Intelligence - Volume 1: ICAART},
year={2022},
pages={282-289},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0010845400003116},
isbn={978-989-758-547-0},
issn={2184-433X},
}

TY - CONF

JO - Proceedings of the 14th International Conference on Agents and Artificial Intelligence - Volume 1: ICAART
TI - Multi-agent Policy Gradient Algorithms for Cyber-physical Systems with Lossy Communication
SN - 978-989-758-547-0
IS - 2184-433X
AU - Redder, A.
AU - Ramaswamy, A.
AU - Karl, H.
PY - 2022
SP - 282
EP - 289
DO - 10.5220/0010845400003116
PB - SciTePress