CONTINUOUS ACTION REINFORCEMENT LEARNING AUTOMATA - Performance and Convergence

Abdel Rodríguez; Ricardo Grau; Ann Nowé

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

CONTINUOUS ACTION REINFORCEMENT LEARNING AUTOMATA - Performance and Convergence

Topics: Autonomous Systems; Multi-Agent Systems

In Proceedings of the 3rd International Conference on Agents and Artificial Intelligence - Volume 1: ICAART, 473-478, 2011 , Rome, Italy

Authors: Abdel Rodríguez ¹ ; Ricardo Grau ² and Ann Nowé ³

Affiliations: ¹ Central University of Las Villas and Vrije Universiteit Brussel, Cuba ; ² Central University of Las Villas, Cuba ; ³ Vrije Universiteit Brussel, Belgium

Keyword(s): CARLA, Convergence, Performance.

Related Ontology Subjects/Areas/Topics: Agents ; Artificial Intelligence ; Artificial Intelligence and Decision Support Systems ; Autonomous Systems ; Distributed and Mobile Software Systems ; Enterprise Information Systems ; Knowledge Engineering and Ontology Development ; Knowledge-Based Systems ; Multi-Agent Systems ; Software Engineering ; Symbolic Systems

Abstract: Reinforcement Learning is a powerful technique for agents to solve unknown Markovian Decision Processes, from the possibly delayed signals that they receive. Most RL work, in particular for multi-agent settings, assume a discrete action set. Learning automata are reinforcement learners, belonging to the category of policy iterators, that exhibit nice convergence properties in discrete action settings. Unfortunately, most applications assume continuous actions. A formulation for a continuous action reinforcement learning automaton already exists, but there is no convergence guarantee to optimal decisions. An improve of the performance of the method is proposed in this paper as well as the proof for the local convergence.

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 216.73.216.200

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

Rodríguez, A., Grau, R., Nowé and A. (2011). CONTINUOUS ACTION REINFORCEMENT LEARNING AUTOMATA - Performance and Convergence. In Proceedings of the 3rd International Conference on Agents and Artificial Intelligence - Volume 1: ICAART; ISBN 978-989-8425-41-6; ISSN 2184-433X, SciTePress, pages 473-478. DOI: 10.5220/0003287104730478

@conference{icaart11,
author={Abdel Rodríguez and Ricardo Grau and Ann Nowé},
title={CONTINUOUS ACTION REINFORCEMENT LEARNING AUTOMATA - Performance and Convergence},
booktitle={Proceedings of the 3rd International Conference on Agents and Artificial Intelligence - Volume 1: ICAART},
year={2011},
pages={473-478},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003287104730478},
isbn={978-989-8425-41-6},
issn={2184-433X},
}

TY - CONF

JO - Proceedings of the 3rd International Conference on Agents and Artificial Intelligence - Volume 1: ICAART
TI - CONTINUOUS ACTION REINFORCEMENT LEARNING AUTOMATA - Performance and Convergence
SN - 978-989-8425-41-6
IS - 2184-433X
AU - Rodríguez, A.
AU - Grau, R.
AU - Nowé, A.
PY - 2011
SP - 473
EP - 478
DO - 10.5220/0003287104730478
PB - SciTePress