Balancing Exploitation and Exploration via Fully Probabilistic Design of Decision Policies

Miroslav Kárný; František Hůla

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

Balancing Exploitation and Exploration via Fully Probabilistic Design of Decision Policies

Topics: Cognitive Robotics; Machine Learning; Model-Based Reasoning; Uncertainty in AI

In Proceedings of the 11th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART, 857-864, 2019 , Prague, Czech Republic

Authors: Miroslav Kárný and František Hůla

Affiliation: The Czech Academy of Sciences, Inst. of Inf. Theory and Automation, POB 18, 182 08 Prague 8 and Czech Republic

Keyword(s): Exploitation, Exploration, Bayesian Estimation, Adaptive Systems, Fully Probabilistic Design, Markov Decision Process.

Related Ontology Subjects/Areas/Topics: Agents ; Artificial Intelligence ; Cognitive Robotics ; Computational Intelligence ; Evolutionary Computing ; Informatics in Control, Automation and Robotics ; Knowledge Discovery and Information Retrieval ; Knowledge-Based Systems ; Machine Learning ; Model-Based Reasoning ; Robotics and Automation ; Soft Computing ; Symbolic Systems ; Uncertainty in AI

Abstract: Adaptive decision making learns an environment model serving a design of a decision policy. The policy-generated actions influence both the acquired reward and the future knowledge. The optimal policy properly balances exploitation with exploration. The inherent dimensionality curse of decision making under incomplete knowledge prevents the realisation of the optimal design. This has stimulated repetitive attempts to reach this balance at least approximately. Usually, either: (a) the exploitative reward is enriched by a part reflecting the exploration quality and a feasible approximate certainty-equivalent design is made; or (b) an explorative random noise is added to the purely exploitative actions. This paper avoids the inauspicious (a) and improves (b) by employing the non-standard fully probabilistic design (FPD) of decision policies, which naturally generates random actions. Monte-Carlo experiments confirm its achieved quality. The quality stems from methodological contributions , which include: (i) an improvement of the relation between FPD and standard Markov decision processes; (ii) a design of an adaptive tuning of an FPD-parameter. The latter also suits for the tuning of the temperature in both simulated annealing and Boltzmann’s machine. (More)

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 216.73.216.200

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

Kárný, M., Hůla and F. (2019). Balancing Exploitation and Exploration via Fully Probabilistic Design of Decision Policies. In Proceedings of the 11th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART; ISBN 978-989-758-350-6; ISSN 2184-433X, SciTePress, pages 857-864. DOI: 10.5220/0007587208570864

@conference{icaart19,
author={Miroslav Kárný and František Hůla},
title={Balancing Exploitation and Exploration via Fully Probabilistic Design of Decision Policies},
booktitle={Proceedings of the 11th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART},
year={2019},
pages={857-864},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0007587208570864},
isbn={978-989-758-350-6},
issn={2184-433X},
}

TY - CONF

JO - Proceedings of the 11th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART
TI - Balancing Exploitation and Exploration via Fully Probabilistic Design of Decision Policies
SN - 978-989-758-350-6
IS - 2184-433X
AU - Kárný, M.
AU - Hůla, F.
PY - 2019
SP - 857
EP - 864
DO - 10.5220/0007587208570864
PB - SciTePress