Upper Confident Bound Fuzzy Q-learning and Its Application to a Video Game

Takahiro Morita; Hiroshi Hosobe

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

Upper Confident Bound Fuzzy Q-learning and Its Application to a Video Game

Topics: Fuzzy Systems; Machine Learning

In Proceedings of the 14th International Conference on Agents and Artificial Intelligence - Volume 3: ICAART, 454-461, 2022

Authors: Takahiro Morita ¹ and Hiroshi Hosobe ²

Affiliations: ¹ Graduate School of Computer and Information Sciences, Hosei University, Tokyo, Japan ; ² Faculty of Computer and Information Sciences, Hosei University, Tokyo, Japan

Keyword(s): Machine Learning, Fuzzy Q-learning, UCB Algorithm, Video Game.

Abstract: This paper proposes upper confident bound (UCB) fuzzy Q-learning by combining fuzzy Q-learning and the UCBQ algorithm and applies it to a video game. The UCBQ algorithm improved the action selection method called the UCB algorithm by applying it to Q-learning. The UCB algorithm selects the action with the highest UCB value instead of a value estimate. Since the UCB algorithm is based on the premise that any unselected actions are selected and value estimates are obtained, the number of unselected actions becomes small, and it is able to prevent local optimal solutions. The proposed method aims to promote the efficiency of learning by reducing unselected actions and preventing the Q value from becoming a local optimal solution in fuzzy Q-learning. This paper applies the proposed method to a video game called Ms. PacMan and presents the result of an experiment on finding optimum values in the method. Its evaluation is conducted by comparing the game scores with the scores obtained by a previous fuzzy Q-learning method. The result shows that the proposed method significantly reduced unselected actions. (More)

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 216.73.216.27

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

Morita, T., Hosobe and H. (2022). Upper Confident Bound Fuzzy Q-learning and Its Application to a Video Game. In Proceedings of the 14th International Conference on Agents and Artificial Intelligence - Volume 3: ICAART; ISBN 978-989-758-547-0; ISSN 2184-433X, SciTePress, pages 454-461. DOI: 10.5220/0010835700003116

@conference{icaart22,
author={Takahiro Morita and Hiroshi Hosobe},
title={Upper Confident Bound Fuzzy Q-learning and Its Application to a Video Game},
booktitle={Proceedings of the 14th International Conference on Agents and Artificial Intelligence - Volume 3: ICAART},
year={2022},
pages={454-461},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0010835700003116},
isbn={978-989-758-547-0},
issn={2184-433X},
}

TY - CONF

JO - Proceedings of the 14th International Conference on Agents and Artificial Intelligence - Volume 3: ICAART
TI - Upper Confident Bound Fuzzy Q-learning and Its Application to a Video Game
SN - 978-989-758-547-0
IS - 2184-433X
AU - Morita, T.
AU - Hosobe, H.
PY - 2022
SP - 454
EP - 461
DO - 10.5220/0010835700003116
PB - SciTePress