Knowledge Gradient for Multi-objective Multi-armed Bandit Algorithms

Saba Q. Yahyaa; Madalina M. Drugan; Bernard Manderick

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

Knowledge Gradient for Multi-objective Multi-armed Bandit Algorithms

Topics: Machine Learning

In Proceedings of the 6th International Conference on Agents and Artificial Intelligence - Volume 1: ICAART, 74-83, 2014 , ESEO, Angers, Loire Valley, France

Authors: Saba Q. Yahyaa ; Madalina M. Drugan and Bernard Manderick

Affiliation: Vrije Universiteit Brussel, Belgium

Keyword(s): Multi-armed Bandit Problems, Multi-objective Optimization, Knowledge Gradient Policy.

Related Ontology Subjects/Areas/Topics: Artificial Intelligence ; Computational Intelligence ; Evolutionary Computing ; Knowledge Discovery and Information Retrieval ; Knowledge-Based Systems ; Machine Learning ; Soft Computing ; Symbolic Systems

Abstract: We extend knowledge gradient (KG) policy for the multi-objective, multi-armed bandits problem to efficiently explore the Pareto optimal arms. We consider two partial order relationships to order the mean vectors, i.e. Pareto and scalarized functions. Pareto KG finds the optimal arms using Pareto search, while the scalarizations-KG transform the multi-objective arms into one-objective arm to find the optimal arms. To measure the performance of the proposed algorithms, we propose three regret measures. We compare the performance of knowledge gradient policy with UCB1 on a multi-objective multi-armed bandits problem, where KG outperforms UCB1.

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 216.73.216.108

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

Q. Yahyaa, S., M. Drugan, M. and Manderick, B. (2014). Knowledge Gradient for Multi-objective Multi-armed Bandit Algorithms. In Proceedings of the 6th International Conference on Agents and Artificial Intelligence - Volume 1: ICAART; ISBN 978-989-758-015-4; ISSN 2184-433X, SciTePress, pages 74-83. DOI: 10.5220/0004796600740083

@conference{icaart14,
author={Saba {Q. Yahyaa} and Madalina {M. Drugan} and Bernard Manderick},
title={Knowledge Gradient for Multi-objective Multi-armed Bandit Algorithms},
booktitle={Proceedings of the 6th International Conference on Agents and Artificial Intelligence - Volume 1: ICAART},
year={2014},
pages={74-83},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004796600740083},
isbn={978-989-758-015-4},
issn={2184-433X},
}

TY - CONF

JO - Proceedings of the 6th International Conference on Agents and Artificial Intelligence - Volume 1: ICAART
TI - Knowledge Gradient for Multi-objective Multi-armed Bandit Algorithms
SN - 978-989-758-015-4
IS - 2184-433X
AU - Q. Yahyaa, S.
AU - M. Drugan, M.
AU - Manderick, B.
PY - 2014
SP - 74
EP - 83
DO - 10.5220/0004796600740083
PB - SciTePress