loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Hugo Tanzarella Teixeira and Celso Pascoli Bottura

Affiliation: State University of Campinas - UNICAMP, Brazil

Keyword(s): Machine Learning, Reinforcement Learning, Temporal Difference Learning, Value Function Approximation, Online Support Vector Machine.

Related Ontology Subjects/Areas/Topics: Informatics in Control, Automation and Robotics ; Intelligent Control Systems and Optimization ; Machine Learning in Control Applications

Abstract: This paper proposes a new algorithm for Temporal-Difference (TD) learning using online support vector regression. It benefits from the good generalization properties support vector regression (SVR) has, and also can do incremental learning and automatically track variation of environment with time-varying characteristics. Using the online SVR we can obtain good estimation of value function in TD learning in linear and nonlinear prediction problems. Experimental results demonstrate the effectiveness of the proposed method by comparison with others methods.

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 52.55.214.236

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Tanzarella Teixeira, H. and Pascoli Bottura, C. (2015). Temporal-Difference Learning - An Online Support Vector Regression Approach. In Proceedings of the 12th International Conference on Informatics in Control, Automation and Robotics - Volume 1: ICINCO; ISBN 978-989-758-122-9; ISSN 2184-2809, SciTePress, pages 318-323. DOI: 10.5220/0005572103180323

@conference{icinco15,
author={Hugo {Tanzarella Teixeira}. and Celso {Pascoli Bottura}.},
title={Temporal-Difference Learning - An Online Support Vector Regression Approach},
booktitle={Proceedings of the 12th International Conference on Informatics in Control, Automation and Robotics - Volume 1: ICINCO},
year={2015},
pages={318-323},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005572103180323},
isbn={978-989-758-122-9},
issn={2184-2809},
}

TY - CONF

JO - Proceedings of the 12th International Conference on Informatics in Control, Automation and Robotics - Volume 1: ICINCO
TI - Temporal-Difference Learning - An Online Support Vector Regression Approach
SN - 978-989-758-122-9
IS - 2184-2809
AU - Tanzarella Teixeira, H.
AU - Pascoli Bottura, C.
PY - 2015
SP - 318
EP - 323
DO - 10.5220/0005572103180323
PB - SciTePress