Contextual Action with Multiple Policies Inverse Reinforcement Learning for Behavior Simulation
Nahum Alvarez, Itsuki Noda
2019
Abstract
Machine learning is a discipline with many simulator-driven applications oriented to learn behavior. However, behavior simulation it comes with a number of associated difficulties, like the lack of a clear reward function, actions that depend of the state of the actor and the alternation of different policies. We present a method for behavior learning called Contextual Action Multiple Policy Inverse Reinforcement Learning (CAMP-IRL) that tackles those factors. Our method allows to extract multiple reward functions and generates different behavior profiles from them. We applied our method to a large scale crowd simulator using intelligent agents to imitate pedestrian behavior, making the virtual pedestrians able to switch between behaviors depending of the goal they have and navigating efficiently across unknown environments.
DownloadPaper Citation
in Harvard Style
Alvarez N. and Noda I. (2019). Contextual Action with Multiple Policies Inverse Reinforcement Learning for Behavior Simulation.In Proceedings of the 11th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART, ISBN 978-989-758-350-6, pages 887-894. DOI: 10.5220/0007684908870894
in Bibtex Style
@conference{icaart19,
author={Nahum Alvarez and Itsuki Noda},
title={Contextual Action with Multiple Policies Inverse Reinforcement Learning for Behavior Simulation},
booktitle={Proceedings of the 11th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART,},
year={2019},
pages={887-894},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0007684908870894},
isbn={978-989-758-350-6},
}
in EndNote Style
TY - CONF
JO - Proceedings of the 11th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART,
TI - Contextual Action with Multiple Policies Inverse Reinforcement Learning for Behavior Simulation
SN - 978-989-758-350-6
AU - Alvarez N.
AU - Noda I.
PY - 2019
SP - 887
EP - 894
DO - 10.5220/0007684908870894