A Two-Phase Safe Reinforcement Learning Framework for Finding the Safe Policy Space
A. J. Westley, Gavin Rens
2025
Abstract
As reinforcement learning (RL) expands into safety-critical domains, ensuring agent adherence to safety constraints becomes crucial. This paper introduces a two-phase approach to safe RL, Violation-Guided Identification of Safety(ViGIS), which firstidentifies a safe policy space and then performs standard RL within this space. We present two variants: ViGIS-P, which precalculates the safe policy space given a known transition function, and ViGIS-L, which learns the safe policy space through exploration. We evaluate ViGIS in three environments: a multi-constraint taxi world, a deterministic bank robber game, and a continuous cart-pole problem. Results show that both variants significantly reduce constraint violations compared to standard and β-pessimistic Q-learning, sometimes at the cost of achieving a lower average reward. ViGIS-L consistently outperforms ViGIS-P in the taxi world, especially as constraints increase. In the bank robber environment, both achieve perfect safety. A Deep Q-Network (DQN) implementation of ViGIS-L in the cart-pole domain reduces violations compared to a standard DQN. This research contributes to safe RL by providing a flexible framework for incorporating safety constraints into the RL process. The two-phase approach allows for clear separation between safety consideration and task optimization, potentially easing application in various safety-critical domains.
DownloadPaper Citation
in Harvard Style
Westley A. and Rens G. (2025). A Two-Phase Safe Reinforcement Learning Framework for Finding the Safe Policy Space. In Proceedings of the 17th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART; ISBN 978-989-758-737-5, SciTePress, pages 275-285. DOI: 10.5220/0013151600003890
in Bibtex Style
@conference{icaart25,
author={A. Westley and Gavin Rens},
title={A Two-Phase Safe Reinforcement Learning Framework for Finding the Safe Policy Space},
booktitle={Proceedings of the 17th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART},
year={2025},
pages={275-285},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0013151600003890},
isbn={978-989-758-737-5},
}
in EndNote Style
TY - CONF
JO - Proceedings of the 17th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART
TI - A Two-Phase Safe Reinforcement Learning Framework for Finding the Safe Policy Space
SN - 978-989-758-737-5
AU - Westley A.
AU - Rens G.
PY - 2025
SP - 275
EP - 285
DO - 10.5220/0013151600003890
PB - SciTePress