Model-Free versus Model-Based Reinforcement Learning for Fixed-Wing UAV Attitude Control Under Varying Wind Conditions

David Olivares; David Olivares; Pierre Fournier; Pavan Vasishta; Julien Marzat

doi:10.5220/0012946600003822

Model-Free versus Model-Based Reinforcement Learning for Fixed-Wing UAV Attitude Control Under Varying Wind Conditions

David Olivares, David Olivares, Pierre Fournier, Pavan Vasishta, Julien Marzat

2024

Abstract

This paper evaluates and compares the performance of model-free and model-based reinforcement learning for the attitude control of fixed-wing unmanned aerial vehicles using PID as a reference point. The comparison focuses on their ability to handle varying flight dynamics and wind disturbances in a simulated environment. Our results show that the Temporal Difference Model Predictive Control agent outperforms both the PID controller and other model-free reinforcement learning methods in terms of tracking accuracy and robustness over different reference difficulties, particularly in nonlinear flight regimes. Furthermore, we introduce actuation fluctuation as a key metric to assess energy efficiency and actuator wear, and we test two different approaches from the literature: action variation penalty and conditioning for action policy smoothness. We also evaluate all control methods when subject to stochastic turbulence and gusts separately, so as to measure their effects on tracking performance, observe their limitations and outline their implications on the Markov decision process formalism.

Download

Paper Citation

in Harvard Style

Olivares D., Fournier P., Vasishta P. and Marzat J. (2024). Model-Free versus Model-Based Reinforcement Learning for Fixed-Wing UAV Attitude Control Under Varying Wind Conditions. In Proceedings of the 21st International Conference on Informatics in Control, Automation and Robotics - Volume 1: ICINCO; ISBN 978-989-758-717-7, SciTePress, pages 79-91. DOI: 10.5220/0012946600003822

in Bibtex Style

@conference{icinco24,
author={David Olivares and Pierre Fournier and Pavan Vasishta and Julien Marzat},
title={Model-Free versus Model-Based Reinforcement Learning for Fixed-Wing UAV Attitude Control Under Varying Wind Conditions},
booktitle={Proceedings of the 21st International Conference on Informatics in Control, Automation and Robotics - Volume 1: ICINCO},
year={2024},
pages={79-91},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0012946600003822},
isbn={978-989-758-717-7},
}

in EndNote Style

TY - CONF

JO - Proceedings of the 21st International Conference on Informatics in Control, Automation and Robotics - Volume 1: ICINCO
TI - Model-Free versus Model-Based Reinforcement Learning for Fixed-Wing UAV Attitude Control Under Varying Wind Conditions
SN - 978-989-758-717-7
AU - Olivares D.
AU - Fournier P.
AU - Vasishta P.
AU - Marzat J.
PY - 2024
SP - 79
EP - 91
DO - 10.5220/0012946600003822
PB - SciTePress