A Reward-driven Model of Darwinian Fitness

Jan Teichmann, Eduardo Alonso, Mark Broom


In this paper we present a model that, based on the principle of total energy balance (similar to energy conservation in Physics), bridges the gap between Darwinian fitness theories and reward-driven theories of behaviour. Results show that it is possible to accommodate the reward maximization principle underlying modern approaches in behavioural reinforcement learning and traditional fitness approaches. Our framework, presented within a prey-predator model, may have important consequences in the study of behaviour.


  1. Alonso, E., Fairbank, M., and Mondragón, E. (2015). Back to optimality: A formal framework to express the dynamics of learning optimal behavior. Adaptive Behavior, 23(4), 206-215.
  2. Barto, A.G., Sutton, R.S., and Watkins, C.J.C.H. (1990). Learning and sequential decision making. In Learning and Computational Neuroscience: Foundations of Adaptive Networks, M. Gabriel and J.W. Moore, Eds., pp. 539-602, Cambridge, Mass: MIT Press.
  3. Dayan, P., and Daw, N.D. (2008). Decision theory, reinforcement learning, and the brain, Cognitive, Affective, & Behavioral Neuroscience 8, 429-453.
  4. Dingemanse, N. J., and Réale, D. (2005). Natural selection and animal personality, Behavior 142, 1159-1184.
  5. Orr, H. A., (2009). Fitness and its role in evolutionary genetics. Nature Review Genetics 10, 531-539.
  6. Rangel, A., Camerer, C., and Montague, P.R. (2008). A framework for studying the neurobiology of valuebased decision making, Nature Reviews Neuroscience 9, 545-556.
  7. Schultz, W. (2008). Neuroeconomics: the promise and the profit, Philosophical Transactions of the Royal Society B: Biological Sciences 363, 3767-3769.
  8. Staddon, J.E. (2007). Is animal behavior optimal? In A. Bejan & G.W. Merkx (eds.) Constructal Theory of Social Dynamics, NY: Springer.
  9. Sutton, R.S., and Barto, A.G. (1998). Reinforcement learning: An introduction, Boston, MA: Cambridge University Press.
  10. Teichmann, J. (2014). Models of aposematism and the role of aversive learning. PhD dissertation, City University London, London, UK.
  11. Teichmann, J., Broom, M., and Alonso, E. (2014). The application of temporal difference learning in optimal diet models, Journal of Theoretical Biology 340, 11- 16.

Paper Citation

in Harvard Style

Teichmann J., Alonso E. and Broom M. (2015). A Reward-driven Model of Darwinian Fitness . In Proceedings of the 7th International Joint Conference on Computational Intelligence - Volume 1: ECTA, ISBN 978-989-758-157-1, pages 174-179. DOI: 10.5220/0005591501740179

in Bibtex Style

author={Jan Teichmann and Eduardo Alonso and Mark Broom},
title={A Reward-driven Model of Darwinian Fitness},
booktitle={Proceedings of the 7th International Joint Conference on Computational Intelligence - Volume 1: ECTA,},

in EndNote Style

JO - Proceedings of the 7th International Joint Conference on Computational Intelligence - Volume 1: ECTA,
TI - A Reward-driven Model of Darwinian Fitness
SN - 978-989-758-157-1
AU - Teichmann J.
AU - Alonso E.
AU - Broom M.
PY - 2015
SP - 174
EP - 179
DO - 10.5220/0005591501740179