The application of temporal difference learning in optimal diet models

Teichmann, J., Broom, M. & Alonso, E. (2014). The application of temporal difference learning in optimal diet models. Journal of Theoretical Biology, 340, pp. 11-16. doi: 10.1016/j.jtbi.2013.08.036

[img]
Preview
PDF
Download (452kB) | Preview

Abstract

An experience-based aversive learning model of foraging behaviour in uncertain environments is presented. We use Q-learning as a model-free implementation of Temporal difference learning motivated by growing evidence for neural correlates in natural reinforcement settings. The predator has the choice of including an aposematic prey in its diet or to forage on alternative food sources. We show how the predator's foraging behaviour and energy intake depend on toxicity of the defended prey and the presence of Batesian mimics. We introduce the precondition of exploration of the action space for successful aversion formation and show how it predicts foraging behaviour in the presence of conflicting rewards which is conditionally suboptimal in a fixed environment but allows better adaptation in changing environments.

Item Type: Article
Uncontrolled Keywords: Batesian mimicry, Optimal diet, Predator–prey, Taste sampling, Temporal difference learning
Subjects: Q Science > QA Mathematics
Q Science > QH Natural history
Divisions: School of Engineering & Mathematical Sciences > Department of Mathematical Science
URI: http://openaccess.city.ac.uk/id/eprint/2787

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year

View more statistics