City Research Online

The application of temporal difference learning in optimal diet models

Teichmann, J., Broom, M. & Alonso, E. (2014). The application of temporal difference learning in optimal diet models. Journal of Theoretical Biology, 340(7), pp. 11-16. doi: 10.1016/j.jtbi.2013.08.036


An experience-based aversive learning model of foraging behaviour in uncertain environments is presented. We use Q-learning as a model-free implementation of Temporal difference learning motivated by growing evidence for neural correlates in natural reinforcement settings. The predator has the choice of including an aposematic prey in its diet or to forage on alternative food sources. We show how the predator's foraging behaviour and energy intake depend on toxicity of the defended prey and the presence of Batesian mimics. We introduce the precondition of exploration of the action space for successful aversion formation and show how it predicts foraging behaviour in the presence of conflicting rewards which is conditionally suboptimal in a fixed environment but allows better adaptation in changing environments.

Publication Type: Article
Publisher Keywords: Batesian mimicry, Optimal diet, Predator–prey, Taste sampling, Temporal difference learning
Subjects: Q Science > QA Mathematics
Q Science > QH Natural history
Departments: School of Science & Technology > Mathematics
SWORD Depositor:
[thumbnail of optimalDiet (3).pdf]
Download (452kB) | Preview


Add to AnyAdd to TwitterAdd to FacebookAdd to LinkedinAdd to PinterestAdd to Email


Downloads per month over past year

View more statistics

Actions (login required)

Admin Login Admin Login