On the strategic learning of signal associations-Reference-Cited by-同舟云学术

On the strategic learning of signal associations

Published:2022-09-02 Issue:6 Volume:33 Page:1058-1069
ISSN:1045-2249
Container-title:Behavioral Ecology
language:en
Short-container-title:

Author:

Sherratt Thomas N¹^ORCID,Voll James¹

Affiliation:

1. Department of Biology, Carleton University , 1125 Colonel By Drive, Ottawa, ON K1S 5B6 , Canada

Abstract

Abstract Signal detection theory (SDT) has been widely used to identify the optimal response of a receiver to a stimulus when it could be generated by more than one signaler type. While SDT assumes that the receiver adopts the optimal response at the outset, in reality, receivers often have to learn how to respond. We, therefore, recast a simple signal detection problem as a multi-armed bandit (MAB) in which inexperienced receivers chose between accepting a signaler (gaining information and an uncertain payoff) and rejecting it (gaining no information but a certain payoff). An exact solution to this exploration–exploitation dilemma can be identified by solving the relevant dynamic programming equation (DPE). However, to evaluate how the problem is solved in practice, we conducted an experiment. Here humans (n = 135) were repeatedly presented with a four readily discriminable signaler types, some of which were on average profitable, and others unprofitable to accept in the long term. We then compared the performance of SDT, DPE, and three candidate exploration–exploitation models (Softmax, Thompson, and Greedy) in explaining the observed sequences of acceptance and rejection. All of the models predicted volunteer behavior well when signalers were clearly profitable or clearly unprofitable to accept. Overall however, the Softmax and Thompson sampling models, which predict the optimal (SDT) response towards signalers with borderline profitability only after extensive learning, explained the responses of volunteers significantly better. By highlighting the relationship between the MAB and SDT models, we encourage others to evaluate how receivers strategically learn about their environments.

Funder

Natural Sciences and Engineering Research Council of Canada

Publisher

Oxford University Press (OUP)

Subject

Animal Science and Zoology,Ecology, Evolution, Behavior and Systematics

Link

https://academic.oup.com/beheco/article-pdf/33/6/1058/47771741/arac027.pdf

Reference70 articles.

1. Fitting linear mixed-effects models using lme4.;Bates;J Stat Softw,2015

2. Economic models of animal communication.;Bradbury;Anim Behav,2000

3. Boltzmann exploration done right.;Cesa-Bianchi,2017

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Signal detection models as contextual bandits;Royal Society Open Science;2023-06

2. On the strategic learning of signal associations;Behavioral Ecology;2022-09-02