Abstract
In two experiments, we used the simple zero-sum game Rock, Paper and Scissors to study the common reinforcement-based rules of repeating choices after winning (win-stay) and shifting from previous choice options after losing (lose-shift). Participants played the game against both computer opponents who could not be exploited and computer opponents who could be exploited by making choices that would at times conflict with reinforcement. Against unexploitable opponents, participants achieved an approximation of random behavior, contrary to previous research commonly finding reinforcement biases. Against exploitable opponents, the participants learned to exploit the opponent regardless of whether optimal choices conflicted with reinforcement or not. The data suggest that learning a rule that allows one to exploit was largely determined by the outcome of the previous trial.
Funder
University of Sussex, School of Psychology
Osk. Huttusen säätiö
Publisher
Public Library of Science (PLoS)
Reference41 articles.
1. Reinforcement learning in the brain;Y. Niv;Journal of Mathematical Psychology,2009
2. Learning and decision making in monkeys during a rock–paper–scissors game;D Lee;Cognitive Brain Research,2005
3. Generation of random series in two-person strictly competitive games;A Rapoport;Journal of Experimental Psychology: General,1992
4. The perception of randomness;M Bar-Hillel;Advances in applied mathematics,1991
5. Randomness and randomizers: Maybe the problem is not so big;WA Wagenaar;Journal of Behavioral Decision Making,1991
Cited by
6 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献