Stochasticity, Nonlinear Value Functions, and Update Rules in Learning Aesthetic Biases-Reference-Cited by-同舟云学术

Stochasticity, Nonlinear Value Functions, and Update Rules in Learning Aesthetic Biases

Published:2021-05-10 Issue: Volume:15 Page:
ISSN:1662-5161
Container-title:Frontiers in Human Neuroscience
language:
Short-container-title:Front. Hum. Neurosci.

Author:

Grzywacz Norberto M.

Abstract

A theoretical framework for the reinforcement learning of aesthetic biases was recently proposed based on brain circuitries revealed by neuroimaging. A model grounded on that framework accounted for interesting features of human aesthetic biases. These features included individuality, cultural predispositions, stochastic dynamics of learning and aesthetic biases, and the peak-shift effect. However, despite the success in explaining these features, a potential weakness was the linearity of the value function used to predict reward. This linearity meant that the learning process employed a value function that assumed a linear relationship between reward and sensory stimuli. Linearity is common in reinforcement learning in neuroscience. However, linearity can be problematic because neural mechanisms and the dependence of reward on sensory stimuli were typically nonlinear. Here, we analyze the learning performance with models including optimal nonlinear value functions. We also compare updating the free parameters of the value functions with the delta rule, which neuroscience models use frequently, vs. updating with a new Phi rule that considers the structure of the nonlinearities. Our computer simulations showed that optimal nonlinear value functions resulted in improvements of learning errors when the reward models were nonlinear. Similarly, the new Phi rule led to improvements in these errors. These improvements were accompanied by the straightening of the trajectories of the vector of free parameters in its phase space. This straightening meant that the process became more efficient in learning the prediction of reward. Surprisingly, however, this improved efficiency had a complex relationship with the rate of learning. Finally, the stochasticity arising from the probabilistic sampling of sensory stimuli, rewards, and motivations helped the learning process narrow the range of free parameters to nearly optimal outcomes. Therefore, we suggest that value functions and update rules optimized for social and ecological constraints are ideal for learning aesthetic biases.

Publisher

Frontiers Media SA

Subject

Behavioral Neuroscience,Biological Psychiatry,Psychiatry and Mental health,Neurology,Neuropsychology and Physiological Psychology

Reference132 articles.

1. Nonlinear dynamics of emotion-cognition interaction: when emotion does not destroy cognition?;Afraimovich;Bull. Math. Biol.,2011

2. Judgments of pleasingness and interestingness as functions of visual complexity;Aitken;J. Exp. Psychol.,1974

3. Inferring master painters’ esthetic biases from the statistics of portraits;Aleem;Front. Hum. Neurosci.,2017

4. A theoretical framework for how we learn aesthetic values;Aleem;Front. Hum. Neurosci.,2020

5. Is beauty in the eye of the beholder or an objective truth? A neuroscientific answer;Aleem,2019

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Social groups and polarization of aesthetic values from symmetry and complexity;Scientific Reports;2023-12-06

2. Social Groups and Polarization of Aesthetic Values;2023-08-28

3. The temporal instability of aesthetic preferences.;Psychology of Aesthetics, Creativity, and the Arts;2023-04-06

4. Does Amount of Information Support Aesthetic Values?;Frontiers in Neuroscience;2022-03-22