Abstract
AbstractReflectance, lighting and geometry combine in complex ways to create images. How do we disentangle these to perceive individual properties, such as surface glossiness? We suggest that brains disentangle properties by learning to model statistical structure in proximal images. To test this hypothesis, we trained unsupervised generative neural networks on renderings of glossy surfaces and compared their representations with human gloss judgements. The networks spontaneously cluster images according to distal properties such as reflectance and illumination, despite receiving no explicit information about these properties. Intriguingly, the resulting representations also predict the specific patterns of ‘successes’ and ‘errors’ in human perception. Linearly decoding specular reflectance from the model’s internal code predicts human gloss perception better than ground truth, supervised networks or control models, and it predicts, on an image-by-image basis, illusions of gloss perception caused by interactions between material, shape and lighting. Unsupervised learning may underlie many perceptual dimensions in vision and beyond.
Publisher
Springer Science and Business Media LLC
Subject
Behavioral Neuroscience,Experimental and Cognitive Psychology,Social Psychology
Reference117 articles.
1. Adelson, E. H. Lightness perception and lightness illusions. in The New Cognitive Neurosciences (ed. Gazzaniga, M.S.) 339–351 (MIT Press, 2000).
2. Anderson, B. L. Mid-level vision. Curr. Biol. 30, R105–R109 (2020).
3. Anderson, B. L. The perceptual representation of transparency, lightness, and gloss. in Handbook of Perceptual Organization (ed. Wagemans, J.) 466–483 (Oxford University Press, 2015).
4. Barrow, H., Tenenbaum, J., Hanson, A. & Riseman, E. Recovering intrinsic scene characteristics. Comput. Vis. Syst. 2, 3–26 (1978).
5. Fleming, R. W. Material perception. Annu. Rev. Vis. Sci. 3, 365–388 (2017).
Cited by
52 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献