Disentangled deep generative models reveal coding principles of the human face processing network-Reference-Cited by-同舟云学术

Disentangled deep generative models reveal coding principles of the human face processing network

Published:2024-02-26 Issue:2 Volume:20 Page:e1011887
ISSN:1553-7358
Container-title:PLOS Computational Biology
language:en
Short-container-title:PLoS Comput Biol

Author:

Soulos Paul,Isik Leyla^ORCID

Abstract

Despite decades of research, much is still unknown about the computations carried out in the human face processing network. Recently, deep networks have been proposed as a computational account of human visual processing, but while they provide a good match to neural data throughout visual cortex, they lack interpretability. We introduce a method for interpreting brain activity using a new class of deep generative models, disentangled representation learning models, which learn a low-dimensional latent space that “disentangles” different semantically meaningful dimensions of faces, such as rotation, lighting, or hairstyle, in an unsupervised manner by enforcing statistical independence between dimensions. We find that the majority of our model’s learned latent dimensions are interpretable by human raters. Further, these latent dimensions serve as a good encoding model for human fMRI data. We next investigate the representation of different latent dimensions across face-selective voxels. We find that low- and high-level face features are represented in posterior and anterior face-selective regions, respectively, corroborating prior models of human face recognition. Interestingly, though, we find identity-relevant and irrelevant face features across the face processing network. Finally, we provide new insight into the few "entangled" (uninterpretable) dimensions in our model by showing that they match responses in the ventral stream and carry information about facial identity. Disentangled face encoding models provide an exciting alternative to standard “black box” deep learning approaches for modeling and interpreting human brain data.

Funder

The Clare Boothe Luce Program for Women

Publisher

Public Library of Science (PLoS)

Reference52 articles.

1. Comparing face patch systems in macaques and humans;D. Y. Tsao;Proc. Natl. Acad. Sci. U. S. A.,2008

2. A Revised Neural Framework for Face Processing;B. Duchaine;Annu. Rev. Vis. Sci.,2015

3. Face Processing Systems: From Neurons to Real-World Social Perception;W. Freiwald,2016

4. Differential selectivity for dynamic versus static information in face-selective cortical regions;D. Pitcher;Neuroimage,2011

5. Faces in Motion: Selectivity of Macaque and Human Face Processing Areas for Dynamic Stimuli;P. Polosecki;J. Neurosci.,2013