Shared representations of human actions across vision and language-Reference-Cited by-同舟云学术

Shared representations of human actions across vision and language

Published:2023-11-06 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Dima Diana C.^ORCID,Janarthanan Sugitha,Culham Jody C.,Mohsenzadeh Yalda

Abstract

AbstractHumans can recognize and communicate about many actions performed by others. How are actions organized in the mind, and is this organization shared across vision and language? We collected similarity judgments of human actions depicted through naturalistic videos and sentences, and tested four models of action categorization, defining actions at different levels of abstraction ranging from specific (action verb) to broad (action target: whether an action is directed towards an object, another person, or the self). The similarity judgments reflected a shared semantic organization across videos and sentences, determined mainly by the target of actions, even after accounting for other semantic features. Large language model features predicted the behavioral similarity of action videos and sentences, and captured information about the target of actions alongside unique semantic information. Together, our results show how modality-invariant action concepts are organized in the human mind and in large language model representations.

Publisher

Cold Spring Harbor Laboratory

Reference107 articles.

1. The lateral occipitotemporal cortex in action

2. Leshinskaya, A. , Wurm, M. F. & Caramazza, A. Concepts of Actions and their Objects. in The Cognitive Neurosciences (eds. Gazzaniga, M. , Mangun, G. & Poeppel, D. ) 757–765 (MIT Press, 2020).

3. Action understanding as inverse planning

4. Thornton, M. A. & Tamir, D. I . The brain represents situations and mental states as sums of their action affordances. PsyArXiv 1–52 (2023).

5. The neural basis of conceptualizing the same action at different levels of abstraction

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Context Helps: Integrating Context Information with Videos in a Graph-Based HAR Framework;Lecture Notes in Computer Science;2024