Subword Representations Successfully Decode Brain Responses to Morphologically Complex Written Words

Author:

Hakala Tero12ORCID,Lindh-Knuutila Tiina1ORCID,Hultén Annika1ORCID,Lehtonen Minna34ORCID,Salmelin Riitta1ORCID

Affiliation:

1. Department of Neuroscience and Biomedical Engineering, Aalto University, Espoo, Finland

2. Aalto NeuroImaging, Aalto University, Espoo, Finland

3. Department of Psychology and Speech-Language Pathology, University of Turku, Turku, Finland

4. Centre for Multilingualism in Society Across the Lifespan, University of Oslo, Oslo, Norway

Abstract

Abstract This study extends the idea of decoding word-evoked brain activations using a corpus-semantic vector space to multimorphemic words in the agglutinative Finnish language. The corpus-semantic models are trained on word segments, and decoding is carried out with word vectors that are composed of these segments. We tested several alternative vector-space models using different segmentations: no segmentation (whole word), linguistic morphemes, statistical morphemes, random segmentation, and character-level 1-, 2- and 3-grams, and paired them with recorded MEG responses to multimorphemic words in a visual word recognition task. For all variants, the decoding accuracy exceeded the standard word-label permutation-based significance thresholds at 350–500 ms after stimulus onset. However, the critical segment-label permutation test revealed that only those segmentations that were morphologically aware reached significance in the brain decoding task. The results suggest that both whole-word forms and morphemes are represented in the brain and show that neural decoding using corpus-semantic word representations derived from compositional subword segments is applicable also for multimorphemic word forms. This is especially relevant for languages with complex morphology, because a large proportion of word forms are rare and it can be difficult to find statistically reliable surface representations for them in any large corpus.

Funder

Academy of Finland

Aalto Brain Center

Sigrid Juséliuksen Säätiö

Publisher

MIT Press

Reference62 articles.

1. A short history of morphological theory;Anderson,2019

2. An amorphous model for morphological processing in visual comprehension based on naive discriminative learning;Baayen;Psychological Review,2011

3. Enriching word vectors with subword information;Bojanowski;Transactions of the Association for Computational Linguistics,2017

4. The impact of word prevalence on lexical decision times: Evidence from the Dutch Lexicon Project 2;Brysbaert;Journal of Experimental Psychology: Human Perception and Performance,2016

5. Extracting semantic representations from word co-occurrence statistics: A computational study;Bullinaria;Behavior Research Methods,2007

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3