Human and computer estimations of Predictability of words in written language-Reference-Cited by-同舟云学术

Human and computer estimations of Predictability of words in written language

Published:2020-03-10 Issue:1 Volume:10 Page:
ISSN:2045-2322
Container-title:Scientific Reports
language:en
Short-container-title:Sci Rep

Author:

Bianchi Bruno,Bengolea Monzón Gastón,Ferrer Luciana,Fernández Slezak Diego,Shalom Diego E.,Kamienkowski Juan E.

Abstract

AbstractWhen we read printed text, we are continuously predicting upcoming words to integrate information and guide future eye movements. Thus, the Predictability of a given word has become one of the most important variables when explaining human behaviour and information processing during reading. In parallel, the Natural Language Processing (NLP) field evolved by developing a wide variety of applications. Here, we show that using different word embeddings techniques (like Latent Semantic Analysis, Word2Vec, and FastText) and N-gram-based language models we were able to estimate how humans predict words (cloze-task Predictability) and how to better understand eye movements in long Spanish texts. Both types of models partially captured aspects of predictability. On the one hand, our N-gram model performed well when added as a replacement for the cloze-task Predictability of the fixated word. On the other hand, word embeddings were useful to mimic Predictability of the following word. Our study joins efforts from neurolinguistic and NLP fields to understand human information processing during reading to potentially improve NLP algorithms.

Publisher

Springer Science and Business Media LLC

Subject

Multidisciplinary

Link

http://www.nature.com/articles/s41598-020-61353-z.pdf

Reference37 articles.

1. Rolfs, M. Attention in active vision: A perspective on perceptual continuity across saccades. Percept. 44, 900–919 (2015).

2. Yang, S. C.-H., Wolpert, D. M. & Lengyel, M. Theoretical perspectives on active sensing. Curr. Opin. Behav. Sci. 11, 100–108 (2016).

3. Gottlieb, J. & Oudeyer, P.-Y. Towards a neuroscience of active sampling and curiosity. Nat. Rev. Neurosci. 1 (2018).

4. Ehrlich, S. F. & Rayner, K. Contextual effects on word perception and eye movements during reading. J. verbal learning verbal behavior 20, 641–655 (1981).

5. Inhoff, A. W. Two stages of word processing during eye fixations in the reading of prose. J. verbal learning verbal behavior 23, 612–624 (1984).

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A discriminative information-theoretical analysis of the regularity gradient in inflectional morphology;Morphology;2023-08-02

2. Synthetic predictabilities from large language models explain reading eye movements;2023 Symposium on Eye Tracking Research and Applications;2023-05-30

3. Neural Bases of Predictions During Natural Reading of Known Statements: An Electroencephalography and Eye Movements Co-registration Study;Neuroscience;2023-05

4. Language Models Explain Word Reading Times Better Than Empirical Predictability;Frontiers in Artificial Intelligence;2022-02-02

5. An Implementation of Natural Language Processing and Text Mining in Stroke Research;Journal of the Korean Neurological Association;2021-08-01