Affiliation:
1. Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100094, China
2. School of Computer and Artificial Intelligence, Zhengzhou University, Zhengzhou 450001, China
Abstract
Semantic folding theory (SFT) is an emerging cognitive science theory that aims to explain how the human brain processes and organizes semantic information. The distribution of text into semantic grids is key to SFT. We propose a sentence-level semantic division baseline with 100 grids (SSDB-100), the only dataset we are currently aware of that performs a relevant validation of the sentence-level SFT algorithm, to evaluate the validity of text distribution in semantic grids and divide it using classical division algorithms on SSDB-100. In this article, we describe the construction of SSDB-100. First, a semantic division questionnaire with broad coverage was generated by limiting the uncertainty range of the topics and corpus. Subsequently, through an expert survey, 11 human experts provided feedback. Finally, we analyzed and processed the feedback; the average consistency index for the used feedback was 0.856 after eliminating the invalid feedback. SSDB-100 has 100 semantic grids with clear distinctions between the grids, allowing the dataset to be extended using semantic methods.
Funder
National Natural Science Foundation of China
Natural Science Foundation of Heilongjiang Province of China
National Key Research and Development Program of China
Reference28 articles.
1. Unity and diversity in human language;Fitch;Philos. Trans. R. Soc. Lond. B Biol. Sci.,2011
2. Language Representation in the Human Brain: Evidence from Cortical Mapping;Bhatnagar;Brain Lang.,2000
3. The neurobiology of language beyond single-word processing;Hagoort;Science,2019
4. Why Can’t a Computer be more Like a Brain?;Hawkins;IEEE Spectr.,2007
5. Hawkins, J., Ahmad, S., Purdy, S., and Lavin, A. (2023, November 08). Biological and Machine Intelligence. Release 0.4. 2016–2020. Available online: https://numenta.com/resources/biological-and-machine-intelligence/.