Affiliation:
1. Pushkin State Russian Language Institute
Abstract
The authors of this article identify distinctive features in texts written by humans and texts generated by the GPT-3 neural network. Texts generated by GPT-3 have not yet been subject to systematic in-depth study. In total, 160 texts were analyzed in the article, distributed across four topics (“Higher Education in My Eyes,” “How to Remain Human in Inhuman Conditions,” “How I Spent the Summer,” “Teacher of the Year”), with 80 texts generated by the neural network and 80 texts written by humans. The texts were analyzed using quantitative linguistic methods. A concordance was compiled for each text using the AntConc program, from which quantitative values were obtained for further analysis. The authors reached the following conclusions: (1) in the generated texts, words included in the title occur with the highest frequency; (2) the relative frequency of words included in the title is unreasonably inflated; (3) the list of the 20 most frequent words in all generated texts includes the highest number of full-fledged words; (4) the lexical diversity coefficient in the examined natural texts is significantly higher than that of the generated texts. The findings of this research can be useful for both educators and machine learning specialists.
Publisher
OOO Centr naucnyh i obrazovatelnyh proektov
Subject
General Economics, Econometrics and Finance
Reference21 articles.
1. Borunov, A. B. (2017). Diversity of speech and methods of measuring it in text (linguostatistical approach). Litera, 4: 81—86. (In Russ.).
2. Burnashev, R. F., Alamova, A. S. (2022). Quantitative linguistics and artificial intelligence. Science and Education, 3(2): 1390—1402. (In Russ.).
3. Burnashev, R. F., Alamova, A. S. (2023). The role of neural networks in linguistic research. Science and Education, 3: 258—269. (In Russ.).
4. Cohen, A., Mantegna, R., Havlin, S. (2011). Numerical Analysis of Word Frequencies in Artificial and Natural Language Texts. Fractals, 5 (1): 1—19. DOI: 10.1142/S0218348X97000103.
5. Dale, R. (2021). GPT-3: What’s it good for? Natural Language Engineering, 27 (1): 113— 118. DOI: 10.1017/S1351324920000601.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献