Affiliation:
1. College of Liberal Arts and International Education, Xi’an Peihua University, Xi’an, Shaanxi 710125, China
2. College of Liberal Arts, Huaibei Normal University, Huaibei, Anhui 235000, China
3. College of Computer Science and Technology, Xi’an University of Technology, Xi’an, Shaanxi 710054, China
Abstract
As an important linguistic phenomenon in verbal communication, relayed speech exists in a large number of news texts and is also one of the most prominent features of the Chinese language. However, at present, there are few systematic comparative studies on the recognition of the relayed problems in Chinese language crossover, and the existing methods are highly subjective. This paper makes a qualitative comparative analysis of the reported speech in the Chinese language by the DE-BP model, which combines differential evolution (DE) algorithm and BP (backpropagation) neural network to recognize the Chinese cross-language paraphrase. After that, we obtained some meaningful findings as follows. In the Chinese language, the frequency of indirect paraphrase is the highest, followed by direct paraphrase, while other categories, namely, free indirect paraphrase, free direct paraphrase, and narrative paraphrase of speech acts, are relatively rare. Through the identification and manual labeling of reported verbs and then the word frequency statistics, it is found that the number of reported verbs in English newspapers is dominant in general, and there is a significant difference between them.
Subject
Computer Science Applications,Software
Reference32 articles.
1. On the use of word embedding for cross language plagiarism detection
2. Paraphrase identification based on interpretable mechanism;L. Li
3. Chinese medical paraphrase generation: based on neural machine translation;B. Sun;Journal of European Economy,2021
4. Cross-language multimodal scene semantic guidance and leap sampling for video captioning;B. Sun;The Visual Computer,2022
5. Anatomy of Preprocessing of Big Data for Monolingual Corpora Paraphrase Extraction: Source Language Sentence Selection