Abstract
Abstract
The traditional English part-of-speech analysis model fails to meet people’s actual needs due to the fact that the accuracy and other parameters are not up to standard. Facing large-scale English text data, quickly and accurately obtaining the key information needed and improv-ing the efficiency and accuracy of clustering have always been the focus of attention. However, the inherent characteristics of English text make it impossible to accurately calculate the traditional feature weight calculation method, and it’s part of speech is difficult to recognize. Moreover, in order to obtain a structure closer to the real data, this paper fuses the norm graph and the k-nearest neighbor graph, proposes a new composition framework, and combines it with two common propagation algorithms to complete the classification task. In addition, in order to obtain the improvement effect of the algorithm, the algorithm is tested on the English text classification corpus data set of the natural language processing open platform, and a control experiment is set to analyze the model performance. Finally, this article combines mathematical statistics to process data and draw corresponding charts.
Publisher
Research Square Platform LLC