Affiliation:
1. Universiti Putra Malaysia, Serdang 43400, Selangor, Malaysia
2. Universiti Sains Malaysia, 11800 USM, Penang, Malaysia
Abstract
With the advent of the information age, the massive increase of English text data puts forward higher requirements for text analysis and processing. The aim of this study is to accurately evaluate the semantic complexity of English text through an autoencoder structure based on bidirectional attention. This paper first analyzes the importance of automatic classification of semantic complexity in English text, and then builds an autoencoder structure based on bidirectional attention, which captures bidirectional information in text, and then uses the autoencoder structure for feature extraction and dimension reduction, which further strengthens the model’s ability to capture semantic complexity. Finally, A Bidirectional Attention Self-Encoding English Text Semantic Complexity Automatic Grading Model (BSETG) is established. This study conducted experimental verification based on semantic Evaluation (SemEval) dataset, convolutional neural network (CNN)/Daily Mail dataset and Penn Treebank dataset, and conducted a comparative analysis with existing semantic complexity evaluation methods. The experimental results show that the overall accuracy of BSETG algorithm is maintained between 70% and 90%, the response speed of BSETG algorithm is relatively fast, and the success rate of BSETG algorithm is relatively stable to a large extent.
Publisher
World Scientific Pub Co Pte Ltd