Abstract
Abstract
In this paper, a deep learning RNN model is used to classify Tibetan texts. The core idea is to first preprocess the Tibetan news corpus, and then use Tibetan syllables to construct a Tibetan syllable table based on the lexical and grammatical structure of Tibetan, embed the syllables in the sentence, and represent each syllable as a fixed Numerical vector. Secondly, the RNN cyclic neural network model is constructed. First, the text of different lengths is filled or truncated into a sequence length of uniform length. For each input text, the vector representation of text syllables is input on each time step of RNN to train the RNN model. The test samples were then used to evaluate the accuracy of model classification by introducing recall rate, precision rate and F-test. Finally, compared with traditional machine learning Logistic algorithm, polynomial naive Bayes algorithm and KNN algorithm, the results show that RNN model has better classification effect.
Subject
General Physics and Astronomy
Reference15 articles.
1. Text classification method based on BiLSTM-Attention-CNN hybrid neural network;Wan;J. Computer applications and software,2020
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献