Affiliation:
1. Minzu University of China
Abstract
Text classification has been a hot research in recent years. This text reviewed the history of text classification. It summarized some common classification methods and mainly introduced classification methods based on semantic. Especially, it elaborated the text classification based on ontology, the text classification based on similarity computation and the text classification based on latent semantic indexing.
Publisher
Trans Tech Publications, Ltd.
Reference31 articles.
1. Sebastiani F. Machine learning in automated text categorization, ACM Computing Surveys. 2002, 34 (1), 1−4.
2. Fang Yu, Yunfei Jiang A feature selection method based on naive Bias classification[J]. Journal of Sun Yat-sen(Natural Science Edition), 2004, 43(5).
3. Kazama J, Tsujii J. Maximum entropy models with inequality constraints: A case study on text categorization, Machine Learning. 2005, 60(1-3), 159−194.
4. Li R, Wang J, Chen X, Tao X, Hu Y. Using maximum entropy model for Chinese text categorization, Journal of Computer.
5. Yang Y M, Liu X. A re-examination of text categorization methods [C]. Proceedings 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval(SIGIR'99), Berkeley: ACM Press, 1999, 42-49.