Review of short-text classification-Reference-Cited by-同舟云学术

Review of short-text classification

Published:2019-06-17 Issue:2 Volume:15 Page:155-182
ISSN:1744-0084
Container-title:International Journal of Web Information Systems
language:en
Short-container-title:IJWIS

Author:

Alsmadi Issa,Gan Keng Hoon

Abstract

PurposeRapid developments in social networks and their usage in everyday life have caused an explosion in the amount of short electronic documents. Thus, the need to classify this type of document based on their content has a significant implication in many applications. The need to classify these documents in relevant classes according to their text contents should be interested in many practical reasons. Short-text classification is an essential step in many applications, such as spam filtering, sentiment analysis, Twitter personalization, customer review and many other applications related to social networks. Reviews on short text and its application are limited. Thus, this paper aims to discuss the characteristics of short text, its challenges and difficulties in classification. The paper attempt to introduce all stages in principle classification, the technique used in each stage and the possible development trend in each stage.Design/methodology/approachThe paper as a review of the main aspect of short-text classification. The paper is structured based on the classification task stage.FindingsThis paper discusses related issues and approaches to these problems. Further research could be conducted to address the challenges in short texts and avoid poor accuracy in classification. Problems in low performance can be solved by using optimized solutions, such as genetic algorithms that are powerful in enhancing the quality of selected features. Soft computing solution has a fuzzy logic that makes short-text problems a promising area of research.Originality/valueUsing a powerful short-text classification method significantly affects many applications in terms of efficiency enhancement. Current solutions still have low performance, implying the need for improvement. This paper discusses related issues and approaches to these problems.

Publisher

Emerald

Subject

Computer Networks and Communications,Information Systems

Reference89 articles.

1. Text feature selection using ant colony optimization;Expert Systems with Applications,2009

2. A novel framework for termset selection and weighting in binary text classification;Engineering Applications of Artificial Intelligence,2014

3. Sentiment analysis system adaptation for multilingual processing: the case of tweets,2015

4. Bekkerman, R. and Allan, J. (2003), “Using bigrams in text categorization”, Technical Report IR-408, Center of Intelligent Information Retrieval, UMass Amherst, Vol. 1003, pp. 1-10, available at: http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.83.1999&rep=rep1&type=pdf

5. Hybrid dimension reduction by integrating feature selection with feature extraction method for text clustering;Expert Systems with Applications,2015

Cited by 39 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Cost-effective data classification storage through text seasonal features;Future Generation Computer Systems;2024-09

2. Response to critique of the paper: “Sentiment-topic dynamic collaborative analysis-based public opinion mapping in aviation disaster management: A case study of the MU5735 air crash”;International Journal of Disaster Risk Reduction;2024-07

3. An Industrial Short Text Classification Method Based on Large Language Model and Knowledge Base;2024 International Joint Conference on Neural Networks (IJCNN);2024-06-30

4. A Deep Learning Short Text Classification Model Integrating Part of Speech Features;2024 4th International Conference on Neural Networks, Information and Communication (NNICE);2024-01-19

5. Predicting the cause of seizures using features extracted from interactions with a virtual agent;Seizure: European Journal of Epilepsy;2024-01