Author:
Khalaf Zainab A.,Jawad Zainab M.
Abstract
Abstract
A huge amount of textual data is available on the web. These data need to be classified under labels or classes to make the search more efficient and easier. Achieved by using automatic classification is used for this task. Many factors impact on the performance of the classifier system, such as the amount of using dataset, the data dispersion degree, preprocessing tools, feature extraction methods, terms weighting, and data reduction. So, researchers constantly compete to build a robust classifier with good performance. This study focuses on the effect of using different tools in preprocessing and term weighting stages. The experimental results applied on two different languages (Arabic and English languages). Also, the experimental results were compared with the recent related works.
Subject
General Physics and Astronomy
Reference30 articles.
1. Improving Text Classification With Word Embedding;Ge,2018
2. Survey On Supervised Machine Learning Techniques For Automatic Text Classification;Ammar;Springer Artificial Intelligence Review,2019
3. Composite Feature Extraction And Selection For Text Classification;Chuan;IEEE Access,2019
4. Improved inverse gravity moment term weighting for text classification;Turgut;Elsevier, Expert Systems with Applications,2019