Author:
hamid Zeyad,Khafaji Hussein K
Abstract
Abstract
Many data mining techniques and machine learning algorithms have been developed to classify textual data involving decision tree, support vector machine, K-Nearest neighbour, in addition to machine learning-based algorithms. Association rules based machine learning is accomplished in two phases; training phase and testing phase that may be reinforced to enhance the classification accuracy according to new minimum support and confidence. Association rules mining/processing, in its various applications, passes through two massive computation steps; frequent itemsets mining and association rules extraction. This paper presents a general algorithm for association rules-based machine learning dedicated to text classification. To verify the efficiency of the algorithm, different text datasets were used such as tweets dataset for sentiment classification, pdf documents and HTML documents. Experiments of sentiment classification showed that the classifier constructed according to minsup threshold =%700 and minconf threshold =50% gives the best performance with F1 = 0.9861811 while the experiments of HTML and PDF appeared accurate classification equal to (94%).
Subject
General Physics and Astronomy
Reference20 articles.
1. Neural Networks and Statistical Learning;Du,2014
2. An Approach for Arabic Text Categorization Using Association Rule;Al-radaideh,2011
3. Text categorization with support vector machines: learning with many relevant features;Joachims,1998
4. Feature selection for text classification with Naïve Bayes;Chen;Expert Systems with Applications,2009
5. KNN based Machine Learning Approach for Text and Document Mining;Bijalwan;International Journal of Database Theory and Application,2014
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Natural Language Processing System for Text Classification Corpus Based on Machine Learning;ACM Transactions on Asian and Low-Resource Language Information Processing;2024-08-08
2. Application of Association Rule Algorithm in Distributed New SQL Database Design;2023 IEEE International Conference on Integrated Circuits and Communication Systems (ICICACS);2023-02-24
3. Education Platform System on Account of Association Rule Algorithm;Proceedings of the 6th International Conference on Digital Technology in Education;2022-09-16