Abstract
Abstract
The purpose of this study is to build automated summation tools, especially in grouping methods such as K-Means and K-Medoids. Finding the best method between the two algorithms, this study focuses on comparing the two methods to summarize thesis report documents. This system is divided into Filtering, Tokenization, TF-IDF, Cosine Similarity, and Clustering. Based on 50 test documents, the average accuracy rate is 51.16% for K-Means and 63.35% for K-Medoids. K-Means has a smaller accuracy value than K-Medoids. The accuracy of the resulting K-Means also depends on the size and center of the initial cluster chosen. So, as the next stage of development, research needs to be done that compares the results of the combination of initial size and center cluster values for K-Means and continue with several other classifications.
Reference13 articles.
1. Multi-Document Text Summarization Using Clustering Techniques and Lexical Chaining;Saraswathi;ICTACT Journal on Soft Computing,2010
2. Evaluation of automatic text summarizations based on human summaries;Kiyoumarsi;Procedia-Social and Behavioral Sciences,2015
3. An approach to automatic text summarization using simplified lesk algorithm and wordnet;Pal;International Journal of Control Theory and Computer Modeling (IJCTCM),2013
4. Text Summarization Extraction System (TSES) Using Extracted Keywords;Al-Hashemi;Int. Arab J. e-Technol.,2010
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献