Author:
Ramadhani Siti,Azzahra Dini,Z Tomi
Abstract
The thesis is one of the scientific works based on the conclusions of field research or observations compiled and developed by students as well as research carried out according to the topic containing the study program which is carried out as a final project compiled in the last stage of formal study. A large number of theses, of course, will be sought in looking for categories of thesis topics, or the titles raised have different relevance. However, the student thesis can be by topics that are almost relevant to other topics so that it can make it easier to find topics that are relevant to the group. One of the uses of techniques in machine learning is to find text processing (Text Mining). In-text mining, there is a method that can be used, namely the Clustering method. Clustering processing techniques can group objects into the number of clusters formed. In addition, there are several methods used in clustering processing. This study aims to compare 2 cluster algorithms, namely the K-Means and K-Medoids algorithms to obtain an appropriate evaluation in the case of thesis grouping so that the relevant topics in the formed groups have better accuracy. The evaluation stage used is the Davies Bouldin Index (DBI) evaluation which is one of the testing techniques on the cluster. In addition, another indicator for comparison is the computation time of the two algorithms. According to the DBI value test carried out on algorithm 2, the K-Medoids algorithm is superior to K-Means, where the average DBI value produced by K-Medoids is 1,56 while K-Means is 2,79. In addition, the computational time required in classifying documents is also a reference. In testing the computational time required to group 50 documents, K-Means is superior to K-Medoids. K-Means has an average computation time for grouping documents, which is 1 second, while K-Medoids provide a computation time of 26,7778 seconds.
Publisher
Universitas Lancang Kuning
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献