Affiliation:
1. Department of Computer Science and Electronic, Universitas Gadjah Mada
2. Master of Computer Science, Universitas Gadjah Mada
Abstract
One algorithm to classify textual data in automatic organizing of documents application is KNN, by changing word representations into vectors. The distance calculation in the KNN algorithm becomes essential in measuring the closeness between data elements. This study compares four distance calculations commonly used in KNN, namely Euclidean, Chebyshev, Manhattan, and Minkowski. The dataset used data from Youtube Eminem’s comments which contain 448 data. This study showed that Euclidian or Minkowski on the KNN algorithm achieved the best result compared to Chebycev and Manhattan. The best results on KNN are obtained when the K value is 3.
Funder
Universitas Gadjah Mada, Indonesia
Publisher
Institute of Research and Community Services Diponegoro University (LPPM UNDIP)
Subject
General Earth and Planetary Sciences,General Environmental Science
Cited by
6 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献