Affiliation:
1. AFYON KOCATEPE ÜNİVERSİTESİ
Abstract
The aim of this study is to compare hierarchical clustering methods by Cophenetic Correlation Coefficient (CCC) when there is a big data. For this purpose, after giving information about big data, clustering methods and CCC, analyzes are carried out for the related data set. The 2015 air travel consumer report, which was used in the application part of the study and published by the US Ministry of Transport, was used as big data. Libraries of the Python programming language installed on the Amazon cloud server, which includes open-source big data technologies, were used for data analysis. Since there is big data in the study, in order to save time and economy, the variables used in the study were first reduced by feature selection method, standardized and analyzed over the final 4 different data sets. As a result of the clustering analysis, it was observed that the highest CCC was obtained with the Average clustering method for all of these four different data sets.
Publisher
Afyon Kocatepe Universitesi Fen Ve Muhendislik Bilimleri Dergisi
Reference27 articles.
1. Backer, E., 1995. Computer-Assisted Reasoning in Cluster Analysis. Prentice Hall, Hertfordshire, 214.
2. Budak, H., 2018. Özellik Seçim Yöntemleri ve Yeni Bir Yaklaşım: Süleyman Demirel Üniversitesi Fen Bilimleri Enstitüsü Dergisi, 22, 21-31.
3. Carvalho, P.R., Munita, C.S. and Lapolli, A.L., 2019. Validity Studies Among Hierarchical Methods of Cluster Analysis Using Cophenetic Correlation Coefficient. Brazilian Journal of Radiation Sciences, 7, 1-14.
4. Çelik, S., 2017. Büyük Veri Teknolojilerinin İşletmeler İçin Önemi. Social Sciences Studies Journal, 3(6), 873-883.
5. Demirtaş, B. and Argan, M., 2018. Büyük Veri ve Pazarlamadaki Dönüşüm: Kuramsal Bir Yaklaşım. Pazarlama ve Pazarlama Araştırmaları Dergisi, 8(15), 1-21.