Modified Euclidean-Canberra blend distance metric for kNN classifier

Author:

Sandhu Gaurav1,Singh Amandeep1,Lamba Puneet Singh2,Virmani Deepali2,Chaudhary Gopal2

Affiliation:

1. Guru Tegh Bahadur Institute of Technology, GGSIPU, New Delhi, India

2. VIPS-TC, School of Engineering and Technology, Pitampura, New Delhi, India

Abstract

In today’s world different data sets are available on which regression or classification algorithms of machine learning are applied. One of the classification algorithms is k-nearest neighbor (kNN) which computes distance amongst various rows in a dataset. The performance of kNN is evaluated based on K-value and distance metric used, where K is the total count of neighboring elements. Many different distance metrics have been used by researchers in literature, one of them is Canberra distance metric. In this paper the performance of kNN based on Canberra distance metric is measured on different datasets, further the proposed Canberra distance metric, namely, Modified Euclidean-Canberra Blend Distance (MECBD) metric has been applied to the kNN algorithm which led to improvement of class prediction efficiency on the same datasets measured in terms of accuracy, precision, recall, F1-score for different values of k. Further, this study depicts that MECBD metric use led to improvement in accuracy value 80.4% to 90.3%, 80.6% to 85.4% and 70.0% to 77.0% for various data sets used. Also, implementation of ROC curves and auc for k= 5 is done to show the improvement is kNN model prediction which showed increase in auc values for different data sets, for instance increase in auc values from 0.873 to 0.958 for Spine (2 Classes) dataset, 0.857 to 0.940, 0.983 to 0.983 (no change), 0.910 to 0.957 for DH, SL and NO class for Spine (3 Classes) data set and 0.651 to 0.742 for Haberman’s data set.

Publisher

IOS Press

Subject

Artificial Intelligence,Computer Vision and Pattern Recognition,Human-Computer Interaction,Software

Reference32 articles.

1. Analysis of distance measures using k-nearest neighbor algorithm on kdd dataset;Mulak;Int J Sci Res,2015

2. Algorithm Modified K-Nearest Neighbor (M-KNN) for Classification of Attention Deficit Hyperactive Disorder (ADHD) in Children;Sagala;Login: Jurnal Teknologi Komputer,2019

3. Surya VB, Haneen P, Ahmad AA, Omar BA, Ahmad L. Effects of Distance Measure Choice on KNN Classifier Performance-A Review. Mary Ann Liebert. 2019.

4. Comparative analysis of k-nearest neighbor and modified k-nearest neighbor algorithm for data classification;Gazalba;2017 2nd International conferences on Information Technology, Information Systems and Electrical Engineering (ICITISEE),2017

5. Analysis of braycurtis, canberra and euclidean distance in knn algorithm;Pulungan;Sinkron: jurnal dan penelitian teknik informatika,2019

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3