Optimization of Classifiers Performance for Node Embedding on Graph Based Data

Author:

Yadav Neha1,Gopinathan Dhanalekshmi1

Affiliation:

1. Jaypee Institute of Information Technology

Abstract

Abstract

The Publications regarding the research of embedding the large-scale information that helps in getting networks utilizing neighborhood-aware node representations and low-dimensional communities cover a wide area of research. In graph mining applications, these classification models, and embedding performed better than the conventional approaches. When using different conventional machine learning and data analysis approaches, the display of graphs and their relationship is highly useful in describing features present. Many different embedding approaches are used in machine learning, and a literature review was conducted to determine the best techniques for comparison. This study examines the accuracy scores of different classifiers using the approach on a single dataset. The dataset which is used in this study is CORA, and it is used to import it. After the network has been formed using the dataset, the nodes are embedded since the result of this node embedding will be used as a training set. The machine learns through training of model, for which the Node2vex method is applied in this work. The classifiers are used to train the model. Gradient Boosting, Logistic Regression, Random Forest, K-Neighbors, Decision Tree, Gaussian, and SVC are the classifiers utilized to solve this model's classification problem. To assess performance, the model makes use of two classifiers: Gradient Boosting, Logistic Regression, Random Forest, K-Neighbors, Decision Tree, Gaussian, and SVC. Through experimentation, the accuracy score is used to compare the classifier’s levels of efficiency. From the study, it was clearly observed that for the dataset, it was only the Support Vector Classifier that performed best in the testing and training of dataset for getting desired result. This was achieved by achieving an accuracy of 0.7706 and an MCC score of 0.7200. The optimum classifier for model training tasks and node classification can be chosen with the aid of this paper.

Publisher

Research Square Platform LLC

Reference27 articles.

1. Ahmed AA, Ayub A, Aljabouh PK, Donepudi, and Myung Suh Choi (2021). Detecting fake news using machine learning: A systematic literature review. arXiv preprint arXiv:2102.04458

2. Talukdar S, Singha P, Mahato S, Pal S, Liou YA, Rahman A (2020) Land-use land-cover classification by machine learning classifiers for satellite observations—A review. Remote Sensing, 12(7), p.1135

3. Zhang Z, Cao L, Chen X, Tang W, Xu Z, Meng Y (2020) Representation Learn Knowl graphs entity attributes IEEE Access 8:7435–7441

4. Combining domain knowledge extraction with graph long short-term memory for learning classification of chinese legal documents;Li G;IEEE Access,2019

5. Bloem P, Wilcke X, van Berkel L, Victor de Boer (2021) kgbench: A collection of knowledge graph datasets for evaluating relational and multimodal machine learning. In European Semantic Web Conference, pp. 614–630. Cham: Springer International Publishing

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3