A Novel Ensemble Stacking Classification of Genetic Variations Using Machine Learning Algorithms-Reference-Cited by-同舟云学术

A Novel Ensemble Stacking Classification of Genetic Variations Using Machine Learning Algorithms

Published:2021-12-31 Issue: Volume: Page:
ISSN:0219-4678
Container-title:International Journal of Image and Graphics
language:en
Short-container-title:Int. J. Image Grap.

Author:

Yeturu Jahnavi¹,Elango Poongothai²,Raja S. P.³,Nagendra Kumar P.⁴

Affiliation:

1. Department of Computer Science, Dr. V. S. Krishna Government Degree College (Autonomous), Visakhapatnam, Andhra Pradesh, India

2. Department of Computational Intelligence, SRM Institute of Science and Technology, Kattankulathur, Chennai, Tamil Nadu, India

3. School of Computer Science and Engineering, Vellore Institute of Technology, Vellore 632014, Tamil Nadu, India

4. Department of Computer Science and Engineering, Geethanjali Institute of Science and Technology, Nellore, Andhra Pradesh, India

Abstract

Genetics is the clinical review of congenital mutation, where the principal advantage of analyzing genetic mutation of humans is the exploration, analysis, interpretation and description of the genetic transmitted and inherited effect of several diseases such as cancer, diabetes and heart diseases. Cancer is the most troublesome and disordered affliction as the proportion of cancer sufferers is growing massively. Identification and discrimination of the mutations that impart to the enlargement of tumor from the unbiased mutations is difficult, as majority tumors of cancer are able to exercise genetic mutations. The genetic mutations are systematized and categorized to sort the cancer by way of medical observations and considering clinical studies. At the present time, genetic mutations are being annotated and these interpretations are being accomplished either manually or using the existing primary algorithms. Evaluation and classification of each and every individual genetic mutation was basically predicated on evidence from documented content built on medical literature. Consequently, as a means to build genetic mutations, basically, depending on the clinical evidences persists a challenging task. There exist various algorithms such as one hot encoding technique is used to derive features from genes and their variations, TF-IDF is used to extract features from the clinical text data. In order to increase the accuracy of the classification, machine learning algorithms such as support vector machine, logistic regression, Naive Bayes, etc., are experimented. A stacking model classifier has been developed to increase the accuracy. The proposed stacking model classifier has obtained the log loss 0.8436 and 0.8572 for cross-validation data set and test data set, respectively. By the experimentation, it has been proved that the proposed stacking model classifier outperforms the existing algorithms in terms of log loss. Basically, minimum log loss refers to the efficient model. Here the log loss has been reduced to less than 1 by using the proposed stacking model classifier. The performance of these algorithms can be gauged on the basis of the various measures like multi-class log loss.

Publisher

World Scientific Pub Co Pte Ltd

Subject

Computer Graphics and Computer-Aided Design,Computer Science Applications,Computer Vision and Pattern Recognition

Link

https://www.worldscientific.com/doi/pdf/10.1142/S0219467823500158

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Analysis of Pose Estimation Based GLOGT Feature Extraction for Person Re-Identification in Surveillance Area Network;Wireless Personal Communications;2024-08-11

2. Argo data anomaly detection algorithm based on selective ensemble of fuzzy clustering;Journal of Physics: Conference Series;2024-08-01

3. Comparative analysis of supervised learning algorithms for prediction of cardiovascular diseases;Technology and Health Care;2024-05-31

4. Performance Analysis of Various Machine Learning Classifiers on Diverse Datasets;Proceedings of Congress on Control, Robotics, and Mechatronics;2023-11-10

5. Prediction and Evaluation of Cancer Using Machine Learning Techniques;Artificial Intelligence and Sustainable Computing;2023