ANFIS ve SBERT Yaklaşımlarının Hibrit Kullanımı ile DNA Dizilimleri Üzerinde Ekson ve İntron Bölgelerinin Sınıflandırılması-Reference-Cited by-同舟云学术

ANFIS ve SBERT Yaklaşımlarının Hibrit Kullanımı ile DNA Dizilimleri Üzerinde Ekson ve İntron Bölgelerinin Sınıflandırılması

Published:2023-03-12 Issue: Volume: Page:
ISSN:1302-0900
Container-title:Journal of Polytechnic
language:tr
Short-container-title:

Author:

AKALIN Fatma¹,YUMUŞAK Nejat¹^ORCID

Affiliation:

1. SAKARYA ÜNİVERSİTESİ

Abstract

DNA is the part of the genome that contains enormous amounts of information related to life. Amino acids are formed by coding three nucleotides in this genome part, and the encoded amino acids are called codes in DNA. The frequency of the triple nucleotide in the DNA sequence allows for the evaluation of protein-coding (exon) and non-protein-coding (intron) regions. Distinguishing these regions enables the analysis of vital functions related to life. This study provides the classification of exon and intron regions for BCR-ABL and MEFV genes obtained from NCBI and Ensemble datasets, respectively. Then, existing DNA sequences are clustered using pretrained models in the scope of the SBERT approach. In the clustering process, K-Means and Agglomerative Clustering approaches are used consecutively. The frequency of repetition of codes is calculated with a representative sample selected from each cluster. The matrix is created using the frequencies of 64 different codons that constitute genetic code. This matrix is given as input to the ANFIS structure. The %88.88 accuracy rate is obtained with the ANFIS approach to classify exon and intron DNA sequences. As a result of this study, a successful result was produced independently of DNA length.

Publisher

Politeknik Dergisi

Subject

Colloid and Surface Chemistry,Physical and Theoretical Chemistry

Reference43 articles.

1. [1] Raza K., ‘Fuzzy logic based approaches for gene regulatory network inference’, Artificial Intelligence in Medicine, 97: 189–203, (2019).

2. [2] Zheng P., Wang S., Wang X., and Zeng X., ‘Editorial: Artificial Intelligence in Bioinformatics and Drug Repurposing: Methods and Applications’, Frontiers in Genetics, 13: 1–4, (2022).

3. [3] Singh N., Nath R., and Singh D.B., ‘Splice-site identification for exon prediction using bidirectional LSTM-RNN approach’, Biochemistry and Biophysics Reports, 30, (2022).

4. [4] Kar S. and Ganguly M., ‘Study of effectiveness of FIR and IIR filters in Exon identification: A comparative approach’, Materials Today: Proceedings, 58: 437–444, (2022).

5. [5] Barman S., Saha S., Mandal A., and Roy M., ‘Prediction of protein coding regions of a DNA sequence through spectral analysis’, 2012 International Conference on Informatics, Electronics and Vision, ICIEV 2012, 12–16, (2012).

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. C.elegans coding and non-coding zones classification using multifractal features and support vector machine;2024 IEEE 7th International Conference on Advanced Technologies, Signal and Image Processing (ATSIP);2024-07-11

2. Splice site recognition - deciphering Exon-Intron transitions for genetic insights using Enhanced integrated Block-Level gated LSTM model;Gene;2024-07