Author:
Kamal Nor Ashikin Mohamad,Bakar Azuraliza Abu,Zainudin Suhaila
Abstract
Features play an important role in representing classes in the hierarchy structure, and using unsuitable features will affect classification performance. The discrete wavelet transform (DWT) approach provides the ability to create the appropriate features to represent data. DWT can produce global and local features using different wavelet families and decomposition levels. These two parameters are essential to obtain a suitable representation for classes in the hierarchy structure. This study proposes using a particle swarm optimisation (PSO) algorithm to select the suitable wavelet family and decomposition level for G-protein coupled receptor (GPCR) hierarchical class representation. The results indicate that the PSO algorithm mostly selects Biorthogonal wavelets and decomposition level 2 to represent GPCR protein. Concerning the performance, the proposed method achieved an accuracy of 97.9%, 85.9%, and 77.5% at the family, subfamily, and sub-subfamily levels, respectively.
Publisher
Academy and Industry Research Collaboration Center (AIRCC)
Subject
General Earth and Planetary Sciences,General Environmental Science
Reference55 articles.
1. [1] K. Alhosaini, A. Azhar, A. Alonazi, and F. Al-Zoghaibi, "GPCRs: The most promiscuous druggable receptor of the mankind," Saudi Pharm. J., no. May, 2021, doi: 10.1016/j.jsps.2021.04.015.
2. [2] M. Li, C. Ling, and J. Gao, "An Efficient CNN-based Classification on G-protein Coupled Receptors Using TF-IDF and N-gram," 2017 IEEE Symp. Comput. Commun., pp. 924-931, 2017.
3. [3] M. Davies, A. Secker, and A. Freitas, "Optimising amino acid groupings for GPCR classification, "Bioinformatics, vol. 24, no. 18, pp. 1980-1986, 2008, doi: 10.1093/bioinformatics/btn382.
4. [4] R. Karchin, K. Karplus, and D. Haussler, "Classifying G-protein coupled receptors with support vector machines,"Bioinformatics, vol. 18, no. 1, pp. 147-159, 2002, doi: 10.1093/bioinformatics/18.1.147.
5. [5] S. Saini and L. Dewan, "Comparison of Numerical Representations of Genomic Sequences: Choosing the Best Mapping for Wavelet Analysis, "Int. J. Appl. Comput. Math., vol.3, no.4, pp. 2943-2958, 2017, doi: 10.1007/s40819-016-0277-1.