Affiliation:
1. Department of Computer Science, Colorado State University, Fort Collins, CO, 80523, USA
Abstract
Protein function prediction is an active area of research in bioinformatics. Yet, the transfer of annotation on the basis of sequence or structural similarity remains widely used as an annotation method. Most of today's machine learning approaches reduce the problem to a collection of binary classification problems: whether a protein performs a particular function, sometimes with a post-processing step to combine the binary outputs. We propose a method that directly predicts a full functional annotation of a protein by modeling the structure of the Gene Ontology hierarchy in the framework of kernel methods for structured-output spaces. Our empirical results show improved performance over a BLAST nearest-neighbor method, and over algorithms that employ a collection of binary classifiers as measured on the Mousefunc benchmark dataset.
Publisher
World Scientific Pub Co Pte Lt
Subject
Computer Science Applications,Molecular Biology,Biochemistry
Cited by
59 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. ProtEx: A Retrieval-Augmented Approach for Protein Function Prediction;2024-06-02
2. Online Hierarchical Multi–label Classification;2023 IEEE International Conference on Big Data (BigData);2023-12-15
3. Hybrid Loss for Hierarchical Multi–label Classification Network;2023 IEEE International Conference on Big Data (BigData);2023-12-15
4. BERT-based classification of fungi protein sequences with multiple GO labels;Proceedings of the International Conference on Research in Adaptive and Convergent Systems;2023-08-06
5. Hierarchical MultiClass AdaBoost;2021 IEEE International Conference on Big Data (Big Data);2021-12-15