Isoform function prediction based on bi-random walks on a heterogeneous network

Author:

Yu Guoxian1ORCID,Wang Keyao1,Domeniconi Carlotta2,Guo Maozu34,Wang Jun1ORCID

Affiliation:

1. College of Computer and Information Science, Southwest University, Chongqing, China

2. Department of Computer Science, George Mason University, Fairfax, VA 22030, USA

3. School of Electrical and Information Engineering, Beijing University of Civil Engineering and Architecture, Beijing, China

4. Beijing Key Laboratory of Intelligent Processing for Building Big Data, Beijing, China

Abstract

Abstract Motivation Alternative splicing contributes to the functional diversity of protein species and the proteoforms translated from alternatively spliced isoforms of a gene actually execute the biological functions. Computationally predicting the functions of genes has been studied for decades. However, how to distinguish the functional annotations of isoforms, whose annotations are essential for understanding developmental abnormalities and cancers, is rarely explored. The main bottleneck is that functional annotations of isoforms are generally unavailable and functional genomic databases universally store the functional annotations at the gene level. Results We propose IsoFun to accomplish Isoform Function prediction based on bi-random walks on a heterogeneous network. IsoFun firstly constructs an isoform functional association network based on the expression profiles of isoforms derived from multiple RNA-seq datasets. Next, IsoFun uses the available Gene Ontology annotations of genes, gene–gene interactions and the relations between genes and isoforms to construct a heterogeneous network. After this, IsoFun performs a tailored bi-random walk on the heterogeneous network to predict the association between GO terms and isoforms, thus accomplishing the prediction of GO annotations of isoforms. Experimental results show that IsoFun significantly outperforms the state-of-the-art algorithms and improves the area under the receiver-operating curve (AUROC) and the area under the precision-recall curve (AUPRC) by 17% and 44% at the gene-level, respectively. We further validated the performance of IsoFun on the genes ADAM15 and BCL2L1. IsoFun accurately differentiates the functions of respective isoforms of these two genes. Availability and implementation The code of IsoFun is available at http://mlda.swu.edu.cn/codes.php? name=IsoFun. Supplementary information Supplementary data are available at Bioinformatics online.

Funder

Natural Science Foundation of China

Fundamental Research Funds for the Central Universities

National Key Research and Development Plan Task of China

Natural Science Foundation of CQ CSTC

Publisher

Oxford University Press (OUP)

Subject

Computational Mathematics,Computational Theory and Mathematics,Computer Science Applications,Molecular Biology,Biochemistry,Statistics and Probability

Reference35 articles.

1. Support vector machines for multiple-instance learning;Andrews;Adv. Neural Inf. Process. Syst,2003

2. bcl-x, a bcl-2-related gene that functions as a dominant regulator of apoptotic cell death;Boise;Cell,1993

3. The BioGRID interaction database: 2017 update;Chatr-Aryamontri;Nucleic Acids Res,2017

4. The functional impact of alternative splicing in cancer;Climente-Gonzalez;Cell Rep,2017

5. Random walk models in biology;Codling;J. R. Soc. Interface,2008

Cited by 24 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3