TACOS: a novel approach for accurate prediction of cell-specific long noncoding RNAs subcellular localization

Author:

Jeon Young-Jun12ORCID,Hasan Md Mehedi34ORCID,Park Hyun Woo12,Lee Ki Wook12,Manavalan Balachandran56ORCID

Affiliation:

1. Department of Integrative Biotechnology , College of Bioengineering and Biotechnology, , Suwon 16419, Korea

2. Sungkyunkwan University , College of Bioengineering and Biotechnology, , Suwon 16419, Korea

3. Tulane Center for Biomedical Informatics and Genomics , Division of Biomedical Informatics and Genomics, John W. Deming Department of Medicine, School of Medicine, , New Orleans, LA 70112, USA

4. Tulane University , Division of Biomedical Informatics and Genomics, John W. Deming Department of Medicine, School of Medicine, , New Orleans, LA 70112, USA

5. Computational Biology and Bioinformatics laboratory , Department of Integrative Biotechnology, College of Bioengineering and Biotechnology, , Suwon 16419, Korea

6. Sungkyunkwan University , Department of Integrative Biotechnology, College of Bioengineering and Biotechnology, , Suwon 16419, Korea

Abstract

AbstractLong noncoding RNAs (lncRNAs) are primarily regulated by their cellular localization, which is responsible for their molecular functions, including cell cycle regulation and genome rearrangements. Accurately identifying the subcellular location of lncRNAs from sequence information is crucial for a better understanding of their biological functions and mechanisms. In contrast to traditional experimental methods, bioinformatics or computational methods can be applied for the annotation of lncRNA subcellular locations in humans more effectively. In the past, several machine learning-based methods have been developed to identify lncRNA subcellular localization, but relevant work for identifying cell-specific localization of human lncRNA remains limited. In this study, we present the first application of the tree-based stacking approach, TACOS, which allows users to identify the subcellular localization of human lncRNA in 10 different cell types. Specifically, we conducted comprehensive evaluations of six tree-based classifiers with 10 different feature descriptors, using a newly constructed balanced training dataset for each cell type. Subsequently, the strengths of the AdaBoost baseline models were integrated via a stacking approach, with an appropriate tree-based classifier for the final prediction. TACOS displayed consistent performance in both the cross-validation and independent assessments compared with the other two approaches employed in this study. The user-friendly online TACOS web server can be accessed at https://balalab-skku.org/TACOS.

Funder

National Research Foundation of Korea

Publisher

Oxford University Press (OUP)

Subject

Molecular Biology,Information Systems

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3