CoRegNet: unraveling gene co-regulation networks from public RNA-Seq repositories using a beta-binomial statistical model

Author:

Wang Jiasheng123,Wan Ying-Wooi145,Al-Ouran Rami6,Huang Meichen17,Liu Zhandong123

Affiliation:

1. Jan and Dan Duncan Neurological Research Institute at Texas Children’s Hospital , Houston, TX 77030 , USA

2. Graduate Program in Quantitative and Computational Biosciences, Baylor College of Medicine , Houston, TX, 77030 , USA

3. Department of Pediatrics, Baylor College of Medicine , Houston, TX 77030 , USA

4. Department of Molecular and Human Genetics , Baylor College of Medicine, , Houston, TX 77030 , USA

5. Howard Hughes Medical Institute , Baylor College of Medicine, , Houston, TX 77030 , USA

6. Al Hussein Technical University , Amman , Jordan

7. Department of Neurology, Baylor College of Medicine , Houston, TX 77030 , USA

Abstract

Abstract Millions of RNA sequencing samples have been deposited into public databases, providing a rich resource for biological research. These datasets encompass tens of thousands of experiments and offer comprehensive insights into human cellular regulation. However, a major challenge is how to integrate these experiments that acquired at different conditions. We propose a new statistical tool based on beta-binomial distributions that can construct robust gene co-regulation network (CoRegNet) across tens of thousands of experiments. Our analysis of over 12 000 experiments involving human tissues and cells shows that CoRegNet significantly outperforms existing gene co-expression-based methods. Although the majority of the genes are linearly co-regulated, we did discover an interesting set of genes that are non-linearly co-regulated; half of the time they change in the same direction and the other half they change in the opposite direction. Additionally, we identified a set of gene pairs that follows the Simpson’s paradox. By utilizing public domain data, CoRegNet offers a powerful approach for identifying functionally related gene pairs, thereby revealing new biological insights.

Funder

Eunice Kennedy Shriver National Institute of Child Health and Human Development

National Institutes of Health

Bioinformatics Core facilities

Publisher

Oxford University Press (OUP)

Subject

Molecular Biology,Information Systems

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3