Biomarker Prioritisation and Power Estimation Using Ensemble Gene Regulatory Network Inference-Reference-Cited by-同舟云学术

Biomarker Prioritisation and Power Estimation Using Ensemble Gene Regulatory Network Inference

Published:2020-10-23 Issue:21 Volume:21 Page:7886
ISSN:1422-0067
Container-title:International Journal of Molecular Sciences
language:en
Short-container-title:IJMS

Author:

Aziz Furqan,Acharjee Animesh^ORCID,Williams John A.^ORCID,Russ Dominic,Bravo-Merodio Laura,Gkoutos Georgios V.

Abstract

Inferring the topology of a gene regulatory network (GRN) from gene expression data is a challenging but important undertaking for gaining a better understanding of gene regulation. Key challenges include working with noisy data and dealing with a higher number of genes than samples. Although a number of different methods have been proposed to infer the structure of a GRN, there are large discrepancies among the different inference algorithms they adopt, rendering their meaningful comparison challenging. In this study, we used two methods, namely the MIDER (Mutual Information Distance and Entropy Reduction) and the PLSNET (Partial least square based feature selection) methods, to infer the structure of a GRN directly from data and computationally validated our results. Both methods were applied to different gene expression datasets resulting from inflammatory bowel disease (IBD), pancreatic ductal adenocarcinoma (PDAC), and acute myeloid leukaemia (AML) studies. For each case, gene regulators were successfully identified. For example, for the case of the IBD dataset, the UGT1A family genes were identified as key regulators while upon analysing the PDAC dataset, the SULF1 and THBS2 genes were depicted. We further demonstrate that an ensemble-based approach, that combines the output of the MIDER and PLSNET algorithms, can infer the structure of a GRN from data with higher accuracy. We have also estimated the number of the samples required for potential future validation studies. Here, we presented our proposed analysis framework that caters not only to candidate regulator genes prediction for potential validation experiments but also an estimation of the number of samples required for these experiments.

Publisher

MDPI AG

Subject

Inorganic Chemistry,Organic Chemistry,Physical and Theoretical Chemistry,Computer Science Applications,Spectroscopy,Molecular Biology,General Medicine,Catalysis

Link

https://www.mdpi.com/1422-0067/21/21/7886/pdf

Reference47 articles.

1. Advantages and limitations of current network inference methods

2. MIDER: Network Inference with Mutual Information Distance and Entropy Reduction

3. Large-Scale Mapping and Validation of Escherichia coli Transcriptional Regulation from a Compendium of Expression Profiles

4. Inferring Regulatory Networks from Expression Data Using Tree-Based Methods

5. Cluster analysis and display of genome-wide expression patterns

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Consensus Gene Regulatory Network for Neurodegenerative Diseases Using Single-Cell RNA-Seq Data;Advances in Experimental Medicine and Biology;2023

2. Machine Learning-Based Identification of Potentially Novel Non-Alcoholic Fatty Liver Disease Biomarkers;Biomedicines;2021-11-07

3. Graph characterisation using graphlet-based entropies;Pattern Recognition Letters;2021-07

4. A Causal Web between Chronotype and Metabolic Health Traits;Genes;2021-07-01