Global transcriptional regulatory network forEscherichia colirobustly connects gene expression to transcription factor activities-Reference-Cited by-同舟云学术

Global transcriptional regulatory network forEscherichia colirobustly connects gene expression to transcription factor activities

Published:2017-09-05 Issue:38 Volume:114 Page:10286-10291
ISSN:0027-8424
Container-title:Proceedings of the National Academy of Sciences
language:en
Short-container-title:Proc Natl Acad Sci USA

Author:

Fang Xin,Sastry Anand,Mih Nathan,Kim Donghyuk,Tan Justin,Yurkovich James T.^ORCID,Lloyd Colton J.,Gao Ye,Yang Laurence,Palsson Bernhard O.

Abstract

Transcriptional regulatory networks (TRNs) have been studied intensely for >25 y. Yet, even for theEscherichia coliTRN—probably the best characterized TRN—several questions remain. Here, we address three questions: (i) How complete is our knowledge of theE. coliTRN; (ii) how well can we predict gene expression using this TRN; and (iii) how robust is our understanding of the TRN? First, we reconstructed a high-confidence TRN (hiTRN) consisting of 147 transcription factors (TFs) regulating 1,538 transcription units (TUs) encoding 1,764 genes. The 3,797 high-confidence regulatory interactions were collected from published, validated chromatin immunoprecipitation (ChIP) data and RegulonDB. For 21 different TF knockouts, up to 63% of the differentially expressed genes in the hiTRN were traced to the knocked-out TF through regulatory cascades. Second, we trained supervised machine learning algorithms to predict the expression of 1,364 TUs given TF activities using 441 samples. The algorithms accurately predicted condition-specific expression for 86% (1,174 of 1,364) of the TUs, while 193 TUs (14%) were predicted better than random TRNs. Third, we identified 10 regulatory modules whose definitions were robust against changes to the TRN or expression compendium. Using surrogate variable analysis, we also identified three unmodeled factors that systematically influenced gene expression. Our computational workflow comprehensively characterizes the predictive capabilities and systems-level functions of an organism’s TRN from disparate data types.

Funder

Novo Nordisk Foundation

HHS | NIH | National Institute of General Medical Sciences

U.S. Department of Energy

National Science Foundation

Publisher

Proceedings of the National Academy of Sciences

Subject

Multidisciplinary

Reference53 articles.

1. Functional organisation of Escherichia coli transcriptional regulatory network

2. Integrating high-throughput and computational data elucidates bacterial networks