Abstract
ABSTRACTGenome-wide phenotypic screens in the budding yeastSaccharomyces cerevisiaehave produced the largest, richest and most systematic phenotypic description of any organism. Such an achievement was enabled by the development of highly scalable phenotypic assays and construction of the yeast knock-out (YKO) collection, comprising ~5,000 isogenic strains each deleted for exactly one open reading frame. Systematic screening of the YKO collection led to ~500 publications describing ~14,500 phenotypes capturing nearly every aspect of yeast biology. Yet, integrative analyses of this rich data source have been virtually impossible due to the lack of a central repository and consistent meta-data annotations. Here, we describe the aggregation, harmonization and analysis of all published phenotypic screens of the YKO collection, which we refer to as the Yeast Phenome (www.yeastphenome.org). To demonstrate the power of data integration and illustrate how much it facilitates the generation of testable hypotheses, we present three discoveries uniquely enabled by Yeast Phenome. First, we use the variation in the number of phenotypes per gene to identify tryptophan homeostasis as a central point of vulnerability to a wide range of chemical compounds, including FDA-approved drugs. Second, using phenotypic profiles as a tool for predicting gene function, we identify and validate the role ofYHR045Was a novel regulator of ergosterol biosynthesis and DNA damage response, andYGL117Was a new member of the aromatic amino acid biosynthesis pathway. Finally, we describe a surprising exponential relationship between phenotypic similarity and intergenic distance in both yeast and human genomes. This relationship, which stretches as far as 380 kb in yeast and 100 Mb in humans, suggests that gene positions are optimized for function to a much greater extent than appreciated previously. Overall, we show that Yeast Phenome enables systematic enquiries into the nature of gene-gene and gene-phenotype relationships and is an important new resource for systems biology.
Publisher
Cold Spring Harbor Laboratory