Predicting essential genes in fungal genomes-Reference-Cited by-同舟云学术

Predicting essential genes in fungal genomes

Published:2006-08-09 Issue:9 Volume:16 Page:1126-1135
ISSN:1088-9051
Container-title:Genome Research
language:en
Short-container-title:Genome Res.

Author:

Seringhaus Michael,Paccanaro Alberto,Borneman Anthony,Snyder Michael,Gerstein Mark

Abstract

Essential genes are required for an organism's viability, and the ability to identify these genes in pathogens is crucial to directed drug development. Predicting essential genes through computational methods is appealing because it circumvents expensive and difficult experimental screens. Most such prediction is based on homology mapping to experimentally verified essential genes in model organisms. We present here a different approach, one that relies exclusively on sequence features of a gene to estimate essentiality and offers a promising way to identify essential genes in unstudied or uncultured organisms. We identified 14 characteristic sequence features potentially associated with essentiality, such as localization signals, codon adaptation, GC content, and overall hydrophobicity. Using the well-characterized baker's yeast Saccharomyces cerevisiae, we employed a simple Bayesian framework to measure the correlation of each of these features with essentiality. We then employed the 14 features to learn the parameters of a machine learning classifier capable of predicting essential genes. We trained our classifier on known essential genes in S. cerevisiae and applied it to the closely related and relatively unstudied yeast Saccharomyces mikatae. We assessed predictive success in two ways: First, we compared all of our predictions with those generated by homology mapping between these two species. Second, we verified a subset of our predictions with eight in vivo knockouts in S. mikatae, and we present here the first experimentally confirmed essential genes in this species.

Publisher

Cold Spring Harbor Laboratory

Subject

Genetics(clinical),Genetics

Reference39 articles.

1. A genome-based approach for the identification of essential bacterial genes

2. Random forests. Mach;Breiman;Learn.,2001

3. Concordance analysis of microbial genomes

4. Chalker A. Lunsford R. (2002) Rational identification of new antibacterial drug targets that are essential for viability using a genomics-based approach. Pharmacol. Ther. 95, 1.

Cited by 103 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. RFEM: A framework for essential microRNA identification in mice based on rotation forest and multiple feature fusion;Computers in Biology and Medicine;2024-03

2. Essential genes identification model based on sequence feature map and graph convolutional neural network;BMC Genomics;2024-01-10

3. Leveraging Random Forest and Graph-based Centralities to Predict Yeast Essential Genes;2023 IEEE International Conference on Bioinformatics and Biomedicine (BIBM);2023-12-05

4. Whole-genome sequencing and functional annotation of pathogenic Paraconiothyrium brasiliense causing human cellulitis;Human Genomics;2023-07-17

5. Essential Genes Identification Model Based on Sequence Feature Map and Graph Convolutional Neural Network;2023-07-03