Author:
Sperschneider Jana,Dodds Peter N.,Gardiner Donald M.,Singh Karam B.,Taylor Jennifer M.
Abstract
AbstractPlant-pathogenic fungi secrete effector proteins to facilitate infection. We describe extensive improvements to EffectorP, the first machine learning classifier for fungal effector prediction. EffectorP 2.0 is now trained on a larger set of effectors and utilizes a different approach based on an ensemble of classifiers trained on different subsets of negative data, offering different views on classification. EffectorP 2.0 achieves accuracy of 89%, compared to 82% for EffectorP 1.0 and 59.8% for a small size classifier. Important features for effector prediction appear to be protein size, protein net charge as well as the amino acids serine and cysteine. EffectorP 2.0 decreases the number of predicted effectors in secretomes of fungal plant symbionts and saprophytes by 40% when compared to EffectorP 1.0. However, EffectorP 1.0 retains value and combining EffectorP 1.0 and 2.0 results in a stringent classifier with low false positive rate of 9%. EffectorP 2.0 predicts significant enrichments of effectors in 12 out of 13 sets of infection-induced proteins from diverse fungal pathogens, whereas a small cysteine-rich classifier detects enrichment only in 7 out of 13. EffectorP 2.0 will fast-track prioritization of high-confidence effector candidates for functional validation and aid in improving our understanding of effector biology. EffectorP 2.0 is available at http://effectorp.csiro.au.
Publisher
Cold Spring Harbor Laboratory
Reference86 articles.
1. Genomic survey of the non-cultivatable opportunistic human pathogen, Enterocytozoon bieneusi;PLoSPathog,2009
2. Genomic Analysis of the Necrotrophic Fungal Pathogens Sclerotinia sclerotiorum and Botrytis cinerea
3. Comparative genomics of citric-acid-producing Aspergillus niger ATCC 1015 versus enzyme-producing CBS 513.88
4. Improved Prediction of Signal Peptides: SignalP 3.0
5. Bing L , Yang D , Li XL , Lee WS , Yu PS . 2003. Building text classifiers using positive and unlabeled examples. Third leee International Conference on Data Mining, Proceedings: 179–186.
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献