Comparison of codon usage measures and their applicability in prediction of microbial gene expressivity-Reference-Cited by-同舟云学术

Comparison of codon usage measures and their applicability in prediction of microbial gene expressivity

Published:2005-07-19 Issue:1 Volume:6 Page:
ISSN:1471-2105
Container-title:BMC Bioinformatics
language:en
Short-container-title:BMC Bioinformatics

Author:

Supek Fran,Vlahoviček Kristian

Abstract

Abstract Background There are a number of methods (also called: measures) currently in use that quantify codon usage in genes. These measures are often influenced by other sequence properties, such as length. This can introduce strong methodological bias into measurements; therefore we attempted to develop a method free from such dependencies. One of the common applications of codon usage analyses is to quantitatively predict gene expressivity. Results We compared the performance of several commonly used measures and a novel method we introduce in this paper – Measure Independent of Length and Composition (MILC). Large, randomly generated sequence sets were used to test for dependence on (i) sequence length, (ii) overall amount of codon bias and (iii) codon bias discrepancy in the sequences. A derivative of the method, named MELP (MILC-based Expression Level Predictor) can be used to quantitatively predict gene expression levels from genomic data. It was compared to other similar predictors by examining their correlation with actual, experimentally obtained mRNA or protein abundances. Conclusion We have established that MILC is a generally applicable measure, being resistant to changes in gene length and overall nucleotide composition, and introducing little noise into measurements. Other methods, however, may also be appropriate in certain applications. Our efforts to quantitatively predict gene expression levels in several prokaryotes and unicellular eukaryotes met with varying levels of success, depending on the experimental dataset and predictor used. Out of all methods, MELP and Rainer Merkl's GCB method had the most consistent behaviour. A 'reference set' containing known ribosomal protein genes appears to be a valid starting point for a codon usage-based expressivity prediction.

Publisher

Springer Science and Business Media LLC

Subject

Applied Mathematics,Computer Science Applications,Molecular Biology,Biochemistry,Structural Biology

Link

https://link.springer.com/content/pdf/10.1186/1471-2105-6-182.pdf

Reference62 articles.

1. Ikemura T: Correlation between the abundance of Escherichia coli transfer RNAs and the occurrence of the respective codons in its protein genes: a proposal for a synonymous codon choice that is optimal for the E. coli translational system. J Mol Biol 1981, 151(3):389–409. 10.1016/0022-2836(81)90003-6

2. Grantham R, Gautier C, Gouy M, Jacobzone M, Mercier R: Codon catalog usage is a genome strategy modulated for gene expressivity. Nucleic Acids Res 1981, 9(1):r43–74. 10.1093/nar/9.1.213-b

3. Gouy M, Gautier C: Codon usage in bacteria: correlation with gene expressivity. Nucleic Acids Res 1982, 10(22):7055–7074. 10.1093/nar/10.22.7055

4. Hooper SD, Berg OG: Gradients in nucleotide and codon usage along Escherichia coli genes. Nucleic Acids Res 2000, 28(18):3517–3523. 10.1093/nar/28.18.3517

5. Ikemura T: Codon usage and tRNA content in unicellular and multicellular organisms. Mol Biol Evol 1985, 2(1):13–34.

Cited by 112 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Environmental implications of codon usage bias in Crocus sativus and its impact on host pathogen interactions;Rhizosphere;2024-03

2. A Codon Usage Shift of Selecting A/T-Ending Optimal Codons in Hexaploid Actinidia Deliciosa is Likely from Environmental Adaptation;2024

3. Putative novel hydrogen- and iron-oxidizing sheath-producing Zetaproteobacteria thrive at the Fåvne deep-sea hydrothermal vent field;mSystems;2023-12-21

4. Exploring the relationship between codon usage and gene expression in the Meloidogyne incognita genome: Implications for environmental adaptability;Gene Reports;2023-12

5. System-wide analysis of RNA and protein subcellular localization dynamics;Nature Methods;2023-11-30