Improved DNA-versus-Protein Homology Search for Protein Fossils-Reference-Cited by-同舟云学术

Improved DNA-versus-Protein Homology Search for Protein Fossils

Published:2021-01-26 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Yao Yin,Frith Martin C.

Abstract

AbstractProtein fossils, i.e. noncoding DNA descended from coding DNA, arise frequently from transposable elements (TEs), decayed genes, and viral integrations. They can reveal, and mislead about, evolutionary history and relationships. They have been detected by comparing DNA to protein sequences, but current methods are not optimized for this task. We describe a powerful DNA-protein homology search method. We use a 64×21 substitution matrix, which is fitted to sequence data, automatically learning the genetic code. We detect subtly homologous regions by considering alternative possible alignments between them, and calculate significance (probability of occurring by chance between random sequences). Our method detects TE protein fossils much more sensitively than blastx, and > 10× faster. Of the ~7 major categories of eukaryotic TE, three have not been found in mammals: we find two of them in the human genome, polinton and DIRS/Ngaro. This method increases our power to find ancient fossils, and perhaps to detect non-standard genetic codes. The alternative-alignments and significance paradigm is not specific to DNA-protein comparison, and could benefit homology search generally.

Publisher

Cold Spring Harbor Laboratory

Reference39 articles.

1. Finite-state models in the alignment of macromolecules

2. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs

3. Disentangling the origins of virophages and polintons;Current opinion in virology,2017

4. Statistical Alignment of Retropseudogenes and Their Functional Paralogs

5. Durbin, R. , Eddy, S. , Krogh, A. , Mitchison, G. : Biological sequence analysis: probabilistic models of proteins and nucleic acids. Cambridge University Press (1998)

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. An immune-suppressing protein in human endogenous retroviruses;2022-11-04