Abstract
AbstractBased on the expression blueprint encoded in the genome, three groups of sequences have been identified – protein encoding, RNA encoding, and non-expressing. We asked: Why did nature choose a particular DNA sequence for expression? Did she sample every possibility, approving some for RNA synthesis, some for protein synthesis, and retiring/ignoring the rest. If evolution randomly selected sequences for metabolic trials, how much non-utilized (not-expressing) and under-utilized (only RNA encoding) information is currently available for innovations? These questions lead us to experimentally synthesizing functional proteins from intergenic sequences of E.coli (Dhar et al 2009). The current work is an extension of this original report and takes into consideration natural protein-coding sequences ‘read backward’ to generate a new possibility. Reverse proteins are full-length ‘translation equivalents’ of the existing protein-coding genes read in the -1 frame. The structural, functional and interaction predictions of reverse proteins inE.coli, S.cerevisiaeandD.melanogaster, open up a new opportunity of producing ‘first-in-the-class’ proteins towards functional endpoints. This study points to a large untapped genomic space from the fundamental biology and applications perspectives.
Publisher
Cold Spring Harbor Laboratory
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献