Affiliation:
1. Department of Biochemistry and Biophysics and Center for Advanced Biomolecular Research, Texas A&M University, College Station, Texas 77843-2128
Abstract
ABSTRACT
Self-assembling proteins and protein fragments encoded by the
Escherichia coli
genome were identified from
E. coli
K-12 strain MG1655. Libraries of random DNA fragments cloned into a series of λ repressor fusion vectors were subjected to selection for immunity to infection by phage λ. Survivors were identified by sequencing the ends of the inserts, and the fused protein sequence was inferred from the known genomic sequence. Four hundred sixty-three nonredundant open reading frame-encoded interacting sequence tags (ISTs) were recovered from sequencing 2,089 candidates. These ISTs, which range from 16 to 794 amino acids in length, were clustered into families of overlapping fragments, identifying potential homotypic interactions encoded by 232
E. coli
genes. Repressor fusions identified ISTs from genes in every protein-based functional category, but membrane proteins were underrepresented. The IST-containing genes were enriched for regulatory proteins and for proteins that form higher-order oligomers. Forty-eight (20.7%) homotypic proteins identified by ISTs are predicted to contain coiled coils. Although most of the IST-containing genes are identifiably related to proteins in other bacterial genomes, more than half of the ISTs do not have identifiable homologs in the Protein Data Bank, suggesting that they may include many novel structures. The data are available online at
http://oligomers.tamu.edu/
.
Publisher
American Society for Microbiology
Subject
Molecular Biology,Microbiology
Cited by
13 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献