Affiliation:
1. Toyota Technological Institute at Chicago and Massachusetts Institute of Technology, Chicago, IL
2. Massachusetts Institute of Technology, Cambridge, MA
Abstract
This article studies the protein side-chain packing problem using the tree-decomposition of a protein structure. To obtain fast and accurate protein side-chain packing, protein structures are modeled using a geometric neighborhood graph, which can be easily decomposed into smaller blocks. Therefore, the side-chain assignment of the whole protein can be assembled from the assignment of the small blocks. Although we will show that the side-chain packing problem is still
NP
-hard, we can achieve a tree-decomposition-based globally optimal algorithm with time complexity of
O
(
Nn
rot
tw
+ 1)
and several polynomial-time approximation schemes (PTAS), where
N
is the number of residues contained in the protein,
n
rot
the average number of rotamers for each residue, and
tw
=
O
(
N
2/3
log
N
) the treewidth of the protein structure graph. Experimental results indicate that after Goldstein dead-end elimination is conducted,
n
rot
is very small and
tw
is equal to 3 or 4 most of the time. Based on the globally optimal algorithm, we developed a protein side-chain assignment program TreePack, which runs up to 90 times faster than SCWRL 3.0, a widely-used side-chain packing program, on some large test proteins in the SCWRL benchmark database and an average of five times faster on all the test proteins in this database. There are also some real-world instances that TreePack can solve but that SCWRL 3.0 cannot. The TreePack program is available at http://ttic.uchicago.edu/~jinbo/TreePack.htm.
Publisher
Association for Computing Machinery (ACM)
Subject
Artificial Intelligence,Hardware and Architecture,Information Systems,Control and Systems Engineering,Software
Reference43 articles.
1. Akutsu T. 1997. NP-hardness results for protein side-chain packing. In Genome Informatics 8 S. Miyano and T. Takagi Eds. 180--186.]] Akutsu T. 1997. NP-hardness results for protein side-chain packing. In Genome Informatics 8 S. Miyano and T. Takagi Eds. 180--186.]]
2. Complexity of Finding Embeddings in a k-Tree
3. An optimal algorithm for approximate nearest neighbor searching fixed dimensions
Cited by
69 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献