Affiliation:
1. Université Grenoble Alpes, Inria, CNRS, Grenoble INP, LJK, Grenoble, France
Abstract
Abstract
Motivation
Protein model quality assessment (QA) is a crucial and yet open problem in structural bioinformatics. The current best methods for single-model QA typically combine results from different approaches, each based on different input features constructed by experts in the field. Then, the prediction model is trained using a machine-learning algorithm. Recently, with the development of convolutional neural networks (CNN), the training paradigm has changed. In computer vision, the expert-developed features have been significantly overpassed by automatically trained convolutional filters. This motivated us to apply a three-dimensional (3D) CNN to the problem of protein model QA.
Results
We developed Ornate (Oriented Routed Neural network with Automatic Typing)—a novel method for single-model QA. Ornate is a residue-wise scoring function that takes as input 3D density maps. It predicts the local (residue-wise) and the global model quality through a deep 3D CNN. Specifically, Ornate aligns the input density map, corresponding to each residue and its neighborhood, with the backbone topology of this residue. This circumvents the problem of ambiguous orientations of the initial models. Also, Ornate includes automatic identification of atom types and dynamic routing of the data in the network. Established benchmarks (CASP 11 and CASP 12) demonstrate the state-of-the-art performance of our approach among single-model QA methods.
Availability and implementation
The method is available at https://team.inria.fr/nano-d/software/Ornate/. It consists of a C++ executable that transforms molecular structures into volumetric density maps, and a Python code based on the TensorFlow framework for applying the Ornate model to these maps.
Supplementary information
Supplementary data are available at Bioinformatics online.
Funder
L’Agence Nationale de la Recherche
Publisher
Oxford University Press (OUP)
Subject
Computational Mathematics,Computational Theory and Mathematics,Computer Science Applications,Molecular Biology,Biochemistry,Statistics and Probability
Reference32 articles.
1. Tensorflow: a system for large-scale machine learning;Abadi,2016
2. Protein single-model quality assessment by feature-based probability density functions;Cao;Sci. Rep.,2016
3. DeepQA: improving the estimation of single protein model quality with deep belief networks;Cao;BMC Bioinform.,2016
4. Fast and accurate deep network learning by exponential linear units (elus);Clevert;International Conf. on Learning Representations,2016
5. Assessment of predictions in the model quality assessment category;Cozzetto;ProteinsStruct. Funct. Bioinform.,2007
Cited by
82 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献