Abstract
ABSTRACTMolecular docking excels at creating a plethora of potential models of protein-protein complexes. To correctly distinguish the favourable, native-like models from the remaining ones remains, however, a challenge. We assessed here if a protocol based on molecular dynamics (MD) simulations would allow to distinguish native from non-native models to complement scoring functions used in docking. To this end, first models for 25 protein-protein complexes were generated using HADDOCK. Next, MD simulations complemented with machine learning were used to discriminate between native and non-native complexes based on a combination of metrics reporting on the stability of the initial models. Native models showed higher stability in almost all measured properties, including the key ones used for scoring in the CAPRI competition, namely the positional root mean square deviations and fraction of native contacts from the initial docked model. A Random Forest classifier was trained, reaching 0.85 accuracy in correctly distinguishing native from non-native complexes. Reasonably modest simulation lengths in the order of 50 to 100 ns are already sufficient to reach this accuracy, which makes this approach applicable in practice.
Publisher
Cold Spring Harbor Laboratory
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献