Abstract
Structure prediction of protein complexes has improved significantly with AlphaFold2 and AlphaFold-multimer (AFM), but only 60% of dimers are accurately predicted. Here, we learn a bias to the MSA representation that improves the predictions by performing gradient descent through the AFM network. We demonstrate the performance on seven difficult targets from CASP15 and increase the average MMscore to 0.76 compared to 0.63 with AFM. We evaluate the procedure on 487 protein complexes where AFM fails and obtain an increased success rate (MMscore>0.75) of 33% on these difficult targets. Our protocol, AFProfile, provides a way to direct predictions towards a defined target function guided by the MSA. We expect gradient descent over the MSA to be useful for different tasks.
Funder
FP7 Ideas: European Research Council
Berlin Mathematics Research Center MATH+
Deutsche Forschungsgemeinschaft
SciLifeLab & Wallenberg Data Driven Life Science Program
Zentrum für Informationsdienste und Hochleistungsrechnen, Technische Universität Dresden
National Supercomputer Centre, Linköpings Universitet
Publisher
Public Library of Science (PLoS)