Training bias and sequence alignments shape protein-peptide docking by AlphaFold and related methods

训练偏差和序列比对会影响 AlphaFold 及相关方法的蛋白质-肽对接结果

阅读:1

Abstract

Protein-peptide interactions mediate many biological processes. Accurate structural models of protein-peptide complexes, determined by experiment or computational prediction, are essential for understanding function and designing interaction inhibitors. AlphaFold2-Multimer (AF2-Multimer), AlphaFold3 (AF3), and related models such as Boltz-1 and Chai-1 can predict protein-peptide binding geometry, often with high accuracy. Using a dataset of experimentally resolved structures, we analyzed the performance of these four structure prediction models to understand how they work. We found evidence of bias for previously seen structures, indicating that models struggle to generalize to novel proteins or binding sites. We tested how models use the protein and peptide multiple sequence alignments (MSAs), which are often shallow or of poor quality for peptide sequences. We found weak evidence that models use coevolutionary information from paired MSAs, but both the protein and peptide unpaired MSAs contribute to prediction accuracy. Our work highlights the promise of deep learning for peptide docking and the importance of diverse representation of interface geometries in the training data for optimal prediction performance.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。