ConforFold recovers alternative protein conformations beyond MSA subsampling

ConforFold能够恢复MSA亚采样之外的其他蛋白质构象

阅读:3

Abstract

Conformational changes underlie many aspects of protein function, yet current structure prediction tools remain limited in their ability to systematically sample structural ensembles. Here, we present ConforPSSP and ConforFold, a combined framework that integrates secondary-structure sampling into a deep learning-based prediction to recover multiple protein conformational states. ConforPSSP employs a transformer model trained on multi-residue fragments to generate diverse 8-state protein secondary structure predictions (PSSPs), which are then used to condition a retrained OpenFold model (ConforFold). ConforFold achieved state-of-the-art performance in conformer recovery. On our test dataset of protein samples with two alternative conformations, it correctly identified both conformers in 84% of cases at TM-scores ≥0.8, outperforming AlphaFlow (75.4%), which uses diffusion-based sampling, and Cfold, which relies on MSA clustering. It outperformed BioEmu, a novel method that emulates MD simulation results, in cases where secondary structures between conformers differ significantly (83% and 76% of cases for ConforFold and BioEmu, respectively). These results establish ConforFold as a broadly applicable framework for modeling structural ensembles. It recovers conformations inaccessible to MSA-based subsampling or diffusion models by explicitly integrating secondary structure, offering a new avenue for investigating conformational heterogeneity, mechanistic transitions, and the structural basis of protein function.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。