Increasing the fraction of correct solutions in ensembles of protein-protein docking models by an iterative consensus algorithm

通过迭代共识算法提高蛋白质-蛋白质对接模型集合中正确解的比例

阅读:1

Abstract

Protein-protein interactions play a pivotal role in numerous biological processes, making their structural prediction essential for molecular biology and medicine. Computational docking methods generate ensembles of models for protein complexes, but identifying correct solutions within these ensembles remains a challenge. This study introduces Iter-CONSRANK, an iterative consensus-based scoring algorithm designed to enrich the fraction of correct models in such ensembles. Building on the established CONSRANK algorithm, Iter-CONSRANK incorporates iterative filtering to progressively discard lower-ranked models based on inter-residue contact conservation, thus refining the ensemble. Performance evaluation using the 3K-BM5up dataset (~1.6 × 10(5) models), 30K-BM5 dataset (~6.4 × 10(6) models), and the CAPRI-derived Score_set (~2.0 × 10(4) models) demonstrated significant improvement in the proportion of correct solutions across model difficulty categories. Iter-CONSRANK effectively identified correct models within the top-ranking positions, outperforming over 150 other scoring functions. For moderately challenging targets, the algorithm enriched correct solutions by up to eightfold, making subsequent analyses more straightforward. The software is publicly available, enabling its application for pre-processing docking ensembles or independent scoring. Iter-CONSRANK is a promising tool for advancing the accuracy of protein-protein docking model evaluations.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。