Artificial intelligence-driven framework for discovering synthetic binding protein-like scaffolds from the entire protein universe

利用人工智能驱动的框架,从整个蛋白质宇宙中发现合成结合蛋白样支架

阅读:1

Abstract

Compared to traditional sequence-based methods, artificial intelligence (AI) approaches offer distinct advantages, such as significantly improved structural recognition efficiency and the ability to overcome inherent limitations of sequence alignment. Here, we introduce an AI-driven framework designed to discover synthetic binding proteins (SBPs)-like scaffolds from the entire known proteome. The framework integrates a deep learning-based FoldSeek with our in-house developed holistic protein attributes assessment (HP2A) algorithm, and enables subsequent protein function annotation and evolutionary analysis. As a proof-of-concept, four representative SBPs, including Affibody, Anticalin, DARPin, and Fynome, were used as query to discover SBP-like scaffolds. The results demonstrate that some of the identified SBP-like proteins, despite their low sequence similarity (identity ≤0.3), exhibit significant structural resemblance to the templates (template modeling score (TM-score) ≥ 0.5), highlighting the large sequence space available within specific protein scaffold. Statistical analysis identifies key biophysical properties that contribute to privileged scaffold functionality. Additionally, evolutionary insights derived from potential SBP-like scaffolds provide valuable guidance for protein binder design, as validated through targeted sequence analysis and in silico site-directed mutagenesis. This work highlights the potential of our framework to facilitate the discovery of high-quality engineered protein scaffolds, paving the way for the development of novel SBPs.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。