SAGE-prot: scoring-assisted generative exploration for multi-objective protein design

SAGE-prot:基于评分的生成式探索方法,用于多目标蛋白质设计

阅读:1

Abstract

Designing proteins with multiple optimized properties remains a fundamental challenge in biotechnology, especially when design objectives exhibit trade-offs or when structural templates are unavailable. We present scoring-assisted generative exploration for proteins (SAGE-Prot), a modular and extensible protein design framework that integrates autoregressive sequence generation, genetic algorithm(GA)-based diversification, and scoring-guided property evaluation in a closed-loop optimization process. Unlike conventional approaches, SAGE-Prot performs optimization directly at the sequence level without relying on structural templates for generation, while enabling structure-aware evaluation. Across rediscovery and similarity benchmarks involving 10 therapeutic proteins, hybrid language model/GA strategies implemented in SAGE-Prot consistently outperformed language model-only and heuristic baselines. Applied to two design problems, protein G domain B1 optimization for binding affinity and thermal stability, and TEM-1 β-lactamase optimization for enzymatic activity and solubility, SAGE-Prot effectively identified high-performing variants guided by predictive models trained on diverse sequence- and structure-derived descriptors. A curriculum learning (CL) strategy further accelerated convergence and improved design quality. Notably, experimental validation of six SAGE-Prot-designed TEM-1 β-lactamase variants confirmed up to a 752-fold increase in catalytic activity, underscoring the practical utility of this generative framework. These results highlight how coupling deep generative modeling with structure-informed evaluation and iterative fine-tuning enables generalizable, data-driven protein engineering across diverse optimization landscapes.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。