ESMStabP: A Regression Model for Protein Thermostability Prediction

ESMStabP:一种用于预测蛋白质热稳定性的回归模型

阅读:1

Abstract

Accurately predicting protein thermostability is crucial for numerous applications in biotechnology, pharmaceuticals, and food science. Experimental methods for determining protein melting temperatures are often time-consuming and costly, driving the need for efficient computational alternatives. In this paper, we introduce ESMStabP, an enhanced regression model for predicting protein thermostability. To improve model performance and generalizability, we assembled a significantly larger dataset by combining and cleaning datasets previously utilized in other thermostability models. Building on DeepStabP, ESMStabP incorporates significant improvements, using embeddings from the ESM2 protein language model and thermophilic classifications. The predictions from ESMStabP outperform DeepStabP and other existing predictors, achieving an R(2) of 0.95 and a Pearson correlation coefficient (PCC) of 0.97. Despite these improvements, challenges such as dataset availability. This work underscores the critical role of specific layer identification for model development and outlines potential directions for future advancements in protein stability predictions.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。