Deep learning framework with interpretable feature selection for accurate SUMOylation site prediction

具有可解释特征选择的深度学习框架,用于精确预测SUMO化位点

阅读:1

Abstract

Small ubiquitin-like modifiers (SUMOs) are crucial protein regulators influencing diverse biological processes through covalent modifications or non-covalent interactions. SUMOylation, a key post-translational modification (PTM), plays a vital role in cellular regulation. This study presents Hybrid-Sumo, a deep learning-based model integrating protein structural and sequence features to predict SUMOylation sites. Hybrid-Sumo combines three advanced feature extraction techniques: Half-Sphere Exposure (HSE), Position-Specific Scoring Matrix with Discrete Wavelet Transform (PSSM-DWT), and Bidirectional Encoder Representations from Transformers (BERT). The SHapley Additive exPlanations (SHAP) algorithm is employed for optimal feature selection, while a Deep Neural Network (DNN) serves as the classification model. Extensive 10-fold cross-validation confirms the effectiveness of Hybrid-Sumo, achieving 99.74% accuracy on benchmark datasets and 96.15% and 95.83% on balanced and imbalanced independent datasets, respectively. These results surpass existing models, improving training accuracy by 1.45% and testing accuracy (both balanced and imbalanced) by 1.90% and 0.25%, respectively. These findings highlight Hybrid-Sumo as a robust computational tool for accurate prediction of SUMOylation sites, accelerating research on protein function and modification analysis.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。