DSS-PPI: a self-supervised graph learning framework for protein-protein interaction prediction via multimodal sequence semantics

DSS-PPI:一种基于多模态序列语义的蛋白质-蛋白质相互作用预测的自监督图学习框架

阅读:2

Abstract

BACKGROUND: Reliable identification of protein‑protein interactions (PPIs) is crucial for deciphering cellular functional networks. Current research models still face limitations in aligning heterogeneous features and handling sparse supervision signals in graph learning. To address these issues, this study proposes a prediction framework named DSS‑PPI. This framework aims to enhance prediction performance by integrating multimodal sequence semantics with self‑supervised graph learning, thereby transforming static protein sequence embeddings into dynamic, topology‑aware representations. RESULTS: DSS‑PPI employs a dual‑stream architecture that synergistically integrates ProTrek’s cross‑modal aligned embeddings with ProtT5’s deep sequence features. The study innovatively constructs a context encoder that leverages Smith‑Waterman sequence similarity as quantitative edge features to guide graph attention weights, and incorporates Deep Graph Infomax (DGI) for self‑supervised pretraining. Furthermore, a gated fusion mechanism enables the model to adaptively integrate sequence semantics with network topological information. Experimental results indicate that the model achieves competitive performance compared to existing state‑of‑the‑art algorithms on both human and multi‑species benchmark datasets, with an accuracy of 0.73 on the rigorously designed Bernett test set. CONCLUSIONS: This study demonstrates the synergistic effect of multimodal embeddings and self‑supervised graph learning in PPI prediction. Ablation experiments and SHAP interpretability analysis further confirm that DSS‑PPI can effectively capture genuine physical interaction patterns. The framework provides a reliable computational tool for understanding complex biological networks and holds broad potential for biomedical applications. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12864-026-12762-3.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。