SGAC: a graph neural network framework for imbalanced and structure-aware AMP classification

SGAC:一种用于不平衡和结构感知 AMP 分类的图神经网络框架

阅读:1

Abstract

Classifying antimicrobial peptides (AMPs) from the vast collection of peptides derived from metagenomic sequencing offers a promising avenue for combating antibiotic resistance. However, most existing AMP classification methods rely primarily on sequence-based representations and fail to capture the spatial structural information critical for accurate identification. Although recent graph-based approaches attempt to incorporate structural information, they typically construct residue- or atom-level graphs that introduce redundant atomic details and increase structural complexity. Furthermore, the class imbalance between the small number of known AMPs and the abundant non-AMPs significantly hinders predictive performance. To address these challenges, we employ lightweight OmegaFold to predict the 3D structures of peptides and construct peptide graphs using C$_\alpha $ atoms to capture their backbone geometry and spatial topology. Building on this representation, we propose the spatial graph neural network (GNN)-based AMP classifier (SGAC), a novel framework that leverages GNNs to extract structural features and generate discriminative graph representations. To handle class imbalance, SGAC incorporates weight-enhanced contrastive learning to cluster structurally similar peptides and separate dissimilar ones through adaptive weighting, and applies weight-enhanced pseudo-label distillation to generate high-confidence pseudo labels for unlabeled samples, achieving balanced and consistent representation learning. Experiments on publicly available AMP and non-AMP datasets demonstrate that SGAC significantly achieves state-of-the-art performance compared to baselines. The complete code and dataset are available at: https://github.com/wyxwyx46941930/SGAC.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。